Fast constrained sampling in pre-trained diffusion models [NeurIPS 2025]

We introduce a new algorithm for sampling with novel constraints from pre-trained diffusion models.

Instead of performing gradient descent steps (see DPS), which require expensive backward passes through the denoiser network we propose using Inexact Newton steps, that can be done with just forward passes and produce as good results.

Masked image	Inpainting using Stable Diffusion (~15s)

Inference

We provide two implementations of the proposed inexact Newton sampling algorithm for linear and non-linear tasks:

In stable-diffusion we provide an implementation based on the LDM repository.
- inpaint.ipynb performs inpainting on a given image and mask.
- superres.ipynb performs super-resolution on an image and a given downsampling rate.
- style.ipynb generates an image from a given caption, following the style provided in the reference image. We utilize the second layer features from a CLIP ViT-B/16 to compare the style between the generated and reference images.
In diffusers we provide an implementation using the diffusers library.
- inpaint.ipynb performs inpainting on a given image and mask.

We have also experimented with rectified flow models (e.g. Instaflow, Stable Diffusion 3). The extension is straightforward and we will be adding a code implementation for such models as well.

Analysis

Inexact vs exact Newton

In mnist/train_diffusion.ipynb we showcase the comparison between the inexact ($Je$), exact ($J^{-1}e$), and gradient descent ($J^Te$) directions for enforcing constraints during sampling. This is done on the MNIST dataset where computing the Jacobian is tractable. Even then, inverting the Jacobian to compute the exact Newton step requires some tuning and we utilize `scipy.linalg.lstsq' to find a suitable solution.

Inexact Newton vs Gradient descent

In stable-diffusion/jacobian_exact_vs_gd.ipynb we demonstrate the qualitative differences between the proposed inexact Newton step and gradient descent. Theoretically, the denoiser Jacobian should be symmetric, making the two update directions equivalent. In practice, we find fundamental differences between the two directions. The inexact Newton direction retains shapes better and shows stronger global coherency.

Convergence

We provide the code to perform our convergence analysis in stable-diffusion/jacobian_analysis.ipynb. We use the Arnoldi Iteration to compute the eigenvalues of the Jacobian (and its symmetric and skew-symmetric versions). Using the maximum computed eigenvalue, we test the convergence of different learning rates, showing that our dynamic $\infty$-norm scaling learning rate achieves the best result.

Inexact Newton for VAEs

In stable-diffusion/superres_vae_newton.ipynb we show an implementation of super-resolution that also avoids backpropagating through the Stable Diffusion decoder using a second Newton approximation in the VAE space. This is a central idea of our paper ZoomLDM, where backpropagating through the VAE is prohibitive due to memory constraints.

Bibtex

@article{graikos2024fast,
  title={Fast constrained sampling in pre-trained diffusion models},
  author={Graikos, Alexandros and Jojic, Nebojsa and Samaras, Dimitris},
  journal={arXiv preprint arXiv:2410.18804},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
diffusers		diffusers
mnist		mnist
stable-diffusion		stable-diffusion
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast constrained sampling in pre-trained diffusion models [NeurIPS 2025]

Inference