Annealed Importance Sampling

Radford M. Neal

arxiv: physics/9803008 · v2 · submitted 1998-03-08 · ⚛️ physics.comp-ph · physics.data-an

Annealed Importance Sampling

Radford M. Neal This is my paper

classification ⚛️ physics.comp-ph physics.data-an

keywords importancesamplingchainmarkovannealedannealingallowsconstants

0 comments

read the original abstract

Simulated annealing - moving from a tractable distribution to a distribution of interest via a sequence of intermediate distributions - has traditionally been used as an inexact method of handling isolated modes in Markov chain samplers. Here, it is shown how one can use the Markov chain transitions for such an annealing sequence to define an importance sampler. The Markov chain aspect allows this method to perform acceptably even for high-dimensional problems, where finding good importance sampling distributions would otherwise be very difficult, while the use of importance weights ensures that the estimates found converge to the correct values as the number of annealing runs increases. This annealed importance sampling procedure resembles the second half of the previously-studied tempered transitions, and can be seen as a generalization of a recently-proposed variant of sequential importance sampling. It is also related to thermodynamic integration methods for estimating ratios of normalizing constants. Annealed importance sampling is most attractive when isolated modes are present, or when estimates of normalizing constants are required, but it may also be more generally useful, since its independent sampling allows one to bypass some of the problems of assessing convergence and autocorrelation in Markov chain samplers.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Tempered Guided Diffusion
stat.ML 2026-05 unverdicted novelty 7.0

Tempered Guided Diffusion uses annealed SMC to produce consistent particle approximations to the posterior for training-free conditional diffusion sampling, outperforming independent guided trajectories in experiments.
Scalable Inference-Time Annealing with Surrogate Likelihood Estimators
cs.LG 2026-05 unverdicted novelty 6.0

SITA performs scalable inference-time annealing of flow-based models on molecular systems by substituting energy-based surrogate likelihoods for divergence-based importance weights.
Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing
cs.LG 2026-05 unverdicted novelty 6.0

Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.
Scaling flow-based approaches for topology sampling in $\mathrm{SU}(3)$ gauge theory
hep-lat 2025-10 unverdicted novelty 6.0

Out-of-equilibrium simulations with open-to-periodic boundary switching plus a tailored stochastic normalizing flow enable efficient topology sampling in the continuum limit of four-dimensional SU(3) Yang-Mills theory.