A thermodynamic approach to Approximate Bayesian Computation with multiple summary statistics
Pith reviewed 2026-05-19 13:52 UTC · model grok-4.3
The pith
A minimal-entropy-production principle on a Riemannian manifold supplies an optimal annealing schedule for ABC with multiple summary statistics.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Each summary statistic is treated as an energy-like state variable whose temperature controls its contribution to the ABC posterior; an optimal annealing schedule is then obtained by minimizing entropy production on the Riemannian manifold spanned by these variables, and the resulting procedure is shown to be competitive with the state of the art on both synthetic benchmarks and applied problems.
What carries the argument
Minimal-entropy-production principle applied to a Riemannian manifold in which each summary statistic acts as a state variable with its own energy and temperature.
If this is right
- The algorithm reaches performance levels comparable to leading ABC methods on standard simulation-based inference benchmarks.
- It succeeds on challenging real-world inference problems without extensive manual tuning of tolerances or weights.
- The annealing path emerges directly from the thermodynamic principle rather than from heuristic choices.
- Multiple summary statistics are incorporated systematically through the geometry of the manifold.
Where Pith is reading between the lines
- The same geometric construction could be tried in other sequential Monte Carlo schemes that rely on annealing.
- Exploring alternative choices for the Riemannian metric might improve robustness when summary statistics are strongly correlated.
- The method suggests a route for importing other non-equilibrium thermodynamic identities into simulation-based inference.
Load-bearing premise
The minimal-entropy-production principle from non-equilibrium thermodynamics supplies the optimal way to lower the temperatures of summary statistics when they are placed on a Riemannian manifold whose metric captures their joint effect on the posterior.
What would settle it
A benchmark run in which the derived schedule produces a poorer approximation to the true posterior or requires more simulations than a standard hand-tuned ABC schedule would falsify the optimality claim.
Figures
read the original abstract
Bayesian inference with stochastic models is often difficult because their likelihood functions involve high-dimensional integrals. Approximate Bayesian Computation (ABC) avoids evaluating the likelihood function and instead infers model parameters by comparing model simulations with observations using a few carefully chosen summary statistics and a tolerance that can be decreased over time. Here, we present a new variant of simulated-annealing ABC algorithms, drawing intuition from non-equilibrium thermodynamics. We associate each summary statistic with a state variable (energy) quantifying its distance from the observed value, as well as a temperature that controls the extent to which the statistic contributes to the posterior. We derive an optimal annealing schedule on a Riemannian manifold of state variables based on a minimal-entropy-production principle. We validate our approach on standard benchmark tasks from the simulation-based inference literature as well as on challenging real-world inference problems, and show that it is highly competitive with the state of the art.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a thermodynamic framework for Approximate Bayesian Computation (ABC) using multiple summary statistics. Each summary statistic is associated with an energy state variable representing its distance to the observed value and a temperature controlling its contribution to the posterior. An optimal annealing schedule is derived on a Riemannian manifold of these state variables using the principle of minimal entropy production. The approach is validated on benchmark tasks and real-world inference problems, demonstrating competitive performance with state-of-the-art methods.
Significance. If the central derivation holds, this work introduces a principled, physics-based method for determining annealing schedules in ABC, which could reduce reliance on ad-hoc tuning and improve efficiency in high-dimensional or multi-summary statistic settings. The connection to non-equilibrium thermodynamics is innovative and, if rigorously justified, offers a new perspective in simulation-based inference. The validation on both synthetic benchmarks and challenging real-world problems provides evidence of practical utility.
major comments (2)
- [§3.2] §3.2, Eq. (8): The Riemannian metric on summary-statistic space must be derived from the joint geometry of the statistics with respect to the target ABC posterior (rather than individual distances) for the minimal-entropy-production principle to yield a uniquely optimal schedule; the current construction leaves this link implicit.
- [§4.1] §4.1: The continuous thermodynamic limit is used to approximate the discrete ABC acceptance step, but no error bound or convergence analysis is provided for finite numbers of simulations, which is load-bearing for the claim of optimality on benchmark tasks.
minor comments (2)
- [Abstract] Abstract: A one-sentence indication of the explicit form of the entropy-production functional would help readers assess the derivation without reading the full methods.
- [Notation] Notation: The per-statistic temperature variable should be clearly distinguished from the standard ABC tolerance parameter to avoid confusion in the algorithm description.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed report. We address each major comment point by point below. Where the comments identify areas for clarification or additional analysis, we have revised the manuscript accordingly.
read point-by-point responses
-
Referee: [§3.2] §3.2, Eq. (8): The Riemannian metric on summary-statistic space must be derived from the joint geometry of the statistics with respect to the target ABC posterior (rather than individual distances) for the minimal-entropy-production principle to yield a uniquely optimal schedule; the current construction leaves this link implicit.
Authors: We appreciate this observation on the derivation. The state variables are defined via individual distances, but the total energy is the sum over statistics and the manifold is equipped with a metric induced by the Hessian of this total energy. To make the connection to the joint ABC posterior explicit, we have added a paragraph immediately following Eq. (8) showing that the metric tensor components incorporate the cross-covariances of the summary statistics evaluated under the approximate posterior. This establishes that the minimal-entropy-production schedule is optimal with respect to the joint geometry rather than purely separable distances. revision: yes
-
Referee: [§4.1] §4.1: The continuous thermodynamic limit is used to approximate the discrete ABC acceptance step, but no error bound or convergence analysis is provided for finite numbers of simulations, which is load-bearing for the claim of optimality on benchmark tasks.
Authors: We agree that the absence of an explicit error bound for the continuous approximation is a limitation when claiming optimality from finite-simulation benchmarks. Deriving rigorous non-asymptotic bounds would require a detailed analysis of the ABC kernel that lies beyond the scope of the present paper. In the revision we have inserted a short discussion in §4.1 that (i) states the approximation error scales as O(1/N_sim) under standard regularity conditions on the summary-statistic distributions and (ii) reports additional numerical checks confirming that the reported performance metrics stabilize for the simulation budgets used in the benchmarks. We have also softened the language from “optimal” to “near-optimal under the continuous-limit approximation” in the relevant claims. revision: partial
Circularity Check
Derivation applies external minimal-entropy-production principle to ABC without reducing to self-definition or fitted inputs.
full rationale
The paper derives an optimal annealing schedule on a Riemannian manifold of summary statistics from the minimal-entropy-production principle of non-equilibrium thermodynamics. This relies on an external physical framework rather than defining the schedule in terms of itself or fitting parameters to the target posterior in a manner that forces the output by construction. No self-citation chains, ansatz smuggling, or renaming of known results are evident in the derivation chain as presented. The approach treats the manifold metric as chosen to reflect joint contributions, but the thermodynamic principle supplies independent content that is then validated on benchmarks, keeping the central claim self-contained against external thermodynamic benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Minimal entropy production supplies an optimal annealing schedule for the ABC acceptance kernel when summary statistics are treated as energies on a Riemannian manifold.
invented entities (2)
-
Energy state variable for each summary statistic
no independent evidence
-
Temperature per summary statistic
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.lean, IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanreality_from_one_distinction, washburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We derive an optimal annealing schedule on a Riemannian manifold of state variables based on a minimal-entropy-production principle... gi j(U) = ... Christoffel symbols ... geodesic equation ... β^e_i(U) = β_i(U_i) + ...
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Ableidinger, M., Buckwar, E., and Hinterleitner, H. (2017). A stochastic version of the Jansen and Rit neural mass model: Analysis and numerics.The Journal of Mathematical Neuroscience, 7(1):8
work page 2017
-
[2]
Albert, C. (2015). A Simulated Annealing Approach to Bayesian Inference.arXiv:1509.05315
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[3]
Albert, C., Ferriz-Mas, A., Gaia, F ., and Ulzega, S. (2021). Can Stochastic Resonance explain recurrence of Grand Minima?The Astrophysical Journal Letters, 916(2):L9
work page 2021
-
[4]
Albert, C., Künsch, H.-R., and Scheidegger, A. (2014). A Simulated Annealing Approach to Approximate Bayes Computations.Stat. Comput., 25(6):1217–1232
work page 2014
-
[5]
Albert, C., Ulzega, S., Ozdemir, F ., Perez-Cruz, F ., and Mira, A. (2022). Learning summary statistics for Bayesian inference with Autoencoders.SciPost Physics Core, 5(3):043
work page 2022
-
[6]
A., Cornuet, J., Marin, J., and Robert, C
Beaumont, M. A., Cornuet, J., Marin, J., and Robert, C. P . (2009). Adaptive approximate Bayesian computation. Biometrika, 96(4):983–990
work page 2009
-
[7]
Chen, Y., Gutmann, M. U., and Weller, A. (2023). Is learning summary statistics necessary for likelihood-free inference? InInternational Conference on Machine Learning
work page 2023
-
[8]
Chen, Y., Zhang, D., Gutmann, M., Courville, A., and Zhu, Z. (2021). Neural approximate sufficient statistics for implicit models. InInternational Conference on Learning Representations
work page 2021
-
[9]
Clette, F . and Lefèvre, L. (2015). Silso sunspot number v2.0. Published by WDC SILSO - Royal Observatory of Belgium (ROB). Delaunoy , A., Hermans, J., Rozet, F ., Wehenkel, A., and Louppe, G. (2022). Towards reliable simulation-based inference with balanced neural ratio estimation. InAdvances in Neural Information Processing Systems
work page 2015
- [10]
- [11]
-
[12]
Durkan, C., Bekasov, A., Murray , I., and Papamakarios, G. (2019). Neural spline flows. InAdvances in Neural Information Processing Systems
work page 2019
-
[13]
Forbes, F ., Nguyen, H. D., Nguyen, T ., and Arbel, J. (2022). Summary statistics and discrepancy measures for approximate Bayesian computation via surrogate posteriors.Statistics and Computing, 32(5):85
work page 2022
-
[14]
Goodman, J. and Weare, J. (2010). Ensemble samplers with affine invariance.Communications in applied math- ematics and computational science, 5(1):65–80
work page 2010
-
[15]
Greenberg, D., Nonnenmacher, M., and Macke, J. (2019). Automatic posterior transformation for likelihood-free inference. InProceedings of the 36th International Conference on Machine Learning. Köster, J. and Rahmann, S. (2012). Snakemake—a scalable bioinformatics workflow engine.Bioinformatics, 28(19):2520–2522
work page 2019
-
[16]
Lopez-Paz, D. and Oquab, M. (2017). Revisiting classifier two-sample tests. InInternational Conference on Learning Representations
work page 2017
-
[17]
Lueckmann, J.-M., Boelts, J., Greenberg, D., Goncalves, P ., and Macke, J. (2021). Benchmarking simulation-based inference. InProceedings of the 24th International Conference on Artificial Intelligence and Statistics
work page 2021
-
[18]
Papamakarios, G., Pavlakou, T ., and Murray , I. (2017). Masked autoregressive flow for density estimation. In Advances in Neural Information Processing Systems
work page 2017
-
[19]
Papamakarios, G., Sterratt, D., and Murray , I. (2019). Sequential neural likelihood: Fast likelihood-free inference with autoregressive flows. InThe 22nd International Conference on Artificial Intelligence and Statistics
work page 2019
-
[20]
Sharrock, L., Simons, J., Liu, S., and Beaumont, M. (2024). Sequential neural score estimation: Likelihood-free inference with conditional score based diffusion models. InForty-first International Conference on Machine Learning
work page 2024
-
[21]
J., Tung, H.-Y., Strathmann, H., De, S., Ramdas, A., Smola, A., and Gretton, A
Sutherland, D. J., Tung, H.-Y., Strathmann, H., De, S., Ramdas, A., Smola, A., and Gretton, A. (2017). Generative models and model criticism via optimized maximum mean discrepancy . InInternational Conference on Learning Representations
work page 2017
-
[22]
Macke, J. H. (2020). sbi: A toolkit for simulation-based inference.Journal of Open Source Software, 5(52):2505. 33 Thermodynamic ABC with multiple Summary StatisticsA PREPRINT Ter Braak, C. (2006). A Markov Chain Monte Carlo version of the genetic algorithm differential evolution: easy Bayesian computing for real parameter spaces.Statistics and Computing,...
work page 2020
-
[23]
Tong, A., Fatras, K., Malkin, N., Huguet, G., Zhang, Y., Rector-Brooks, J., Wolf, G., and Bengio, Y. (2024). Improv- ing and generalizing flow-based generative models with minibatch optimal transport.Transactions on Machine Learning Research
work page 2024
-
[24]
Ulzega, S., Beer, J., Ferriz-Mas, A., Dirmeier, S., and Albert, C. (2025). Shedding light on the solar dynamo using data-driven Bayesian parameter inference.The Astrophysical Journal, 992(1):61
work page 2025
-
[25]
Usoskin, I., Solanki, S. K., Krivova, N. A., Hofer, B., Kovaltsov, G., Wacker, L., Brehm, N., and Kromer, B. (2021). Solar cyclic activity over the last millennium reconstructed from annual 14C data.Astronomy & Astrophysics, 649:A141
work page 2021
-
[26]
B., Dax, M., Buchholz, S., Green, S
Wildberger, J. B., Dax, M., Buchholz, S., Green, S. R., Macke, J. H., and Schölkopf, B. (2023). Flow matching for scalable simulation-based inference. InThirty-seventh Conference on Neural Information Processing Systems
work page 2023
-
[27]
Wilmot-Smith, A., Nandy , D., Hornig, G., and Martens, P . (2006). A time delay model for solar and stellar dynamos. The Astrophysical Journal, 652(1):696
work page 2006
-
[28]
Zhao, S., Sinha, A., He, Y., Perreault, A., Song, J., and Ermon, S. (2022). Comparing distributions by measuring differences that affect decision making. InInternational Conference on Learning Representations. 34
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.