Bring the noise: exact inference from noisy simulations in collider physics
Pith reviewed 2026-05-23 03:32 UTC · model grok-4.3
The pith
A pseudo-marginal MCMC method returns exact inferences from noisy Monte Carlo simulations in collider physics.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that an unbiased estimator for the Poisson likelihood, constructed by drawing a Poisson-distributed number of Monte Carlo events, can be used within pseudo-marginal MCMC to produce exact posterior inferences despite the noise in the simulations. The method is demonstrated on a simplified model search for neutralinos and charginos at the LHC, showing that exact inferences are obtained for a similar computational cost to approximate ones and that the results are robust with respect to the number of events generated per point.
What carries the argument
An unbiased estimator for a Poisson likelihood that uses a Poisson-distributed number of Monte Carlo events, placed inside pseudo-marginal MCMC.
If this is right
- Exact inferences are obtained for a similar computational cost to approximate ones from existing methods.
- Inferences remain robust with respect to the number of events generated per point.
- The unbiased estimator uses a Poisson-distributed number of MC events.
- A biased estimator is also possible whose bias decays factorially with increasing number of MC events.
Where Pith is reading between the lines
- Similar techniques could be applied to other fields that rely on Monte Carlo simulations for likelihoods, such as cosmology or epidemiology.
- If widely adopted, this would allow experimental collaborations to reduce the number of simulated events needed for reliable results.
- The method preserves the exactness property without introducing new convergence issues, suggesting it can be integrated into existing analysis pipelines.
Load-bearing premise
The Monte Carlo simulations can produce estimators that are made unbiased for the Poisson likelihood by using a Poisson-distributed number of events.
What would settle it
Running the MCMC with the new estimator on a problem where the true posterior is known exactly from analytic calculation or infinite statistics, and checking whether the recovered posterior matches within sampling error.
Figures
read the original abstract
We rely on Monte Carlo (MC) simulations to interpret searches for new physics at the Large Hadron Collider (LHC) and elsewhere. These simulations result in noisy and approximate estimators of selection efficiencies and likelihoods. In this context we pioneer an exact-approximate computational method - exact-approximate Markov Chain Monte Carlo (MCMC), also known as pseudo-marginal MCMC - that returns exact inferences despite noisy simulations. To do so, we introduce an unbiased estimator for a Poisson likelihood. We demonstrate the new estimator and new techniques in examples based on a search for neutralinos and charginos at the LHC using a simplified model. We find attractive performance characteristics - exact inferences are obtained for a similar computational cost to approximate ones from existing methods and inferences are robust with respect to the number of events generated per point. The unbiased estimator uses a Poisson-distributed number of MC events; it is also possible to construct a biased estimator whose bias decays factorially with increasing number of MC events.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to pioneer the use of pseudo-marginal MCMC (exact-approximate MCMC) for collider physics by constructing an unbiased estimator of the Poisson likelihood from Monte Carlo simulations that employ a Poisson-distributed number of events. This enables exact Bayesian inferences despite noisy simulations. The method is demonstrated on a simplified model for neutralino/chargino searches at the LHC, with reported performance that matches the cost of approximate methods while remaining robust to the number of events generated per parameter point. A secondary biased estimator with factorial bias decay is also presented.
Significance. If the unbiasedness construction and its integration into pseudo-marginal MCMC hold, the work supplies a practical route to exact inference in LHC analyses that avoids the systematic biases from finite-MC approximations. Credit is due for the explicit estimator, the direct calculation of its expectation, and the numerical demonstrations on a realistic search channel. The robustness result and the factorial-bias alternative are useful additions that could affect how statistical interpretations are performed in future searches.
minor comments (2)
- [Abstract] Abstract and §2: the phrase 'exact inferences' should be accompanied by a brief qualifier that exactness holds for the target posterior in the limit of infinite MCMC iterations, to prevent misreading by experimental readers.
- The manuscript would benefit from an explicit statement in the methods section of how the Poisson number of events is implemented in standard event generators that normally produce fixed-N samples.
Simulated Author's Rebuttal
We thank the referee for their positive assessment of the manuscript, recognition of the method's potential utility for LHC analyses, and recommendation for minor revision. We appreciate the credit given to the unbiased estimator construction, its integration with pseudo-marginal MCMC, and the numerical demonstrations.
Circularity Check
No significant circularity; derivation is self-contained
full rationale
The paper constructs an explicit unbiased estimator for the Poisson likelihood by drawing a Poisson-distributed number of Monte Carlo events and shows via direct expectation calculation that its mean recovers the target likelihood. This estimator is then inserted into the standard pseudo-marginal MCMC framework. No step reduces by definition to a fitted parameter, no self-citation supplies a load-bearing uniqueness theorem, and the central claim (exactness preserved under the unbiased estimator) is verified by algebraic expectation rather than by renaming or circular re-use of the target quantity. The numerical demonstrations on the neutralino/chargino example serve only as validation, not as the source of the estimator itself.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Markov chains constructed with unbiased likelihood estimators converge to the correct posterior
Forward citations
Cited by 1 Pith paper
-
Many Wrongs Make a Right: Leveraging Biased Simulations Towards Unbiased Parameter Inference
Template-Adapted Mixture Model uses many biased simulations for data-driven estimates of signal and background distributions, yielding unbiased signal fraction estimates with well-calibrated uncertainties.
Reference graph
Works this paper leans on
-
[1]
That is, on average, the estimator equals the exact likelihood
The estimator is unbiased (at least up to a constant factor), 〈 ˆL〉 = C L (1) for constant C . That is, on average, the estimator equals the exact likelihood
-
[2]
Bring the noise: exact inference from noisy simulations in collider physics
The estimator is never negative, ˆL ≥ 0. (2) 2 Though note that other algorithms have exact-approximate variants, e.g., pseudo-marginal slice-sampling [10]. arXiv:2502.08157v1 [hep-ph] 12 Feb 2025 2 We later discuss adaptations of MCMC that remove this requirement. If the estimator satisfies these requirements, the MCMC algo- rithm converges to the exactl...
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[3]
We take a flat prior on ϵ and fix the cross section to σ = 1000 fb
First, we construct a simple one-dimensional model in which the selection efficiency, ϵ, is an input parameter. We take a flat prior on ϵ and fix the cross section to σ = 1000 fb
-
[4]
Second, we construct a simplified model based on the TChiWZ topology: mass-degenerate χ± 1 and χ2 particles are pair produced and decay exclusively through χ± 1 → W χ1 and χ2 → Z χ1, respectively. We fix the production cross section to σ = 1000fb and compute a selection efficiency for the SRWZ_15 signal region as a function of (m1,m2) through SModelS [30]...
-
[5]
Lastly, we use the same simplified model, but compute the production cross-section σ(pp → χ2 χ± 1 ) as a func- tion of (m1,m2) using tabulated wino-like neutralino- chargino production cross-sections [31–33]. The tabulated cross sections were computed at NLO + NLL precision assuming mass-degenerate wino-like χ2 and χ± 1 , and a bino-like χ1 and use an env...
work page 2019
-
[6]
S.S. AbdusSalam et al., Simple and statistically sound recommendations for analysing physical theories, Rept. Prog. Phys. 85 (2022) 052201 [2012.09874]
-
[7]
J. Brehmer and K. Cranmer, Simulation-based inference methods for particle physics, in Artificial Intelligence for High Energy Physics, pp. 579–611 (2020), DOI [2010.06439]
-
[8]
Matrix Element Method in HEP: Transfer Functions, Efficiencies, and Likelihood Normalization
I. Volobouev, Matrix Element Method in HEP: Transfer Functions, Efficiencies, and Likelihood Normalization, 1101.2259
work page internal anchor Pith review Pith/arXiv arXiv
-
[9]
T . Sjöstrand, S. Ask, J.R. Christiansen, R. Corke, N. Desai, P . Ilten et al.,An introduction to PYTHIA 8.2, Comput. Phys. Commun. 191 (2015) 159 [1410.3012]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[10]
M. Bahr et al., Herwig++ Physics and Manual, Eur . Phys. J. C58 (2008) 639 [0803.0883]
work page internal anchor Pith review Pith/arXiv arXiv 2008
-
[11]
S HERPA collaboration, Event Generation with Sherpa 2.2, SciPost Phys. 7 (2019) 034 [1905.09127]
-
[12]
DELPHES 3 collaboration, DELPHES 3, A modular framework for fast simulation of a generic collider experiment, JHEP 02 (2014) 057 [1307.6346]
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[13]
R. Brun, F . Bruyant, F . Carminati, S. Giani, M. Maire, A. McPherson et al., GEANT Detector Description and Simulation Tool, CERN Program Library Long Writeup (1994)
work page 1994
- [14]
-
[15]
Pseudo-Marginal Slice Sampling
I. Murray and M.M. Graham, Pseudo-Marginal Slice Sampling, in Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, A. Gretton and C.C. Robert, eds., vol. 51 of Proc. Mach. Learn. Res., (Cadiz, Spain), pp. 911–919, Oct., 2016 [1510.02958]
work page internal anchor Pith review Pith/arXiv arXiv 2016
- [16]
- [17]
-
[18]
Overview of Approximate Bayesian Computation
S.A. Sisson, Y. Fan and M.A. Beaumont, Overview of Approximate Bayesian Computation, in Handbook of Approximate Bayesian Computation, Chapman and Hall/CRC (2018), DOI [1802.09720]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[19]
M.A. Beaumont, Estimation of Population Growth or Decline in Genetically Monitored Populations, Genetics 164 (2003) 1139
work page 2003
-
[20]
J. Møller, A.N. Pettitt, R. Reeves and K.K. Berthelsen, An efficient Markov chain Monte Carlo method for distributions with intractable normalising constants, Biometrika 93 (2006) 451
work page 2006
-
[21]
The pseudo-marginal approach for efficient Monte Carlo computations
C. Andrieu and G.O. Roberts, The pseudo-marginal approach for efficient Monte Carlo computations, Ann. Stat. 37 (2009) [0903.5480]
work page internal anchor Pith review Pith/arXiv arXiv 2009
-
[22]
L. Lin, K.F . Liu and J.H. Sloan,A Stochastic Monte Carlo algorithm, Phys. Rev. D 61 (2000) 074505 [hep-lat/9905033]
work page internal anchor Pith review Pith/arXiv arXiv 2000
-
[23]
H. Dembinski and M. Schmelling, Bias, variance, and confidence intervals for efficiency estimators in particle physics experiments, 2110.00294
-
[24]
Efficient implementation of Markov chain Monte Carlo when using an unbiased likelihood estimator
A. Doucet, M. Pitt, G. Deligiannidis and R. Kohn, Efficient implementation of Markov chain Monte Carlo when using an unbiased likelihood estimator, Biometrika 102 (2015) 295 [1210.1871]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[25]
On the efficiency of pseudo-marginal random walk Metropolis algorithms
C. Sherlock, A.H. Thiery, G.O. Roberts and J.S. Rosenthal, On the efficiency of pseudo-marginal random walk Metropolis algorithms, Ann. Stat. 43 (2015) [ 1309.7209]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[26]
Convergence properties of pseudo-marginal Markov chain Monte Carlo algorithms
C. Andrieu and M. Vihola, Convergence properties of pseudo-marginal Markov chain Monte Carlo algorithms, Ann. Appl. Probab. 25 (2015) [ 1210.1484]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[27]
C. Sherlock, Optimal Scaling for the Pseudo-Marginal Random Walk Metropolis: Insensitivity to the Noise Generating Mechanism, Methodol. Comput. In Appl. Probab. 18 (2015) 869 [1408.4344]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[28]
Glasser, Minimum Variance Unbiased Estimators for Poisson Probabilities, Technometrics 4 (1962) 409
G.J. Glasser, Minimum Variance Unbiased Estimators for Poisson Probabilities, Technometrics 4 (1962) 409
work page 1962
-
[29]
A.-M. Lyne, M. Girolami, Y. Atchadé, H. Strathmann and D. Simpson, On Russian Roulette Estimates for Bayesian Inference with Doubly-Intractable Likelihoods, Stat. Sci. 30 (2015) [ 1306.4032]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[30]
Establishing some order amongst exact approximations of MCMCs
C. Andrieu and M. Vihola, Establishing some order amongst exact approximations of MCMCs, Ann. Appl. Probab. 26 (2016) [1404.6909]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[31]
J. Goodman and J. Weare, Ensemble samplers with affine invariance, Commun. In Appl. Math. Comput. Sci. 5 (2010) 65
work page 2010
-
[32]
D. Foreman-Mackey, D.W . Hogg, D. Lang and J. Goodman, emcee: The MCMC Hammer, Publ. Astron. Soc. Pac. 125 (2013) 306 [1202.3665]
work page internal anchor Pith review Pith/arXiv arXiv 2013
- [33]
- [34]
-
[35]
G. Alguero, J. Heisig, C.K. Khosa, S. Kraml, S. Kulkarni, A. Lessa et al., Constraining new physics with SModelS version 2, JHEP 08 (2022) 068 [2112.00769]
-
[36]
LHC SUSY Cross Section Working Group, NLO-NLL wino-like chargino-neutralino (N2C1) cross sections, 2017
work page 2017
-
[37]
B. Fuks, M. Klasen, D.R. Lamprea and M. Rothering, Gaugino production in proton-proton collisions at a center-of-mass energy of 8 TeV, JHEP 10 (2012) 081 [1207.2159]
work page internal anchor Pith review Pith/arXiv arXiv 2012
-
[38]
B. Fuks, M. Klasen, D.R. Lamprea and M. Rothering, Precision predictions for electroweak superpartner production at hadron colliders with RESUMMINO , Eur . Phys. J. C73 (2013) 2480 [1304.0790]
work page internal anchor Pith review Pith/arXiv arXiv 2013
-
[39]
On nonnegative unbiased estimators
P .E. Jacob and A.H. Thiery,On nonnegative unbiased estimators, Ann. Stat. 43 (2015) 769 [ 1309.6473]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[40]
Bias in parametric estimation: reduction and useful side-effects
I. Kosmidis, Bias in parametric estimation: reduction and useful side-effects, WIREs Comput. Stat. 6 (2014) 185–196 [1311.6311]
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[41]
Zhenhua, Advanced Statistical Theory I, Lecture Notes (2019)
L. Zhenhua, Advanced Statistical Theory I, Lecture Notes (2019)
work page 2019
-
[42]
W . Jakob, J. Rhinelander, D. Moldovan and others,pybind11 – Seamless operability between C++11 and Python, 2017. 8 Appendix A: Proof that estimator is UMVUE We wish to show that our estimator is the uniform minimum variance unbiased estimator (UMVUE). That is, it is expected to equal the true likelihood when k ∼ Po(ϵnMC), and that the variance of our est...
work page 2017
-
[43]
C++ implementation We provide a simple header-only library ideal.hpp providing the function: double umvue_poisson_like(int k, double b, int o, int n_mc, double n_exp) in the namespace ideal. This function returns an unbiased estimate of a Poisson likelihood when k events were simulated from an expected n_mc simulations, and o events were observed in a sam...
-
[44]
Python bindings We supply Python bindings for the C++ functions umvue_poisson_like and umvue_draw_n_mc using pybind11 [37]. The ideal module can be built and installed by make. The equivalent example program would be: 1 import ideal 2 3 if __name__ == "__main__": 4 print(ideal.umvue_poisson_like(1000, 10, 20, 10000, 100)) 5 print(ideal.umvue_draw_n_mc(1000))
-
[45]
Implementation in GAMBIT We use code similar to that in Appendix C 1 as part of the ColliderBit module of GAMBIT and implement an option to enable the UMVUE. The UMVUE may be turned on by setting the estimator in the rules governing the MC generation of collider events in the yaml input file: 1 Rules: 2 - capability: RunMC 3 function: operateLHCLoop 4 opt...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.