On the reconstruction of kinematic distributions computed with Monte Carlo methods using orthogonal basis functions
Pith reviewed 2026-05-21 15:03 UTC · model grok-4.3
The pith
Kinematic distributions from Monte Carlo calculations can be rebuilt as smooth sums of orthogonal basis functions rather than binned histograms.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The target kinematic distribution is approximated by a finite sum of orthogonal basis functions whose coefficients are computed as Monte Carlo averages of the integrand multiplied by each basis function. This yields a smooth reconstruction that eliminates the statistical fluctuations between histogram bins, which are especially pronounced in perturbative calculations with local subtractions.
What carries the argument
The central mechanism is the orthonormal expansion of the distribution, where each coefficient is evaluated by Monte Carlo integration over the high-dimensional phase space and the basis is optionally optimized using a leading-order approximation to the same distribution.
If this is right
- Perturbative predictions at next-to-leading order and beyond become less sensitive to Monte Carlo statistical errors when plotted as continuous functions.
- The reconstructed distributions can be differentiated or integrated analytically for use in further calculations without re-binning.
- Optimized bases built from leading-order results accelerate convergence of the expansion for higher-order corrections.
- The method applies directly to processes with local subtraction terms, such as Higgs production in weak boson fusion.
Where Pith is reading between the lines
- The same expansion idea could be extended to selected multi-dimensional distributions if suitable tensor-product bases are available.
- Combining the basis reconstruction with adaptive Monte Carlo sampling might further reduce the number of events needed for a given precision.
- The approach could be tested on distributions that contain sharp resonances or thresholds to quantify any truncation effects near those features.
Load-bearing premise
A finite truncation of the orthogonal basis expansion can represent the target kinematic distribution with sufficient accuracy and without introducing significant bias or truncation artifacts that affect the final physics results.
What would settle it
Generate a high-statistics reference histogram for the same distribution and check whether the basis-function reconstruction deviates systematically from it in any kinematic region or introduces oscillations absent from the reference.
Figures
read the original abstract
Reconstruction of one-dimensional kinematic distributions from calculations based on high-dimensional Monte-Carlo integration is a standard problem in high-energy physics. Traditionally, this is done by collecting randomly-generated events in histograms. In this article, we explore an alternative approach, whose main idea is to approximate the target distribution by a weighted sum of orthogonal basis functions whose coefficients are calculated using the Monte-Carlo integration. This method has the advantage of directly yielding smooth approximations to target distributions. Furthermore, in the context of high-order perturbative calculations with local subtractions, it eliminates the so-called bin-to-bin fluctuations, which often severely affect the quality of conventional histograms. We also demonstrate that the availability of a high-quality approximation to the target distribution, for example the leading-order result in the perturbative expansion, can be exploited to construct an optimized orthonormal basis. We compare the performance of this method to conventional histograms in both toy-model and real Monte-Carlo settings, applying it to Higgs boson production in weak boson fusion as an example.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes reconstructing one-dimensional kinematic distributions from high-dimensional Monte Carlo integrations by expanding the target in a sum of orthogonal basis functions, with coefficients obtained directly from the MC integrator. This yields smooth approximations and, in the presence of local subtractions at higher perturbative orders, removes bin-to-bin fluctuations that plague conventional histograms. The authors further show how a high-quality approximation (e.g., the leading-order result) can be used to construct an optimized orthonormal basis and compare the method against histograms in toy models and in a realistic Higgs weak-boson-fusion calculation.
Significance. If the truncation error can be shown to remain below the statistical and perturbative precision of the underlying calculation, the technique would supply a practical tool for producing smooth, fluctuation-free differential distributions in NLO and NNLO Monte Carlo programs that employ local subtraction schemes. The availability of an optimized basis derived from a lower-order result is a concrete strength that could be adopted more broadly.
major comments (3)
- [§4] §4 (toy-model section): the comparisons are presented without quantitative tables or plots that report the integrated absolute deviation or Kolmogorov-Smirnov distance between the truncated basis reconstruction and the known analytic target, making it impossible to verify that truncation artifacts remain smaller than the Monte Carlo statistical uncertainty.
- [§5] §5 (WBF Higgs example): the claim that bin-to-bin fluctuations are eliminated must be accompanied by a direct comparison, for at least one distribution, of the basis-reconstructed result against a high-statistics reference histogram or against the unsubtracted integrand; without such a benchmark it is unclear whether the global smoothing introduced by a finite basis order distorts the large positive/negative cancellations that characterize local subtraction terms.
- [§3.2] §3.2 (basis optimization): the procedure for choosing the truncation order N is not specified; the manuscript should state an a-priori criterion (e.g., monitoring the size of the last few coefficients or the change in the reconstructed distribution when N is increased by one) that guarantees the residual bias lies below the target precision of the physics analysis.
minor comments (2)
- [§2] The notation for the orthogonal polynomials (or other basis functions) and the inner-product measure should be introduced once in §2 and used consistently thereafter.
- Figure captions should explicitly state the truncation order N employed for each curve and whether error bands include only Monte Carlo statistics or also an estimate of truncation uncertainty.
Simulated Author's Rebuttal
We thank the referee for the careful reading and the constructive suggestions. We address each major comment below and will revise the manuscript to incorporate the requested quantitative validations, benchmarks, and explicit selection criteria.
read point-by-point responses
-
Referee: [§4] §4 (toy-model section): the comparisons are presented without quantitative tables or plots that report the integrated absolute deviation or Kolmogorov-Smirnov distance between the truncated basis reconstruction and the known analytic target, making it impossible to verify that truncation artifacts remain smaller than the Monte Carlo statistical uncertainty.
Authors: We agree that quantitative error metrics would make the validation more rigorous. In the revised manuscript we will add a table in §4 that reports both the integrated absolute deviation and the Kolmogorov-Smirnov distance between the truncated basis expansion and the known analytic target for several values of N. These numbers will be compared directly to the Monte Carlo statistical uncertainties quoted in the same section. revision: yes
-
Referee: [§5] §5 (WBF Higgs example): the claim that bin-to-bin fluctuations are eliminated must be accompanied by a direct comparison, for at least one distribution, of the basis-reconstructed result against a high-statistics reference histogram or against the unsubtracted integrand; without such a benchmark it is unclear whether the global smoothing introduced by a finite basis order distorts the large positive/negative cancellations that characterize local subtraction terms.
Authors: We will include the requested benchmark in the revised §5. For the leading differential distribution we will add a direct overlay of the basis-reconstructed result against both (i) a high-statistics reference histogram generated with an independent, much larger Monte Carlo sample and (ii) the unsubtracted integrand. The comparison will be shown at the same binning used in the original figure so that any residual distortion from the finite-order smoothing can be quantified. revision: yes
-
Referee: [§3.2] §3.2 (basis optimization): the procedure for choosing the truncation order N is not specified; the manuscript should state an a-priori criterion (e.g., monitoring the size of the last few coefficients or the change in the reconstructed distribution when N is increased by one) that guarantees the residual bias lies below the target precision of the physics analysis.
Authors: We will expand §3.2 to state an explicit a-priori criterion. The truncation order N is chosen such that the absolute value of the highest retained coefficient satisfies |c_N| < ε, where ε is set to 10 % of the target statistical precision of the distribution. In addition, we will verify that increasing N by one changes the reconstructed distribution by less than the same tolerance. Both the coefficient threshold and the stability check will be documented with a short algorithmic description and illustrated for the toy-model case. revision: yes
Circularity Check
No circularity: coefficients computed directly via Monte Carlo integration on independent basis
full rationale
The paper's core method computes expansion coefficients for an orthogonal basis directly from Monte Carlo integration of the target integrand. This is a standard, non-self-referential numerical procedure equivalent to projecting the distribution onto the basis. The use of a leading-order approximation to optimize the basis is an external input from perturbation theory, not derived from the method itself. Comparisons to histograms in toy models and WBF examples are direct numerical tests without fitted parameters renamed as predictions or load-bearing self-citations. The derivation chain remains independent of its outputs.
Axiom & Free-Parameter Ledger
free parameters (1)
- truncation order of the basis expansion
axioms (1)
- domain assumption The target distribution lies in the span of the selected orthogonal basis to within acceptable truncation error
Reference graph
Works this paper leans on
-
[1]
James, Monte Carlo Theory and Practice, Rept
F. James, Monte Carlo Theory and Practice, Rept. Prog. Phys.43, 1145 (1980)
work page 1980
-
[2]
G. P. Lepage, A New Algorithm for Adaptive Multidi- mensional Integration, J. Comput. Phys.27, 192 (1978)
work page 1978
-
[3]
Navaset al.(Particle Data Group), Review of particle physics, Phys
S. Navaset al.(Particle Data Group), Review of particle physics, Phys. Rev. D110, 030001 (2024)
work page 2024
-
[4]
General-purpose event generators for LHC physics
A. Buckleyet al., General-purpose event genera- tors for LHC physics, Phys. Rept.504, 145 (2011), arXiv:1101.2599 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2011
-
[5]
B. W. Silverman,Density estimation for statistics and data analysis(Chapman and Hall, London, 1986)
work page 1986
-
[6]
D. G. Truhlar and N. C. Blais, Legendre moment method for calculating differential scattering cross sections from classical trajectories with monte carlo initial conditions, The Journal of Chemical Physics 67, 1532 (1977), https://pubs.aip.org/aip/jcp/article- pdf/67/4/1532/18907999/1532 1 online.pdf
work page 1977
-
[7]
Three-jet cross sections to next-to-leading order
S. Frixione, Z. Kunszt, and A. Signer, Three jet cross- sections to next-to-leading order, Nucl. Phys. B467, 399 (1996), arXiv:hep-ph/9512328
work page internal anchor Pith review Pith/arXiv arXiv 1996
-
[8]
A General Algorithm for Calculating Jet Cross Sections in NLO QCD
S. Catani and M. H. Seymour, A General algorithm for calculating jet cross-sections in NLO QCD, Nucl. Phys. B485, 291 (1997), [Erratum: Nucl.Phys.B 510, 503–504 (1998)], arXiv:hep-ph/9605323
work page internal anchor Pith review Pith/arXiv arXiv 1997
-
[9]
A. Gehrmann-De Ridder, T. Gehrmann, and E. W. N. Glover, Antenna subtraction at NNLO, JHEP09, 056, arXiv:hep-ph/0505111
work page internal anchor Pith review Pith/arXiv arXiv
-
[10]
A novel subtraction scheme for double-real radiation at NNLO
M. Czakon, A novel subtraction scheme for double-real radiation at NNLO, Phys. Lett. B693, 259 (2010), arXiv:1005.0274 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2010
-
[11]
Fully differential VBF Higgs production at NNLO
M. Cacciari, F. A. Dreyer, A. Karlberg, G. P. Salam, and G. Zanderighi, Fully Differential Vector-Boson- Fusion Higgs Production at Next-to-Next-to-Leading Or- der, Phys. Rev. Lett.115, 082002 (2015), [Erratum: Phys.Rev.Lett. 120, 139901 (2018)], arXiv:1506.02660 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[12]
Jet production in the CoLoRFulNNLO method: event shapes in electron-positron collisions
V. Del Duca, C. Duhr, A. Kardos, G. Somogyi, Z. Sz˝ or, Z. Tr´ ocs´ anyi, and Z. Tulip´ ant, Jet production in the CoLoRFulNNLO method: event shapes in electron- positron collisions, Phys. Rev. D94, 074019 (2016), arXiv:1606.03453 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[13]
Nested soft-collinear subtractions in NNLO QCD computations
F. Caola, K. Melnikov, and R. R¨ ontsch, Nested soft- collinear subtractions in NNLO QCD computations, Eur. Phys. J. C77, 248 (2017), arXiv:1702.01352 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[14]
Local Analytic Sector Subtraction at NNLO
L. Magnea, E. Maina, G. Pelliccioli, C. Signorile- Signorile, P. Torrielli, and S. Uccirati, Local analytic sector subtraction at NNLO, JHEP12, 107, [Erratum: JHEP 06, 013 (2019)], arXiv:1806.09570 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2019
- [15]
-
[16]
Szeg¨ o,Orthogonal Polynomials(American Mathemat- ical Society, New York City, 1939)
G. Szeg¨ o,Orthogonal Polynomials(American Mathemat- ical Society, New York City, 1939)
work page 1939
-
[17]
J. P. Boyd,Chebyshev and Fourier Spectral Methods, 2nd ed. (Dover, Mineola, New York, 2001)
work page 2001
-
[18]
Treating the b quark distribution function with reliable uncertainties
Z. Ligeti, I. W. Stewart, and F. J. Tackmann, Treating the b quark distribution function with reliable uncertain- ties, Phys. Rev. D78, 114014 (2008), arXiv:0807.1926 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2008
-
[19]
C. W. Clenshaw, A note on the summation of chebyshev series, Mathematics of Computation9, 118 (1955)
work page 1955
-
[20]
H. Wang and S. Xiang, On the convergence rates of leg- endre approximation, Mathematics of computation81, 13 861 (2012)
work page 2012
-
[21]
K. Asteriadis, F. Caola, K. Melnikov, and R. R¨ ontsch, NNLO QCD corrections to weak boson fusion Higgs bo- son production in the H→b band H→WW ∗ →4l decay channels, JHEP02, 046 (2022), arXiv:2110.02818 [hep-ph]
-
[22]
C. Bierlichet al., Robust Independent Validation of Ex- periment and Theory: Rivet version 3, SciPost Phys.8, 026 (2020), arXiv:1912.05451 [hep-ph]
-
[23]
Bothmannet al.(Sherpa), Event generation with Sherpa 3, JHEP12, 156, arXiv:2410.22148 [hep-ph]
E. Bothmannet al.(Sherpa), Event generation with Sherpa 3, JHEP12, 156, arXiv:2410.22148 [hep-ph]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.