pith. sign in

arxiv: 2512.01761 · v2 · submitted 2025-12-01 · 🧮 math.ST · math.OC· stat.ME· stat.TH

A novel sequential method for building upper and lower bounds of moments of distributions

Pith reviewed 2026-05-17 03:15 UTC · model grok-4.3

classification 🧮 math.ST math.OCstat.MEstat.TH
keywords moment boundsmajorization-minimizationunnormalized distributionssequential methodBayesian inferencepower diagramsintegral approximationupper lower bounds
0
0 comments X

The pith

Sequential method constructs upper and lower bounds on moments of unnormalized distributions

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a sequential method to construct upper and lower bounds on integrals that represent moments of scalar unnormalized distributions. It leverages the majorization-minimization framework and an envelope principle to iteratively refine the bounds. The method comes with proofs of convergence and controlled accuracy under mild conditions, and it generalizes to multi-dimensional settings using power diagrams. This matters in statistics because many tasks require guaranteed inequalities for expectations rather than point estimates alone.

Core claim

We introduce a sequential method for constructing upper and lower bounds on the sought integral. Our approach leverages the majorization-minimization framework to iteratively refine these bounds using an envelope principle. The method has proven convergence and controlled accuracy under mild conditions. We then generalize the method to the multi-dimensional setting, along with an effective implementation strategy based on power diagrams. We demonstrate the effectiveness through a numerical example of estimating Monte Carlo sampler variance in a Bayesian inference problem.

What carries the argument

The envelope principle within the majorization-minimization framework that iteratively refines upper and lower bounds on the integral

Load-bearing premise

The convergence and accuracy of the bounds depend on mild conditions whose precise statement and applicability to general unnormalized distributions are not fully detailed.

What would settle it

A concrete unnormalized distribution where the iterative bounds fail to converge or lose controlled accuracy under the stated mild conditions would disprove the central claims.

Figures

Figures reproduced from arXiv: 2512.01761 by Emilie Chouzenoux, Solal Martin, Victor Elvira.

Figure 1
Figure 1. Figure 1: Example of function π (blue thick line) and minorant/majorant functions (black/red thin lines) at various tangency points (black/red circles) With these definitions, we have the following identities: ∀x ∈ R, f(x) = f +(x) − f −(x), |f(x)| = f +(x) + f −(x). (20) Let us denote, for every t ∈ R, I(t) = Z R f +(x) b(x;t) dx − Z R f −(x) b(x;t) dx = C(t) Z R f +(x) g(x; u(t), σ(t)) dx − C(t) Z R f −(x) g(x; u(… view at source ↗
Figure 2
Figure 2. Figure 2: Example of function π (blue thick line), set of tangent minorant/majorant functions (black/red thin lines) at various tangency points, and associated piecewise minorant/majorant function (black/red thick line). 3.2. Minorant/majorant envelopes Let us introduce the notation that will be required below. We set M ≥ 2 tangency points, and T = [t1 · · ·tm · · ·tM] ∈ RM as the vector that contains the M values o… view at source ↗
Figure 3
Figure 3. Figure 3: Densities p and p 2/q as a function of x. We set m(x) = x 2 430 . The final goal is to compute bounds on the variance estimator (95). We run Alg. 2 to construct bounds for the three intermediary quantities (96),(97), and (98), leading to the following bounds for the integral (95): V = 1 NZ 2 J − I 2 N , V = 1 NZ 2 J − I 2 N . (104) The bounds are compared to empirical variance estimation, obtained by runni… view at source ↗
Figure 4
Figure 4. Figure 4: Evolution of the bounds along iterations [PITH_FULL_IMAGE:figures/full_fig_p031_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Obtained bounds (red, blue) and empirical varianc [PITH_FULL_IMAGE:figures/full_fig_p033_5.png] view at source ↗
read the original abstract

Approximating integrals is a fundamental task in probability theory and statistical inference, and their applied fields of signal processing, and Bayesian learning, as soon as expectations over probability distributions must be computed efficiently and accurately. When these integrals lack closedform expressions, numerical methods must be used, from the Newton-Cotes formulas and Gaussian quadrature, to Monte Carlo and variational approximation techniques. Despite these numerous tools, few are guaranteed to preserve majoration/minoration inequalities, while this feature is fundamental in certain applications in statistics. In this paper, we focus on the integration problem arising in the estimation of moments of scalar unnormalized distributions. We introduce a sequential method for constructing upper and lower bounds on the sought integral. Our approach leverages the majorization-minimization framework to iteratively refine these bounds using an envelope principle. The method has proven convergence and controlled accuracy under mild conditions. We then generalize the method to the multi-dimensional setting, along with an effective implementation strategy based on power diagrams. We demonstrate the effectiveness of the proposed approach through a detailed numerical example of the estimation of a Monte Carlo sampler variance in a Bayesian inference problem, in one- and two-dimensional cases.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes a sequential majorization-minimization (MM) procedure that iteratively constructs upper and lower bounds on moments (integrals) of scalar and multi-dimensional unnormalized densities. The method relies on an envelope principle, claims proven convergence and controlled accuracy under mild conditions, extends the construction to the multi-dimensional case via power diagrams, and illustrates performance on a Monte Carlo variance estimation task arising from a Bayesian posterior in one and two dimensions.

Significance. If the convergence result holds under the stated conditions and the bounds remain usefully tight, the approach supplies a constructive way to obtain inequality-preserving approximations to expectations, a property that is valuable in certain statistical applications but is not automatically guaranteed by standard quadrature or Monte Carlo methods. The explicit use of the MM framework and the power-diagram implementation for the multi-dimensional case are technically interesting and could be reusable.

major comments (2)
  1. [§3] §3 (convergence theorem): the statement that convergence and accuracy hold under 'mild conditions' is load-bearing for the central claim, yet the precise hypotheses (e.g., requirements on log-concavity, tail decay, or envelope tightness) are not shown to be satisfied by the unnormalized Bayesian posterior used in the numerical example; without this verification the guarantees do not automatically transfer to the claimed setting.
  2. [§4] §4 (multi-dimensional extension): the reduction to power diagrams is presented as an effective implementation strategy, but the error analysis that would quantify how the bound tightness degrades with dimension is missing; this omission weakens the claim that the method scales beyond the two-dimensional illustration.
minor comments (2)
  1. [Abstract] The abstract asserts 'proven convergence' without even a one-sentence indication of the key hypotheses; adding a brief qualifier would improve readability.
  2. [§2] Notation for the envelope functions and the sequence of bounds is introduced without a consolidated table or diagram; a single schematic would help readers track the iterative refinement.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment below and indicate the planned revisions.

read point-by-point responses
  1. Referee: [§3] §3 (convergence theorem): the statement that convergence and accuracy hold under 'mild conditions' is load-bearing for the central claim, yet the precise hypotheses (e.g., requirements on log-concavity, tail decay, or envelope tightness) are not shown to be satisfied by the unnormalized Bayesian posterior used in the numerical example; without this verification the guarantees do not automatically transfer to the claimed setting.

    Authors: We agree that making the verification explicit strengthens the paper. Theorem 3.1 requires the target density to admit a suitable majorizing envelope and to satisfy mild tail integrability conditions. The Bayesian posterior in the numerical example is formed from a Gaussian likelihood and a log-concave prior, which guarantees log-concavity and exponential tail decay. In the revised manuscript we will add a short paragraph in §3 (or a dedicated appendix subsection) that confirms these hypotheses hold for the specific posterior, including a brief analytic check on the envelope tightness and tail behavior. revision: yes

  2. Referee: [§4] §4 (multi-dimensional extension): the reduction to power diagrams is presented as an effective implementation strategy, but the error analysis that would quantify how the bound tightness degrades with dimension is missing; this omission weakens the claim that the method scales beyond the two-dimensional illustration.

    Authors: We acknowledge that a quantitative error analysis describing the degradation of bound tightness with dimension is absent from the current version. The power-diagram construction yields an efficient computational procedure, yet deriving explicit dimension-dependent rates would require further geometric analysis of cell volumes. In the revision we will expand §4 with a discussion of the expected scaling behavior derived from the properties of power diagrams, together with a remark on the practical limitations for dimensions substantially higher than two. We will also note that the two-dimensional numerical results already illustrate the method’s viability in moderate dimensions. revision: partial

Circularity Check

0 steps flagged

No circularity: constructive MM-based bounding procedure with independent convergence claims

full rationale

The paper presents a novel sequential method that applies the majorization-minimization framework and an envelope principle to iteratively construct upper and lower bounds on moments of unnormalized distributions. Convergence and accuracy are asserted under explicitly invoked mild conditions whose statement is independent of the target result itself. No equations reduce a claimed prediction to a fitted parameter by construction, no self-citation chain supplies the uniqueness or convergence guarantee, and the multi-dimensional generalization via power diagrams is presented as an implementation choice rather than a renaming of a prior result. The numerical Bayesian example functions as validation, not as the source of the bounds. The derivation chain therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no explicit free parameters, axioms, or invented entities can be identified from the given text.

pith-pipeline@v0.9.0 · 5511 in / 1008 out tokens · 26876 ms · 2026-05-17T03:15:17.290163+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

34 extracted references · 34 canonical work pages · 1 internal anchor

  1. [1]

    C. P. Robert, G. Casella, Monte Carlo Statistical Method s, Springer, 2004

  2. [2]

    J. S. Liu, J. S. Liu, Monte Carlo strategies in scientific c omputing, Vol. 10, Springer, 2001.565

  3. [3]

    V. I. Krylov, A. H. Stroud, Approximate calculation of in tegrals, Courier Corporation, 2006

  4. [4]

    P. J. Davis, P. Rabinowitz, Methods of Numerical Integra tion, Academic Press, 1984

  5. [5]

    Demailly, Analyse numérique et équations différen tielles, Grenoble Sciences, EDP Sci- ences, Les Ulis, France, 2006

    J.-P. Demailly, Analyse numérique et équations différen tielles, Grenoble Sciences, EDP Sci- ences, Les Ulis, France, 2006

  6. [6]

    Brass, K

    H. Brass, K. Petras, Quadrature theory: the theory of num erical integration on a compact570 interval, no. 178, American Mathematical Soc., 2011

  7. [7]

    Elvira, L

    V. Elvira, L. Martino, P. Closas, Importance gaussian qu adrature, IEEE Transactions on Signal Processing 69 (2020) 474–488

  8. [8]

    The MathWorks, MATLAB Integral Function Documentati on, https://fr.mathworks.com/help/matlab/ref/integral.html (2025).575

    I. The MathWorks, MATLAB Integral Function Documentati on, https://fr.mathworks.com/help/matlab/ref/integral.html (2025).575

  9. [9]

    Kahaner, C

    D. Kahaner, C. Moler, S. Nash, Numerical Methods and Soft ware, Prentice Hall, 1989

  10. [10]

    A. B. Owen, Monte Carlo theory, methods and examples, https://artowen.su.domains/mc/, 2013

  11. [11]

    arXiv preprint arXiv:2102.05407 , year=

    V. Elvira, L. Martino, Advances in importance sampling , arXiv preprint arXiv:2102.05407 (2021).580

  12. [12]

    Novak, K

    E. Novak, K. Ritter, High dimensional integration of sm ooth functions over cubes, Numerische Mathematik 75 (1996) 79–97. 39

  13. [13]

    Evans, T

    M. Evans, T. Swartz, Approximating Integrals via Monte Carlo and Deterministic Methods, Vol. 20 of Oxford Statistical Science Series, Oxford Univer sity Press, Oxford, UK, 2000

  14. [14]

    Martino, V

    L. Martino, V. Elvira, F. Louzada, Effective sample size for importance sampling based on585 discrepancy measures, Signal Processing 131 (2017) 386–40 1

  15. [15]

    Elvira, L

    V. Elvira, L. Martino, C. P. Robert, Rethinking the effec tive sample size, International Statis- tical Review 90 (3) (2022) 525–550

  16. [16]

    A. Owen, Y. Zhou, Safe and effective importance sampling , Journal of the American Statistical Association 95 (449) (2000) 135–143.590

  17. [17]

    Elvira, L

    V. Elvira, L. Martino, D. Luengo, M. F. Bugallo, General ized multiple importance sampling, Statistical Science 34 (1) (2019) 129–155

  18. [18]

    Chouzenoux, A

    E. Chouzenoux, A. Jezierska, J.-C. Pesquet, H. Talbot, A Majorize-Minimize subspace approach for l2-l0 image regu larization, SIAM Journal on Imaging Sciences 6 (1) (2013) 563–591. doi:10.1137/11085997X.595 URL https://hal.science/hal-00789962

  19. [19]

    Chouzenoux, J.-C

    E. Chouzenoux, J.-C. Pesquet, Optimization Methods fo r Signal Processing, in: C.Jutten, L. Duarte, S. Moussaoui (Eds.), Source Separation in Physic al-Chemical Sensing, no. Chapter 2, IEEE Press, 2023. URL https://inria.hal.science/hal-04250055600

  20. [20]

    Y. Sun, P. Babu, D. P. Palomar, Majorization-minimizat ion algorithms in signal processing, communications, and machine learning, IEEE Transactions o n Signal Processing 65 (3) (2017) 794–816. doi:10.1109/TSP.2016.2601299

  21. [21]

    Bauschke, P

    H. Bauschke, P. Combettes, Convex Analysis and Monoton e Operator Theory in Hilbert Spaces, 2nd Edition, Springer, New York, 2017.605

  22. [22]

    Boissonnat, M

    J.-D. Boissonnat, M. Yvinec, Algorithmic Geometry, Ca mbridge University Press, 1998

  23. [23]

    D. P. Bertsekas, J. N. Tsitsiklis, Introduction to Prob ability, Athena Scientific, 2008

  24. [24]

    Rudin, Principles of Mathematical Analysis, McGraw -Hill, 1976

    W. Rudin, Principles of Mathematical Analysis, McGraw -Hill, 1976. 40

  25. [25]

    Abramowitz, I

    M. Abramowitz, I. A. Stegun, Handbook of mathematical f unctions: with formulas, graphs, and mathematical tables, Dover Pub., number 55., 1972.610

  26. [26]

    The MathWorks, Inc., norminv: Inverse of the normal cum u- lative distribution function, mATLAB R2025a, Retrieved fr om https://fr.mathworks.com/help/stats/norminv.html#References (2025)

  27. [27]

    Owen, Monte Carlo Theory, Methods and Examples, http://statweb.stanford.edu/∼ owen/mc/, 2013.615

    A. Owen, Monte Carlo Theory, Methods and Examples, http://statweb.stanford.edu/∼ owen/mc/, 2013.615

  28. [28]

    H. Kahn, A. W. Marshall, Methods of reducing sample size in Monte Carlo computations, Journal of the Operations Research Society of America 1 (5) ( 1953) 263–278

  29. [29]

    Evans, T

    M. Evans, T. Swartz, An algorithm for the approximation of integrals with exact error bounds (11 1997)

  30. [30]

    G. Bouchard, Efficient bounds for the softmax function an d applications to approximate infer-620 ence in hybrid models, in: Proceedings of the Neural Informa tion Processing Systems (NIPS 2008), Vol. 31, Vancouver, B.C., Canada, Dec. 2008

  31. [31]

    Chouzenoux, J.-B

    E. Chouzenoux, J.-B. Fest, SABRINA: A Stochastic Subsp ace Majorization-Minimization Algorithm, Journal of Optimization Theory and Applications 195 (2022) 919–952. URL https://hal.science/hal-03793623625

  32. [32]

    J. F. de Cossio-Diaz, Moments of the univariate truncat ed normal distribution, https://arxiv.org/abs/1803.09541, march 26 (2018)

  33. [33]

    Orjebin, A recursive formula for the moments of a trun cated univariate normal distribution (September 26 2014)

    E. Orjebin, A recursive formula for the moments of a trun cated univariate normal distribution (September 26 2014)

  34. [34]

    Chevillard, The functions erf and erfc computed with arbitrary precision and explicit error630 bounds, Information and Computation 216 (2012) 72–95

    S. Chevillard, The functions erf and erfc computed with arbitrary precision and explicit error630 bounds, Information and Computation 216 (2012) 72–95. doi:10.1016/j.ic.2011.09.001. 41