Estimating a probability of failure with the convex order in computer experiments
Pith reviewed 2026-05-25 09:50 UTC · model grok-4.3
The pith
A convex-order inequality between two bias-equivalent Kriging estimators ranks their efficiency for black-box failure probability estimation.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
There exists a convex order inequality between the Kriging-based estimator of the failure probability and the proposed alternative estimator; the inequality can be used both to compare efficiency and to quantify uncertainty, and it yields a sequential design procedure for computer experiments.
What carries the argument
The convex order relation between the two estimators of the failure probability; it ranks them by the expected value of any convex function applied to the estimator.
If this is right
- The alternative estimator has lower or equal variance for any convex loss, hence is more efficient under the same bias.
- Bounds on the variability of the estimated failure probability follow directly from the convex-order relation.
- A sequential design algorithm can be constructed that selects the next simulation point to shrink the convex-order gap.
Where Pith is reading between the lines
- The same ordering technique may apply to other threshold functionals of Gaussian processes, such as expected excursion volume.
- On low-dimensional analytic test cases the inequality can be verified by direct Monte Carlo replication of both estimators.
Load-bearing premise
The alternative estimator matches the Kriging estimator in bias and the convex-order comparison extends to the failure-probability functional.
What would settle it
A numerical check on an analytic test function that records whether one estimator consistently shows smaller variance than the other while their means remain equal.
Figures
read the original abstract
This paper deals with the estimation of a failure probability of an industrial product. To be more specific, it is defined as the probability that the output of a physical model, with random input variables, exceeds a threshold. The model corresponds with an expensive to evaluate black-box function, so that classical Monte Carlo simulation methods cannot be applied. Bayesian principles of the Kriging method are then used to design an estimator of the failure probability. From a numerical point of view, the practical use of this estimator is restricted. An alternative estimator is proposed, which is equivalent in term of bias. The main result of this paper concerns the existence of a convex order inequality between these two estimators. This inequality allows to compare their efficiency and to quantify the uncertainty on the results that these estimators provide. A sequential procedure for the construction of a design of computer experiments, based on the principle of the Stepwise Uncertainty Reduction strategies, also results of the convex order inequality. The interest of this approach is highlighted through the study of a real case from the company STMicroelectronics.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper defines a Kriging-based estimator of failure probability for an expensive black-box simulator and proposes an alternative estimator shown to have identical bias. The central result is a convex-order inequality between the two random variables, proved using properties of the Gaussian-process posterior. This inequality is used to rank estimator efficiency, bound uncertainty, and derive a Stepwise Uncertainty Reduction (SUR) sequential design. The approach is illustrated on an industrial case from STMicroelectronics.
Significance. If the convex-order result holds, the paper supplies a rigorous, assumption-light tool for comparing the two estimators and for driving adaptive designs in rare-event estimation. The explicit bias calculation and the use of convex order on the failure-probability functional constitute a clear technical contribution; the manuscript also provides the construction and proof, which are positive features.
minor comments (3)
- The notation for the two estimators (Kriging-based and alternative) should be introduced with explicit formulas in §2 or §3 so that the bias-equivalence statement can be checked without back-referencing the abstract.
- Figure captions for the industrial example should state the dimension of the input space and the number of design points used, to allow direct comparison with other SUR strategies.
- A short remark on the computational cost of evaluating the convex-order bound itself would help readers assess practicality for larger designs.
Simulated Author's Rebuttal
We thank the referee for the positive summary, the recognition of the technical contribution of the convex-order result, and the recommendation of minor revision. We are pleased that the bias calculation, the use of convex order on the failure-probability functional, and the resulting SUR design are viewed favorably.
Circularity Check
No significant circularity detected
full rationale
The manuscript supplies explicit constructions of both the Kriging-based estimator and the alternative estimator, derives their bias equivalence from the Gaussian process posterior, and proves the convex-order inequality directly from properties of that posterior. These steps constitute an internally supported derivation chain with no reduction to fitted inputs, self-citations, or ansatzes that would render the central claim equivalent to its own premises by construction.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel (J is the unique calibrated reciprocal cost) echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
Sn ≤cx Rn … E[ϕ(Rn)] ≤ E[ϕ(Sn)] for convex ϕ (Prop. 4.1); Var[Sn] ≤ Var[Rn]; quantile bounds via convex-order integrals (Prop. 5.2)
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection (coupling combiner forces bilinear branch) unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
SUR criterion JRn(x) = En[Var[Rn+1] | Xn+1=x] derived from the same convex-order inequality
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
P. Abrahamsen. A review of Gaussian random fields and correlation functions-2nd edition. Norwegian Computing Center , 1997
work page 1997
- [2]
- [3]
-
[4]
D. Azzimonti, J. Bect, C. Chevalier, and D. Ginsbourger. Quantifying uncertainties on excursion sets under a Gaussian random field prior. SIAM/ASA Journal of Uncertainty Quantification, 4(1):850–874, 2016
work page 2016
-
[5]
N. B¨ auerle and A. M¨ uller. Stochastic orders and risk measures: consistency and bounds. Insurance Mathematics and Economics , 38(1):132:148, 2006
work page 2006
-
[6]
J. Bect, F. Bachoc, and D. Ginsbourger. A supermartingale approach to Gaussian process based sequential design of experiments. Bernoulli, 2019. 26
work page 2019
-
[7]
J. Bect, D. Ginsbourger, L. Li, V. Picheny, and E. Vazquez. Sequential design of computer experiments for the estimation of a probability of failure. Statistics and Computing, 22(3):773–793, 2012
work page 2012
-
[8]
L. Bernard. M´ ethodes probabilistes pour l’estimation de probabilit´ es de d´ efaillance. PhD thesis, Universit´ e de Tours, 2019
work page 2019
-
[9]
B. J. Bichon, M. S. Eldred, L. P. Swiler, S. Mahadevan, and J. M. McFarland. Efficient global reliability analysis for nonlinear implicit performance functions. AIAA Journal, 46:2459–2468, 2008
work page 2008
-
[10]
M. Boutsikas and E. Vaggelatou. On the distance between convex-ordered random variables, with applications. Advances in Applied Probability, 34:349–374, 2002
work page 2002
-
[11]
V. Brazauskas, B. L. Jones, M. L. Puri, and R. Zitikis. Estimating conditional tail expectation with acturial applications in view. Journal of Statistical Planning and Inference, 138:3590–3604, 2008
work page 2008
- [12]
-
[13]
C. Chevalier, J. Bect, D. Ginsbourger, E. Vazquez, V. Picheny, and Y. Richet. Fast parallel kriging-based stepwise uncertainty reduction with application to the identification of an excursion set. Technometrics, 56(4):455–465, 2014
work page 2014
-
[14]
C. Chevalier, V. Picheny, and D. Ginsbourger. KrigInv: An efficient and user-friendly R implementation of Kriging-based inversion algorithms. Computational Statistics & Data Analysis , 71:1021–1034, 2014
work page 2014
-
[15]
J. Chil` es and P. Delfiner.Geostatistis: modeling spatial uncertainty, volume 2. Wiley series in probability and statistics, 1999
work page 1999
- [16]
-
[17]
M. Davis. Consistency of risk measure estimates. 2013
work page 2013
-
[18]
H. Dette and A. Pepelyshev. Generalized latin hypercube design for computer experiments. Technometrics, 52(4):421–429, 2010
work page 2010
- [19]
-
[20]
P. Diggle and P.-J. Ribeiro. Model-based geostatistics. Springer Series in Statistics. 2007
work page 2007
-
[21]
V. Dubourg, F. Deheeger, and B. Sudret. Metamodel-based importance sampling for structural reliability analysis. Probabilistic Engineering Mechanics, 33:47–57, 2013
work page 2013
- [22]
-
[23]
M. R. El Amri, C. Helbert, O. Lepreux, M. Munoz Zuniga, C. Prieur, and D. Sinoquet. Data-driven stochastic inversion under functional uncertainties. working paper or preprint, Feb. 2018
work page 2018
-
[24]
M. J. Shervish. Theory of statistics . Springer, 2010
work page 2010
-
[25]
R. Kaas, J. Dhaene, D. Vyncke, M. Goovaerts, and M. Denuit. A simple geometric proof that comonotonic risks have the convex-largest sum. Astin Bulletin , 32:71–80, 2002. 27
work page 2002
-
[26]
I. Kaymaz. Application of Kriging method for structural reliability problems. Struct. Safety, 27:133–151, 2005
work page 2005
-
[27]
L. Le Gratiet. Multi-fidelity Gaussian process regression for computer experiments . PhD thesis, Universit´ e Paris-Diderot Paris VII, 2013
work page 2013
- [28]
- [29]
- [30]
-
[31]
A. M¨ uller and D. Stoyan.Comparison Methods for Stochastic Models and Risks. Wiley Series in Probability and Statistics. Hohn Wiley & Sons Ltd, Chichester, 2002
work page 2002
-
[32]
J. Oger, P. Leduc, and E. Lesigne. A random field model and decision support in industrial production. J. SFdS, 156(3):1–26, 2015
work page 2015
-
[33]
V. Picheny, D. Ginsbourger, O. Roustant, R. Haftka, and N. Kim. Adaptive designs of experiments for accurate approximation of a target region. Journal of Mechanical Design, 132, 2010
work page 2010
-
[34]
M. Pitera and T. Schmidt. Unbiased estimation of risk. Journal of Banking and Finance, 91, 2018
work page 2018
-
[35]
C. E. Rasmussen and C. K. I. Williams. Gaussian processes for machine learning . Adaptive Computation and Machine Learning. MIT Press, Cambridge, MA, 2006
work page 2006
-
[36]
C. Robert. The Bayesian Choice . Springer, 2007
work page 2007
-
[37]
O. Roustant, D. Ginsbourger, and Y. Deville. DiceKriging, DiceOptim: Two R packages for the analysis of computer experiments by Kriging-based metamodelling and optimization. Journal of Statistical Software , 51, 2012
work page 2012
- [38]
- [39]
- [40]
-
[41]
T. J. Santner, B. J. Williams, and W. I. Notz. The design and analysis of computer experiments. Springer Science & Business Media. 2003
work page 2003
-
[42]
M. Shaked and J. Shanthikumar. Stochastic orders. Springer Series in Statistics. 2007
work page 2007
- [43]
-
[44]
M. Stein. Large sample properties of simulations using Latin hypercube sampling. Technometrics, 29:143–151, 1987
work page 1987
-
[45]
M. L. Stein. Interpolation of spatial data . Springer Series in Statistics. 1999
work page 1999
-
[46]
D. Tasche. Expected shortall and beyond. Journal of Banking and Finance , 26:1519– 1533, 2002
work page 2002
-
[47]
E. Vestrup. The Theory of Measures and Integration . Wiley, 2003
work page 2003
-
[48]
H. Wackernagel. Multivariate Geostatistics. Springer-Verlag, Berlin, 2003
work page 2003
-
[49]
A. Wu, M. Aoi, and J. Pillow. Exploiting gradients and Hessians in Bayesian optimization and Bayesian quadrature. 2018. 28
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.