Balancing the exploration-exploitation trade-off in active learning for surrogate model-based reliability analysis via multi-objective optimization
Pith reviewed 2026-05-18 21:05 UTC · model grok-4.3
The pith
Casting acquisition as multi-objective optimization lets adaptive rules balance exploration against exploitation for surrogate reliability analysis.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By formulating sample acquisition as a multi-objective optimization problem in which exploration (global uncertainty reduction) and exploitation (accuracy near the failure boundary) are explicit competing objectives, the method produces a Pareto set that quantifies the trade-off; principled criteria and adaptive rules, including a scheduled exploration-to-exploitation shift and a reliability-aware selection rule, then select the next sample, yielding strategies whose relative failure-probability error trajectories, sample-efficiency comparisons, and global rankings demonstrate robust overall performance that consistently meets strict error targets across diverse limit-state functions.
What carries the argument
The multi-objective optimization formulation of the acquisition step that returns a Pareto set of candidate samples from which adaptive trade-off rules choose the next limit-state evaluation.
If this is right
- Relative failure-probability error trajectories converge to targets with fewer high-fidelity evaluations than single-score baselines.
- Sample-efficiency rankings remain favorable across varied limit-state functions without per-problem retuning.
- Strict error targets are met consistently by the scheduled-shift and reliability-aware selection rules.
- The Pareto-set representation supplies an explicit, quantifiable view of the exploration-exploitation balance at each iteration.
Where Pith is reading between the lines
- The same Pareto-based acquisition could be ported to other surrogate tasks such as Bayesian optimization or global sensitivity analysis that also face exploration-exploitation decisions.
- Higher-dimensional input spaces may require new normalization or selection heuristics to keep the Pareto front computationally tractable.
- Comparing the overhead of Pareto-front generation against portfolio-based acquisition would clarify whether the added machinery pays off in wall-clock time on very expensive models.
Load-bearing premise
The proposed adaptive trade-off rules will select samples from the Pareto set in a way that generalizes to unseen limit-state functions and surrogate models without extensive per-problem tuning.
What would settle it
A new limit-state function or surrogate model on which the adaptive MOO strategies either fail to reach the target error level or require substantially more evaluations than the conventional single-score methods.
Figures
read the original abstract
Reliability assessment of engineering systems often requires repeated evaluations of limit-state functions that may rely on computationally expensive high-fidelity models, rendering direct sampling-based reliability analysis impractical. An effective solution is to approximate the limit-state function with a surrogate model that can be iteratively refined through active learning, thereby reducing the number of model evaluations. At each iteration, an acquisition strategy selects the next sample for evaluation by balancing two competing objectives: exploration, to reduce global predictive uncertainty, and exploitation, to improve accuracy near the failure boundary. Conventional strategies such as the U-function, EFF, ERF, REIF, and portfolio-based schemes encode this balance through single pointwise scores, concealing the underlying trade-off. In this work, we formulate sample acquisition as a multi-objective optimization (MOO) problem in which exploration and exploitation are explicit competing objectives, yielding a compact Pareto set that provides a quantifiable trade-off representation. To select samples from the Pareto set, we investigate principled MOO criteria and propose adaptive trade-off rules, including a scheduled exploration-to-exploitation shift and a reliability-aware selection rule. Across diverse limit-state functions, we evaluate all tested strategies through relative failure-probability error trajectories, sample-efficiency comparisons, and global rankings, showing that the adaptive MOO-based strategies achieve robust overall performance while consistently meeting strict error targets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper formulates sample acquisition in active learning for surrogate-based reliability analysis as a multi-objective optimization (MOO) problem with explicit exploration (global uncertainty reduction) and exploitation (accuracy near the failure boundary) objectives. It generates a Pareto set of candidate samples and proposes adaptive selection rules, including a scheduled exploration-to-exploitation shift and a reliability-aware criterion, to choose the next evaluation point. Evaluations across multiple limit-state functions compare these strategies to conventional single-score methods (U-function, EFF, etc.) using relative failure-probability error trajectories, sample-efficiency metrics, and global rankings, claiming that the adaptive MOO approaches deliver robust performance and consistently meet strict error targets.
Significance. If the adaptive MOO rules generalize without per-problem retuning, the work offers a more transparent and quantifiable representation of the exploration-exploitation trade-off than single-score heuristics. This could improve sample efficiency and reliability in engineering applications involving expensive high-fidelity models, particularly when the Pareto front provides interpretable options for balancing the two objectives.
major comments (2)
- The central claim of robust generalization across diverse limit-state functions rests on evaluations using a finite collection of analytic benchmarks. No cross-validation on held-out functions or additional test cases with differing smoothness, dimensionality, or discontinuity characteristics is described, leaving open whether the observed rankings and error trajectories are driven by the adaptive rules themselves or by characteristics shared with the benchmark suite.
- The proposed adaptive trade-off rules (scheduled exploration-to-exploitation shift and reliability-aware Pareto selection) contain scheduling hyperparameters. The manuscript does not report sensitivity analysis on these parameters or demonstrate that performance remains stable under modest perturbations, which is necessary to substantiate that the adaptivity, rather than benchmark-specific tuning, produces the reported gains in sample efficiency and error-target compliance.
minor comments (2)
- Clarify the exact definition and implementation of the reliability-aware selection rule when multiple points lie on the Pareto front; a short pseudocode or explicit tie-breaking procedure would improve reproducibility.
- Ensure that all reported error trajectories include uncertainty bands (e.g., standard deviation over repeated runs) so that apparent differences between strategies can be assessed for statistical significance.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive comments, which help clarify the scope and limitations of our evaluation. We address each major comment below and outline revisions that will be incorporated into the next version of the manuscript.
read point-by-point responses
-
Referee: The central claim of robust generalization across diverse limit-state functions rests on evaluations using a finite collection of analytic benchmarks. No cross-validation on held-out functions or additional test cases with differing smoothness, dimensionality, or discontinuity characteristics is described, leaving open whether the observed rankings and error trajectories are driven by the adaptive rules themselves or by characteristics shared with the benchmark suite.
Authors: The benchmark functions were deliberately chosen from the standard set used in surrogate-based reliability analysis to span a range of dimensionalities, degrees of smoothness, and discontinuity features. The adaptive MOO strategies exhibited consistent ranking and error-target compliance across this collection without requiring function-specific retuning. We agree that a broader set of test cases would further strengthen the generalization claim. In the revision we will add results on two additional limit-state functions with higher dimensionality and more pronounced discontinuities, together with an expanded discussion of benchmark selection criteria and acknowledged limitations for functions outside the tested class. revision: yes
-
Referee: The proposed adaptive trade-off rules (scheduled exploration-to-exploitation shift and reliability-aware Pareto selection) contain scheduling hyperparameters. The manuscript does not report sensitivity analysis on these parameters or demonstrate that performance remains stable under modest perturbations, which is necessary to substantiate that the adaptivity, rather than benchmark-specific tuning, produces the reported gains in sample efficiency and error-target compliance.
Authors: The scheduling parameters were selected according to a principled linear decay of the exploration weight as surrogate accuracy improves, rather than by exhaustive per-benchmark optimization. While internal checks during development indicated stability, we acknowledge that a formal sensitivity study was omitted from the submitted manuscript. The revised version will include a new subsection that perturbs the scheduling and reliability-aware thresholds by ±20 % and reports the resulting variation in sample-efficiency metrics and failure-probability error trajectories for all benchmarks, thereby confirming that the reported gains arise from the adaptive structure itself. revision: yes
Circularity Check
No circularity: formulation and adaptive rules are presented as independent proposals evaluated empirically
full rationale
The paper formulates acquisition as an explicit MOO problem yielding a Pareto set, then proposes new adaptive selection rules (scheduled shift and reliability-aware) that are not derived from or fitted to the target performance metrics. Evaluation proceeds via direct empirical comparison of error trajectories and rankings on benchmark limit-state functions. No load-bearing step reduces by the paper's equations to a self-defined quantity, fitted input renamed as prediction, or self-citation chain; the central claims rest on the proposed heuristics and their observed behavior rather than tautological re-expression of inputs.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Pareto optimality defines the set of non-dominated solutions for the two objectives
Reference graph
Works this paper leans on
-
[1]
R. Teixeira, M. Nogal, A. O’Connor, Adaptive approaches in metamodel-based reliability analysis: A review, Structural Safety 89 (2021) 102019
work page 2021
-
[2]
R. Teixeira, B. Martinez-Pastor, M. Nogal, A. O’Connor, Reliability analysis using a multi-metamodel complement-basis approach, Reliability Engineering & System Safety 205 (2021) 107248
work page 2021
-
[3]
M. Moustapha, S. Marelli, B. Sudret, Active learning for structural reliability: Survey, general framework and benchmark, Structural Safety 96 (2022) 102174
work page 2022
-
[4]
B. Sudret, Global sensitivity analysis using polynomial chaos expansions, Reliability Engineering & System Safety 93 (7) (2008) 964–979
work page 2008
-
[5]
B. Bichon, S. Mahadevan, M. Eldred, Reliability-based design optimization using e fficient global reliability analysis, in: 50th AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics, and Materials Conference 17th AIAA/ASME/AHS Adaptive Structures Conference 11th AIAA No, 2009, p. 2261
work page 2009
-
[6]
M. Ehre, I. Papaioannou, B. Sudret, D. Straub, Sequential active learning of low-dimensional model repre- sentations for reliability analysis, SIAM Journal on Scientific Computing 44 (3) (2022) B558–B584
work page 2022
-
[7]
P. G. Morato, K. G. Papakonstantinou, C. P. Andriotis, J. S. Nielsen, P. Rigo, Optimal inspection and main- tenance planning for deteriorating structural components through dynamic Bayesian networks and Markov decision processes, Structural Safety 94 (2022) 102140
work page 2022
-
[8]
J. Moran A., P. G. Morato, P. Rigo, Active learning for structural reliability analysis with multiple limit-state functions through variance-enhanced PC-Kriging surrogate models, in: Proceedings of the 14th International Conference on Applications of Statistics and Probability in Civil Engineering (ICASP14), 2023, pp. 1–8
work page 2023
- [9]
-
[10]
S. Marelli, B. Sudret, An active-learning algorithm that combines sparse polynomial chaos expansions and bootstrap for structural reliability analysis, Structural Safety 75 (2018) 67–74
work page 2018
- [11]
-
[12]
Y . Bao, Z. Xiang, H. Li, Adaptive subset searching-based deep neural network method for structural reliability analysis, Reliability Engineering & System Safety 213 (2021) 107778
work page 2021
-
[13]
M. S. Chauhan, M. Ojeda-Tuz, R. A. Catarelli, K. R. Gurley, D. Tsapetis, M. D. Shields, On active learning for Gaussian process-based global sensitivity analysis, Reliability Engineering & System Safety 245 (2024) 109945
work page 2024
-
[14]
M. Moustapha, P. Parisi, S. Marelli, B. Sudret, Reliability analysis of arbitrary systems based on active learn- ing and global sensitivity analysis, Reliability Engineering & System Safety 248 (2024) 110150
work page 2024
-
[15]
S. L. Dhulipala, M. D. Shields, P. Chakroborty, W. Jiang, B. W. Spencer, J. D. Hales, V . M. Laboure, Z. M. Prince, C. Bolisetti, Y . Che, Reliability estimation of an advanced nuclear fuel using coupled active learning, multifidelity modeling, and subset simulation, Reliability Engineering & System Safety 226 (2022) 108693
work page 2022
- [16]
-
[17]
B. J. Bichon, M. S. Eldred, L. P. Swiler, S. Mahadevan, J. M. McFarland, E fficient global reliability analysis for nonlinear implicit performance functions, AIAA Journal 46 (10) (2008) 2459–2468
work page 2008
- [18]
-
[19]
I. Das, J. E. Dennis, A closer look at drawbacks of minimizing weighted sums of objectives for Pareto set generation in multicriteria optimization problems, Structural Optimization 14 (1) (1997) 63–69
work page 1997
-
[20]
R. T. Marler, J. S. Arora, The weighted sum method for multi-objective optimization: new insights, Structural and Multidisciplinary Optimization 41 (2010) 853–862
work page 2010
-
[21]
Pareto, Manuale di economia politica, Piccola biblioteca scientifica, Societa Editrice, 1906
V . Pareto, Manuale di economia politica, Piccola biblioteca scientifica, Societa Editrice, 1906
work page 1906
-
[22]
R. T. Marler, J. S. Arora, Survey of multi-objective optimization methods for engineering, Structural and Multidisciplinary Optimization 26 (2004) 369–395
work page 2004
-
[23]
Miettinen, Nonlinear multiobjective optimization, V ol
K. Miettinen, Nonlinear multiobjective optimization, V ol. 12, Springer Science & Business Media, 1999
work page 1999
-
[24]
Y . Shi, C. Lin, Y . Ma, J. Shen, Active learning Kriging-based multi-objective modeling and optimization for system reliability-based robust design, Reliability Engineering & System Safety 245 (2024) 110007
work page 2024
-
[25]
Z. Lv, L. Wang, Z. Han, J. Zhao, W. Wang, Surrogate-assisted particle swarm optimization algorithm with Pareto active learning for expensive multi-objective optimization, IEEE /CAA Journal of Automatica Sinica 6 (3) (2019) 838–849
work page 2019
-
[26]
M. Xiao, J. Zhang, L. Gao, A system active learning Kriging method for system reliability-based design optimization with a multiple response model, Reliability Engineering & System Safety 199 (2020) 106935
work page 2020
-
[27]
J. Chen, Z. Chen, W. Jiang, H. Guo, L. Chen, A reliability-based design optimization strategy using quantile surrogates by improved PC-Kriging, Reliability Engineering & System Safety 253 (2025) 110491
work page 2025
-
[28]
V . Dubourg, B. Sudret, J.-M. Bourinet, Reliability-based design optimization using Kriging surrogates and subset simulation, Structural and Multidisciplinary Optimization 44 (2011) 673–690
work page 2011
-
[29]
R. Liu, G. Ma, F. Kong, Z. Ai, K. Xiong, W. Zhou, X. Wang, X. Chang, Pareto-guided active learning for accelerating surrogate-assisted multi-objective optimization of arch dam shape, Engineering Structures 326 (2025) 119541
work page 2025
-
[30]
R. S. de Oliveira, M. F. de L.O. Santos, S. M. Afonso, R. de S. Motta, Successive Pareto simulation method for efficient structural reliability analysis, Probabilistic Engineering Mechanics 81 (2025) 103819
work page 2025
-
[31]
S. Bechikh, L. Ben Said, K. Gh ´edira, Searching for knee regions in multi-objective optimization using mobile reference points, in: Proceedings of the 2010 ACM symposium on applied computing, 2010, pp. 1118–1125
work page 2010
-
[32]
Zeleny, Compromise programming, Multiple Criteria Decision Making (1973)
M. Zeleny, Compromise programming, Multiple Criteria Decision Making (1973)
work page 1973
-
[33]
D. D. Cox, S. John, A statistical method for global optimization, in: IEEE international conference on systems, man, and cybernetics, IEEE, 1992, pp. 1241–1246
work page 1992
- [34]
- [35]
-
[36]
Proceedings 8, Springer, 2004, pp. 722–731
work page 2004
-
[37]
K. Deb, S. Gupta, Understanding knee points in bicriteria problems and their implications as preferred solution principles, Engineering optimization 43 (11) (2011) 1175–1204
work page 2011
-
[38]
C. Rasmussen, C. Williams, Gaussian processes for machine learning, Adaptive Computation and Machine Learning series, MIT Press, 2005
work page 2005
-
[39]
Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound constrained optimization
J. L. Morales, J. Nocedal, Remark on “Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound constrained optimization”, ACM Transactions on Mathematical Software (TOMS) 38 (1) (2011) 1–4
work page 2011
-
[40]
M. D. McKay, R. J. Beckman, W. J. Conover, A comparison of three methods for selecting values of input variables in the analysis of output from a computer code, Technometrics 21 (2) (1979) 239–245
work page 1979
-
[41]
Consortium des ´Equipements de Calcul Intensif (C ´ECI), C ´ECI High-performance computing clusters, https://www.ceci-hpc.be/clusters.html, accessed: August 21, 2025 (2025)
work page 2025
-
[42]
A. B. Yoo, M. A. Jette, M. Grondona, SLURM: Simple Linux Utility for Resource Management, in: D. Fei- telson, L. Rudolph, U. Schwiegelshohn (Eds.), Job Scheduling Strategies for Parallel Processing, Springer Berlin Heidelberg, Berlin, Heidelberg, 2003, pp. 44–60
work page 2003
-
[43]
P. Waarts, Structural reliability using finite element methods: an appraisal of directional adaptive response surface sampling (DARS), Ph.D. thesis, Technische Universiteit Delft (2000)
work page 2000
-
[44]
L. Schueremans, D. Van Gemert, Benefit of splines and neural networks in simulation based structural relia- bility analysis, Structural Safety 27 (3) (2005) 246–261
work page 2005
-
[45]
L. Schueremans, D. Van Gemert, Use of Kriging as meta-model in simulation procedures for structural relia- bility, in: 9th International Conference on Structural Safety and Reliability, 2005, pp. 2483–2490
work page 2005
-
[46]
D. M. Himmelblau, et al., Applied nonlinear programming, McGraw-Hill, 2018
work page 2018
-
[47]
K. G. Papakonstantinou, H. Nikbakht, E. Eshra, Hamiltonian MCMC methods for estimating rare events probabilities in high-dimensional problems, Probabilistic Engineering Mechanics 74 (2023) 103485
work page 2023
- [48]
-
[49]
M. H. F. Nielsen, Benefit of splines and neural networks in simulation based structural reliability analysis, Structural Safety 27 (2005) 403–404
work page 2005
- [50]
-
[51]
R. Rackwitz, Reliability analysis — A review and some perspectives, Structural Safety 23 (4) (2001) 365–395. 29 Appendix A. Proofs Appendix A.1. Proof of Proposition 1 by contradiction Proof. Suppose xU is the minimizer of U(x) but is not Pareto optimal, then: ∃ x ∈ X : h ∀i ∈ {µ, σ}, fi(x) ≥ fi(xU) i ∧ h ∃i ∈ {µ, σ}, fi(x) > fi(xU) i . (A.1) By explicitl...
work page 2001
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.