A Model-Agnostic Bootstrap for Macro-Level Claims Reserving Under the Conditioning Principle

Robin Van Oirbeek; Tim Verdonck

arxiv: 2605.15896 · v1 · pith:7WOIEKAZnew · submitted 2026-05-15 · 📊 stat.ME · stat.AP

A Model-Agnostic Bootstrap for Macro-Level Claims Reserving Under the Conditioning Principle

Robin Van Oirbeek , Tim Verdonck This is my paper

Pith reviewed 2026-05-20 15:57 UTC · model grok-4.3

classification 📊 stat.ME stat.AP

keywords claims reservingbootstrapconditioning principleDirichlet-Gamma hierarchypredictive distributiondevelopment factorsmacro-level reservingcredibility estimation

0 comments

The pith

A bootstrap for claims reserving satisfies the conditioning principle exactly by fixing the observed triangle and sampling only allocation proportions from their predictive distribution.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that existing bootstraps in macro-level claims reserving violate the conditioning principle because they resample functions of the observed data inside the predictive loop, which produces a coverage error of order one that does not shrink as the claims triangle grows. It shows that a Dirichlet-Gamma hierarchy supplies an exact predictive distribution for the allocation proportion W_i, allowing a bootstrap that holds the observed triangle fixed and simulates only W_i from a Beta distribution whose parameters depend on the fitted development factor. This construction inherits calibration from any development-proportion method such as Chain-Ladder or Bornhuetter-Ferguson and reduces the coverage deficit to order I to the minus one half, independent of the number of development periods. A sympathetic reader would care because the method offers a structural explanation for persistent miscalibration in standard approaches and delivers conservative intervals under compound Poisson data-generating processes.

Core claim

The Dirichlet-Gamma hierarchy admits a bootstrap that satisfies the conditioning principle exactly: the simulated increment equals the observed value times (1 minus W_i) over W_i, where W_i is drawn directly from its predictive Beta distribution with parameters c times the fitted development factor and c times one minus that factor. Only the allocation proportion is simulated while the observed triangle remains fixed, so the procedure inherits calibration from any development-proportion estimator and produces a coverage deficit of order I to the minus one half.

What carries the argument

The Dirichlet-Gamma hierarchy that supplies an exact predictive Beta distribution for the allocation proportion W_i conditional on the fitted development factors.

Load-bearing premise

The observed claims triangle is generated from or well approximated by a Dirichlet-Gamma hierarchy that supplies an exact predictive distribution for the allocation proportion W_i conditional on the fitted development factors.

What would settle it

Generate claims triangles from a Dirichlet-Gamma process with known development factors and check whether the proportion of bootstrap intervals that contain the true reserve approaches the nominal level at rate I to the minus one half as the number of accident years increases.

read the original abstract

The correct inferential object in claims reserving is the conditional predictive distribution $p(R \mid \mathcal{D}, \hat\theta)$, where $\mathcal{D}$ is the observed triangle held fixed. We refer to this as the conditioning principle. All existing bootstraps violate it by resampling functions of $\mathcal{D}$ inside the predictive loop, producing an $O(1)$ coverage error that does not vanish as the triangle grows. The Dirichlet-Gamma hierarchy admits a bootstrap that satisfies the principle exactly: $S^{IBNP}_i = X^{obs}_i (1-W_i)/W_i$ with $W_i \sim \mathrm{Beta}(c\hat{F}_{I-i}, c(1-\hat{F}_{I-i}))$ sampled directly from its predictive distribution. Only the allocation proportion $W_i$ is simulated; the observed triangle is held fixed. It thus inherits calibration from any development-proportion method (Chain-Ladder, Bornhuetter-Ferguson, Cape Cod, or other), making it model-agnostic. The coverage deficit is $O(I^{-1/2})$, independent of the number of development periods. Under compound Poisson data-generating processes the bootstrap is conservative for every $F_{I-i} \in (0,1)$: the predictive standard deviation analytically exceeds the true value by the factor $1/\sqrt{F_{I-i}}$. The ODP bootstrap violates the principle through two mechanisms in opposite directions: re-estimation inflates bootstrap variance under the ODP DGP, while missing accident-year frailty deflates it under frailty DGPs. The resulting coverage discrepancy is $\Omega(1)$ regardless of $I$, providing a structural explanation for the cross-portfolio miscalibration heterogeneity documented by Meyers (2015). Chain-Ladder, Bornhuetter-Ferguson and Cape Cod emerge as credibility estimators under diffuse, informative and pooling priors respectively, with identical structure for counts and amounts. The concentration $c$ serves as a diagnostic: $\hat{c} < 30$ signals non-stationary development.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a clean way to satisfy the conditioning principle exactly by sampling only the Beta allocation proportion while fixing the observed triangle, but the model-agnostic claim and O(I^{-1/2}) coverage do not obviously extend beyond the Dirichlet-Gamma hierarchy.

read the letter

The central contribution is a bootstrap that keeps the entire observed triangle fixed and samples only the allocation proportion W_i from Beta(c hat F, c(1-hat F)). This produces the predictive S_i^{IBNP} = X_obs (1-W_i)/W_i and satisfies p(R | D, hat theta) exactly under the Dirichlet-Gamma hierarchy. The coverage deficit is stated as O(I^{-1/2}) and independent of the number of development periods. Under compound Poisson data the bootstrap is conservative by the explicit factor 1/sqrt(F). That is new relative to the ODP and other resampling schemes cited in the abstract, and the structural explanation for why the ODP bootstrap produces Omega(1) coverage error in both directions is useful. The link from Chain-Ladder, Bornhuetter-Ferguson and Cape Cod to credibility estimators under different priors is also a tidy unification, and the concentration parameter c as a diagnostic for non-stationary development is practical. The math is formally grounded in the hierarchy and the coverage claims are given with explicit rates rather than simulation only. The soft spot is the model-agnostic extension. The Beta predictive law and the O(I^{-1/2}) rate are derived under the Dirichlet-Gamma asymptotics; when hat F comes from an arbitrary method such as plain Chain-Ladder or Cape Cod, nothing in the abstract guarantees that the sampled distribution equals the true conditional law given that hat theta. The conservativeness factor and independence from development periods likewise rest on model-specific variance calculations. The stress-test note therefore lands. The circularity burden from conditioning on fitted hat F and c is moderate but acknowledged. For readers working in actuarial reserving or regulatory capital this is worth attention because it targets a known structural flaw with a direct fix. It deserves a serious referee because the central construction is reproducible from the stated hierarchy and the coverage order is stated sharply enough to be checked.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes a bootstrap for macro-level claims reserving that adheres to the conditioning principle by holding the observed triangle D fixed and sampling only the allocation proportions W_i ~ Beta(c hat F_{I-i}, c(1-hat F_{I-i})), yielding S_i^{IBNP} = X_i^{obs} (1-W_i)/W_i. It claims this procedure is model-agnostic (compatible with any development-proportion estimator such as Chain-Ladder, Bornhuetter-Ferguson or Cape Cod), produces coverage deficit O(I^{-1/2}) independent of the number of development periods, and is conservative under compound Poisson DGPs by the factor 1/sqrt(F_{I-i}). It further contrasts the approach with the ODP bootstrap, which incurs Omega(1) coverage error, and interprets standard methods as credibility estimators under different priors.

Significance. If the central claims hold, the work is significant for providing an exact satisfaction of the conditioning principle p(R | D, hat theta) together with an analytical coverage-rate result and a structural account of ODP miscalibration heterogeneity. The unification of Chain-Ladder, BF and Cape Cod as credibility estimators under diffuse, informative and pooling priors, and the diagnostic role of the concentration parameter c, are additional strengths. The model-agnostic framing, if justified beyond the Dirichlet-Gamma hierarchy, would broaden applicability in reserving practice.

major comments (2)

[§3] §3 (Bootstrap Construction): The exact predictive law W_i ~ Beta(c hat F_{I-i}, c(1-hat F_{I-i})) and the resulting O(I^{-1/2}) coverage deficit are derived under the Dirichlet-Gamma hierarchy. When hat F is instead supplied by a non-hierarchical estimator (Chain-Ladder, BF, etc.), the manuscript does not supply a separate argument showing that the sampled distribution still equals the true conditional law p(W_i | D, hat theta) or that the coverage order remains O(I^{-1/2}); this is load-bearing for the model-agnostic claim.
[§4] §4 (Asymptotics and Conservativeness): The independence of the coverage deficit from the number of development periods and the explicit conservativeness factor 1/sqrt(F_{I-i}) rest on variance calculations performed under the hierarchy asymptotics. A concrete statement of the conditions under which these properties transfer to arbitrary hat F, or a brief simulation check with non-hierarchy estimators, is needed to support the central assertion.

minor comments (2)

[Abstract] Abstract: The term 'model-agnostic' appears in the title and abstract; a parenthetical clarification that it means 'compatible with any fitted hat F' would improve immediate readability.
[§2] Notation: The predictive distribution p(R | D, hat theta) is introduced in the abstract but could be restated once more explicitly in the first paragraph of §2 to anchor the subsequent development.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments. The two major comments correctly identify that the exact conditional sampling and coverage results are derived under the Dirichlet-Gamma hierarchy, while the model-agnostic claim relies on plugging in hat F from arbitrary estimators. We address each point below and indicate the revisions we will make.

read point-by-point responses

Referee: [§3] §3 (Bootstrap Construction): The exact predictive law W_i ~ Beta(c hat F_{I-i}, c(1-hat F_{I-i})) and the resulting O(I^{-1/2}) coverage deficit are derived under the Dirichlet-Gamma hierarchy. When hat F is instead supplied by a non-hierarchical estimator (Chain-Ladder, BF, etc.), the manuscript does not supply a separate argument showing that the sampled distribution still equals the true conditional law p(W_i | D, hat theta) or that the coverage order remains O(I^{-1/2}); this is load-bearing for the model-agnostic claim.

Authors: The bootstrap is constructed to sample exactly from the conditional predictive distribution of the allocation proportions under the Dirichlet-Gamma hierarchy, with hat F treated as a fixed plug-in estimator of the development proportions. By resampling only W_i while holding the observed triangle D fixed, the procedure satisfies p(R | D, hat theta) exactly under that hierarchy, regardless of the source of hat F. The model-agnostic property therefore refers to compatibility with any consistent development-proportion estimator rather than claiming that the Beta law remains the true conditional law outside the hierarchy. The O(I^{-1/2}) coverage deficit is proved under the hierarchy asymptotics. We will add a clarifying paragraph in §3 that distinguishes the exact conditional sampling (which holds under DG with any plugged-in hat F) from the coverage-rate result (which is hierarchy-specific) and note that the procedure remains a principled conditional bootstrap when external estimators are used. revision: partial
Referee: [§4] §4 (Asymptotics and Conservativeness): The independence of the coverage deficit from the number of development periods and the explicit conservativeness factor 1/sqrt(F_{I-i}) rest on variance calculations performed under the hierarchy asymptotics. A concrete statement of the conditions under which these properties transfer to arbitrary hat F, or a brief simulation check with non-hierarchy estimators, is needed to support the central assertion.

Authors: The stated independence from the number of development periods and the conservativeness factor 1/sqrt(F_{I-i}) are obtained from explicit variance calculations under the Dirichlet-Gamma hierarchy as I → ∞. These properties transfer to arbitrary hat F when the estimator is consistent for the true development proportions at rate o(I^{-1/2}). We agree that the manuscript would benefit from either a precise statement of these transfer conditions or a small simulation study. We will include a brief simulation appendix that applies the bootstrap with Chain-Ladder and Bornhuetter-Ferguson estimators on data generated from both hierarchical and non-hierarchical processes, reporting empirical coverage to illustrate robustness. revision: yes

Circularity Check

0 steps flagged

No circularity: bootstrap samples from explicit predictive law under stated hierarchy while holding data fixed

full rationale

The paper defines the IBNP bootstrap by sampling only the allocation proportion W_i from the Beta(c hat F, c(1-hat F)) predictive distribution supplied by the Dirichlet-Gamma hierarchy, with the observed triangle X^obs held fixed. This construction directly implements the conditioning principle p(R | D, hat theta) by design. The O(I^{-1/2}) coverage deficit and 1/sqrt(F) conservativeness are derived analytically from the hierarchy asymptotics and compound-Poisson DGP rather than recovered by fitting to the target coverage quantities. Although hat F and c are estimated from the same triangle, this is an explicit modeling assumption that supplies the predictive law; it does not render the bootstrap output equivalent to its inputs by construction. No load-bearing self-citation, uniqueness theorem, or ansatz smuggling is required for the central claim, and the model-agnostic inheritance from external estimators (Chain-Ladder, BF, Cape Cod) is presented as a consequence of using any fitted development proportions inside the hierarchy-supplied predictive. The derivation chain therefore remains self-contained against the stated assumptions.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The method rests on the Dirichlet-Gamma hierarchy as the generative model that supplies the exact predictive distribution for the allocation proportions; concentration c and development factors hat F are estimated from the triangle.

free parameters (2)

c
Concentration parameter of the Dirichlet-Gamma hierarchy, estimated from the triangle and used both for sampling and as a non-stationarity diagnostic.
hat F_{I-i}
Development proportion estimated from the observed triangle and plugged into the Beta parameters.

axioms (1)

domain assumption Claims data admit a Dirichlet-Gamma hierarchy whose predictive distribution for the allocation proportion W_i is exactly Beta(c hat F, c(1-hat F)).
This assumption is invoked to justify sampling W_i directly from its predictive distribution while holding the observed triangle fixed.

pith-pipeline@v0.9.0 · 5919 in / 1249 out tokens · 62144 ms · 2026-05-20T15:57:49.596218+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

37 extracted references · 37 canonical work pages

[1]

Bar-Lev, S. K. and Enis, P. , title =. The Annals of Statistics , year =

work page
[2]

Barndorff-Nielsen, O. E. , title =. Biometrika , year =

work page
[3]

B. Glaubw. Mitteilungen der Vereinigung Schweizerischer Versicherungsmathematiker , year =

work page
[4]

Bornhuetter, R. L. and Ferguson, R. E. , title =. Proceedings of the Casualty Actuarial Society , year =

work page
[5]

A Course in Credibility Theory and its Applications , publisher =

B. A Course in Credibility Theory and its Applications , publisher =

work page
[6]

and Murphy, D

Gesmann, M. and Murphy, D. and Zhang, Y. and Carrato, A. and Wuthrich, M. and Concina, F. and Dal Moro, E. , year =

work page
[7]

Clark, D. R. , title =. CAS Forum , year =

work page
[8]

Cox, D. R. and Reid, N. , title =. Journal of the Royal Statistical Society, Series B , year =

work page
[9]

Davison, A. C. and Hinkley, D. V. , title =

work page
[10]

and Charpentier, A

Dutang, C. and Charpentier, A. , year =

work page
[11]

, title =

De Jong, P. , title =. North American Actuarial Journal , year =

work page
[12]

England, P. D. and Verrall, R. J. , title =. British Actuarial Journal , year =

work page
[13]

Gisler, A. and W. Credibility for the chain ladder reserving method , journal =. 2008 , volume =

work page 2008
[14]

and Arjas, E

Haastrup, S. and Arjas, E. , title =. ASTIN Bulletin , year =

work page
[15]

Exponential dispersion models , journal =

J. Exponential dispersion models , journal =. 1987 , volume =

work page 1987
[16]

, title =

Kremer, E. , title =. Scandinavian Actuarial Journal , year =

work page
[17]

, title =

Mack, T. , title =. ASTIN Bulletin , year =

work page
[18]

, title =

Mack, T. , title =. Insurance: Mathematics and Economics , year =

work page
[19]

and Venter, G

Mack, T. and Venter, G. , title =. Insurance: Mathematics and Economics , year =

work page
[20]

and Nelder, J

McCullagh, P. and Nelder, J. A. , title =. 1989 , edition =

work page 1989
[21]

Merz, M. and W. Prediction error of the chain ladder reserving method applied to correlated run-off triangles , journal =. 2007 , volume =

work page 2007
[22]

Meyers, G. G. , title =. 2015 , series =

work page 2015
[23]

Patterson, H. D. and Thompson, R. , title =. Biometrika , year =

work page
[24]

Renshaw, A. E. and Verrall, R. J. , title =. British Actuarial Journal , year =

work page
[25]

Self, S. G. and Liang, K.-Y. , title =. Journal of the American Statistical Association , year =

work page
[26]

Smyth, G. K. and J. Fitting Tweedie's Compound Poisson Model to Insurance Claims Data: Dispersion Modelling , journal =. 2002 , volume =

work page 2002
[27]

and Shi, P

Sriram, K. and Shi, P. , title =. Journal of Risk and Insurance , year =

work page
[28]

and Ashe, F

Taylor, G. and Ashe, F. , title =. Journal of Econometrics , year =

work page
[29]

, title =

Taylor, G. , title =

work page
[30]

, title =

Taylor, G. , title =. ASTIN Bulletin , year =

work page
[31]

Tweedie, M. C. K. , title =. Statistics: Applications and New Directions , publisher =. 1984 , pages =

work page 1984
[32]

van der Vaart, A. W. , title =

work page
[33]

, title =

Van Oirbeek, R. , title =. 2026 , eprint =

work page 2026
[34]

, title =

Van Oirbeek, R. , title =. 2026 , note =

work page 2026
[35]

Verrall, R. J. , title =. Insurance: Mathematics and Economics , year =

work page
[36]

Willmot, G. E. , title =. Scandinavian Actuarial Journal , year =

work page
[37]

Stochastic Claims Reserving Methods in Insurance , publisher =

W. Stochastic Claims Reserving Methods in Insurance , publisher =

work page

[1] [1]

Bar-Lev, S. K. and Enis, P. , title =. The Annals of Statistics , year =

work page

[2] [2]

Barndorff-Nielsen, O. E. , title =. Biometrika , year =

work page

[3] [3]

B. Glaubw. Mitteilungen der Vereinigung Schweizerischer Versicherungsmathematiker , year =

work page

[4] [4]

Bornhuetter, R. L. and Ferguson, R. E. , title =. Proceedings of the Casualty Actuarial Society , year =

work page

[5] [5]

A Course in Credibility Theory and its Applications , publisher =

B. A Course in Credibility Theory and its Applications , publisher =

work page

[6] [6]

and Murphy, D

Gesmann, M. and Murphy, D. and Zhang, Y. and Carrato, A. and Wuthrich, M. and Concina, F. and Dal Moro, E. , year =

work page

[7] [7]

Clark, D. R. , title =. CAS Forum , year =

work page

[8] [8]

Cox, D. R. and Reid, N. , title =. Journal of the Royal Statistical Society, Series B , year =

work page

[9] [9]

Davison, A. C. and Hinkley, D. V. , title =

work page

[10] [10]

and Charpentier, A

Dutang, C. and Charpentier, A. , year =

work page

[11] [11]

, title =

De Jong, P. , title =. North American Actuarial Journal , year =

work page

[12] [12]

England, P. D. and Verrall, R. J. , title =. British Actuarial Journal , year =

work page

[13] [13]

Gisler, A. and W. Credibility for the chain ladder reserving method , journal =. 2008 , volume =

work page 2008

[14] [14]

and Arjas, E

Haastrup, S. and Arjas, E. , title =. ASTIN Bulletin , year =

work page

[15] [15]

Exponential dispersion models , journal =

J. Exponential dispersion models , journal =. 1987 , volume =

work page 1987

[16] [16]

, title =

Kremer, E. , title =. Scandinavian Actuarial Journal , year =

work page

[17] [17]

, title =

Mack, T. , title =. ASTIN Bulletin , year =

work page

[18] [18]

, title =

Mack, T. , title =. Insurance: Mathematics and Economics , year =

work page

[19] [19]

and Venter, G

Mack, T. and Venter, G. , title =. Insurance: Mathematics and Economics , year =

work page

[20] [20]

and Nelder, J

McCullagh, P. and Nelder, J. A. , title =. 1989 , edition =

work page 1989

[21] [21]

Merz, M. and W. Prediction error of the chain ladder reserving method applied to correlated run-off triangles , journal =. 2007 , volume =

work page 2007

[22] [22]

Meyers, G. G. , title =. 2015 , series =

work page 2015

[23] [23]

Patterson, H. D. and Thompson, R. , title =. Biometrika , year =

work page

[24] [24]

Renshaw, A. E. and Verrall, R. J. , title =. British Actuarial Journal , year =

work page

[25] [25]

Self, S. G. and Liang, K.-Y. , title =. Journal of the American Statistical Association , year =

work page

[26] [26]

Smyth, G. K. and J. Fitting Tweedie's Compound Poisson Model to Insurance Claims Data: Dispersion Modelling , journal =. 2002 , volume =

work page 2002

[27] [27]

and Shi, P

Sriram, K. and Shi, P. , title =. Journal of Risk and Insurance , year =

work page

[28] [28]

and Ashe, F

Taylor, G. and Ashe, F. , title =. Journal of Econometrics , year =

work page

[29] [29]

, title =

Taylor, G. , title =

work page

[30] [30]

, title =

Taylor, G. , title =. ASTIN Bulletin , year =

work page

[31] [31]

Tweedie, M. C. K. , title =. Statistics: Applications and New Directions , publisher =. 1984 , pages =

work page 1984

[32] [32]

van der Vaart, A. W. , title =

work page

[33] [33]

, title =

Van Oirbeek, R. , title =. 2026 , eprint =

work page 2026

[34] [34]

, title =

Van Oirbeek, R. , title =. 2026 , note =

work page 2026

[35] [35]

Verrall, R. J. , title =. Insurance: Mathematics and Economics , year =

work page

[36] [36]

Willmot, G. E. , title =. Scandinavian Actuarial Journal , year =

work page

[37] [37]

Stochastic Claims Reserving Methods in Insurance , publisher =

W. Stochastic Claims Reserving Methods in Insurance , publisher =

work page