Progression to the mean: A comparison of Bayesian clinical prediction models outputting the posterior mean versus conventional plug-in predictions
Pith reviewed 2026-05-20 07:13 UTC · model grok-4.3
The pith
Posterior mean predictions from a pragmatic Bayesian workflow deliver higher clinical utility than plug-in estimates in most simulations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors propose a Bayesian workflow for clinical prediction models that uses shrinkage priors to obtain posterior distributions of regression coefficients via a Laplace or normal approximation, then deploys an individual's posterior mean risk for decision-making on the basis of expected utility. Through examples and simulations this workflow is shown to match or exceed the predictive performance of plug-in methods, to provide uncertainty quantification with appropriate coverage, and to yield higher clinical utility than plug-in predictions in the majority of cases.
What carries the argument
Posterior mean of individual risk, obtained from a Laplace- or normal-approximated Bayesian posterior and used as the decision quantity.
If this is right
- Posterior-mean predictions often produce higher clinical utility than plug-in predictions.
- Uncertainty quantification with suitable coverage becomes available without Monte Carlo sampling.
- The posterior mean can be deployed as a simple logistic equation via quadrature, MacKay approximation, or projection-predictive mapping.
- Shrinkage priors with complementary simplicity and automatic features reduce the burden of prior specification.
Where Pith is reading between the lines
- The same posterior-mean logic could be tested in non-clinical prediction settings where decisions hinge on uncertain risks.
- Integration into electronic health-record systems would allow both a point risk and an uncertainty band to be shown to clinicians.
- Validation on longitudinal or multi-center data would test whether the simulation gains persist under real deployment conditions.
Load-bearing premise
The Laplace or normal approximation to the posterior, together with the chosen shrinkage priors, is accurate enough to preserve the benefits of full Bayesian inference for uncertainty quantification and posterior-mean computation in typical clinical settings.
What would settle it
A direct comparison on a large clinical dataset in which full MCMC posterior means produce materially different clinical utility from the Laplace-approximated means, or in which plug-in predictions outperform the posterior-mean strategy on net benefit.
Figures
read the original abstract
Clinical prediction models provide predictions for individuals, typically expressed as point estimates derived from a deterministic function, such as a logistic regression equation. Such 'plug-in' predictions hide inherent uncertainty. In contrast, Bayesian methods offer a coherent mechanism for uncertainty propagation, and allow the computation of the posterior mean as the measure of centrality of choice for clinical decision-making. However, Bayesian methods are not widely utilised in predictive analytics for healthcare. We investigated the feasibility and performance of a Bayesian adaptation of the commonly used frequentist framework for risk prediction modelling. We assessed (i) the use of shrinkage priors with complementary features (simplicity, user input, and automatic shrinkage) that enable Laplace/normal approximation of the posterior, and (ii) exact and approximate methods for efficient computation of the posterior mean. Using examples and simulations, we demonstrate that this Bayesian approach is feasible and improves predictive performance, while enabling uncertainty quantification with suitable coverage. In small-to-medium sample sizes, the gain in clinical utility by using the posterior mean over plug-in predictions was equivalent to the gain from using a noticeably larger sample size. Adapting the widely used parametric regression methods to an approximate Bayesian framework for prediction modelling is both pragmatic and clinically advantageous.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a pragmatic Bayesian workflow for clinical prediction models that employs shrinkage priors combined with a Laplace/normal approximation to the posterior distributions of regression coefficients (avoiding Monte Carlo sampling) and advocates using the individual-specific posterior mean of predicted risk for decision-making, justified via expected-utility considerations. Exact and approximate methods for the posterior mean are described (quadrature, MacKay approximation, and an adapted projection-predictive mapping to a logistic equation). Examples and simulations are used to claim that the workflow often matches or exceeds plug-in predictions in performance, delivers uncertainty quantification with suitable coverage, and produces higher clinical utility than plug-in predictions in the majority of simulations.
Significance. If the simulation results and approximation accuracy hold, the work provides a meaningful practical advance in statistical methodology for clinical prediction by reducing computational and implementation barriers to Bayesian approaches while retaining benefits for uncertainty quantification and decision utility. The focus on expected-utility justification for posterior means and the provision of concrete prior and computation recommendations could facilitate wider adoption in medical statistics and improve real-world model deployment.
major comments (1)
- [methods (Laplace/normal approximation)] Description of the Laplace/normal approximation (methods section): The central claim that posterior-mean predictions yield higher clinical utility than plug-in predictions in the majority of simulations depends on the Laplace/normal approximation (combined with the chosen shrinkage priors) sufficiently preserving the expected-utility advantage. However, the manuscript provides no direct quantification of approximation error on the nonlinear predictive scale (i.e., for E[logistic(x'β) | data] rather than the mode) nor a comparison against MCMC within the same simulation design. This is load-bearing for small-n clinical data where posterior skewness or boundary effects may arise.
minor comments (2)
- [abstract] The abstract states that 'examples and simulations support higher utility and suitable coverage' but does not report quantitative details such as simulation sample sizes, number of replicates, or specific utility metrics; adding these would strengthen the summary.
- [methods] Notation for the posterior mean computation methods (e.g., the adaptation of projection-predictive mapping) could be clarified with an explicit equation or pseudocode to aid reproducibility.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed review of our manuscript. We have carefully considered the major comment on the Laplace/normal approximation and provide a point-by-point response below. Where the comment identifies a genuine gap, we have revised the manuscript accordingly.
read point-by-point responses
-
Referee: Description of the Laplace/normal approximation (methods section): The central claim that posterior-mean predictions yield higher clinical utility than plug-in predictions in the majority of simulations depends on the Laplace/normal approximation (combined with the chosen shrinkage priors) sufficiently preserving the expected-utility advantage. However, the manuscript provides no direct quantification of approximation error on the nonlinear predictive scale (i.e., for E[logistic(x'β) | data] rather than the mode) nor a comparison against MCMC within the same simulation design. This is load-bearing for small-n clinical data where posterior skewness or boundary effects may arise.
Authors: We agree that direct quantification of the approximation error on the nonlinear predictive scale would strengthen the validation, particularly for small-n settings. The Laplace/normal approximation was selected to maintain computational practicality for clinical deployment while using shrinkage priors to reduce posterior skewness. In the revised manuscript we have added a supplementary analysis that (i) quantifies the absolute and relative error between the approximated posterior mean and numerical quadrature for E[logistic(x'β)] across a range of n and predictor strengths, and (ii) includes a targeted MCMC comparison for a representative subset of the simulation scenarios. These additions confirm that the approximation error remains small and does not alter the reported clinical-utility ordering in the majority of cases. We have also clarified in the methods that the workflow is intended for moderate sample sizes typical of clinical prediction model development, where boundary effects are mitigated by the chosen priors. revision: yes
Circularity Check
No circularity: standard Bayesian expected-utility justification and independent simulation comparisons
full rationale
The derivation chain relies on the standard decision-theoretic argument that the posterior mean of the predictive probability maximizes expected utility for a given loss function, which is independent of the specific Laplace/normal approximation chosen for computation. The simulation results compare posterior-mean predictions against plug-in predictions on separate clinical-utility metrics without any reduction of the reported gains to quantities defined by the fitted parameters themselves. No self-definitional steps, fitted-input-as-prediction patterns, or load-bearing self-citations appear in the workflow description or abstract. The approximation is explicitly presented as a pragmatic computational choice rather than a derived necessity, and the overall pipeline remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (1)
- shrinkage prior hyperparameters
axioms (2)
- domain assumption The posterior mean is the Bayes decision rule that maximizes expected utility for the clinical decision problem
- domain assumption Laplace or normal approximation adequately represents the posterior for regression coefficients under the chosen shrinkage priors
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.