pith. sign in

arxiv: 2604.06034 · v2 · submitted 2026-04-07 · 📊 stat.ME

Efficient Bayesian Inference in the Cox Model via Rank-Ordered Likelihood

Pith reviewed 2026-05-10 18:27 UTC · model grok-4.3

classification 📊 stat.ME
keywords Cox modelBayesian inferenceGibbs samplingPlackett-Luce modelPólya-Gamma augmentationpartial likelihoodsurvival analysisshared frailty
0
0 comments X

The pith

Two Gibbs samplers let Bayesian Cox models use the partial likelihood directly while handling ties and frailty.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops two Gibbs sampling algorithms for Bayesian inference in the Cox proportional hazards model. It recasts the data as rank-ordered observations and models them with Plackett-Luce distributions that incorporate Pólya-Gamma augmentation. This lets the methods work directly from the partial likelihood without separately modeling the baseline hazard. Simulations across continuous, discrete, and tied-time settings show stable performance, while real-data examples confirm computational feasibility even for large samples and shared frailty extensions.

Core claim

In the Cox proportional hazards model, Bayesian inference can proceed directly from the partial likelihood by representing the data in rank-ordered form and fitting Plackett-Luce or generalized Plackett-Luce models. Pólya-Gamma data augmentation then produces two Gibbs samplers, PL-Cox and GPL-Cox, that generate posterior samples without correction, accommodate tied event times, and extend naturally to shared frailty models.

What carries the argument

Rank-ordered data representation of the Cox partial likelihood, realized through Plackett-Luce models with Pólya-Gamma augmentation to enable Gibbs sampling.

If this is right

  • Posterior samples require no post-hoc correction for validity.
  • Tied event times are handled automatically without special adjustments.
  • The samplers extend directly to shared frailty models with good computational efficiency.
  • PL-Cox performs stably across varying tie levels; GPL-Cox scales better on large datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same rank-order device could simplify Bayesian analysis in other semiparametric models that rely on partial likelihoods.
  • These samplers might combine with existing software for high-dimensional covariate screening in survival data.
  • Extensions to interval-censored or competing-risks settings appear straightforward but remain untested.

Load-bearing premise

The rank-ordered Plackett-Luce representation exactly preserves the posterior distribution implied by the original Cox partial likelihood.

What would settle it

Run both proposed samplers and an established partial-likelihood MCMC on the same dataset; any systematic difference in the posterior means or credible intervals for the regression coefficients would indicate the representation has altered the target distribution.

Figures

Figures reproduced from arXiv: 2604.06034 by Shonosuke Sugasawa, Shunichiro Orihara, Tomohiro Ohigashi.

Figure 1
Figure 1. Figure 1: Forest plot for hazard ratios for a selected subset of covariates on the original [PITH_FULL_IMAGE:figures/full_fig_p022_1.png] view at source ↗
Figure 1
Figure 1. Figure 1: Forest plot for hazard ratios of covariates on the original and coarsened versions [PITH_FULL_IMAGE:figures/full_fig_p045_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Trace plot for hazard for PL-Cox 18 [PITH_FULL_IMAGE:figures/full_fig_p046_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Trace plot for hazard for GPL-Cox 19 [PITH_FULL_IMAGE:figures/full_fig_p047_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Forest plot for hazard ratios. 20 [PITH_FULL_IMAGE:figures/full_fig_p048_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Posterior interval plot for PL-Cox. 21 [PITH_FULL_IMAGE:figures/full_fig_p049_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Posterior interval plot for GPL-Cox. 22 [PITH_FULL_IMAGE:figures/full_fig_p050_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Trace plot for PL-Cox. 23 [PITH_FULL_IMAGE:figures/full_fig_p051_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Trace plot for GPL-Cox. 24 [PITH_FULL_IMAGE:figures/full_fig_p052_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Caterpillar plot for log-frailty for each method. [PITH_FULL_IMAGE:figures/full_fig_p053_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Distribution of tie block sizes. resp1 crea1 temp1 meanbp1 wblc1 hrt1 treat age female 0 200 400 600 800 1000 0 200 400 600 800 1000 0 200 400 600 800 1000 0 200 400 600 800 1000 0 200 400 600 800 1000 0 200 400 600 800 1000 0 200 400 600 800 1000 0 200 400 600 800 1000 0 200 400 600 800 1000 −0.1 0.0 0.1 0.000 0.001 0.002 0.003 0.004 −0.050 −0.025 0.000 0.025 0.006 0.008 0.010 0.012 0.014 0.016 −0.003 0.… view at source ↗
Figure 11
Figure 11. Figure 11: Trace plot for PL-Cox. 28 [PITH_FULL_IMAGE:figures/full_fig_p056_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Trace plot for GPL-Cox. 29 [PITH_FULL_IMAGE:figures/full_fig_p057_12.png] view at source ↗
read the original abstract

In Bayesian inference for the Cox proportional hazards model, modeling the baseline hazard function is challenging. Recently, direct Bayesian inference using the partial likelihood is considered in the framework of general Bayesian inference. In terms of posterior computation, several studies have examined sampling algorithms under the Cox model. In this study, we propose two Gibbs sampling algorithms for Bayesian inference in the Cox proportional hazards model, motivated by a rank-ordered data representation and based on the Plackett--Luce and generalized Plackett--Luce models with P'{o}lya--Gamma data augmentation, referred to as PL-Cox and GPL-Cox, respectively. The two proposed methods offer practical advantages, as they do not require correction of posterior samples, naturally handle tied event times, and are readily extensible to shared frailty models. In simulation study, we considered multiple survival model settings, including continuous and discrete survival time models, as well as scenarios with varying degrees of ties, and found that the PL-Cox model exhibited relatively stable performance. In analyses of a large real dataset, the proposed methods remained computationally feasible, and the GPL-Cox model showed more favorable computational scalability than the PL-Cox model. In analyses of real data incorporating shared frailty, both methods demonstrated good computational efficiency.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript presents two Gibbs sampling algorithms called PL-Cox and GPL-Cox for performing Bayesian inference in the Cox proportional hazards model. The approach is based on a rank-ordered representation of the data and employs the Plackett-Luce model and its generalization, augmented with Pólya-Gamma variables, to facilitate sampling. The authors highlight that these methods do not require post-processing of samples, handle tied event times naturally, and can be extended to shared frailty models. They provide simulation results under various settings including ties and apply the methods to a large real-world dataset, noting good computational performance.

Significance. If the proposed rank-ordered likelihood exactly corresponds to the Cox partial likelihood, the paper offers an efficient and practical Bayesian inference framework for survival analysis that builds on established techniques. The extensibility to frailty models and the handling of ties are significant advantages for applied work. The simulation study covering continuous and discrete times with varying ties provides useful evidence of performance.

major comments (2)
  1. [§2.2] §2.2: The construction of the rank-ordered likelihood for right-censored data is not shown to explicitly truncate the rankings at the censoring times to match the risk sets in the standard partial likelihood. This is load-bearing for the claim that the resulting posterior is identical to that from the Cox partial likelihood; a derivation or proof of equivalence should be added.
  2. [§4.3] §4.3: In the real data analysis with shared frailty, the computational times are reported but without comparison to alternative Bayesian methods for frailty Cox models, making it hard to evaluate the claimed efficiency gains.
minor comments (2)
  1. [Abstract] Abstract: The abstract states that 'the PL-Cox model exhibited relatively stable performance' but does not specify the metrics (e.g., bias, coverage, MSE) used to reach this conclusion.
  2. [§5] §5: Some equations use notation for the Plackett-Luce parameters that could be better aligned with standard survival analysis notation to improve readability for the target audience.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments, which have helped us identify areas where the manuscript can be strengthened. We address each major comment below and will revise the manuscript accordingly.

read point-by-point responses
  1. Referee: [§2.2] §2.2: The construction of the rank-ordered likelihood for right-censored data is not shown to explicitly truncate the rankings at the censoring times to match the risk sets in the standard partial likelihood. This is load-bearing for the claim that the resulting posterior is identical to that from the Cox partial likelihood; a derivation or proof of equivalence should be added.

    Authors: We agree that an explicit derivation is required to establish the equivalence. In the revised manuscript we will expand Section 2.2 with a step-by-step derivation showing that the rank-ordered Plackett-Luce likelihood, when applied to right-censored data, truncates the rankings precisely at the censoring times. This truncation reproduces the risk sets of the standard Cox partial likelihood, thereby confirming that the resulting posterior is identical to the one obtained from the usual partial-likelihood formulation. revision: yes

  2. Referee: [§4.3] §4.3: In the real data analysis with shared frailty, the computational times are reported but without comparison to alternative Bayesian methods for frailty Cox models, making it hard to evaluate the claimed efficiency gains.

    Authors: We acknowledge that the absence of direct comparisons limits the strength of the efficiency claims. In the revision we will add a new paragraph in Section 4.3 that benchmarks the reported run times against at least one established Bayesian frailty Cox sampler (e.g., the MCMC implementation available in the survival or frailtypack packages) on the same dataset and hardware. This will provide readers with a clearer quantitative assessment of the computational advantages of PL-Cox and GPL-Cox. revision: yes

Circularity Check

0 steps flagged

No significant circularity; standard application of Plackett-Luce to rank-ordered Cox data

full rationale

The paper motivates Gibbs samplers (PL-Cox, GPL-Cox) by representing Cox partial likelihood via rank-ordered data and Plackett-Luce models with Pólya-Gamma augmentation. This is an application of established models rather than a self-definitional loop or fitted-input prediction. No load-bearing self-citations, uniqueness theorems imported from the authors, or ansatzes smuggled via prior work are present in the abstract or described chain. The central claim (equivalence for posterior inference) is presented as a modeling choice whose statistical properties are asserted to be preserved, not derived tautologically from the samplers themselves. Minor self-citation risk at most (score 1) but central derivation remains independent.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the domain assumption that rank-ordered representations of survival data can be modeled via Plackett-Luce distributions while preserving the Cox partial likelihood; no free parameters or new entities are introduced in the abstract.

axioms (2)
  • domain assumption The partial likelihood of the Cox proportional hazards model admits an equivalent rank-ordered data representation.
    This equivalence is the stated motivation for connecting the Cox model to Plackett-Luce ranking models.
  • domain assumption Pólya-Gamma data augmentation yields tractable conditional distributions for Gibbs sampling under the Plackett-Luce model.
    This augmentation is invoked to derive the proposed Gibbs updates.

pith-pipeline@v0.9.0 · 5528 in / 1511 out tokens · 68633 ms · 2026-05-10T18:27:02.088211+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

22 extracted references · 22 canonical work pages

  1. [1]

    Baker, R. (2020). New order-statistics-based ranking models and faster computation of outcome probabilities. IMA Journal of Management Mathematics 31, 33--48

  2. [2]

    and Scarf, P

    Baker, R. and Scarf, P. (2021). Modifying Bradley -- Terry and other ranking models to allow ties. IMA Journal of Management Mathematics 32, 451--463

  3. [3]

    G., Holmes, C

    Bissiri, P. G., Holmes, C. C., and Walker, S. G. (2016). A general framework for updating belief distributions. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 78, 1103--1130

  4. [4]

    M., Polson, N

    Carvalho, C. M., Polson, N. G., and Scott, J. G. (2010). The horseshoe estimator for sparse signals. Biometrika 97, 465--480

  5. [5]

    F., Speroff, T., Dawson, N

    Connors, Jr, A. F., Speroff, T., Dawson, N. V., Thomas, C., Harrell, Jr, F. E., Wagner, D., Desbiens, N., Goldman, L., Wu, A. W., Califf, R. M., Fulkerson, Jr, W. J., Vidaillet, H., Broste, S., Bellamy, P., Lynn, J., and Knaus, W. A. (1996). The Effectiveness of Right Heart Catheterization in the Initial Care of Critically III Patients . JAMA 276, 889--897

  6. [6]

    Cox, D. R. (1972). Regression Models and Life-Tables . Journal of the Royal Statistical Society. Series B (Methodological) 34, 187--220

  7. [7]

    Cox, D. R. (1975). Partial likelihood. Biometrika 62, 269--276

  8. [8]

    and Canale, A

    D ' Angelo, L. and Canale, A. (2023). Efficient posterior sampling for bayesian poisson regression. Journal of Computational and Graphical Statistics 32, 917--926

  9. [9]

    Hamura, Y., Irie, K., and Sugasawa, S. (2025). Robust Bayesian Modeling of Counts with Zero Inflation and Outliers : Theoretical Robustness and Efficient Computation . Journal of the American Statistical Association 120, 1545--1557

  10. [10]

    Henderson, D. A. (2025). Modelling and Analysis of Rank Ordered Data with Ties via a Generalized Plackett-Luce Model . Bayesian Analysis 20, 1109--1137

  11. [11]

    Kalbfleisch, J. D. (1978). Non- Parametric Bayesian Analysis of Survival Time Data . Journal of the Royal Statistical Society: Series B (Methodological) 40, 214--221

  12. [12]

    Kalbfleisch, J. D. and Prentice, R. L. (2002). The Statistical Analysis of Failure Time Data . John Wiley & Sons

  13. [13]

    A., Cook, N

    MacFarlane, L. A., Cook, N. R., Kim, E., Lee, I.-M., Iversen, M. D., Gordon, D., Buring, J. E., Katz, J. N., Manson, J. E., and Costenbader, K. H. (2020). The Effects of Vitamin D and Marine Omega -3 Fatty Acid Supplementation on Chronic Knee Pain in Older US Adults : Results From a Randomized Trial . Arthritis & Rheumatology 72, 1836--1844

  14. [14]

    E., Cook, N

    Manson, J. E., Cook, N. R., Lee, I.-M., Christen, W., Bassuk, S. S., Mora, S., Gibson, H., Gordon, D., Copeland, T., D'Agostino, D., Friedenberg, G., Ridge, C., Bubes, V., Giovannucci, E. L., Willett, W. C., and Buring, J. E. (2019). Vitamin D Supplements and Prevention of Cancer and Cardiovascular Disease . New England Journal of Medicine 380, 33--44

  15. [15]

    G., Scott, J

    Polson, N. G., Scott, J. G., and Windle, J. (2013). Bayesian Inference for Logistic Models Using P \'o lya -- Gamma Latent Variables . Journal of the American Statistical Association 108, 1339--1349

  16. [16]

    S., and Barnett, I

    Ren, B., Morris, J. S., and Barnett, I. (2025). The Cox-P \'o lya-Gamma algorithm for flexible Bayesian inference of multilevel survival models. Biometrics 81, ujaf121

  17. [17]

    Rondeau, V., Marzroui, Y., and Gonzalez, J. R. (2012). Frailtypack: An R Package for the Analysis of Correlated Survival Data with Frailty Models Using Penalized Likelihood Estimation or Parametrical Estimation . Journal of Statistical Software 47, 1--28

  18. [18]

    G., and Chen, M.-H

    Sinha, D., Ibrahim, J. G., and Chen, M.-H. (2003). A Bayesian Justification of Cox 's Partial Likelihood . Biometrika 90, 629--641

  19. [19]

    J., Best, N

    Spiegelhalter, D. J., Best, N. G., Carlin, B. P., and Van Der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 64, 583--639

  20. [20]

    and Zhou, M

    Su, Y. and Zhou, M. (2006). On a connection between the Bradley -- Terry model and the Cox proportional hazards model. Statistics & Probability Letters 76, 698--702

  21. [21]

    and Tomo, Y

    Tamano, S. and Tomo, Y. (2025a). Efficient Gibbs Sampling in Cox Regression Models Using Composite Partial Likelihood and P \'o lya-Gamma Augmentation . http://arxiv.org/abs/2506.04675

  22. [22]

    and Tomo, Y

    Tamano, S. and Tomo, Y. (2025b). Location-- Scale Calibration for Generalized Posterior . http://arxiv.org/abs/2511.15320