Archimedean Copula Inference via Taylor-Mode AD
Pith reviewed 2026-05-25 05:00 UTC · model grok-4.3
The pith
Polynomial powering of Taylor-mode AD computes exact nested Archimedean copula likelihoods and gradients under arbitrary censoring.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
acopula evaluates exact nested-copula likelihoods and parameter gradients under arbitrary censoring masks in polynomial time. The mechanism is polynomial powering of Taylor-mode automatic differentiation output, which replaces per-family hand-derived partial Bell polynomial tables with a single differentiable computation that any user-defined generator can drive.
What carries the argument
polynomial powering of Taylor-mode automatic differentiation output, which replaces per-family hand-derived partial Bell polynomial tables with a single differentiable computation
If this is right
- Enables per-variable censoring analysis on d=53 datasets such as MIMIC-IV ICU admissions using both classical and neural generators.
- Supports hierarchical models on d=98 data such as an 11-sector S&P 500 returns model.
- Allows family-agnostic censored maximum likelihood estimation across ten families, including five without prior implementations.
- Delivers approximately 650 times speedup per density evaluation at d=35, with quadratic scaling to d=8000.
Where Pith is reading between the lines
- The same powering approach could be tested on dependence structures outside the Archimedean class where similar polynomial identities arise.
- Integration with other automatic-differentiation libraries beyond JAX would test whether the exactness property transfers without modification.
- Application to even larger censoring patterns in longitudinal studies could reveal whether the polynomial-time bound remains practical in practice.
Load-bearing premise
The polynomial powering of Taylor-mode automatic differentiation output produces exact results for arbitrary nesting trees and user-defined generators without numerical instability or loss of differentiability.
What would settle it
Direct numerical comparison of likelihood values produced by the framework against analytically derived expressions for small nested copulas with known censoring masks.
Figures
read the original abstract
No existing nested Archimedean copula tool handles all three of (a) arbitrary per-variable (right-)censoring in survival analysis, (b) arbitrary nesting trees, and (c) exact parameter gradients. Existing implementations handle only bivariate problems, low dimensional (i.e., $d \leq 10$) cases, two layers of nesting, or only hand-derived copula nestings. We present \textsc{acopula}, a JAX-native framework that, given any Archimedean generator -- classical or neural -- evaluates exact nested-copula likelihoods and parameter gradients under arbitrary censoring masks in polynomial time. The mechanism is polynomial powering of Taylor-mode automatic differentiation output, which replaces per-family hand-derived partial Bell polynomial tables with a single differentiable computation that any user-defined generator can drive. We conduct extensive simulations to verify the correctness of \textsc{acopula}. We then demonstrate (a) per-variable censoring on $85{,}229$ MIMIC-IV ICU admissions in high dimensions with $d{=}53$, fit by both classical Archimedean families and nested neural Archimedean copulas; (b) an 11-sector hierarchical model on S\&P~500 daily returns at $d{=}98$; (c) family-agnostic censored MLE across ten families, five of them with no prior implementation, on a retinopathy study; and (d) a ${\sim}650\times$ per-density speedup over R's \texttt{nacLL} at $d{=}35$, scaling quadratically to $d{=}8{,}000$.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces acopula, a JAX-native framework for nested Archimedean copulas that, given any generator (classical or neural), computes exact likelihoods and parameter gradients under arbitrary nesting trees and per-variable censoring masks. The core mechanism is polynomial powering of Taylor-mode automatic differentiation output, which is claimed to replace hand-derived partial Bell polynomial tables with a single differentiable computation running in polynomial time. Correctness is asserted via extensive simulations, followed by demonstrations on high-dimensional censored data (MIMIC-IV d=53), financial returns (S&P 500 d=98), a retinopathy study across ten families, and a reported ~650x speedup over R's nacLL at d=35.
Significance. If the central mechanism of exact, stable, differentiable polynomial powering holds for arbitrary trees and user-defined generators, the work would enable flexible high-dimensional copula inference with neural generators and censoring, which existing tools do not support at scale. The reported applications and speedups indicate potential practical impact in survival analysis and finance if the exactness claim is substantiated.
major comments (1)
- [Abstract] Abstract: the central claim that polynomial powering of Taylor-mode AD output produces exact nested-copula likelihoods and gradients for arbitrary nesting trees and generators (replacing hand-derived partial Bell polynomials) is load-bearing, yet the provided text supplies no derivation, coefficient-array algebra, or analysis of floating-point accumulation or differentiability preservation under deep compositions.
minor comments (1)
- [Abstract] Abstract: the notation 85{,}229 appears to be a LaTeX artifact for 85,229 and should be rendered consistently.
Simulated Author's Rebuttal
We thank the referee for highlighting the load-bearing nature of the central claim. We agree that the abstract is too terse and that the main text (as submitted) does not contain a self-contained derivation of the polynomial-powering step, the coefficient-array algebra, floating-point analysis, or differentiability preservation. We will revise accordingly.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central claim that polynomial powering of Taylor-mode AD output produces exact nested-copula likelihoods and gradients for arbitrary nesting trees and generators (replacing hand-derived partial Bell polynomials) is load-bearing, yet the provided text supplies no derivation, coefficient-array algebra, or analysis of floating-point accumulation or differentiability preservation under deep compositions.
Authors: We accept the criticism. The submitted manuscript contains only a high-level description of the mechanism. In the revision we will (i) add a concise derivation of the polynomial-powering identity for arbitrary nesting trees in a new subsection of Section 3, including the explicit coefficient-array recurrence that replaces the partial Bell polynomials; (ii) include a short floating-point error analysis (forward-mode Taylor coefficients remain exact up to machine epsilon for the generator evaluations, with quadratic accumulation bounded by tree depth); and (iii) prove that the resulting likelihood and gradient remain differentiable with respect to both parameters and generator weights because the powering operation is a composition of analytic functions. These additions will be placed in the main text rather than the supplement. revision: yes
Circularity Check
No significant circularity; derivation is self-contained
full rationale
The paper introduces a JAX-based computational method that applies Taylor-mode automatic differentiation followed by polynomial powering to evaluate nested Archimedean copula likelihoods and gradients for arbitrary generators and nesting trees. This replaces hand-derived partial Bell polynomials with a general AD-driven procedure. No equations or claims reduce by construction to fitted inputs, self-definitions, or load-bearing self-citations. The central mechanism is presented as a general technique whose correctness is checked via external simulations and real-data demonstrations rather than being presupposed by the method itself. The derivation chain therefore contains no circular steps.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Pair-copula constructions of multiple dependence
Kjersti Aas, Claudia Czado, Arnoldo Frigessi, and Henrik Bakken. Pair-copula constructions of multiple dependence. Insurance: Mathematics and Economics, 44 0 (2), 2009
work page 2009
-
[2]
A Geometric Theory of Higher-Order Automatic Differentiation
Michael Betancourt. A geometric theory of higher-order automatic differentiation. arXiv preprint arXiv:1812.11592, 2018
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[3]
Taylor-mode automatic differentiation for higher-order derivatives in JAX
Jesse Bettencourt, Matthew James Johnson, and David Duvenaud. Taylor-mode automatic differentiation for higher-order derivatives in JAX . In Program Transformations for ML Workshop at NeurIPS, 2019
work page 2019
-
[4]
Generalized autoregressive conditional heteroskedasticity
Tim Bollerslev. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics, 31 0 (3), 1986
work page 1986
-
[5]
JAX : Composable transformations of P ython+ N um P y programs, 2018
James Bradbury, Roy Frostig, Peter Hawkins, Matthew James Johnson, Chris Leary, Dougal Maclaurin, George Necula, Adam Paszke, Jake Vander P las, Skye Wanderman-Milne, and Qiao Zhang. JAX : Composable transformations of P ython+ N um P y programs, 2018. URL http://github.com/jax-ml/jax
work page 2018
-
[6]
Hierarchical K endall copulas: Properties and inference
Eike Christian Brechmann. Hierarchical K endall copulas: Properties and inference. Canadian Journal of Statistics, 42 0 (1), 2014
work page 2014
- [7]
-
[8]
Advanced Combinatorics: The Art of Finite and Infinite Expansions
Louis Comtet. Advanced Combinatorics: The Art of Finite and Infinite Expansions. D. Reidel, 1974
work page 1974
-
[9]
Correlation and dependence in risk management: properties and pitfalls
Paul Embrechts, Alexander McNeil, and Daniel Straumann. Correlation and dependence in risk management: properties and pitfalls. In M. A. H. Dempster, editor, Risk Management: Value at Risk and Beyond, pages 176--223. Cambridge University Press, 2002
work page 2002
-
[10]
Analysis of Survival Data with Dependent Censoring: Copula-Based Approaches
Takeshi Emura and Yi-Hau Chen. Analysis of Survival Data with Dependent Censoring: Copula-Based Approaches. Springer, 2018
work page 2018
- [11]
-
[12]
Christian Genest, Kilani Ghoudi, and Louis-Paul Rivest. A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika, 82 0 (3), 1995
work page 1995
-
[13]
Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation
Andreas Griewank and Andrea Walther. Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation. SIAM, 2008
work page 2008
-
[14]
Nicholas J. Higham. Accuracy and Stability of Numerical Algorithms. SIAM, 2002
work page 2002
-
[15]
Marius Hofert. Sampling A rchimedean copulas. Computational Statistics & Data Analysis, 52 0 (12), 2008
work page 2008
-
[16]
Sampling Nested A rchimedean Copulas: With Applications to CDO Pricing
Marius Hofert. Sampling Nested A rchimedean Copulas: With Applications to CDO Pricing . PhD thesis, Ulm University, 2010
work page 2010
-
[17]
Densities of nested A rchimedean copulas
Marius Hofert and David Pham. Densities of nested A rchimedean copulas. Journal of Multivariate Analysis, 118, 2013
work page 2013
-
[18]
Marius Hofert, Martin M \"a chler, and Alexander J. McNeil. Likelihood inference for A rchimedean copulas in high dimensions under known margins. Journal of Multivariate Analysis, 110, 2012
work page 2012
-
[19]
Marius Hofert, Martin M \"a chler, and Alexander J. McNeil. A rchimedean copulas in high dimensions: Estimators and numerical challenges motivated by financial applications. Journal de la Soci \'e t \'e Fran c aise de Statistique , 154 0 (1), 2013
work page 2013
-
[20]
Huster, Ron Brookmeyer, and Steven G
William J. Huster, Ron Brookmeyer, and Steven G. Self. Modelling paired survival data with covariates. Biometrics, 45 0 (1), 1989
work page 1989
-
[21]
Harry Joe and James J. Xu. The estimation method of inference functions for margins for multivariate models. Technical Report 166, Department of Statistics, University of British Columbia, 1996
work page 1996
-
[22]
Alistair E. W. Johnson, Lucas Bulgarelli, Lu Shen, Alvin Gayles, Ayad Shammout, Steven Horng, Tom J. Pollard, Sicheng Hao, Benjamin Moody, Brian Gow, Li wei H. Lehman, Leo A. Celi, and Roger G. Mark. MIMIC-IV , a freely accessible electronic health record dataset. Scientific Data, 10 0 (1), 2023
work page 2023
-
[23]
Copulas.jl: A fully distributions.jl-compliant copula package
Oskar Laverny and Santiago Jimenez. Copulas.jl: A fully distributions.jl-compliant copula package. Journal of Open Source Software, 9 0 (94): 0 6189, 2024
work page 2024
-
[24]
Dongdong Li, X. Joan Hu, Mary L. McBride, and John J. Spinelli. Multiple event times in the presence of informative censoring: modeling and analysis by copulas. Lifetime Data Analysis, 26 0 (3), 2020
work page 2020
-
[25]
Dongdong Li, X. Joan Hu, and Rui Wang. Evaluating association between two event times with observations subject to informative censoring. Journal of the American Statistical Association, 118 0 (542), 2023
work page 2023
-
[26]
Automatic functional differentiation in JAX
Min Lin. Automatic functional differentiation in JAX . In International Conference on Learning Representations, 2024
work page 2024
-
[27]
Chun Kai Ling, Fei Fang, and J. Zico Kolter. Deep A rchimedean copulas. In Advances in Neural Information Processing Systems, 2020
work page 2020
-
[28]
Anji Liu, Oliver Broadrick, Mathias Niepert, and Guy Van den Broeck. Discrete copula diffusion. In International Conference on Learning Representations, 2025 a
work page 2025
-
[29]
HACSurv : A hierarchical copula-based approach for survival analysis with dependent competing risks
Xin Liu, Weijia Zhang, and Min-Ling Zhang. HACSurv : A hierarchical copula-based approach for survival analysis with dependent competing risks. In International Conference on Artificial Intelligence and Statistics, 2025 b
work page 2025
-
[30]
Alexander J. McNeil. Sampling nested A rchimedean copulas. Journal of Statistical Computation and Simulation, 78 0 (6), 2008
work page 2008
-
[31]
McNeil and Johanna Ne s lehov \'a
Alexander J. McNeil and Johanna Ne s lehov \'a . Multivariate A rchimedean copulas, d -monotone functions and _1 -norm symmetric distributions. The Annals of Statistics, 37 0 (5B), 2009
work page 2009
-
[32]
rvinecopulib: High Performance Algorithms for Vine Copula Modeling, 2023
Thomas Nagler and Thibault Vatter. rvinecopulib: High Performance Algorithms for Vine Copula Modeling, 2023. URL https://CRAN.R-project.org/package=rvinecopulib. R package, CRAN
work page 2023
-
[33]
Roger B. Nelsen. An Introduction to Copulas. Springer, 2006
work page 2006
-
[34]
Generative A rchimedean copulas
Yuting Ng, Ali Hasan, Khalil Elkhalil, and Vahid Tarokh. Generative A rchimedean copulas. In Uncertainty in Artificial Intelligence, 2021
work page 2021
-
[35]
Hierarchical A rchimedean copulae: The HAC package
Ostap Okhrin and Alexander Ristig. Hierarchical A rchimedean copulae: The HAC package. Journal of Statistical Software, 58 0 (4), 2014
work page 2014
-
[36]
Composable Effects for Flexible and Accelerated Probabilistic Programming in NumPyro
Du Phan, Neeraj Pradhan, and Martin Jankowiak. Composable effects for flexible and accelerated probabilistic programming in N um P yro. arXiv preprint arXiv:1912.11554, 2019
work page internal anchor Pith review Pith/arXiv arXiv 1912
-
[37]
You only linearize once: Tangents transpose to gradients
Alexey Radul, Adam Paszke, Roy Frostig, Matthew Johnson, and Dougal Maclaurin. You only linearize once: Tangents transpose to gradients. Proceedings of the ACM on Programming Languages, 7 0 (POPL), 2023
work page 2023
-
[38]
On the construction of nested A rchimedean copulas for d -monotone generators
Mohsen Rezapour. On the construction of nested A rchimedean copulas for d -monotone generators. Statistics & Probability Letters, 101, 2015
work page 2015
-
[39]
Fonctions de r \'e partition \`a n dimensions et leurs marges
Abe Sklar. Fonctions de r \'e partition \`a n dimensions et leurs marges. Publications de l'Institut de Statistique de l'Universit \'e de Paris , 8, 1959
work page 1959
-
[40]
Simplified pair copula constructions---limitations and extensions
Jakob St \"o ber, Harry Joe, and Claudia Czado. Simplified pair copula constructions---limitations and extensions. Journal of Multivariate Analysis, 119, 2013
work page 2013
-
[41]
CopulaCenR : Copula based regression models for bivariate censored data in R
Tao Sun and Ying Ding. CopulaCenR : Copula based regression models for bivariate censored data in R . The R Journal, 12 0 (1), 2020
work page 2020
-
[42]
Matthew Tancik, Pratul P. Srinivasan, Ben Mildenhall, Sara Fridovich-Keil, Nithin Raghavan, Utkarsh Singhal, Ravi Ramamoorthi, Jonathan T. Barron, and Ren Ng. Fourier features let networks learn high frequency functions in low dimensional domains. In Advances in Neural Information Processing Systems, 2020
work page 2020
-
[43]
A statistical distribution function of wide applicability
Waloddi Weibull. A statistical distribution function of wide applicability. Journal of Applied Mechanics, 18 0 (3), 1951
work page 1951
-
[44]
Ming Zheng and John P. Klein. Estimates of marginal survival for dependent competing risks based on an assumed copula. Biometrika, 82 0 (1), 1995
work page 1995
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.