Local estimation of transition rates of jump processes through discretization
Pith reviewed 2026-05-07 12:36 UTC · model grok-4.3
The pith
Occurrence/exposure rates for Markov and semi-Markov jump processes are asymptotically normal when discretization intervals shrink with the number of observations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Imposing no structural assumptions on the true intensities, the occurrence/exposure rates are asymptotically normal under appropriate shrinking conditions on the partition lengths for both Markov and semi-Markov models. These results are derived using only classical central limit theorems and elementary results for counting processes.
What carries the argument
The occurrence/exposure rate obtained by Poisson regression on discretized time and duration intervals, which counts transitions in each bin and divides by total exposure time in that bin.
If this is right
- Asymptotic normality holds simultaneously for both Markov and semi-Markov jump processes.
- The estimators require no parametric form for the transition intensities.
- Normal-based confidence intervals and tests follow directly from the limiting distribution.
- The method remains valid for any intensities as long as the partition lengths satisfy the shrinking conditions.
Where Pith is reading between the lines
- The same binning idea could be applied to estimate intensities in non-jump point processes where event counts are still observable.
- Data-driven selection of bin sizes might improve accuracy for moderate sample sizes while preserving the asymptotic guarantee.
- Because the procedure uses only elementary counting-process tools, it can be coded in standard statistical packages without requiring advanced numerical routines.
Load-bearing premise
The lengths of the time and duration intervals must shrink with the number of observations at a rate that satisfies the stated conditions for asymptotic normality.
What would settle it
Generate many independent realizations of a jump process with known intensities, keep partition lengths fixed while increasing the number of observations, and verify whether the properly normalized occurrence/exposure rates still converge in distribution to standard normal.
read the original abstract
We investigate the Poisson regression method for Markov and semi-Markov jump processes from a nonparametric angle, allowing the lengths of the time and duration intervals in the partition to vary with the number of observations. Imposing no structural assumptions on the true intensities, we obtain asymptotic normality of the occurence/exposure rates under appropriate shrinking conditions on the partition lengths. We derive asymptotic normality results for both Markov and semi-Markov models using only classical central limit theorems and elementary results for counting processes. All results are illustrated on both simulated and real data.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript develops a discretization-based nonparametric estimator for transition rates (occurrence/exposure) in Markov and semi-Markov jump processes. Partition lengths for time and duration are permitted to shrink with sample size; under suitable shrinking conditions the authors derive asymptotic normality of the estimators for both model classes, relying only on classical central limit theorems and elementary counting-process results, without any structural assumptions on the true intensities. The claims are illustrated on simulated and real data.
Significance. If the asymptotic normality results hold under the stated conditions, the work supplies a flexible, assumption-light approach to local rate estimation for jump processes that accommodates irregular intensities and varying bin sizes. The exclusive use of standard probabilistic tools (CLTs and counting-process martingales) is a strength that keeps the derivations elementary and reproducible. The extension to semi-Markov models via joint time-duration discretization is a useful contribution for applications in survival analysis and reliability where duration dependence is present.
major comments (2)
- [§3 and §4] §3 (Markov case) and §4 (semi-Markov case), statements preceding Theorems 3.1 and 4.1: the claim that asymptotic normality follows from 'only classical central limit theorems and elementary results for counting processes' with 'no structural assumptions on the true intensities' is load-bearing. For the normalized increments inside each shrinking bin to satisfy a Lindeberg-type condition, the compensators must still obey uniform integrability or moment bounds; arbitrary (unbounded, discontinuous, or singular) intensities can violate this even when bin lengths shrink at the stated rate. The manuscript does not exhibit an explicit verification that the Lindeberg condition holds uniformly under the sole shrinking hypothesis.
- [§4.2] §4.2, the semi-Markov discretization: the dependence induced by conditioning on both time and duration bins is not automatically covered by the standard Poisson CLT invoked for the Markov case. The paper must show that the joint counting-process martingale still satisfies the required tightness or Lindeberg condition after the additional discretization; the current argument appears to treat the duration bins as independent of the time bins, which is not generally true.
minor comments (2)
- [§2] Notation for the occurrence/exposure ratio is introduced without a clear global definition; a single displayed equation collecting all symbols would improve readability.
- [§5] The simulation section reports coverage probabilities but does not tabulate the exact shrinking rates used for each scenario; adding a small table would make the numerical results directly comparable to the theoretical conditions.
Simulated Author's Rebuttal
We thank the referee for the careful reading, the positive assessment of the contribution, and the specific comments on the rigor of the asymptotic arguments. We address the major comments point by point below and will revise the manuscript to strengthen the proofs where needed.
read point-by-point responses
-
Referee: [§3 and §4] §3 (Markov case) and §4 (semi-Markov case), statements preceding Theorems 3.1 and 4.1: the claim that asymptotic normality follows from 'only classical central limit theorems and elementary results for counting processes' with 'no structural assumptions on the true intensities' is load-bearing. For the normalized increments inside each shrinking bin to satisfy a Lindeberg-type condition, the compensators must still obey uniform integrability or moment bounds; arbitrary (unbounded, discontinuous, or singular) intensities can violate this even when bin lengths shrink at the stated rate. The manuscript does not exhibit an explicit verification that the Lindeberg condition holds uniformly under the sole shrinking hypothesis.
Authors: We agree that an explicit verification of the Lindeberg condition strengthens the claim. The proofs apply the martingale central limit theorem to the normalized increments of the counting processes over the shrinking bins. Under the stated conditions (bin lengths δ_n → 0 with nδ_n → ∞ at appropriate rates), the maximum squared increment is o_p(1) relative to the predictable variation because the exposure per bin remains controlled and jumps are of size 1. However, the manuscript does not spell out the uniform integrability step for completely arbitrary intensities. We will add a short lemma in the appendix (or inline before Theorems 3.1 and 4.1) that verifies the Lindeberg condition directly from the shrinking hypothesis and the elementary properties of counting-process compensators, without imposing extra structural assumptions on the intensities beyond those already implicit in the existence of the compensators. revision: yes
-
Referee: [§4.2] §4.2, the semi-Markov discretization: the dependence induced by conditioning on both time and duration bins is not automatically covered by the standard Poisson CLT invoked for the Markov case. The paper must show that the joint counting-process martingale still satisfies the required tightness or Lindeberg condition after the additional discretization; the current argument appears to treat the duration bins as independent of the time bins, which is not generally true.
Authors: We accept that the joint discretization requires a multivariate martingale argument rather than a direct invocation of the univariate Poisson CLT. In §4.2 the estimator is constructed from the multivariate counting process N_{ij}(t,u) indexed by the product partition of time and duration; the compensator is the integral of the joint intensity over each (time,duration) rectangle. The predictable variation process therefore automatically incorporates the dependence between time and duration bins. We will revise the proof of Theorem 4.1 to state the multivariate martingale CLT explicitly, verify the Lindeberg condition for the vector of increments over the product partition, and add a short paragraph clarifying that the bins are not assumed independent—the joint quadratic variation accounts for any dependence induced by the semi-Markov structure. This keeps the argument within elementary counting-process results. revision: yes
Circularity Check
No circularity: derivation applies external classical CLTs to discretized counting processes
full rationale
The paper states it obtains asymptotic normality of occurrence/exposure rates for Markov and semi-Markov models by applying only classical central limit theorems and elementary counting-process results to partitions whose lengths shrink with sample size, while imposing no structural assumptions on the intensities. No equation or step reduces a claimed prediction to a fitted parameter by construction, no self-citation is invoked as a load-bearing uniqueness theorem, and the central results are not renamed empirical patterns. The derivation chain is therefore self-contained against external probabilistic benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- standard math Classical central limit theorems apply to the normalized counting processes under the shrinking partition conditions
- standard math Elementary results for counting processes hold for the occurrence and exposure times
Reference graph
Works this paper leans on
-
[1]
AALEN, O. (1978). Nonparametric Inference for a Family of Counting Processes.The Annals of Statistics6701–726
work page 1978
-
[2]
AALEN, O. O., BORGAN, O. and GJESSING, H. K. (2008).Survival and Event History Analysis.Statistics for Biology and Health. Springer New York
work page 2008
-
[3]
AMSLER, M. H. Les chaines de Markov des assurances vie, invalidité et maladie.Transactions of the 18th International Conference of Actuaries
-
[4]
ANDERSEN, P. K., BORGAN, O., GILL, R. D. and KEIDING, N. (1993).Statistical Models Based on Counting Processes. Springer-Verlag New York
work page 1993
-
[5]
ASMUSSEN, S. and STEFFENSEN, M. (2020).Risk and Insurance, 1 ed.Probability Theory and Stochastic Modelling. Springer Cham
work page 2020
-
[6]
BERAN, R. (1981). Nonparametric Regression with Randomly Censored Survival Data.Technical Report, University of California, Berkeley19
work page 1981
-
[7]
BLADT, M. and FURRER, C. Conditional Aalen–Johansen estimation.Scandinavian Journal of Statistics52873-902
-
[8]
BORGAN, O. (1984). Maximum Likelihood Estimation in Parametric Counting Process Models, with Applications to Censored Failure Time Data.Scandinavian Journal of Statistics111–16
work page 1984
-
[9]
BUCHARDT, K., FURRER, C. and SANDQVIST, O. Estimation for multistate models subject to reporting delays and incomplete event adjudication with application to disability insurance.Annals of Applied Statistics192782-2802
-
[10]
BUCHARDT, K., MØLLER, T. and SCHMIDT, K. B. Cash flows and policyholder behaviour in the semi-Markov life insurance setup.Scandinavian Actuarial Journal2015660-688
-
[11]
COHEN, S. N. and ELLIOTT, R. J. (2010).Stochastic Calculus and Application, 2 ed. Birkhäuser, New York, NY
work page 2010
-
[12]
DABROWSKA, D. M. (1987). Non-Parametric Regression with Censored Survival Time Data.Scandinavian Journal of Statistics14181–197
work page 1987
-
[13]
DABROWSKA, D. M. (1997). Smoothed Cox Regression.The Annals of Statistics251510–1540
work page 1997
-
[14]
GILL, R. D. and JOHANSEN, S. A Survey of Product-Integration with a View Toward Application in Survival Analysis.The Annals of Statistics 41501–1555
-
[15]
(2008).Durational effects and non-smooth semi-Markov models in life insurance, Doctoral dissertation
HELWICH, M. (2008).Durational effects and non-smooth semi-Markov models in life insurance, Doctoral dissertation. University of Rostock
work page 2008
-
[16]
HOEM, J. M. Markov Chain Models in Life Insurance.Blätter der DGVFM991-107
-
[17]
HOEM, J. M. The Versatility of the Markov Chain as a Tool in the Mathematics of Life Insurance.Proceedings, International Congress of Actuaries, Helsinki1988172-202
-
[18]
HOEM, J. M. Inhomogeneous Semi-Markov Processes, Select Actuarial Tables, and Duration-Dependence in Demography.Population Dynamics 1972251-296
-
[19]
HOLFORD, T. R. (1980). The Analysis of Rates and of Survivorship Using Log-Linear Models.Biometrics36299–305
work page 1980
-
[20]
JANSSEN, J. Application des processus semi-markoviens à un problème d’invalidité.Bulletin de l’Association Royale des Actuaires Français63 35-52
-
[21]
JANSSEN, J. andDEDOMINICIS, R. Finite non-homogeneous semi-Markov processes: Theoretical and computational aspects.Insurance: Math- ematics and Economics3157-165
-
[22]
(2021).Foundations of Modern Probability, 3 ed
KALLENBERG, O. (2021).Foundations of Modern Probability, 3 ed. Springer, Switzerland AG
work page 2021
-
[23]
(2012).Stochastic Models in Life Insurance.European Actuarial Academy Series
KOLLER, M. (2012).Stochastic Models in Life Insurance.European Actuarial Academy Series. Springer Berlin Heidelberg
work page 2012
-
[24]
LAIRD, N. and OLIVIER, D. (1981). Covariance Analysis of Censored Survival Data Using Log-Linear Analysis Techniques.Journal of the American Statistical Association76231–240
work page 1981
-
[25]
MCKEAGUE, I. W. and UTIKAL, K. J. (1990). Inference for a Nonlinear Counting Process Regression Model.The Annals of Statistics18 1172–1187
work page 1990
-
[26]
NELSON, W. (2000). Theory and Applications of Hazard Plotting for Censored Failure Data.Technometrics4212–25
work page 2000
-
[27]
NIELSEN, J. P. and LINTON, O. B. Kernel Estimation in a Nonparametric Marker Dependent Hazard Model.Annals of Statistics231735-1748
-
[28]
(1980).Semi-Markovsche Prozesse
NOLLAU, V. (1980).Semi-Markovsche Prozesse. De Gruyter, Berlin, Boston
work page 1980
-
[29]
PRENTICE, R. L. and GLOECKLER, L. A. (1978). Regression Analysis of Grouped Survival Data with Application to Breast Cancer Data. Biometrics3457–67
work page 1978
-
[30]
Smoothing Counting Process Intensities by Means of Kernel Functions.Annals of Statistics11453-466
RAMLAU-HANSEN, H. Smoothing Counting Process Intensities by Means of Kernel Functions.Annals of Statistics11453-466. 18 Appendix A: Additional proofs Proof of Lemma 2.2.We prove each assertion
-
[31]
The asymptotic expression follows by applying a Taylor expansion aroundt
Note that Xi,n = Z In 1(s≤Ri)N i jk(ds) and thus by random right-censoring, µjk n = Z In E[1(s<R)1(Zs−=j)]qjk(ds) = Z In P(s≤R, Z s− =j)µ jk(s)ds = Z In pc j (s)µjk(s)ds. The asymptotic expression follows by applying a Taylor expansion aroundt
-
[32]
We have ηj n = Z In E[1(s<R)1(Zs−=j)]ds= Z In pc j (s)ds= ∆ npc j (t) +O(∆ 2 n), again by a Taylor expansion
-
[33]
Start by noting that Xi,n =N i jk(tm ∧R i)−N i jk(tm−1 ∧R i). DefiningM jk(t) :=N jk(t)− R t 0 1(Zs−=j)µjk(s)ds, we have E[X 2 i,n] =E[(M jk(tm ∧R)−M jk(tm−1 ∧R)) 2] +E h Z In 1(s≤R)1(Zs−=j)µjk(s)ds 2i + 2E h (Mjk(tm ∧R)−M jk(tm−1 ∧R)) Z In 1(s≤R)1(Zs−=j)µjk(s)ds i . Sinceµ jk isC 1 close tot, for large enoughn,µ jk is continuous onI n and thus bounded. F...
-
[34]
Finally, Var(Y1,n) =E[Y 2 1,n]−(η j n)2 =E hZ In Z In 1(s∨u<R)1(Zs=j,Zu=j)dsdu i −(η j n)2 = Z In Z In P(s∨u < R, Z s =j, Z u =j)dsdu−(η j n)2. Applying a two-dimensional Taylor expansion in the point(t, t)yields Var(Y1,n) = ∆2 npc j (t) +O(∆ 2 n) as claimed. Local estimation of transition rates of jump processes through discretization19 Proof of Lemma 2....
-
[35]
By random right-censoring, E[Xi,n] =E hZ I (1) n 1(s≤R)1(um2 −1≤Us−<um2 )1(Zi s−=j)µjk(s, Us−)ds i = Z I (1) n E[1(s≤R)1(um2 −1≤Us−<um2 )1(Zs−=j)µjk(s, Us−)]ds. The joint measure of(Z t, Ut)is forF⊆ ZandG∈ B(R)given by pF (t, G) = X j∈F Z G pj(t,du), wherep j(t, u) =P(Z t =j, U t ≤u)so the inner expectation equals Z I (2) n µjk(s, v)pc j (s,dv) = Z I (2) ...
-
[36]
We have ηj n =E hZ I (1) n 1(um2 −1≤Us−<um2 )1(s≤R)1(Zs−=j)ds i = Z I (1) n pc j (s, um2)−p c j (s, um2−1)ds. Start by doing a second order Taylor expansion arounduin the second coordinate to get pc j (s, v) =p c j (s, u) +∂2pc j (s, u)(v−u) +O ∆(2) n 2 . Due to continuity of the second derivative, we may choose theO-term to be independent ofssince the in...
-
[37]
To computeVar(X i,n), we apply a similar strategy as for the Markov case. We decompose E[X 2 i,n] =E h Z I (1) n 1(s≤R)1(um2 −1≤Us−<um2 )dNjk(s) 2i =E h Z I (1) n 1(s≤R)1(um2 −1≤Us−<um2 )dMjk(s) 2i 22 +E h Z I (1) n 1(s≤R)1(um2 −1≤Us−<um2 )1(Zs−=j)µjk(s, Us−)ds 2i + 2E h Z I (1) n 1(s≤R)1(um2 −1≤Us−<um2 )dMjk(s) · Z I (1) n 1(s≤R)1(um2 −1≤Us−<um2 )1(Zs−=j...
-
[38]
Finally, E[Y 2 1,n] = Z I (1) n Z I (1) n P(s1, s2 ≤R, u m2−1 ≤U(s 1−), U(s2−)< u m2 , Zs1− =j, Z s2− =j)ds 1ds2 = Z I (1) n Z I (1) n pc j (s1, s2, um2)−p c j (s1, s2, um2−1)ds1ds2 withp c j (s1, s2, v)defined in the proof of 3. Now apply a second order Taylor expansion aroundv=uto get pc j (s1, s2, um2)−p c j (s1, s2, um2−1) =∂ 2pc j (s1, s2, u)∆(2) n +...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.