pith. machine review for the scientific record. sign in

arxiv: 2604.18118 · v2 · submitted 2026-04-20 · 💱 q-fin.RM · cond-mat.stat-mech

Recognition: no theorem link

Contagion or Macroeconomic Fluctuations? Identifiability in Aggregated Default Data

Authors on Pith no claims yet

Pith reviewed 2026-05-12 01:46 UTC · model grok-4.3

classification 💱 q-fin.RM cond-mat.stat-mech
keywords default contagionaggregate dataidentifiabilitymacroeconomic fluctuationsdefault modelingvariance decompositionfinancial risk
0
0 comments X

The pith

Most annual default clustering arises from cross-year shifts in conditions rather than within-year contagion after aggregation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper asks whether contagion produces identifiable signatures in yearly aggregated default counts that cannot be explained by macroeconomic fluctuations alone. It tests three dependence structures: cumulative contagion from the Lo-Davis model, threshold contagion from the Torri model, and common-factor dependence from the Vasicek model. In an i.i.d. setup the common-factor model fits the observed counts and tails best. When default probabilities are permitted to vary freely across years, the bulk of variation is absorbed by these macro movements. Threshold contagion disappears into the macro term, while cumulative contagion leaves a small persistent residual visible in both variance and tail statistics.

Core claim

Under the hierarchical specification, cross-year movements in default conditions account for most variation in annual default counts, leaving threshold-type contagion with no separable stable component while cumulative contagion retains a small but detectable persistent element in variance decomposition and tail behavior.

What carries the argument

Variance decomposition and tail comparison across the Lo-Davis cumulative contagion model, Torri threshold contagion model, and Vasicek common-factor model, first under i.i.d. assumptions and then under a hierarchical model that lets default probability vary by year.

If this is right

  • Threshold contagion is absorbed into macro variation and cannot be recovered from aggregate counts.
  • Cumulative contagion leaves a small stable component that survives aggregation.
  • The Vasicek common-factor structure fits observed clustering better than either contagion model under i.i.d. conditions.
  • Identifiability of contagion from coarse data depends on the precise form of the interaction mechanism.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Finer-grained data on individual defaults or timing within years would be needed to separate contagion from macro effects more reliably.
  • Risk models that rely solely on aggregate counts may attribute cumulative-contagion effects to macro shocks instead.
  • The same identifiability limits may apply to other aggregated contagion settings such as interbank lending or supply-chain defaults.

Load-bearing premise

The three chosen dependence structures are the only relevant ones, and letting default probability change freely each year fully captures macroeconomic heterogeneity without leftover confounding.

What would settle it

After fitting the hierarchical macro component to real annual default counts, check whether the remaining variance and extreme tail quantiles match the small persistent residual predicted by the Lo-Davis model or the zero residual predicted by the Torri model.

Figures

Figures reproduced from arXiv: 2604.18118 by Shintaro Mori.

Figure 1
Figure 1. Figure 1: FIG. 1. Comparison of loss distributions under matched mean default probability [PITH_FULL_IMAGE:figures/full_fig_p010_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2. Comparison of model-implied distributions and tail behavior at the fixed portfolio size [PITH_FULL_IMAGE:figures/full_fig_p015_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3. Ratio of the model-implied correlation contribution to the empirical variance of the [PITH_FULL_IMAGE:figures/full_fig_p017_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4. Variance decomposition in the hierarchical Lo–Davis and hierarchical Torri models, shown [PITH_FULL_IMAGE:figures/full_fig_p020_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5. Survival functions of scaled default counts under hierarchical specifications. The hier [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6. Illustration of the residual degree of freedom in the Torri model under fixed ( [PITH_FULL_IMAGE:figures/full_fig_p028_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7. Time series of annual default rates for ALL, SG, and IG portfolios. The shaded region [PITH_FULL_IMAGE:figures/full_fig_p030_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: FIG. 8. Comparison of model-implied distributions and tail behavior at the fixed portfolio size [PITH_FULL_IMAGE:figures/full_fig_p032_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: FIG. 9. Comparison of hierarchical model fits across classes. [PITH_FULL_IMAGE:figures/full_fig_p033_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: FIG. 10. Variance decomposition by subperiod (1950–1979 vs. 1980–2023). Contributions are [PITH_FULL_IMAGE:figures/full_fig_p035_10.png] view at source ↗
read the original abstract

Can contagion be inferred from aggregated default data? We study this as a problem of identifiability, asking whether contagion generates components in default count distributions that remain distinct from those induced by macroeconomic fluctuations. We compare three dependence structures: cumulative contagion in the Lo-Davis model, threshold-type contagion in the Torri model, and common-factor dependence in the Vasicek model. Under an i.i.d. specification, the Vasicek model provides the best overall fit, especially in the tail, indicating that a smooth mixture structure captures annual default clustering more effectively than threshold-type contagion at the aggregate level. We then allow the default probability to vary across years through a hierarchical specification. Under this extension, most of the variation in annual default counts is explained by cross-year movements in default conditions rather than by within-year contagion. What remains, however, depends on the interaction mechanism. In the Torri model, threshold-type contagion does not leave a stable component that can be separated from macroeconomic heterogeneity after aggregation. In the Lo-Davis model, by contrast, a small but persistent component remains visible in both the variance decomposition and the tail behavior. These results clarify when contagion can still be inferred from coarse-grained data and when it is effectively absorbed into macroeconomic variation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 3 minor

Summary. The paper investigates whether contagion effects can be identified separately from macroeconomic fluctuations in aggregated annual default count data. It compares three dependence structures—cumulative contagion (Lo-Davis model), threshold-type contagion (Torri model), and common-factor dependence (Vasicek model)—first under an i.i.d. specification and then under a hierarchical extension that allows default probabilities to vary freely across years. The central finding is that cross-year macro variation absorbs most clustering, with residual contagion effects being model-dependent: absent after aggregation in the Torri model but leaving a small persistent component visible in variance decompositions and tail behavior for the Lo-Davis model.

Significance. If the results hold, the work clarifies the limits of inferring contagion from coarse-grained data, which is directly relevant to credit risk modeling and regulatory capital calculations. The hierarchical approach combined with explicit variance decompositions and tail comparisons provides a quantitative framework for separating mechanisms, and the model-specific findings (no separable component in Torri versus small persistent in Lo-Davis) offer falsifiable distinctions that can guide empirical practice.

major comments (1)
  1. [Hierarchical specification and variance decomposition] Hierarchical extension (around the variance decomposition): the claim that year-specific default probabilities fully absorb macroeconomic heterogeneity and leave only model-dependent residuals is load-bearing for the conclusion that Torri contagion is absorbed while Lo-Davis retains a persistent component. Without robustness checks that include observable macro covariates (e.g., GDP or interest rates) alongside the year-specific probabilities, residual confounding cannot be ruled out, potentially altering the interpretation of the Lo-Davis tail and variance results.
minor comments (3)
  1. [Abstract and §1] The abstract and introduction would benefit from an explicit statement of the sample period, data source for default counts, and number of years/observations used in the fits.
  2. [Model definitions and estimation] Notation for the contagion intensity parameters and the hierarchical prior on default probabilities should be unified across the model definitions and estimation sections to avoid reader confusion.
  3. [i.i.d. model comparison] The tail behavior comparisons would be strengthened by reporting exact p-values or test statistics rather than qualitative statements about 'especially in the tail.'

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive and insightful report. We address the single major comment below and have incorporated a partial revision to clarify the rationale and limitations of our hierarchical approach.

read point-by-point responses
  1. Referee: Hierarchical extension (around the variance decomposition): the claim that year-specific default probabilities fully absorb macroeconomic heterogeneity and leave only model-dependent residuals is load-bearing for the conclusion that Torri contagion is absorbed while Lo-Davis retains a persistent component. Without robustness checks that include observable macro covariates (e.g., GDP or interest rates) alongside the year-specific probabilities, residual confounding cannot be ruled out, potentially altering the interpretation of the Lo-Davis tail and variance results.

    Authors: We thank the referee for this important observation. The hierarchical specification is deliberately constructed to let default probabilities vary freely across years, thereby capturing all macroeconomic heterogeneity in a nonparametric manner rather than through specific observables. This design isolates the identifiability question: whether within-year contagion mechanisms generate distinguishable components once cross-year macro variation is fully absorbed. We agree that adding observable covariates such as GDP or interest rates would constitute a valuable robustness exercise. However, our focus is on the limits of inference from aggregated data without requiring detailed macro series, which is the typical setting for such models. We have added a new paragraph in Section 4.3 of the revised manuscript that explicitly discusses this point, notes the potential for residual confounding, and clarifies that the model-specific findings (absorption in Torri versus small persistent component in Lo-Davis) are conditional on the flexible year effects. This preserves the core identifiability conclusions while acknowledging the referee's concern. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper compares three posited dependence structures (Lo-Davis cumulative contagion, Torri threshold contagion, Vasicek common factor) via i.i.d. and hierarchical fits to aggregated default counts, then reports variance decompositions and tail behavior under the hierarchical extension. These steps are standard model-based inference: parameters are estimated from data, and conclusions about what remains after cross-year default-probability variation are direct outputs of the fitted models rather than inputs renamed as predictions. No self-citation is invoked as load-bearing justification, no quantity is defined in terms of itself, and no uniqueness theorem or ansatz is smuggled from prior author work. The derivation chain remains self-contained against the external data and the three explicit model specifications.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The central claim rests on the domain assumption that the three chosen models exhaust the relevant dependence structures and that year-specific default probabilities fully isolate macroeconomic effects; free parameters include the year-varying default probabilities and the contagion intensity parameters fitted within each model.

free parameters (2)
  • year-specific default probabilities
    Allowed to vary freely across years in the hierarchical specification to represent macroeconomic fluctuations
  • contagion intensity parameters
    Fitted separately in the Lo-Davis cumulative and Torri threshold models
axioms (2)
  • domain assumption The Lo-Davis, Torri, and Vasicek models correctly represent the possible forms of default dependence
    Invoked when using their fits to decide which mechanism leaves a separable component
  • ad hoc to paper Aggregated annual counts contain enough information to distinguish the mechanisms after hierarchical adjustment
    The identifiability question itself assumes this property of the data

pith-pipeline@v0.9.0 · 5522 in / 1559 out tokens · 44170 ms · 2026-05-12T01:46:08.811198+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

36 extracted references · 36 canonical work pages

  1. [1]

    We define the contagion indicator I C i,n := 1− Y j̸=i (1−Y ijXj),(A3) so that Zi =X i + (1−X i)I C i,n.(A4) LetK= Pn i=1 Xi denote the number of idiosyncratic defaults

    A.1 Lo–Davis model In the Lo–Davis model, the final default indicator is given by Zi =X i + (1−X i) 1− Y j̸=i (1−Y ijXj) ! ,(A1) where Xi ∼Bernoulli(p), Y ij ∼Bernoulli(q),(A2) and all variables are independent. We define the contagion indicator I C i,n := 1− Y j̸=i (1−Y ijXj),(A3) so that Zi =X i + (1−X i)I C i,n.(A4) LetK= Pn i=1 Xi denote the number of...

  2. [2]

    Define the global contagion indicator I C n := 1 nX j=1 XjVj >0 ! .(A14) Then Zi =X i + (1−X i)(1−U i)I C n .(A15) 25 a

    A.2 Torri model (infection with immunization) In the Torri model, the final default indicator is Zi =X i + (1−X i)(1−U i) 1− nY j=1 (1−X jVj) ! ,(A12) where Xi ∼Bernoulli(p), U i ∼Bernoulli(u), V i ∼Bernoulli(v),(A13) and all variables are independent. Define the global contagion indicator I C n := 1 nX j=1 XjVj >0 ! .(A14) Then Zi =X i + (1−X i)(1−U i)I ...

  3. [3]

    Torri:1 nX j=1 XjVj >0 ! .(A25) Thus, •Lo–Davis generatessmooth clustering, •Torri generatesdiscrete regime switching

    Structural comparison The two models generate dependence in fundamentally different ways: •Lo–Davis:local cumulative contagion viaI C i,n •Torri:global threshold contagion viaI C n Mathematically, Lo–Davis: 1− Y j̸=i (1−Y ijXj) vs. Torri:1 nX j=1 XjVj >0 ! .(A25) Thus, •Lo–Davis generatessmooth clustering, •Torri generatesdiscrete regime switching. This s...

  4. [4]

    Activation probability and tail-risk trade-off in the Torri model To illustrate the residual degree of freedom in the Torri model, we examine how variation in the parameterpalong the iso-(m, ρ) manifold affects both the activation probabilityπ n and the resulting tail risk. Although the activation probability πn = 1−(1−pv) n 27 0.004 0.006 0.008 0.010 0.0...

  5. [5]

    For each yeart, we observe the total number of obligorsn t and the number of defaultsL t, for three credit classes: ALL, SG (speculative grade), and IG (investment grade)

    Data description The empirical analysis is based on annual default count data. For each yeart, we observe the total number of obligorsn t and the number of defaultsL t, for three credit classes: ALL, SG (speculative grade), and IG (investment grade). The default rate is defined asL t/nt. Since only aggregated default counts are available, the underlying n...

  6. [6]

    The mean default rate corresponds to the simple average of yearly default rates, while the total default rate is computed as the exposure-weighted average across all years

    Summary statistics Table VIII reports summary statistics for two subperiods (1950–1979 and 1980–2023). The mean default rate corresponds to the simple average of yearly default rates, while the total default rate is computed as the exposure-weighted average across all years. TABLE VIII. Summary statistics of annual default counts by subperiod. Class Perio...

  7. [7]

    The shaded region corresponds to the early subperiod (1950–1979)

    Time series properties Figure 7 shows the time series of default rates for ALL, SG, and IG portfolios. The shaded region corresponds to the early subperiod (1950–1979). The series exhibit pronounced clustering of defaults over time, with periods of elevated default activity followed by relatively calm phases. This clustering is particularly evident for SG...

  8. [8]

    The implied pairwise correlation is then defined by ρ= P(Zi = 1, Zj = 1)−m 2 m(1−m)

    Computation of aggregated moments Because the pool sizen t varies across years, the model-implied mean default probability mand joint default probabilityP(Z i = 1, Z j = 1) are first evaluated at each observedn t and then aggregated using the same weighting scheme as in the data: m= P t nt m(nt)P t nt ,P(Z i = 1, Zj = 1) = P t nt(nt −1)P(Z i = 1, Zj = 1|n...

  9. [9]

    For completeness, we report the corresponding results for the SG portfolio in this appendix

    Additional distributional results for the SG class In the main text, we report the PMF and survival function for the ALL and IG portfolios. For completeness, we report the corresponding results for the SG portfolio in this appendix. Figure 8 reports the corresponding PMFs and survival functions for the SG class at the fixed portfolio sizen= ¯n. The same q...

  10. [10]

    For completeness, we also report the corresponding PMFs together with the class-specific survival functions for the ALL, SG, and IG portfolios

    Additional distributional results In the main text, we compare the survival functions under the hierarchical specifications. For completeness, we also report the corresponding PMFs together with the class-specific survival functions for the ALL, SG, and IG portfolios. Figure 9 shows the corresponding PMFs and class-specific survival functions under the hi...

  11. [11]

    The relative ranking of models varies across classes and periods

    Likelihood comparison by subperiod Tables IX and X report the maximum likelihood estimates and the corresponding negative log-likelihood (nll) values for each subperiod. The relative ranking of models varies across classes and periods. In particular, the Torri 32 0 50 100 150 200 h 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09PMF ALL 0 5 10 15 20 25 3...

  12. [12]

    Figure 10 decomposes the empirical variance into contributions from i.i.d

    Variance decomposition by subperiod We therefore turn to the variance decomposition, which is central to the present paper. Figure 10 decomposes the empirical variance into contributions from i.i.d. shocks, contagion, and time variation in the default probabilityp t. A clear structural difference emerges across periods. In the early period (1950–1979), th...

  13. [13]

    R. N. Mantegna and H. E. Stanley,An Introduction to Econophysics: Correlations and Com- plexity in Finance(Cambridge University Press, 1999)

  14. [14]

    Galam, Int

    S. Galam, Int. J. Mod. Phys. C19, 409 (2008)

  15. [15]

    Lux, Econ

    T. Lux, Econ. J.105, 881 (1995)

  16. [16]

    Lux and M

    T. Lux and M. Marchesi, Nature397, 498 (1999)

  17. [17]

    Alfarano, T

    S. Alfarano, T. Lux, and F. Wagner, Comput. Econ.26, 19 (2005)

  18. [18]

    Bouchaud, M

    J.-P. Bouchaud, M. M´ ezard, and M. Potters, Quant. Finance2, 251 (2002)

  19. [19]

    Fernandez-Gracia, K

    J. Fernandez-Gracia, K. Suchecki, J. J. Ramasco, M. SanMiguel, and V. M. Egu´ ıluz, Phys. Rev. Lett.112, 158701 (2014)

  20. [20]

    S. Mori, M. Hisakado, and T. Takahashi, Phys. Rev. E86, 026109 (2012)

  21. [21]

    S. Mori, K. Nakayama, and M. Hisakado, Phys. Rev. E99, 052307 (2019)

  22. [22]

    Smolyak and S

    A. Smolyak and S. Havlin, Entropy24, 271 (2022)

  23. [23]

    P. J. Sch¨ onbucher,Credit Derivatives Pricing Models: Models, Pricing and Implementation (John Wiley & Sons, 2003). 36

  24. [24]

    S. R. Das, D. Duffie, N. Kapadia, and L. Saita, J. Finance62, 93 (2007)

  25. [25]

    M. H. A. Davis and V. Lo, Quant. Finance1, 382 (2001)

  26. [26]

    O. A. Vasicek, KMV Corporation (1991), working paper

  27. [27]

    O. A. Vasicek, Risk15, 160 (2002)

  28. [28]

    Montagna, G

    M. Montagna, G. Torri, and G. Covi,On the Origin of Systemic Risk, Tech. Rep. 2502 (European Central Bank, 2020) ECB Working Paper

  29. [29]

    Torri, R

    G. Torri, R. Giacometti, and G. Farina, Commun. Nonlinear Sci. Numer. Simul.159, 109886 (2026)

  30. [30]

    Sakata, M

    A. Sakata, M. Hisakado, and S. Mori, J. Phys. Soc. Jpn.76, 054801 (2007)

  31. [31]

    Azizpour, K

    S. Azizpour, K. Giesecke, and G. Schwenkler, J. Financ. Econ.129, 154 (2018)

  32. [32]

    A. G. Hawkes, Biometrika58, 83 (1971)

  33. [33]

    Errais, K

    E. Errais, K. Giesecke, and L. R. Goldberg, SIAM J. Financial Math.1, 642 (2010)

  34. [34]

    Kirchner, Quant

    M. Kirchner, Quant. Finance17, 571 (2017)

  35. [35]

    Blanc, J

    P. Blanc, J. Donier, and J.-P. Bouchaud, Quant. Finance17, 171 (2017)

  36. [36]

    de Finetti,Theory of Probability(Wiley, 1974)

    B. de Finetti,Theory of Probability(Wiley, 1974). 37