Modelling the term-structure of default risk under IFRS 9 within a multistate regression framework
Pith reviewed 2026-05-23 02:45 UTC · model grok-4.3
The pith
Multistate models of increasing complexity successively outperform simpler ones in estimating the term-structure of default risk for loans.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper establishes that modelling the transition probabilities of a nonstationary semi-Markov model for loan default behaviour as explicit functions of macroeconomic and loan-level variables produces successively better term-structure estimates as complexity rises from Markov chain to beta regression to multinomial logistic regression, demonstrated on residential mortgage data and aided by new diagnostic tools.
What carries the argument
Multistate regression-based approach that models transition probabilities explicitly as functions of input variables, applied in ascending order of complexity from Markov chain to beta regression to multinomial logistic regression.
If this is right
- Each successive model outperforms the previous due to greater sophistication.
- Novel model diagnostics can be reused to assess sampling representativeness and other modelling techniques.
- Estimation of loss reserves will be more timeous and accurate under IFRS 9.
- The nonstationary semi-Markov behaviour of loans can be captured through regression on loan-level and macroeconomic inputs.
Where Pith is reading between the lines
- The framework may extend to other loan portfolios if the diagnostics confirm representativeness across different data sets.
- Banks could embed the multinomial logistic version directly into IFRS 9 reporting systems to reduce provisioning volatility.
- The diagnostics themselves could serve as a general check for whether any multistate model respects the original loan cohort distribution.
Load-bearing premise
The assumption that outperformance across the three models can be attributed to increasing sophistication rather than differences in how the same data set was prepared or split for each technique.
What would settle it
Re-running all three models on identical data splits and preparations and finding that the performance gains with complexity disappear or reverse.
Figures
read the original abstract
The lifetime behaviour of loans is notoriously difficult to model, which can compromise a bank's financial reserves against future losses, if modelled poorly. Therefore, we present a data-driven comparative study amongst three techniques in modelling a series of default risk estimates over the lifetime of each loan, i.e., its term-structure. The behaviour of loans can be described using a nonstationary and time-dependent semi-Markov model, though we model its elements using a multistate regression-based approach. As such, the transition probabilities are explicitly modelled as a function of a rich set of input variables, including macroeconomic and loan-level inputs. Our modelling techniques are deliberately chosen in ascending order of complexity: 1) a Markov chain; 2) beta regression; and 3) multinomial logistic regression. Using residential mortgage data, our results show that each successive model outperforms the previous, likely as a result of greater sophistication. This finding required devising a novel suite of simple model diagnostics, which can itself be reused in assessing sampling representativeness and the performance of other modelling techniques. These contributions surely advance the current practice within banking when conducting multistate modelling. Consequently, we believe that the estimation of loss reserves will be more timeous and accurate under IFRS 9.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a comparative study of three multistate techniques for modeling the term-structure of default risk under IFRS 9, ordered by ascending complexity (Markov chain, beta regression, multinomial logistic regression). Using residential mortgage data, it claims that each successive model outperforms its predecessor due to greater sophistication and introduces a novel suite of simple model diagnostics for assessing sampling representativeness and performance.
Significance. If the outperformance is demonstrated via consistent out-of-sample protocols and identical data handling, the work could improve the accuracy and timeliness of lifetime expected credit loss estimates required under IFRS 9, while the proposed diagnostics would offer reusable tools for multistate modeling in banking practice.
major comments (2)
- [Abstract] Abstract: The claim that 'each successive model outperforms the previous' supplies no quantitative metrics, error bars, tables, or description of the validation protocol (train/test splits, cross-validation procedure, or hold-out rules), preventing any assessment of whether gains are attributable to model complexity.
- [Abstract] Abstract: Attribution of results to 'greater sophistication' requires that the three techniques were applied under identical conditions (same feature sets, sampling procedures, and performance measures); no such information is provided, so the central causal claim cannot be evaluated.
Simulated Author's Rebuttal
We thank the referee for their detailed review and constructive comments on the abstract. We address each point below and will revise the manuscript accordingly to improve clarity.
read point-by-point responses
-
Referee: [Abstract] Abstract: The claim that 'each successive model outperforms the previous' supplies no quantitative metrics, error bars, tables, or description of the validation protocol (train/test splits, cross-validation procedure, or hold-out rules), preventing any assessment of whether gains are attributable to model complexity.
Authors: We agree that the abstract, due to length constraints, omits quantitative metrics and validation details. The full manuscript provides these in the results and methodology sections, including out-of-sample performance tables, error metrics, and the train/test/hold-out protocol applied consistently across models. We will revise the abstract to incorporate key quantitative summary results and a concise description of the validation protocol. revision: yes
-
Referee: [Abstract] Abstract: Attribution of results to 'greater sophistication' requires that the three techniques were applied under identical conditions (same feature sets, sampling procedures, and performance measures); no such information is provided, so the central causal claim cannot be evaluated.
Authors: The models were applied under identical conditions, using the same feature sets, sampling procedures, and performance measures, as specified in the methodology section. The abstract's brevity did not explicitly restate this. We will revise the abstract to include a statement confirming that all comparisons used identical data handling and evaluation protocols. revision: yes
Circularity Check
No circularity in derivation chain
full rationale
The paper is an empirical comparative study of three regression techniques (Markov chain, beta regression, multinomial logistic regression) applied to residential mortgage data for term-structure default modeling under IFRS 9. The abstract reports observed outperformance across models but contains no equations, first-principles derivations, or predictions that reduce to fitted inputs by construction. No self-citations, uniqueness theorems, ansatzes, or renamings of known results appear in the provided text. The central claim is an empirical finding on data rather than a closed mathematical loop, making the derivation self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Our modelling techniques are deliberately chosen in ascending order of complexity: 1) a Markov chain; 2) beta regression; and 3) multinomial logistic regression.
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Using residential mortgage data, our results show that each successive model outperforms the previous...
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Adha, M., Nurrohmah, S., & Abdullah, S. (2018). Multinomial logistic regression and spline regression for credit risk modelling.Journal of Physics: Conference Series, 1108(1), 012019.https://doi.org/10. 1088/1742-6596/1108/1/012019
work page 2018
-
[2]
(2007).An introduction to categorical data analysis(2nd ed.)
Agresti, A. (2007).An introduction to categorical data analysis(2nd ed.). John Wiley; Sons.https : //doi.org/10.1002/0470114754
-
[3]
Aitchison, J. (1982). The Statistical Analysis of Compositional Data.Journal of the Royal Statistical Society. Series B (Methodological),44(2), 139–177.http://leg.est.ufpr.br/lib/exe/fetch.php/ pessoais:abtmartins:thestatisticalanalysisofcompositionaldata.pdf
work page 1982
-
[4]
Akaike, H. (1998). Information theory and an extension of the maximum likelihood principle. In E. Parzen, K. Tanabe, & G. Kitagawa (Eds.),Selected papers of hirotugu akaike(pp. 199–213). Springer New York. https://doi.org/10.1007/978-1-4612-1694-0_15
-
[5]
Anderson, T. W., & Goodman, L. A. (1957). Statistical inference about Markov chains.The Annals of Mathematical Statistics, 89–110.https://doi.org/10.1214/aoms/1177707039
-
[6]
Arundina, T., Azmi Omar, M., & Kartiwi, M. (2015). The predictive accuracy of Sukuk ratings; Multinomial logistic and neural network inferences.Pacific-Basin Finance Journal,34, 273–292.https://doi.org/10. 1016/j.pacfin.2015.03.002
work page 2015
-
[7]
(2016).Credit risk analytics: Measurement techniques, applications, and examples in SAS
Baesens, B., Rösch, D., & Scheule, H. (2016).Credit risk analytics: Measurement techniques, applications, and examples in SAS. John Wiley & Sons
work page 2016
-
[8]
BCBS. (2019).The Basel framework. Bank of International Settlements: Basel Committee on Banking Supervision (BCBS). Switzerland.https://www.bis.org/basel_framework/
work page 2019
- [9]
-
[10]
Botha, A., & Verster, T. (2025). Approaches for modelling the term-structure of default risk under IFRS 9: A tutorial using discrete-time survival analysis. Retrieved August 11, 2025, fromhttps://www.resea rchgate.net/publication/393890374_Approaches_for_modelling_the_term-structure_of_ default_risk_under_IFRS_9_A_tutorial_using_discrete-time_survival_analysis
-
[11]
Botha, A. (2021).A procedure for loss-optimising the timing of loan recovery under uncertainty[Doctoral dissertation, University of Pretoria].https://doi.org/10.13140/RG.2.2.12015.30888/2 33 Modelling the term-structure of default risk under IFRS 9 within a multistate regression framework
-
[12]
Botha, A., Beyers, C., & De Villiers, P. (2021). Simulation-based optimisation of the timing of loan recovery across different portfolios.Expert Systems with Applications,177. https://doi.org/10.1016/j.eswa. 2021.114878
-
[13]
Botha, A., & Breedt, R. (2025). Modelling the term-structure of default risk under IFRS 9 within a multistate regression framework [Source Code].https://doi.org/10.5281/zenodo.14899815
-
[14]
Botha, A., Oberholzer, E., Larney, J., & De Jongh, R. (2025). Defining and comparing SICR-events for classifying impaired loans under IFRS 9.Annals of Operations Research. https://doi.org/10.1007/ s10479-025-06546-3
work page 2025
-
[15]
JournalofBanking&Finance ,34(5),903–911.https://doi.org/10.1016/j.jbankfin.2009.10.001
Calabrese,R.,&Zenga,M.(2010).Bankloanrecoveryrates:Measuringandnonparametricdensityestimation. JournalofBanking&Finance ,34(5),903–911.https://doi.org/10.1016/j.jbankfin.2009.10.001
-
[16]
Chamboko, R., & Bravo, J. M. (2020). A multi-state approach to modelling intermediate events and multiple mortgage loan outcomes.Risks,8(2). https://doi.org/10.3390/risks8020064
-
[17]
Corcoran, A. W. (1978). The use of exponentially-smoothed transition matrices to improve forecasting of cash flows from accounts receivable.Management Science, 24(7), 732–739.https://doi.org/10.1287/ mnsc.24.7.732
work page 1978
-
[18]
Cribari-Neto, F., & Zeileis, A. (2010). Beta regression in R.Journal of statistical software,34(2), 1–24
work page 2010
-
[19]
Crook,J.,&Bellotti,T.(2010).Timevaryinganddynamicmodelsfordefaultriskinconsumerloans. Journal of the Royal Statistical Society: Series A (Statistics in Society), 173(2), 283–305.https://doi.org/10. 1111/j.1467-985x.2009.00617.x
-
[20]
Cyert, R. M., Davidson, H. J., & Thompson, G. L. (1962). Estimation of the allowance for doubtful accounts by Markov chains.Management Science, 8(3), 287–303.http://www.jstor.org/stable/2627386
-
[21]
DeLong, E. R., DeLong, D. M., & Clarke-Pearson, D. L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach.Biometrics,44(3), 837–845. https://doi.org/10.2307/2531595
-
[22]
Djeundje, V. B., & Crook, J. (2018). Incorporating heterogeneity and macroeconomic variables into multi- state delinquency models for credit cards.European Journal of Operational Research, 271(2), 697–709. https://doi.org/10.1016/j.ejor.2018.05.040
-
[23]
Measuring skewness: A forgotten statistic?
Doane, D. P., & Seward, L. E. (2011). Measuring skewness: A forgotten statistic?Journal of Statistics Education, 19(2). https://doi.org/10.1080/10691898.2011.11889611
-
[24]
European Parliament. (2013). Regulation (EU) No 575/2013 of the European Parliament and of the Council of 26 June 2013 on prudential requirements for credit institutions and investment firms and amending Regulation (EU) No 648/2012 Text with EEA relevance.Official Journal of the European Union. Retrieved October 8, 2019, fromhttps://eur-lex.europa.eu/lega...
work page 2013
-
[25]
(2018).Impairment of financial instruments under IFRS 9(tech
EY. (2018).Impairment of financial instruments under IFRS 9(tech. rep.). Ernst & Young Global Limited. London. https://www.ey.com/en_gl/ifrs-technical-resources/impairment-of-financial- instruments-under-ifrs-9
work page 2018
-
[26]
Fawcett, T. (2006). An introduction to ROC analysis.Pattern recognition letters, 27(8), 861–874.https: //doi.org/10.1016/j.patrec.2005.10.010
-
[27]
Ferrari,S.,&Cribari-Neto,F.(2004).Betaregressionformodellingratesandproportions. Journalofapplied statistics,31(7), 799–815.https://doi.org/10.1080/0266476042000214501 34 Modelling the term-structure of default risk under IFRS 9 within a multistate regression framework
-
[28]
(2024).Unavoidable model risk in expected credit loss models under IFRS 9 and CECL standards (tech
Forrest, A. (2024).Unavoidable model risk in expected credit loss models under IFRS 9 and CECL standards (tech. rep.). Centre for Risk, Credit and Finance, University of Edinburgh. Edinburgh. Retrieved August 11, 2025, fromhttps://crc.business-school.ed.ac.uk/sites/crc/files/2024-09/CECL-IFRS- 9-MRM-AF-report-220924.pdf
work page 2024
-
[29]
Frydman, H., Kallberg, J. G., & Kao, D.-L. (1985). Testing the adequacy of Markov chain and mover- stayer models as representations of credit behavior.Operations Research, 33(6), 1203–1214.https : //doi.org/10.1287/opre.33.6.1203
-
[30]
Frydman, H., & Schuermann, T. (2008). Credit rating dynamics and Markov mixture models.Journal of Banking & Finance, 32(6), 1062–1075.https://doi.org/10.1016/j.jbankfin.2007.09.013
-
[31]
Gaffney, E., Kelly, R., & McCann, F. (2014).A transitions-based framework for estimating expected credit losses(Research Technical Papers No. 16/RT/14). Central Bank of Ireland.https://ideas.repec.org/ p/cbi/wpaper/16-rt-14.html
work page 2014
-
[32]
Grimshaw, S. D., & Alexander, W. P. (2011). Markov chain models for delinquency: Transition matrix estimation and forecasting.Applied Stochastic Models in Business and Industry, 27(3), 267–279.https: //doi.org/10.1002/asmb.827
-
[33]
Hand, D. J., & Henley, W. E. (1997). Statistical classification methods in consumer credit scoring: A review.Journal of the Royal Statistical Society: Series A (Statistics in Society), 160(3), 523–541.https: //doi.org/10.1111/j.1467-985X.1997.00078.x
-
[34]
Hao, C., Alam, M., & Carling, K. (2010). Review of the literature on credit risk modeling: Development of the past 10 years.Banks and Bank Systems, 5(3), 43–60.http://urn.kb.se/resolve?urn=urn:nbn: se:du-4687
work page 2010
-
[35]
Heinze, G., Wallisch, C., & Dunkler, D. (2018). Variable selection–a review and recommendations for the practicing statistician.Biometrical journal, 60(3), 431–449.https://doi.org/10.1002/bimj. 201700067
-
[36]
Hosmer, D. W., & Lemeshow, S. (2000).Applied logistic regression(2nd ed.). John Wiley & Sons, Inc. https://doi.org/10.1002/0471722146
-
[37]
Huang, X., & Oosterlee, C. (2011). Generalized beta regression models for random loss-given-default.The Journal of Credit Risk,7(4), 45–70.https://doi.org/10.21314/jcr.2011.150
-
[38]
(2014).International financial reporting standard (IFRS) 9: Financial instruments
IASB. (2014).International financial reporting standard (IFRS) 9: Financial instruments. IFRS Foundation: International Accounting Standards Board (IASB). London.https://www.ifrs.org/issued-standard s/list-of-standards/ifrs-9-financial-instruments/
work page 2014
-
[39]
Jaber, J. J., Ismail, N., Ramli, S. N. M., Albadareen, B., & Hamadneh, N. N. (2020). Estimating loss given default based on beta regression.Computers, Materials and Continua, 66(3), 3329–3344.https: //doi.org/10.32604/cmc.2021.014509
-
[40]
Jakubík, P., & Teleu, S. (2025). Improving credit risk assessment in uncertain times: Insights from IFRS 9. Risks, 13(2), 38.https://doi.org/10.3390/risks13020038
-
[41]
James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013).An introduction to statistical learning: With applications in R. Springer.https://doi.org/10.1007/978-1-4614-7138-7
-
[42]
Kelly,R.,&O’Malley,T.(2016).Thegood,thebadandtheimpaired:AcreditriskmodeloftheIrishmortgage market.Journal of Financial Stability,22, 1–9.https://doi.org/10.1016/j.jfs.2015.09.005
-
[43]
Kiefer, N. M. (2007).The probability approach to default probabilities. Cornell University. OCC Economics Working Paper.https://kiefer.economics.cornell.edu/ProbabilityApproachWP.pdf 35 Modelling the term-structure of default risk under IFRS 9 within a multistate regression framework
work page 2007
-
[44]
Leow, M., & Crook, J. (2014). Intensity models and transition probabilities for credit card loan delinquencies. European Journal of Operational Research, 236(2), 685–694.https://doi.org/10.1016/j.ejor. 2013.12.026
-
[45]
Lin, M., Lucas Jr, H. C., & Shmueli, G. (2013). Too big to fail: Large samples and the p-value problem. Information Systems Research,24(4), 906–917.https://doi.org/10.1287/isre.2013.0480
-
[46]
Louzada,F.,Ara, A.,&Fernandes,G. B.(2016).Classification methodsappliedtocreditscoring:Systematic review and overall comparison.Surveys in Operations Research and Management Science,21(2), 117–134. https://doi.org/10.1016/j.sorms.2016.10.001
-
[47]
McFadden, D. (1972). Conditional logit analysis of qualitative choice behavior.https://escholarship. org/uc/item/61s3q2xr
work page 1972
-
[48]
Menard, S. (2000). Coefficients of determination for multiple logistic regression analysis.The American Statistician,54(1),17–24.https://doi.org/https://doi.org/10.1080/00031305.2000.10474502
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1080/00031305.2000.10474502 2000
-
[49]
Norris, J. R. (1997).Markov chains. Cambridge University Press. https : / / doi . org / 10 . 1017 / CBO9780511810633
work page 1997
-
[51]
Pasricha, P., Selvamuthu, D., & Arunachalam, V. (2017). Markov regenerative credit rating model.Journal of Risk Finance, 18, 311–325.https://doi.org/10.1108/JRF-09-2016-0123
-
[52]
Perperoglou, A., Sauerbrei, W., Abrahamowicz, M., & Schmid, M. (2019). A review of spline function procedures in r.BMC medical research methodology, 19, 1–16.https://doi.org/10.1186/s12874- 019-0666-3
-
[53]
PS6/23:Modelriskmanagementprinciplesforbanks .BankofEngland,PrudentialRegulation Authority (PRA)
PRA.(2023a). PS6/23:Modelriskmanagementprinciplesforbanks .BankofEngland,PrudentialRegulation Authority (PRA). United Kingdom.https://www.bankofengland.co.uk/prudential-regulation/ publication/2023/may/model-risk-management-principles-for-banks
work page 2023
-
[54]
SS1/23:Modelriskmanagementprinciplesforbanks .BankofEngland,PrudentialRegulation Authority (PRA)
PRA.(2023b). SS1/23:Modelriskmanagementprinciplesforbanks .BankofEngland,PrudentialRegulation Authority (PRA). United Kingdom.https://www.bankofengland.co.uk/prudential-regulation/ publication/2023/may/model-risk-management-principles-for-banks-ss
work page 2023
-
[55]
Putter, H., Fiocco, M., & Geskus, R. B. (2007). Tutorial in biostatistics: Competing risks and multi-state models. Statistics in medicine,26(11), 2389–2430.https://doi.org/10.1002/sim.2712
-
[56]
PwC. (2014). IFRS 9: Expected credit losses(tech. rep.). PriceWaterhouseCoopers. London.https : / / www . pwc . com / gx / en / audit - services / ifrs / publications / ifrs - 9 / ifrs - in - depth - expected-credit-losses.pdf
work page 2014
-
[57]
(2005).Credit risk scorecards: Developing and implementing intelligent credit scoring(1st ed.)
Siddiqi, N. (2005).Credit risk scorecards: Developing and implementing intelligent credit scoring(1st ed.). John Wiley & Sons
work page 2005
-
[58]
B., Barreto-Souza, W., & Rocha, A
Simas, A. B., Barreto-Souza, W., & Rocha, A. V. (2010). Improved estimators for a general class of beta regression models.Computational Statistics & Data Analysis, 54(2), 348–366.https://doi.org/10. 1016/j.csda.2009.08.017
work page 2010
-
[59]
Skoglund,J.(2017).Creditriskterm-structuresforlifetimeimpairmentforecasting:Apracticalguide. Journal of Risk Management in Financial Institutions, 10(2), 177–195.https://www.econbiz.de/Record/ credit - risk - term - structures - for - lifetime - impairment - forecasting - a - practical - guide-skoglund-jimmy/10011670671 36 Modelling the term-structure of ...
-
[60]
Smith, L. D., & Lawrence, E. C. (1995). Forecasting losses on a liquidating long-term loan portfolio.Journal of Banking & Finance,19(6), 959–985.https://doi.org/10.1016/0378-4266(94)00065-B
-
[61]
Thomas, L. C. (2009).Consumer credit models: Pricing, profit and portfolios. Oxford University Press. https://doi.org/10.1093/acprof:oso/9780199232130.001.1
-
[62]
Tong, E. N., Mues, C., & Thomas, L. (2013). A zero-adjusted gamma model for mortgage loan loss given default.International Journal of Forecasting, 29(4), 548–562.https://doi.org/10.1016/j. ijforecast.2013.03.003
work page doi:10.1016/j 2013
-
[63]
Van Kuelen, J. A., Spronk, J., & Corcoran, A. W. (1981). Note—on the Cyert-Davidson-Thompson doubtful accounts model.Management science,27(1), 108–112.https://doi.org/10.1287/mnsc.27.1.108
-
[64]
Yashkir, O., & Yashkir, Y. (2013). Loss given default modeling: A comparative analysis.The Journal of Risk Model Validation, 7, 25–59.https://doi.org/10.21314/jrmv.2013.101 37
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.