Climate-Driven Mortality Forecasting Using Deep Learning

Jens Robben; Karim Barigou; Kenrick So

arxiv: 2606.26980 · v1 · pith:KJJSD3JZnew · submitted 2026-06-25 · 📊 stat.AP

Climate-Driven Mortality Forecasting Using Deep Learning

Kenrick So , Karim Barigou , Jens Robben This is my paper

Pith reviewed 2026-06-26 02:17 UTC · model grok-4.3

classification 📊 stat.AP

keywords climate-driven mortalityexcess mortalityLee-Carter modeldeep learning forecastingCNN-LSTMGNN-LSTMquantile LSTMmortality risk management

0 comments

The pith

Deep learning layers on top of Lee-Carter cut climate-driven mortality forecast errors by 24 percent relative to prior networks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out a two-step approach that first applies a regional weekly Lee-Carter model to capture long-term trends and seasonal patterns, then feeds the residuals into either a CNN-LSTM or a GNN-LSTM network that ingests environmental and climate variables to predict excess mortality. On French regional data from 1990 to 2019 the two architectures each lower test mean squared error by roughly 24 percent compared with the earlier MortFCNet model, with the biggest gains appearing at the oldest ages where climate shocks produce the sharpest mortality spikes. The networks are further extended to a quantile LSTM that supplies time-varying prediction intervals. A reader would care because the resulting forecasts give a more realistic picture of tail risks that matter for health planning and for financial contracts tied to longevity.

Core claim

The central claim is that the CNN-LSTM and GNN-LSTM architectures, when added to a Lee-Carter baseline, successfully capture delayed and nonlinear associations between environmental extremes and excess mortality; both models outperform the Lee-Carter baseline and MortFCNet across all regions, each reducing test MSE by approximately 24 percent relative to MortFCNet, with particularly large gains at the oldest ages.

What carries the argument

The two-step framework that fits a regional weekly Lee-Carter model for baseline trends and seasonality, then applies CNN-LSTM or GNN-LSTM networks to the residual excess mortality driven by climate variables, extended via quantile LSTM to produce time-varying prediction intervals.

If this is right

Forecasts become more accurate for sudden mortality spikes triggered by climate extremes.
Time-varying prediction intervals supply a more realistic description of extreme climate-driven mortality risk.
Insurers and pension funds obtain a clearer basis for quantifying climate-related longevity exposure.
The spatial structure in the GNN-LSTM version accounts for how climate impacts can propagate across neighboring regions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same two-step structure could be applied to other countries that maintain weekly regional mortality and climate records to check whether the error reductions hold outside France.
Coupling the quantile outputs with downscaled climate projections would allow scenario-based longevity forecasts under different warming trajectories.
The graph-based spatial component suggests the model could be adapted to study cross-border transmission of climate mortality effects in densely connected regions.

Load-bearing premise

The mortality variation left after the Lee-Carter baseline is driven mainly by measurable climate and environmental factors rather than by other unmodeled influences or data issues.

What would settle it

Retraining the same architectures on post-2019 weekly data that includes documented heat waves and cold spells, then checking whether the 24 percent MSE reduction disappears or whether performance gains remain after climate variables are removed from the deep-learning inputs.

Figures

Figures reproduced from arXiv: 2606.26980 by Jens Robben, Karim Barigou, Kenrick So.

**Figure 2.** Figure 2: Test MSE by NUTS 2 region, sorted in ascending order of Lee–Carter baseline error. The [PITH_FULL_IMAGE:figures/full_fig_p019_2.png] view at source ↗

**Figure 3.** Figure 3: Weekly mortality rate forecasts in Alsace (FRF1), age group 90+, with training period [PITH_FULL_IMAGE:figures/full_fig_p020_3.png] view at source ↗

**Figure 4.** Figure 4: Weekly mortality rate forecasts for region FRK2, age group 90+, with training period [PITH_FULL_IMAGE:figures/full_fig_p020_4.png] view at source ↗

**Figure 5.** Figure 5: CNN–LSTM regional embedding clusters. (A) PCA projection of the trained regional embedding vectors. Each point represents one NUTS 2 region; colours denote k-means cluster assignment (k = 4); line segments connect each region to its cluster centroid. (B) Geographic map of cluster assignments across French NUTS 2 regions [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗

**Figure 6.** Figure 6: GNN–LSTM regional embedding clusters, using the same layout conventions as Figure [PITH_FULL_IMAGE:figures/full_fig_p022_6.png] view at source ↗

**Figure 7.** Figure 7: Permutation feature importance of the top 15 climate predictors for the CNN–LSTM and [PITH_FULL_IMAGE:figures/full_fig_p023_7.png] view at source ↗

**Figure 8.** Figure 8: Temperature response curves for the CNN–LSTM and GNN–LSTM in FRF1 and FRK2, [PITH_FULL_IMAGE:figures/full_fig_p023_8.png] view at source ↗

**Figure 9.** Figure 9: CNN–LSTM 90% prediction interval for the log mortality rate in region FRK2, age group [PITH_FULL_IMAGE:figures/full_fig_p026_9.png] view at source ↗

**Figure 10.** Figure 10: GNN–LSTM 90% prediction interval for the log mortality rate in region FRK2, age group [PITH_FULL_IMAGE:figures/full_fig_p027_10.png] view at source ↗

read the original abstract

Climate extremes have become important drivers of mortality, producing sudden spikes that traditional mortality models fail to predict. To address this gap, we propose a two-step modelling framework that combines a regional weekly Lee-Carter baseline model that captures long-term mortality trends and overall seasonal patterns, with two complementary deep learning architectures designed to model excess mortality driven by environmental conditions and climate shocks. The first, a CNN-LSTM, captures region-specific temporal responses through convolutional filters. The second, a GNN-LSTM, replaces convolutions with graph-based representations to model spatial mortality dependencies and the propagation of climate-related impacts across regions. Both architectures are further extended to a quantile LSTM framework that produces time-varying prediction intervals. We evaluate our models against both the Lee-Carter baseline and MortFCNet (Zheng et al., 2025). Using French regional data over 1990-2019, our models capture delayed and nonlinear associations between environmental extremes and excess mortality. Both proposed architectures outperform the Lee-Carter baseline and MortFCNet across all regions, each reducing test MSE by approximately 24% relative to the MortFCNet, with particularly large gains at the oldest ages where climate-driven mortality spikes are most severe. From a risk management perspective, the proposed framework provides a more realistic characterization of extreme climate-driven mortality risk, with time-varying prediction intervals that offer a more informed basis for the assessment of climate-related longevity exposure by insurers and pension funds.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper shows a practical 24% MSE gain on French regional mortality data by adding CNN-LSTM and GNN-LSTM residuals to a Lee-Carter baseline, but the abstract leaves the validation steps too thin to judge robustness.

read the letter

The core result is that both the CNN-LSTM and GNN-LSTM versions cut test MSE by roughly 24% versus MortFCNet while also beating the plain Lee-Carter baseline, with the biggest lift at the oldest ages. The two-step structure—weekly regional Lee-Carter for the trend and seasonal part, then DL on the residuals for climate shocks—is straightforward and the quantile extension gives time-varying intervals that insurers and pension funds can actually use.

What the work does cleanly is apply established sequence and graph architectures to a concrete actuarial problem and report out-of-sample numbers on 1990-2019 French data. The GNN version adds spatial propagation that the CNN version lacks, and the focus on delayed nonlinear climate effects matches the stated motivation. That combination is new enough for the specific task even if the building blocks are not.

The main weakness is the missing methodological detail. The abstract states the MSE reductions but says nothing about how the train-test split was performed, whether hyper-parameters were tuned on a separate validation set, or what checks were run for overfitting. Without those, the 24% figure is hard to interpret as genuine generalization rather than residual fitting. The premise that the leftover mortality after Lee-Carter is mainly climate-driven is also taken as given once the performance numbers look good; the paper does not appear to test that assumption directly.

This is for actuaries and public-health modelers who already use Lee-Carter and want a plug-in way to handle climate extremes. A reader in that group would find the setup and the quantile intervals useful even if they later rerun the experiments with stricter controls.

It is worth sending to peer review. The empirical claim is specific and the application matters, but the referee will need to see the full validation protocol before the gains can be trusted.

Referee Report

2 major / 1 minor

Summary. The paper proposes a two-step framework for forecasting climate-driven mortality: a regional weekly Lee-Carter model captures baseline trends and seasonality, while CNN-LSTM and GNN-LSTM architectures model excess mortality attributable to environmental and climate variables. The GNN-LSTM variant incorporates spatial dependencies across regions. Both are extended to quantile LSTM for time-varying prediction intervals. On French regional data 1990-2019, the models are claimed to reduce test MSE by ~24% relative to MortFCNet (and outperform Lee-Carter), with largest gains at oldest ages.

Significance. If the reported out-of-sample gains hold under rigorous validation, the work would demonstrate a practical way to incorporate nonlinear climate effects and spatial propagation into mortality models, improving tail-risk characterization for longevity products. The graph-based spatial component is a distinctive technical choice that could generalize to other multi-region forecasting problems.

major comments (2)

[Abstract] Abstract and (presumably) §3/§4: the central claim of an approximately 24% test-MSE reduction is presented without any description of the train-test split (temporal cutoff, proportion, or stratification by region/age), hyperparameter search procedure, or regularization/overfitting diagnostics. These omissions make it impossible to verify that the numerical improvement reflects genuine generalization rather than in-sample fitting.
[Abstract] Abstract: the attribution of performance gains specifically to the climate-variable inputs rests on the untested premise that residual mortality after the Lee-Carter baseline is driven primarily by the supplied environmental covariates; no ablation removing those covariates or comparison against non-climate residual models is mentioned.

minor comments (1)

The citation Zheng et al. (2025) for MortFCNet appears to be a future or in-preparation reference; its status should be clarified or replaced with a published alternative if possible.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We are grateful to the referee for the constructive comments, which identify key areas where additional transparency will strengthen the manuscript. We respond point by point to the major comments below.

read point-by-point responses

Referee: [Abstract] Abstract and (presumably) §3/§4: the central claim of an approximately 24% test-MSE reduction is presented without any description of the train-test split (temporal cutoff, proportion, or stratification by region/age), hyperparameter search procedure, or regularization/overfitting diagnostics. These omissions make it impossible to verify that the numerical improvement reflects genuine generalization rather than in-sample fitting.

Authors: We agree that the abstract presents the performance claim without these details and that the manuscript body does not supply a complete account of the temporal split, hyperparameter procedure, or overfitting diagnostics. We will revise both the abstract (with a concise reference to the temporal split) and Sections 3/4 to describe the train-test split (1990–2014 training with internal validation, 2015–2019 test, stratified by region and age), the grid-search hyperparameter protocol, regularization (dropout and early stopping), and supporting diagnostics such as training/validation loss curves. These additions will allow readers to assess generalization directly. revision: yes
Referee: [Abstract] Abstract: the attribution of performance gains specifically to the climate-variable inputs rests on the untested premise that residual mortality after the Lee-Carter baseline is driven primarily by the supplied environmental covariates; no ablation removing those covariates or comparison against non-climate residual models is mentioned.

Authors: We agree that an explicit ablation removing the climate covariates would provide stronger evidence that the reported gains are attributable to the environmental inputs rather than to the flexibility of the DL architectures alone. The current manuscript does not contain such an ablation. We will add this analysis in the revision by retraining the CNN-LSTM and GNN-LSTM models without the climate covariates and reporting the resulting change in test MSE relative to the full models. revision: yes

Circularity Check

0 steps flagged

No significant circularity; empirical test-set results are independent of model construction

full rationale

The paper presents a two-step framework (Lee-Carter baseline plus DL residual models) evaluated via held-out test MSE on French regional data 1990-2019. The central claim of ~24% MSE reduction versus MortFCNet is a direct numerical comparison on unseen data after training; no equation reduces the reported performance metric to a fitted parameter by definition, no self-citation supplies a uniqueness theorem or ansatz that forces the result, and the architectures are standard CNN-LSTM/GNN-LSTM variants whose outputs are not tautological with their inputs. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The framework rests on the assumption that climate variables explain the residual mortality after Lee-Carter fitting and that the deep-learning models can learn these relationships without severe overfitting; both are learned from data rather than derived.

free parameters (2)

Lee-Carter parameters
Fitted to long-term trends and seasonal patterns on the 1990-2019 French data.
CNN-LSTM and GNN-LSTM weights and hyperparameters
Numerous parameters trained to minimize excess-mortality prediction error.

axioms (1)

domain assumption Environmental extremes produce measurable excess mortality not captured by the Lee-Carter baseline
Invoked when the authors attribute performance gains to the DL components that receive climate inputs.

pith-pipeline@v0.9.1-grok · 5782 in / 1364 out tokens · 30491 ms · 2026-06-26T02:17:20.985620+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

45 extracted references · 40 canonical work pages · 4 internal anchors

[1]

Layer Normalization

Ba, J. L., Kiros, J. R., and Hinton, G. E. (2016). Layer normalization. Working paper. doi:10.48550/arXiv.1607.06450

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1607.06450 2016
[2]

and Perron, P

Bai, J. and Perron, P. (2003). Computation and analysis of multiple structural change models. Journal of Applied Econometrics, 18(1):1–22. doi:10.1002/jae.659

work page doi:10.1002/jae.659 2003
[3]

Barigou, K., Patten, M., and Zhou, K. Q. (2025). Mortality modeling and forecasting with the actuaries climate index.arXiv preprint arXiv:2510.16266

arXiv 2025
[4]

Barnett, A., Tong, S., and Clements, A. (2010). What measure of temperature is the best predictor of mortality?Environmental Research, 110(6):604–611. doi:10.1016/j.envres.2010.05.006

work page doi:10.1016/j.envres.2010.05.006 2010
[5]

Braga, A. L. F., Zanobetti, A., and Schwartz, J. (2002). The effect of weather on respiratory and cardiovascular deaths in 12 U.S. cities.Environmental Health Perspectives, 110(9):859–863. doi:10.1289/ehp.02110859

work page doi:10.1289/ehp.02110859 2002
[6]

Breiman, L. (2001). Random forests.Machine Learning, 45(1):5–32. doi: 10.1023/a:1010933404324. Bégin, J.-F., Boudreault, M., and Landry, T. (2025). Modelling seasonal mortality: An age–period–cohort approach.Insurance: Mathematics and Economics, 125:103162. doi:10.1016/j.insmatheco.2025.103162. Bégin, J.-F., Boudreault, M., and Landry, T. (2026). Modellin...

work page doi:10.1023/a:1010933404324 2001
[7]

K., and Zanin, L

Calabrese, R., Dombrowski, T., Mandel, A., Pace, R. K., and Zanin, L. (2024). Impacts of extreme weather events on mortgage risks and their evolution under climate change: A case study on Florida.European Journal of Operational Research, 314(1):377–392. doi:10.1016/j.ejor.2023.11.022

work page doi:10.1016/j.ejor.2023.11.022 2024
[8]

and Khaliq, A

Chen, Y. and Khaliq, A. Q. M. (2023). Mortality rates forecasting with data driven LSTM, Bi-LSTM and GRU: The United States case study.Actuarial Research Clearing House, 28 2023(1).https://www.soa.org/globalassets/assets/files/static-pages/research/arch/ 2023/arch-2023-1-mortality-rates-forecasting.pdf

2023
[9]

Chen, Y.-H., Mukherjee, B., and Berrocal, V. J. (2019). Distributed lag interaction models with two pollutants.Journal of the Royal Statistical Society Series C: Applied Statistics, 68(1):79–97. doi:10.1111/rssc.12297. Copernicus Climate Change Service (2025). E-OBS daily gridded meteorological data for Europe from 1950 to present derived from in-situ obs...

work page doi:10.1111/rssc.12297 2019
[10]

Enchev, V., Kleinow, T., and Cairns, A. J. G. (2017). Multi-population mortality mod- els: fitting, forecasting and comparisons.Scandinavian Actuarial Journal, 2017(4):319–342. doi:10.1080/03461238.2015.1133450

work page doi:10.1080/03461238.2015.1133450 2017
[11]

Armstrong, B. (2015). Mortality risk attributable to high and low ambient temperature: a multi- country observational study.The Lancet, 386(9991):369–375. doi:10.1016/s0140-6736(14)62114-0

work page doi:10.1016/s0140-6736(14)62114-0 2015
[12]

Giglio, S., Kelly, B., and Stroebel, J. (2021). Climate finance.Annual Review of Financial Eco- nomics, 13(Volume 13, 2021):15–36. doi:10.1146/annurev-financial-102620-103311

work page doi:10.1146/annurev-financial-102620-103311 2021
[13]

Goes, J., Barigou, K., and Leucht, A. (2025). Bayesian mortality modelling with pandemics: a vanishing jump approach.Journal of the Royal Statistical Society Series C: Applied Statistics, 74(4):1150–1182

2025
[14]

Guibert, Q., Pincemin, G., and Planchet, F. (2025). Impact of climate change on mortality: An extrapolation of temperature effects based on time series data in France.International Journal of Forecasting. doi:10.1016/j.ijforecast.2025.07.004

work page doi:10.1016/j.ijforecast.2025.07.004 2025
[15]

Hainaut, D. (2018). A neural-network analyzer for mortality forecast.ASTIN Bulletin, 48(02):481–508. doi:10.1017/asb.2017.45

work page doi:10.1017/asb.2017.45 2018
[16]

Hainaut, D. (2026). Explaining regional mortality differences with an economic-neural model: Ev- idence from European NUTS-2 regions.LIDAM Discussion Paper. Working paper.https: //drive.google.com/file/d/1hJku2vojXti0AYkkkLrsQ2xnqXXjlGar/view

2026
[17]

Jo, H., Kim, J., Huang, T.-C., and Ni, Y.-L. (2022). condLSTM-Q: A novel deep learning model for predicting COVID-19 mortality in fine geographical scale.Quantitative Biology, 10(2):125–138. doi:10.15302/J-QB-021-0276

work page doi:10.15302/j-qb-021-0276 2022
[18]

Kleinow, T. (2015). A common age effect model for the mortality of multiple populations.Insurance: Mathematics and Economics, 63:147–152. doi:10.1016/j.insmatheco.2015.03.023

work page doi:10.1016/j.insmatheco.2015.03.023 2015
[19]

In: IEEE International Conference on Communications (ICC)

Li, H. and Tang, Q. (2022). Joint extremes in temperature and mortality: A bivariate POT approach.North American Actuarial Journal, 26(1):43–63. doi:10.1080/10920277.2020.1823236. 29

work page doi:10.1080/10920277.2020.1823236 2022
[20]

and Lee, R

Li, N. and Lee, R. (2005). Coherent mortality forecasts for a group of populations: An extension of the Lee-Carter method.Demography, 42(3):575–594. doi:10.1353/dem.2005.0021

work page doi:10.1353/dem.2005.0021 2005
[21]

Li, R., Zhou, R., and Pitt, D. (2026a). Beyond annual data: Mortality forecast- ing with mixed frequency data.Insurance: Mathematics and Economics, 126:103172. doi:10.1016/j.insmatheco.2025.103172

work page doi:10.1016/j.insmatheco.2025.103172 2025
[22]

Li, R., Zhou, R., and Pitt, D. (2026b). Dynamic mortality forecasting via mixed-frequency state- space models. Working paper. doi:10.48550/arXiv.2601.05702

work page doi:10.48550/arxiv.2601.05702
[23]

and Palmborg, L

Lindholm, M. and Palmborg, L. (2022). Efficient use of data for LSTM mortality forecasting. European Actuarial Journal, 12(2):749–778. doi:10.1007/s13385-022-00307-3

work page doi:10.1007/s13385-022-00307-3 2022
[24]

L., Hannun, A

Maas, A. L., Hannun, A. Y., and Ng, A. Y. (2013). Rectifier nonlinearities improve neural net- work acoustic models. InProceedings of the 30th International Conference on Machine Learning (ICML), volume 28 ofJMLR Workshop and Conference Proceedings, Atlanta, Georgia, USA. https://ai.stanford.edu/~amaas/papers/relu_hybrid_icml2013_final.pdf

2013
[25]

Marino, M., Levantesi, S., and Nigri, A. (2022). A neural approach to improve the Lee-Carter mortality density forecasts.North American Actuarial Journal, 27(1):148–165. doi:10.1080/10920277.2022.2050260

work page doi:10.1080/10920277.2022.2050260 2022
[26]

Miao, Z., Li, H., and Chen, Y. (2026). Gradient boosted multi-population mortality modelling with high-frequency data. Working paper. doi:10.48550/arXiv.2507.09983

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2507.09983 2026
[27]

Min, J., Li, H., Nagler, T., and Li, S. (2025). Assessing climate-driven mortality risk: A stochastic approach with distributed lag non-linear models. Working paper. doi:10.48550/arXiv.2506.00561

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2506.00561 2025
[28]

Perla, F., Richman, R., Scognamiglio, S., and Wüthrich, M. V. (2024). Accurate and explainable mortality forecasting with the LocalGLMnet.Scandinavian Actuarial Journal, 2024(7):739–761. doi:10.1080/03461238.2024.2307620

work page doi:10.1080/03461238.2024.2307620 2024
[29]

Perla, F., Richman, R., Scognamiglio, S., and Wüthrich, M. V. (2021). Time-series forecast- ing of mortality rates using deep learning.Scandinavian Actuarial Journal, 2021(7):572–598. doi:10.1080/03461238.2020.1867232

work page doi:10.1080/03461238.2020.1867232 2021
[30]

and Wüthrich, M

Richman, R. and Wüthrich, M. V. (2019). A neural network extension of the Lee- Carter model to multiple populations.Annals of Actuarial Science, 15(2):346–366. doi:10.1017/s1748499519000071

work page doi:10.1017/s1748499519000071 2019
[31]

Robben, J., Antonio, K., and Kleinow, T. (2026a). The short-term association between environ- mental variables and mortality: evidence from europe.Journal of the Royal Statistical Society Series A: Statistics in Society, 189(2):1131–1153. doi:10.1093/jrsssa/qnaf052

work page doi:10.1093/jrsssa/qnaf052
[32]

A penalized distributed lag non-linear Lee-Carter framework for regional weekly mortality forecasting

Robben, J. and Barigou, K. (2025). A penalized distributed lag non-linear Lee-Carter framework for regional weekly mortality forecasting. Working paper. doi:10.48550/arXiv.2509.24087

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2509.24087 2025
[33]

Robben, J., Barigou, K., and Kleinow, T. (2026b). Granular mortality modeling with temperature and epidemic shocks: A three-state regime-switching approach.Insurance: Mathematics and Economics, 128:103250. doi:10.1016/j.insmatheco.2026.103250

work page doi:10.1016/j.insmatheco.2026.103250 2026
[34]

S., Langseth, H., and Ramampiaro, H

Salem, T. S., Langseth, H., and Ramampiaro, H. (2020). Prediction intervals: Split normal mixture from quality-driven deep ensembles. InConference on Uncertainty in Artificial Intelligence, pages 1179–1187. PMLR. 30 Schnürch, S. and Korn, R. (2022). Point and interval forecasts of death rates using neural networks. ASTIN Bulletin, 52(1):333–360. doi:10.10...

work page doi:10.1017/asb.2021.34 2020
[35]

and Schumacher, B

Shala, I. and Schumacher, B. (2024). The impact of natural disasters on banks’ impairment flow – evidence from Germany.Journal of Climate Finance, 6:100031. doi:10.1016/j.jclimf.2024.100031

work page doi:10.1016/j.jclimf.2024.100031 2024
[36]

Shen, Y., Yang, X., Liu, H., and Li, Z. (2024). Advancing mortality rate prediction in European population clusters: integrating deep learning and multiscale analysis.Scientific Reports, 14(1). doi:10.1038/s41598-024-56390-x

work page doi:10.1038/s41598-024-56390-x 2024
[37]

R., Cruz, S

So, K. R., Cruz, S. C., Marcella, E. A., Briones, J., and Garces, L. P. D. (2025). Uncertainty in pricing and risk measurement of survivor contracts.Risks, 13(2). doi:10.3390/risks13020035

work page doi:10.3390/risks13020035 2025
[38]

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. (2014). Dropout: A simple way to prevent neural networks from overfitting.Journal of Machine Learning Research, 15(56):1929–1958.http://jmlr.org/papers/v15/srivastava14a.html. Van Berkum, F., Antonio, K., and Vellekoop, M. (2016). The impact of multiple struc- tural changes o...

work page doi:10.1080/03461238.2014.987807 2014
[39]

Wang, C.-W., Zhang, J., and Zhu, W. (2021). Neighbouring prediction for mortality.ASTIN Bulletin, 51(3):689–718. doi:10.1017/asb.2021.13

work page doi:10.1017/asb.2021.13 2021
[40]

Wang, J., Wen, L., Xiao, L., and Wang, C. (2023). Time-series forecasting of mortality rates using transformer.Scandinavian Actuarial Journal, 2024(2):109–123. doi:10.1080/03461238.2023.2218859

work page doi:10.1080/03461238.2023.2218859 2023
[41]

Wang, Y., Gan, D., Sun, M., Zhang, N., Lu, Z., and Kang, C. (2019). Probabilistic individual load forecasting using pinball loss guided LSTM.Applied Energy, 235:10–20. doi:10.1016/j.apenergy.2018.10.078

work page doi:10.1016/j.apenergy.2018.10.078 2019
[42]

Wuthrich, M. V. and Merz, M. (2023).Statistical Foundations of Actuarial Learning and its Appli- cations. Springer International Publishing. doi:10.1007/978-3-031-12409-9

work page doi:10.1007/978-3-031-12409-9 2023
[43]

Zhang, N., Chen, H., and LIU, J. (2022). Mortality forecasting using LSTM-CNN model.SSRN Electronic Journal. Working paper. doi:10.2139/ssrn.4261735

work page doi:10.2139/ssrn.4261735 2022
[44]

Zheng, H., Wang, H., Zhu, R., and Xue, J.-H. (2025). Fine-grained mortality forecasting with deep learning.Annals of Actuarial Science, page 1–27. doi:10.1017/s1748499525100171

work page doi:10.1017/s1748499525100171 2025
[45]

Abriefreviewofdeeplearningmethodsinmor- tality forecasting.Annals of Actuarial Science, 20(1):150–165

Zheng, H., Wang, H., Zhu, R., andXue, J.-H.(2026). Abriefreviewofdeeplearningmethodsinmor- tality forecasting.Annals of Actuarial Science, 20(1):150–165. doi:10.1017/S1748499525100110. 31

work page doi:10.1017/s1748499525100110 2026

[1] [1]

Layer Normalization

Ba, J. L., Kiros, J. R., and Hinton, G. E. (2016). Layer normalization. Working paper. doi:10.48550/arXiv.1607.06450

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1607.06450 2016

[2] [2]

and Perron, P

Bai, J. and Perron, P. (2003). Computation and analysis of multiple structural change models. Journal of Applied Econometrics, 18(1):1–22. doi:10.1002/jae.659

work page doi:10.1002/jae.659 2003

[3] [3]

Barigou, K., Patten, M., and Zhou, K. Q. (2025). Mortality modeling and forecasting with the actuaries climate index.arXiv preprint arXiv:2510.16266

arXiv 2025

[4] [4]

Barnett, A., Tong, S., and Clements, A. (2010). What measure of temperature is the best predictor of mortality?Environmental Research, 110(6):604–611. doi:10.1016/j.envres.2010.05.006

work page doi:10.1016/j.envres.2010.05.006 2010

[5] [5]

Braga, A. L. F., Zanobetti, A., and Schwartz, J. (2002). The effect of weather on respiratory and cardiovascular deaths in 12 U.S. cities.Environmental Health Perspectives, 110(9):859–863. doi:10.1289/ehp.02110859

work page doi:10.1289/ehp.02110859 2002

[6] [6]

Breiman, L. (2001). Random forests.Machine Learning, 45(1):5–32. doi: 10.1023/a:1010933404324. Bégin, J.-F., Boudreault, M., and Landry, T. (2025). Modelling seasonal mortality: An age–period–cohort approach.Insurance: Mathematics and Economics, 125:103162. doi:10.1016/j.insmatheco.2025.103162. Bégin, J.-F., Boudreault, M., and Landry, T. (2026). Modellin...

work page doi:10.1023/a:1010933404324 2001

[7] [7]

K., and Zanin, L

Calabrese, R., Dombrowski, T., Mandel, A., Pace, R. K., and Zanin, L. (2024). Impacts of extreme weather events on mortgage risks and their evolution under climate change: A case study on Florida.European Journal of Operational Research, 314(1):377–392. doi:10.1016/j.ejor.2023.11.022

work page doi:10.1016/j.ejor.2023.11.022 2024

[8] [8]

and Khaliq, A

Chen, Y. and Khaliq, A. Q. M. (2023). Mortality rates forecasting with data driven LSTM, Bi-LSTM and GRU: The United States case study.Actuarial Research Clearing House, 28 2023(1).https://www.soa.org/globalassets/assets/files/static-pages/research/arch/ 2023/arch-2023-1-mortality-rates-forecasting.pdf

2023

[9] [9]

Chen, Y.-H., Mukherjee, B., and Berrocal, V. J. (2019). Distributed lag interaction models with two pollutants.Journal of the Royal Statistical Society Series C: Applied Statistics, 68(1):79–97. doi:10.1111/rssc.12297. Copernicus Climate Change Service (2025). E-OBS daily gridded meteorological data for Europe from 1950 to present derived from in-situ obs...

work page doi:10.1111/rssc.12297 2019

[10] [10]

Enchev, V., Kleinow, T., and Cairns, A. J. G. (2017). Multi-population mortality mod- els: fitting, forecasting and comparisons.Scandinavian Actuarial Journal, 2017(4):319–342. doi:10.1080/03461238.2015.1133450

work page doi:10.1080/03461238.2015.1133450 2017

[11] [11]

Armstrong, B. (2015). Mortality risk attributable to high and low ambient temperature: a multi- country observational study.The Lancet, 386(9991):369–375. doi:10.1016/s0140-6736(14)62114-0

work page doi:10.1016/s0140-6736(14)62114-0 2015

[12] [12]

Giglio, S., Kelly, B., and Stroebel, J. (2021). Climate finance.Annual Review of Financial Eco- nomics, 13(Volume 13, 2021):15–36. doi:10.1146/annurev-financial-102620-103311

work page doi:10.1146/annurev-financial-102620-103311 2021

[13] [13]

Goes, J., Barigou, K., and Leucht, A. (2025). Bayesian mortality modelling with pandemics: a vanishing jump approach.Journal of the Royal Statistical Society Series C: Applied Statistics, 74(4):1150–1182

2025

[14] [14]

Guibert, Q., Pincemin, G., and Planchet, F. (2025). Impact of climate change on mortality: An extrapolation of temperature effects based on time series data in France.International Journal of Forecasting. doi:10.1016/j.ijforecast.2025.07.004

work page doi:10.1016/j.ijforecast.2025.07.004 2025

[15] [15]

Hainaut, D. (2018). A neural-network analyzer for mortality forecast.ASTIN Bulletin, 48(02):481–508. doi:10.1017/asb.2017.45

work page doi:10.1017/asb.2017.45 2018

[16] [16]

Hainaut, D. (2026). Explaining regional mortality differences with an economic-neural model: Ev- idence from European NUTS-2 regions.LIDAM Discussion Paper. Working paper.https: //drive.google.com/file/d/1hJku2vojXti0AYkkkLrsQ2xnqXXjlGar/view

2026

[17] [17]

Jo, H., Kim, J., Huang, T.-C., and Ni, Y.-L. (2022). condLSTM-Q: A novel deep learning model for predicting COVID-19 mortality in fine geographical scale.Quantitative Biology, 10(2):125–138. doi:10.15302/J-QB-021-0276

work page doi:10.15302/j-qb-021-0276 2022

[18] [18]

Kleinow, T. (2015). A common age effect model for the mortality of multiple populations.Insurance: Mathematics and Economics, 63:147–152. doi:10.1016/j.insmatheco.2015.03.023

work page doi:10.1016/j.insmatheco.2015.03.023 2015

[19] [19]

In: IEEE International Conference on Communications (ICC)

Li, H. and Tang, Q. (2022). Joint extremes in temperature and mortality: A bivariate POT approach.North American Actuarial Journal, 26(1):43–63. doi:10.1080/10920277.2020.1823236. 29

work page doi:10.1080/10920277.2020.1823236 2022

[20] [20]

and Lee, R

Li, N. and Lee, R. (2005). Coherent mortality forecasts for a group of populations: An extension of the Lee-Carter method.Demography, 42(3):575–594. doi:10.1353/dem.2005.0021

work page doi:10.1353/dem.2005.0021 2005

[21] [21]

Li, R., Zhou, R., and Pitt, D. (2026a). Beyond annual data: Mortality forecast- ing with mixed frequency data.Insurance: Mathematics and Economics, 126:103172. doi:10.1016/j.insmatheco.2025.103172

work page doi:10.1016/j.insmatheco.2025.103172 2025

[22] [22]

Li, R., Zhou, R., and Pitt, D. (2026b). Dynamic mortality forecasting via mixed-frequency state- space models. Working paper. doi:10.48550/arXiv.2601.05702

work page doi:10.48550/arxiv.2601.05702

[23] [23]

and Palmborg, L

Lindholm, M. and Palmborg, L. (2022). Efficient use of data for LSTM mortality forecasting. European Actuarial Journal, 12(2):749–778. doi:10.1007/s13385-022-00307-3

work page doi:10.1007/s13385-022-00307-3 2022

[24] [24]

L., Hannun, A

Maas, A. L., Hannun, A. Y., and Ng, A. Y. (2013). Rectifier nonlinearities improve neural net- work acoustic models. InProceedings of the 30th International Conference on Machine Learning (ICML), volume 28 ofJMLR Workshop and Conference Proceedings, Atlanta, Georgia, USA. https://ai.stanford.edu/~amaas/papers/relu_hybrid_icml2013_final.pdf

2013

[25] [25]

Marino, M., Levantesi, S., and Nigri, A. (2022). A neural approach to improve the Lee-Carter mortality density forecasts.North American Actuarial Journal, 27(1):148–165. doi:10.1080/10920277.2022.2050260

work page doi:10.1080/10920277.2022.2050260 2022

[26] [26]

Miao, Z., Li, H., and Chen, Y. (2026). Gradient boosted multi-population mortality modelling with high-frequency data. Working paper. doi:10.48550/arXiv.2507.09983

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2507.09983 2026

[27] [27]

Min, J., Li, H., Nagler, T., and Li, S. (2025). Assessing climate-driven mortality risk: A stochastic approach with distributed lag non-linear models. Working paper. doi:10.48550/arXiv.2506.00561

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2506.00561 2025

[28] [28]

Perla, F., Richman, R., Scognamiglio, S., and Wüthrich, M. V. (2024). Accurate and explainable mortality forecasting with the LocalGLMnet.Scandinavian Actuarial Journal, 2024(7):739–761. doi:10.1080/03461238.2024.2307620

work page doi:10.1080/03461238.2024.2307620 2024

[29] [29]

Perla, F., Richman, R., Scognamiglio, S., and Wüthrich, M. V. (2021). Time-series forecast- ing of mortality rates using deep learning.Scandinavian Actuarial Journal, 2021(7):572–598. doi:10.1080/03461238.2020.1867232

work page doi:10.1080/03461238.2020.1867232 2021

[30] [30]

and Wüthrich, M

Richman, R. and Wüthrich, M. V. (2019). A neural network extension of the Lee- Carter model to multiple populations.Annals of Actuarial Science, 15(2):346–366. doi:10.1017/s1748499519000071

work page doi:10.1017/s1748499519000071 2019

[31] [31]

Robben, J., Antonio, K., and Kleinow, T. (2026a). The short-term association between environ- mental variables and mortality: evidence from europe.Journal of the Royal Statistical Society Series A: Statistics in Society, 189(2):1131–1153. doi:10.1093/jrsssa/qnaf052

work page doi:10.1093/jrsssa/qnaf052

[32] [32]

A penalized distributed lag non-linear Lee-Carter framework for regional weekly mortality forecasting

Robben, J. and Barigou, K. (2025). A penalized distributed lag non-linear Lee-Carter framework for regional weekly mortality forecasting. Working paper. doi:10.48550/arXiv.2509.24087

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2509.24087 2025

[33] [33]

Robben, J., Barigou, K., and Kleinow, T. (2026b). Granular mortality modeling with temperature and epidemic shocks: A three-state regime-switching approach.Insurance: Mathematics and Economics, 128:103250. doi:10.1016/j.insmatheco.2026.103250

work page doi:10.1016/j.insmatheco.2026.103250 2026

[34] [34]

S., Langseth, H., and Ramampiaro, H

Salem, T. S., Langseth, H., and Ramampiaro, H. (2020). Prediction intervals: Split normal mixture from quality-driven deep ensembles. InConference on Uncertainty in Artificial Intelligence, pages 1179–1187. PMLR. 30 Schnürch, S. and Korn, R. (2022). Point and interval forecasts of death rates using neural networks. ASTIN Bulletin, 52(1):333–360. doi:10.10...

work page doi:10.1017/asb.2021.34 2020

[35] [35]

and Schumacher, B

Shala, I. and Schumacher, B. (2024). The impact of natural disasters on banks’ impairment flow – evidence from Germany.Journal of Climate Finance, 6:100031. doi:10.1016/j.jclimf.2024.100031

work page doi:10.1016/j.jclimf.2024.100031 2024

[36] [36]

Shen, Y., Yang, X., Liu, H., and Li, Z. (2024). Advancing mortality rate prediction in European population clusters: integrating deep learning and multiscale analysis.Scientific Reports, 14(1). doi:10.1038/s41598-024-56390-x

work page doi:10.1038/s41598-024-56390-x 2024

[37] [37]

R., Cruz, S

So, K. R., Cruz, S. C., Marcella, E. A., Briones, J., and Garces, L. P. D. (2025). Uncertainty in pricing and risk measurement of survivor contracts.Risks, 13(2). doi:10.3390/risks13020035

work page doi:10.3390/risks13020035 2025

[38] [38]

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. (2014). Dropout: A simple way to prevent neural networks from overfitting.Journal of Machine Learning Research, 15(56):1929–1958.http://jmlr.org/papers/v15/srivastava14a.html. Van Berkum, F., Antonio, K., and Vellekoop, M. (2016). The impact of multiple struc- tural changes o...

work page doi:10.1080/03461238.2014.987807 2014

[39] [39]

Wang, C.-W., Zhang, J., and Zhu, W. (2021). Neighbouring prediction for mortality.ASTIN Bulletin, 51(3):689–718. doi:10.1017/asb.2021.13

work page doi:10.1017/asb.2021.13 2021

[40] [40]

Wang, J., Wen, L., Xiao, L., and Wang, C. (2023). Time-series forecasting of mortality rates using transformer.Scandinavian Actuarial Journal, 2024(2):109–123. doi:10.1080/03461238.2023.2218859

work page doi:10.1080/03461238.2023.2218859 2023

[41] [41]

Wang, Y., Gan, D., Sun, M., Zhang, N., Lu, Z., and Kang, C. (2019). Probabilistic individual load forecasting using pinball loss guided LSTM.Applied Energy, 235:10–20. doi:10.1016/j.apenergy.2018.10.078

work page doi:10.1016/j.apenergy.2018.10.078 2019

[42] [42]

Wuthrich, M. V. and Merz, M. (2023).Statistical Foundations of Actuarial Learning and its Appli- cations. Springer International Publishing. doi:10.1007/978-3-031-12409-9

work page doi:10.1007/978-3-031-12409-9 2023

[43] [43]

Zhang, N., Chen, H., and LIU, J. (2022). Mortality forecasting using LSTM-CNN model.SSRN Electronic Journal. Working paper. doi:10.2139/ssrn.4261735

work page doi:10.2139/ssrn.4261735 2022

[44] [44]

Zheng, H., Wang, H., Zhu, R., and Xue, J.-H. (2025). Fine-grained mortality forecasting with deep learning.Annals of Actuarial Science, page 1–27. doi:10.1017/s1748499525100171

work page doi:10.1017/s1748499525100171 2025

[45] [45]

Abriefreviewofdeeplearningmethodsinmor- tality forecasting.Annals of Actuarial Science, 20(1):150–165

Zheng, H., Wang, H., Zhu, R., andXue, J.-H.(2026). Abriefreviewofdeeplearningmethodsinmor- tality forecasting.Annals of Actuarial Science, 20(1):150–165. doi:10.1017/S1748499525100110. 31

work page doi:10.1017/s1748499525100110 2026