pith. sign in

arxiv: 2605.29952 · v1 · pith:VUNDZHOQnew · submitted 2026-05-28 · 💻 cs.LG

From Short Histories to Long Futures: Horizon-Aware Graph Neural Networks for Long Horizon Forecasting

Pith reviewed 2026-06-29 08:30 UTC · model grok-4.3

classification 💻 cs.LG
keywords graph neural networkslong horizon forecastinggeophysical emulationice sheet modelingmulti-horizon predictionstate increment learningclimate surrogatesautoregressive stability
0
0 comments X

The pith

A multi-horizon graph neural network trained jointly on multiple lead times and state increments produces more stable long-range forecasts for ice sheet dynamics than single-step autoregressive models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tries to establish that representing a geophysical domain as a graph and training one shared network to output state increments at several future horizons at once yields forecasts that hold accuracy over decades without the error buildup seen in conventional one-step predictors. A sympathetic reader would care because full physics simulations of glaciers and similar systems are too slow for routine long-term use, while existing neural emulators drift when rolled out over multi-decadal periods needed for climate and sea-level work. The approach uses a single current graph state, separate output branches per horizon, and a coarse-to-fine rollout that mixes large and small steps to limit drift and computation.

Core claim

The central claim is that a horizon-aware graph neural network with a shared backbone and per-horizon branches, trained to predict ice thickness and velocity increments at multiple lead times from one current state and rolled out with coarse-to-fine jumps, achieves higher long-range accuracy and improved stability on multi-decadal Pine Island Glacier simulations compared with both an initial-state baseline and a standard single-step autoregressive rollout.

What carries the argument

The multi-horizon graph neural network with shared backbone and separate output branches per lead time that predicts state increments rather than absolute states.

If this is right

  • Joint training across horizons limits cumulative error growth relative to autoregressive rollout.
  • Predicting increments rather than full states improves numerical stability over long sequences.
  • Coarse-to-fine inference reduces redundant computation while controlling drift.
  • The resulting emulator supplies more reliable inputs for downstream sea-level studies than prior neural baselines.
  • The same architecture outperforms both direct-from-initial-state prediction and standard one-step GNN rollouts on the tested multi-decadal runs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The architecture could transfer to other nonlinear systems such as atmospheric or ocean models without major redesign.
  • Pairing the emulator with observational data assimilation would test whether real-world noise reduces the observed stability gains.
  • Scaling the underlying graph to higher spatial resolution would reveal whether the joint-training benefit persists under increased computational load.
  • The method might shorten the reliable forecast horizon needed for ensemble climate projections by lowering per-run cost.

Load-bearing premise

That jointly optimizing across multiple lead times while predicting increments will reduce long-term drift without extra loss terms or physical constraints.

What would settle it

If the multi-horizon model accumulates equal or greater error in ice thickness or velocity than the single-step autoregressive baseline after fifty years on the same Pine Island Glacier test simulations, the stability claim is falsified.

Figures

Figures reproduced from arXiv: 2605.29952 by Maryam Rahnemoonfar, Zesheng Liu.

Figure 1
Figure 1. Figure 1: Network architecture of the graph neural network emulator used in this work. to further improve representational capacity, our work focuses on the proposed multi-horizon training and greedy rollout strategy; therefore, we adopt a simpler backbone consisting of five GCN layers. Following [18], we use two task-specific linear heads for velocity and thickness, and train the network to predict state increments… view at source ↗
Figure 2
Figure 2. Figure 2: Per-month pooled RMSE on Pine Island Glacier forecasts (months 61–240) for Vx, Vy, and ice thickness. Each curve corresponds to a different method: the initial-state baseline [16], the one-step autoregressive model [18], and our proposed multi-horizon training and greedy descending-horizon inference with different horizon sets. The figure illustrates how multi-horizon training substantially reduces both th… view at source ↗
read the original abstract

Accurate long-range prediction of geophysical systems is difficult due to strongly nonlinear dynamics, the high computational cost of full-physics simulations, and the error accumulation that arise when one-step autoregressive surrogates are rolled out over decades. Deep neural network can serve as efficient emulators, but most are trained only for next-step prediction and often drift or become unstable as the forecast horizon grows. We propose a multi-horizon graph neural network emulator that learns state-to-state transitions from a single current time to multiple future lead times within one unified model. The physical domain is represented as a graph, where nodes correspond to spatial locations with time-varying geophysical attributes and edges encode local spatial interactions. Given the current graph state, the model predicts the future evolution of key fields, ice thickness and ice velocities at all nodes, using a shared graph backbone with separate output branches for each target variable. To improve stability, the network predicts state increments relative to the current state, which are then added back to reconstruct future states. Training jointly optimizes all lead times with a unified regression objective, and inference uses a coarse-to-fine rollout that advances with larger jumps and selectively refines with shorter jumps to reduce drift and avoid redundant computation. Experiments on multi-decadal Pine Island Glacier simulations show that our approach achieves higher long-range accuracy and improved stability than both (i) an initial-state baseline that predicts each future time directly from the starting state and (ii) a standard single-step autoregressive rollout, producing a more reliable emulator for downstream climate and sea-level studies.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a multi-horizon graph neural network emulator for long-range forecasting of geophysical systems, applied to multi-decadal Pine Island Glacier ice-flow simulations. The domain is modeled as a graph with nodes holding time-varying attributes and edges encoding spatial interactions; a shared GNN backbone with per-variable output heads predicts state increments (ice thickness and velocities) at multiple lead times. Training jointly optimizes a regression loss across horizons, and inference uses a coarse-to-fine rollout. The central claim is that this yields higher long-range accuracy and stability than an initial-state baseline and a standard single-step autoregressive rollout.

Significance. If the experimental improvements hold under scrutiny, the work could supply more stable and computationally efficient surrogates for downstream climate and sea-level applications, directly addressing error accumulation that limits conventional autoregressive emulators.

major comments (2)
  1. [Abstract] Abstract: the claim that the approach 'achieves higher long-range accuracy and improved stability' supplies no quantitative metrics, error bars, dataset sizes, ablation results, or statistical tests; this is load-bearing because the entire contribution rests on the experimental comparison to the two baselines.
  2. [Abstract] Abstract (training and inference paragraphs): joint multi-horizon optimization plus increment prediction is presented as sufficient to 'reduce drift,' yet the text gives no indication of explicit stabilization mechanisms (consistency losses across horizons, physics residuals, or rollout regularization); given that nonlinear ice dynamics are known to amplify discrepancies over decades, this assumption requires explicit verification or counter-evidence.
minor comments (1)
  1. [Abstract] Abstract: 'Deep neural network can serve' should read 'Deep neural networks can serve'; 'the error accumulation that arise' should read 'the error accumulation that arises'.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major comment below and have made revisions to strengthen the presentation of our claims and methods.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the claim that the approach 'achieves higher long-range accuracy and improved stability' supplies no quantitative metrics, error bars, dataset sizes, ablation results, or statistical tests; this is load-bearing because the entire contribution rests on the experimental comparison to the two baselines.

    Authors: We agree that the abstract would be strengthened by including key quantitative support for the central claims. The full paper reports these details in the experiments section, including error metrics across horizons, the size of the Pine Island Glacier simulation dataset, and direct comparisons to the initial-state and autoregressive baselines. We have revised the abstract to incorporate specific accuracy and stability metrics from those comparisons. revision: yes

  2. Referee: [Abstract] Abstract (training and inference paragraphs): joint multi-horizon optimization plus increment prediction is presented as sufficient to 'reduce drift,' yet the text gives no indication of explicit stabilization mechanisms (consistency losses across horizons, physics residuals, or rollout regularization); given that nonlinear ice dynamics are known to amplify discrepancies over decades, this assumption requires explicit verification or counter-evidence.

    Authors: The design choices of increment prediction and joint multi-horizon training are presented as the core mechanisms for mitigating drift, with the coarse-to-fine rollout providing additional practical stabilization during inference. The multi-decadal experiments on nonlinear ice-flow dynamics serve as the empirical verification, demonstrating reduced error accumulation relative to the baselines. We have clarified the rationale for these choices in an expanded methods discussion and added a brief ablation on the contribution of joint training to long-horizon stability. revision: partial

Circularity Check

0 steps flagged

No significant circularity; claims rest on empirical validation against baselines

full rationale

The paper presents a multi-horizon GNN architecture with joint training across lead times and increment-based prediction as design choices, validated through direct experimental comparison to an initial-state baseline and single-step autoregressive rollout on Pine Island Glacier data. No equations, fitted parameters, or self-citations are described in the provided text that would reduce the reported accuracy or stability gains to the inputs by construction. The derivation chain consists of architectural decisions whose performance is assessed externally via held-out simulation trajectories, making the central claims self-contained against independent benchmarks rather than tautological.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that a graph with local spatial edges plus increment prediction and joint multi-horizon training will capture the necessary dynamics; no free parameters or invented entities are mentioned in the abstract.

axioms (1)
  • domain assumption The physical domain can be represented as a graph where nodes correspond to spatial locations with time-varying geophysical attributes and edges encode local spatial interactions.
    Explicitly stated as the representation used for the emulator.

pith-pipeline@v0.9.1-grok · 5811 in / 1280 out tokens · 30849 ms · 2026-06-29T08:30:21.642837+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. COGENT: Continuous Graph Emulators with Neural Ordinary Differential Equations for Long-Term Physical Forecasting

    cs.LG 2026-06 unverdicted novelty 4.0

    COGENT is a continuous graph emulator using Neural ODEs for stable long-term forecasting on irregular geospatial meshes, evaluated on ice-sheet simulations with improved stability over autoregressive baselines.

Reference graph

Works this paper leans on

25 extracted references · 22 canonical work pages · cited by 1 Pith paper · 2 internal anchors

  1. [1]

    J., Deems, S., Furlani, T

    Boerner, T.J., Deems, S., Furlani, T.R., Knuth, S.L., Towns, J.: Access: Advancing innovation: Nsf’s advanced cyberinfrastructure coordination ecosystem: Services & 14 Liu et al. support.In:PracticeandExperienceinAdvancedResearchComputing2023:Com- puting for the Common Good. p. 173–176. PEARC ’23, Association for Computing Machinery,NewYork,NY,USA(2023).h...

  2. [2]

    Computer Meth- ods in Applied Mechanics and Engineering417, 116351 (2023)

    Dalton, D., Husmeier, D., Gao, H.: Physics-informed graph neu- ral network emulation of soft-tissue mechanics. Computer Meth- ods in Applied Mechanics and Engineering417, 116351 (2023). https://doi.org/https://doi.org/10.1016/j.cma.2023.116351, https://www. sciencedirect.com/science/article/pii/S0045782523004759

  3. [3]

    In: Proceedings of the 30th In- ternational Conference on Neural Information Processing Systems

    Defferrard, M., Bresson, X., Vandergheynst, P.: Convolutional neural networks on graphs with fast localized spectral filtering. In: Proceedings of the 30th In- ternational Conference on Neural Information Processing Systems. p. 3844–3852. NIPS’16, Curran Associates Inc., Red Hook, NY, USA (2016)

  4. [4]

    EGU- sphere2025, 1–27 (2025)

    Fillola, E., Santos-Rodriguez, R., Tunnicliffe, R., Clark, J., Keshtmand, N., Gane- san, A., Rigby, M.: Enabling fast greenhouse gas emissions inference from satellites with gates: agraph-neural-network atmospherictransportemulationsystem. EGU- sphere2025, 1–27 (2025). https://doi.org/10.5194/egusphere-2025-2392, https: //egusphere.copernicus.org/preprint...

  5. [5]

    Forsberg, R., Sorensen, L., Simonsen, S.: Greenland and Antarctica Ice Sheet Mass Changes and Effects on Global Sea Level, pp. 91–106. Springer Inter- national Publishing, Cham (2017). https://doi.org/10.1007/978-3-319-56490-6_5, https://doi.org/10.1007/978-3-319-56490-6_5

  6. [6]

    Applied and Computational Harmonic Analysis30(2), 129–150 (2011)

    Hammond, D.K., Vandergheynst, P., Gribonval, R.: Wavelets on graphs via spec- tral graph theory. Applied and Computational Harmonic Analysis30(2), 129–150 (2011). https://doi.org/https://doi.org/10.1016/j.acha.2010.04.005, https://www. sciencedirect.com/science/article/pii/S1063520310000552

  7. [7]

    In: Practice and Experience in Advanced Research Computing 2021: Evolution Across All Di- mensions

    Hancock, D.Y., Fischer, J., Lowe, J.M., Snapp-Childs, W., Pierce, M., Marru, S., Coulter, J.E., Vaughn, M., Beck, B., Merchant, N., Skidmore, E., Jacobs, G.: Jetstream2: Accelerating cloud computing via jetstream. In: Practice and Experience in Advanced Research Computing 2021: Evolution Across All Di- mensions. PEARC ’21, Association for Computing Machin...

  8. [8]

    Nature Geoscience4(8), 519–523 (Aug 2011)

    Jacobs, S.S., Jenkins, A., Giulivi, C.F., Dutrieux, P.: Stronger ocean circulation and increased melting under pine island glacier ice shelf. Nature Geoscience4(8), 519–523 (Aug 2011). https://doi.org/10.1038/ngeo1188, https://doi.org/10.1038/ ngeo1188

  9. [9]

    Science Advances7(43), eabi5738 (2021)

    Joughin, I., Shapero, D., Dutrieux, P., Smith, B.: Ocean-induced melt vol- ume directly paces ice loss from pine island glacier. Science Advances7(43), eabi5738 (2021). https://doi.org/10.1126/sciadv.abi5738, https://www.science. org/doi/abs/10.1126/sciadv.abi5738

  10. [10]

    Journal of Glaciology69(273), 13–26 (2023)

    Jouvet, G.: Inversion of a stokes glacier flow model emulated by deep learning. Journal of Glaciology69(273), 13–26 (2023). https://doi.org/10.1017/jog.2022.41

  11. [11]

    Journal of Glaciology69(278), 1941–1955 (2023)

    Jouvet, G., Cordonnier, G.: Ice-flow model emulator based on physics- informed deep learning. Journal of Glaciology69(278), 1941–1955 (2023). https://doi.org/10.1017/jog.2023.73

  12. [12]

    Journal of Glaciology68(270), 651–664 (2022)

    Jouvet, G., Cordonnier, G., Kim, B., Lüthi, M., Vieli, A., Aschwanden, A.: Deep learning speeds up ice flow modelling by several orders of magnitude. Journal of Glaciology68(270), 651–664 (2022). https://doi.org/10.1017/jog.2021.120

  13. [13]

    Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization (2017), https: //arxiv.org/abs/1412.6980 Horizon-Aware Graph Neural Networks for Multi-Step Forecasting 15

  14. [14]

    In: International Conference on Learning Representations (ICLR) (2017)

    Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: International Conference on Learning Representations (ICLR) (2017)

  15. [15]

    The Cryosphere19(7), 2583– 2599 (2025)

    Koo, Y., Cheng, G., Morlighem, M., Rahnemoonfar, M.: Calibrating calving parameterizations using graph neural network emulators: applica- tion to helheim glacier, east greenland. The Cryosphere19(7), 2583– 2599 (2025). https://doi.org/10.5194/tc-19-2583-2025, https://tc.copernicus.org/ articles/19/2583/2025/

  16. [16]

    Journal of Glaciology71, e15 (2025)

    Koo, Y., Rahnemoonfar, M.: Graph convolutional network as a fast statistical emulator for numerical ice sheet modeling. Journal of Glaciology71, e15 (2025). https://doi.org/10.1017/jog.2024.93

  17. [17]

    Journal of Geophysical Research: Earth Surface117(F2) (2012)

    Larour, E., Schiermeier, J., Rignot, E., Seroussi, H., Morlighem, M., Paden, J.: Sensitivity analysis of pine island glacier ice flow using issm and dakota. Journal of Geophysical Research: Earth Surface117(F2) (2012). https://doi.org/https://doi.org/10.1029/2011JF002146, https://agupubs. onlinelibrary.wiley.com/doi/abs/10.1029/2011JF002146

  18. [18]

    Liu, Z., Koo, Y., Rahnemoonfar, M.: Kan-gcn: Combining kolmogorov-arnold net- work with graph convolution network for an accurate ice sheet emulator (2025), https://arxiv.org/abs/2510.24926

  19. [19]

    Loshchilov, I., Hutter, F.: Sgdr: Stochastic gradient descent with warm restarts (2017), https://arxiv.org/abs/1608.03983

  20. [20]

    Earth System Science Data12(2), 1367–1383 (2020)

    Mankoff, K.D., Solgaard, A., Colgan, W., Ahlstrøm, A.P., Khan, S.A., Fausto, R.S.: Greenland ice sheet solid ice discharge from 1986 through march 2020. Earth System Science Data12(2), 1367–1383 (2020). https://doi.org/10.5194/essd-12- 1367-2020, https://essd.copernicus.org/articles/12/1367/2020/

  21. [21]

    In: Van der Veen, C.J., Oerlemans, J

    Morland, L.W.: Unconfined ice-shelf flow. In: Van der Veen, C.J., Oerlemans, J. (eds.) Dynamics of the West Antarctic Ice Sheet. pp. 99–116. Springer Netherlands, Dordrecht (1987)

  22. [22]

    Future Internet15(12) (2023)

    Ngo, D.T., Aouedi, O., Piamrat, K., Hassan, T., Raipin-Parvédy, P.: Empow- ering digital twin for future networks with graph neural networks: Overview, enabling technologies, challenges, and opportunities. Future Internet15(12) (2023). https://doi.org/10.3390/fi15120377, https://www.mdpi.com/1999-5903/ 15/12/377

  23. [23]

    IET Microwaves, Antennas & Propagation18(7), 505–515 (2024)

    Noakoasteen, O., Christodoulou, C., Peng, Z., Goudos, S.K.: Physics-informed surrogates for electromagnetic dynamics using transformers and graph neu- ral networks. IET Microwaves, Antennas & Propagation18(7), 505–515 (2024). https://doi.org/https://doi.org/10.1049/mia2.12463, https://ietresearch. onlinelibrary.wiley.com/doi/abs/10.1049/mia2.12463

  24. [24]

    Proceedings of the National Academy of Sciences116(4), 1095– 1103 (2019)

    Rignot, E., Mouginot, J., Scheuchl, B., van den Broeke, M., van Wessem, M.J., Morlighem, M.: Four decades of antarctic ice sheet mass balance from 1979–2017. Proceedings of the National Academy of Sciences116(4), 1095– 1103 (2019). https://doi.org/10.1073/pnas.1812883116, https://www.pnas.org/ doi/abs/10.1073/pnas.1812883116

  25. [25]

    The Cryosphere8(5), 1699– 1710 (2014)

    Seroussi, H., Morlighem, M., Rignot, E., Mouginot, J., Larour, E., Schodlok, M., Khazendar, A.: Sensitivity of the dynamics of pine island glacier, west antarc- tica, to climate forcing for the next 50 years. The Cryosphere8(5), 1699– 1710 (2014). https://doi.org/10.5194/tc-8-1699-2014, https://tc.copernicus.org/ articles/8/1699/2014/