Mixed-Precision in adaptive Runge-Kutta method for large ODE systems

Ars\`ene Marzorati (DRACULA; BEAGLE; BIOTIC); CITI; ICJ; ICJ); INSA Lyon); Jonathan Rouzaud-Cornabas (BIOTIC; MMCS; Mouhamad Al-Sayed

arxiv: 2605.23727 · v1 · pith:EWN74JFKnew · submitted 2026-05-22 · 🧮 math.NA · cs.NA

Mixed-Precision in adaptive Runge-Kutta method for large ODE systems

Mouhamad Al-Sayed , Samuel Bernard (MUSICS , ICJ) , Ars\`ene Marzorati (DRACULA , BEAGLE , MMCS , MUSICS , ICJ

show 4 more authors

BIOTIC) Jonathan Rouzaud-Cornabas (BIOTIC CITI INSA Lyon)

This is my paper

Pith reviewed 2026-05-25 03:13 UTC · model grok-4.3

classification 🧮 math.NA cs.NA

keywords mixed precisionRunge-KuttaODE systemsadaptive methodsnumerical accuracycomputational efficiencyBogacki-Shampine

0 comments

The pith

Mixed-precision Runge-Kutta solvers preserve most high-precision accuracy in large ODE systems

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests mixed-precision adaptations of the Bogacki-Shampine 3(2) Runge-Kutta method on three large ODE benchmark systems to determine if they can retain the accuracy of full high-precision calculations while benefiting from faster low-precision arithmetic. It finds that accuracy is largely preserved across tolerances and actually improves toward high-precision levels as system size increases. The number of function evaluations stays comparable, allowing the speed advantage of low precision to translate into overall gains without extra steps. This approach is presented as adaptable to any finite-precision format and numerical scheme for handling computationally costly large systems with heterogeneous interactions.

Core claim

Mixed-precision versions of the Bogacki-Shampine 3(2) Runge-Kutta pair preserve most of the accuracy of high-precision solvers under a wide range of tolerances. Accuracy improves with system size to match high-precision performance seen in smaller systems. The arithmetic change does not alter evaluation counts enough to offset the speed benefit of low-precision operations, enabling significant speed-up with little accuracy loss in large coupled ODE systems.

What carries the argument

Mixed-precision Bogacki-Shampine 3(2) Runge-Kutta pair applied to large ODE systems, using low precision for speed and high precision selectively for accuracy control.

If this is right

Mixed-precision methods can deliver significant computational speed-up for large coupled ODE systems.
Accuracy remains close to high-precision levels over wide solver tolerances.
Accuracy performance improves as the ODE system size grows.
The number of function evaluations is not substantially affected, maintaining the speed benefit.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This technique could be tested on other families of ODE solvers such as multistep methods.
Hardware architectures optimized for low precision might see even larger gains if error control is also adapted.
The size-dependent accuracy improvement suggests benefits specifically for very large-scale simulations not fully explored in the benchmarks.

Load-bearing premise

The three benchmark systems of coupled oscillators, Kuramoto model, and circadian clock adequately represent large ODE systems containing many heterogeneous interactions.

What would settle it

A demonstration that mixed-precision accuracy drops substantially below high-precision levels for some large ODE system under standard tolerances would challenge the main claim.

Figures

Figures reproduced from arXiv: 2605.23727 by Ars\`ene Marzorati (DRACULA, BEAGLE, BIOTIC), CITI, ICJ, ICJ), INSA Lyon), Jonathan Rouzaud-Cornabas (BIOTIC, MMCS, Mouhamad Al-Sayed, MUSICS, Samuel Bernard (MUSICS.

**Figure 1.** Figure 1: Normalized final error, see Equation (8), with respect to Matlab’s ode45 solution. 1000 agents for Benchmark 1 (a), to 2000 for Benchmark 2 (b) and to 700 for Benchmark 3 (c). The error bars are the 5-th and 95-th percentiles. For tolerances of (10−3 , 10−4 , 10−5 , 10−6 , 10−7 , 10−8 , 10−9 ), there are respectively in (a) (113, 97, 78, 59, 33, 19, 0), in (b) (498, 423, 359, 270, 189, 122, 53) and (c) (28… view at source ↗

**Figure 2.** Figure 2: Distribution of normalized final error, see Equation (8), for all the tests completed by all the solvers. One box plot represents a distribution, with limits at 1-st and 99-th percentiles, for all the tests of one benchmark, indicated by the column (a) Benchmark 1, Equation (3), (b) Benchmark 2, Equation (4) and (c) Benchmark 3, Equation (5). With a unique system size, indicated by the intensity of blue, p… view at source ↗

**Figure 3.** Figure 3: (a) Averaged local error (one value per test) approximated as the difference between the 2 RK schemes (EBS, in Equation (7)). (b) Averaged real local error (one value per test). In each plot the median which is computed over all the tests performed for a fixed tolerance is given with the error bars corresponding to the 5th and 95th percentiles. As the tolerance values tested are discrete, a shift was added… view at source ↗

**Figure 4.** Figure 4: Number of tests successfully solved by all solvers for each couple of system size (abscissa) and relative tolerance (ordinate). (a) Benchmark 1, (b) Benchmark 2 and (c) Benchmark 3. 16 [PITH_FULL_IMAGE:figures/full_fig_p016_4.png] view at source ↗

read the original abstract

Mixed-precision methods combine low and high precision arithmetics to exploit low precision computational speed and high precision accuracy. Large ODE systems that contain many heterogeneous interactions lead to a high computational cost that could be tackled with mixed-precision solvers. We tested mixedprecision versions of the Bogacki-Shampine 3(2) Runge-Kutta pair over three benchmark systems: coupled linear oscillators, the Kuramoto model and a circadian clock model. Our study is performed in a way that can be adapted to any finite-precision format, software architecture and numerical scheme. We found that mixed-precision solvers can preserve most of the high-precision solver accuracy under a wide range of solver tolerances. Moreover, mixed-precision solver accuracy improves with system size, reaching levels equivalent to high-precision solvers in small system size. We also observed that mixed-precision arithmetic does not impact the number of evaluation in a way that balances the benefit of fast operations in low precision. Taken together, these results show that mixed-precision methods can offer significant computational speed-up at little or no loss of accuracy in large coupled ODE systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Mixed-precision Bogacki-Shampine keeps accuracy on three standard ODE benchmarks and may speed up runs, but the test systems are too regular to support claims for large heterogeneous networks.

read the letter

The paper runs mixed-precision versions of the Bogacki-Shampine 3(2) pair on coupled oscillators, Kuramoto, and a circadian clock model. It reports that the mixed-precision solver holds most of the accuracy of the full-precision version across tolerances and that accuracy actually improves as system size grows. The number of function evaluations stays roughly the same, so any speed-up would come from the cheaper low-precision operations. The method is written so it can be swapped into other formats or schemes, which is useful for people who already have an adaptive RK code base.

Referee Report

2 major / 0 minor

Summary. The manuscript presents an empirical study of mixed-precision arithmetic applied to the adaptive Bogacki-Shampine 3(2) Runge-Kutta method for solving large systems of ODEs. It tests this approach on three benchmark systems—coupled linear oscillators, the Kuramoto model, and a circadian clock model—and reports that mixed-precision solvers maintain most of the accuracy of full high-precision solvers across a range of tolerances, with accuracy improving as system size grows, while not significantly affecting the number of function evaluations, thereby offering computational speed-up at little accuracy loss.

Significance. If the empirical findings prove robust upon detailed verification and generalize beyond the tested models, the work could enable meaningful speed-ups in simulating large coupled ODE systems with minimal accuracy trade-offs, which would be valuable for computational applications in physics, biology, and engineering involving high-dimensional dynamical systems.

major comments (2)

[Abstract] Abstract: the abstract states empirical findings but supplies no implementation details, error-bar methodology, statistical tests, or description of how mixed-precision arithmetic was realized inside the adaptive step-size controller; claims such as 'accuracy improves with system size' therefore cannot be verified.
[Numerical experiments] Benchmark systems: the three chosen benchmark systems (coupled oscillators, Kuramoto, circadian clock) have relatively uniform or low-dimensional coupling structures; none is a genuinely large (N ≫ 10^3), stiff, or randomly heterogeneous network. This raises concerns about whether the reported size-dependent accuracy gain is a general property of mixed-precision Runge-Kutta or model-specific.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive report and the recommendation of major revision. We address each major comment below, agreeing to expand the abstract and to add discussion of benchmark limitations while defending the relevance of the chosen systems.

read point-by-point responses

Referee: [Abstract] Abstract: the abstract states empirical findings but supplies no implementation details, error-bar methodology, statistical tests, or description of how mixed-precision arithmetic was realized inside the adaptive step-size controller; claims such as 'accuracy improves with system size' therefore cannot be verified.

Authors: We agree the abstract is too concise for full verification. In the revised manuscript we will add a sentence describing the mixed-precision realization (low-precision arithmetic applied only to the function evaluations inside the Bogacki-Shampine pair while retaining high-precision error estimation and step-size control), specify that accuracy is measured by direct comparison of solution trajectories against a reference high-precision run at each tolerance, and note that the size-dependent trend is shown by systematic variation of N in the numerical experiments section rather than by formal statistical tests. revision: yes
Referee: [Numerical experiments] Benchmark systems: the three chosen benchmark systems (coupled oscillators, Kuramoto, circadian clock) have relatively uniform or low-dimensional coupling structures; none is a genuinely large (N ≫ 10^3), stiff, or randomly heterogeneous network. This raises concerns about whether the reported size-dependent accuracy gain is a general property of mixed-precision Runge-Kutta or model-specific.

Authors: The three systems were selected because they are standard large-scale coupled ODE benchmarks in the literature and exhibit heterogeneous interactions (phase differences in Kuramoto, multiple regulatory loops in the circadian model). Experiments were performed with system sizes reaching several thousand equations, and the accuracy gain with N is observed consistently. We acknowledge that the models are not stiff or randomly heterogeneous; the revised manuscript will include an explicit limitations paragraph stating that generalization to such networks remains to be verified and suggesting this as future work. revision: partial

Circularity Check

0 steps flagged

No circularity: purely empirical benchmark study with no derivations or fitted predictions

full rationale

The manuscript reports direct numerical experiments comparing mixed-precision and high-precision Bogacki-Shampine 3(2) integrators on three concrete ODE systems (coupled oscillators, Kuramoto, circadian clock). No equations, ansatzes, uniqueness theorems, or parameter-fitting steps appear in the abstract or described content. Claims about accuracy preservation and size-dependent improvement are presented as observed outcomes of those runs, not as predictions derived from prior fitted quantities or self-citations. The central result therefore does not reduce to its own inputs by construction; the derivation chain is empty and the work is self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no free parameters, axioms, or invented entities can be identified.

pith-pipeline@v0.9.0 · 5769 in / 1077 out tokens · 22429 ms · 2026-05-25T03:13:16.694132+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

44 extracted references · 44 canonical work pages · 2 internal anchors

[1]

D., Palmer, T., and Smolarkiewicz, P

Ackmann, J., Dueben, P. D., Palmer, T., and Smolarkiewicz, P. K.Mixed-precision for linear solvers in global geophysical flows. Journal of Advances in Modeling Earth Systems 14 , 9 (2022), e2022MS003148. Publisher: Wiley Online Library

work page 2022
[2]

Approximation of large-scale dynamical systems: An overview

Antoulas, A. Approximation of large-scale dynamical systems: An overview. IFAC Proceedings Volumes 37, 11 (2004), 19–28

work page 2004
[3]

Nonequilibrium dynamical mean-field theory and its applications

Aoki, H., Tsuji, N., Eckstein, M., Kollar, M., Oka, T., and Werner, P. Nonequilibrium dynamical mean-field theory and its applications. Reviews of Modern Physics 86 , 2 (2014), 779–837

work page 2014
[4]

Acceler- ating scientific computations with mixed precision algorithms

Baboulin, M., Buttari, A., Dongarra, J., Kurzak, J., Langou, J., Langou, J., Luszczek, P., and Tomov, S. Acceler- ating scientific computations with mixed precision algorithms. Computer Physics Communications 180, 12 (2009), 2526–2533

work page 2009
[5]

Baker, J., and Christofides, P. D. Finite-dimensional approximation and control of non-linear parabolic PDE systems. International Journal of Control 73 , 5 (2000), 439–456

work page 2000
[6]

Bogacki, P., and Shampine, L. F. A 3 (2) pair of Runge-Kutta formulas. Applied Mathematics Letters 2, 4 (1989), 321–325. Publisher: Elsevier

work page 1989
[7]

A particle system in interaction with a rapidly varying environment: Mean field limits and applications

Bordenave, C., McDonald, D., and Proutiere, A. A particle system in interaction with a rapidly varying environment: Mean field limits and applications. arXiv preprint math/0701363 (2007). 12

work page internal anchor Pith review Pith/arXiv arXiv 2007
[8]

Mean-field limits beyond ordinary differential equations

Bortolussi, L., and Gast, N. Mean-field limits beyond ordinary differential equations. Formal Methods for the Quantita- tive Evaluation of Collective Adaptive Systems: 16th International School on Formal Methods for the Design of Computer, Communication, and Software Systems, SFM 2016, Bertinoro, Italy, June 20-24, 2016, Advanced Lectures 16 (2016), 61–82

work page 2016
[9]

E., and Cerminara, M

Brogi, F., Bn`a, S., Boga, G., Amati, G., Ongaro, T. E., and Cerminara, M. On floating point precision in computational fluid dynamics using openfoam. Future Generation Computer Systems 152 (2024), 1–16

work page 2024
[10]

Burnett, B., Gottlieb, S., and Grant, Z. J. Stability analysis and performance evaluation of mixed-precision Runge-Kutta methods, Dec. 2022

work page 2022
[11]

Butcher, J. C. Numerical methods for ordinary differential equations . John Wiley & Sons, 2016

work page 2016
[12]

Low-storage implicit/explicit Runge-Kutta schemes for the simulation of stiff high- dimensional ODE systems

Cavaglieri, D., and Bewley, T. Low-storage implicit/explicit Runge-Kutta schemes for the simulation of stiff high- dimensional ODE systems. Journal of Computational Physics 286 (2015), 172–193

work page 2015
[13]

Rigorous floating- point mixed-precision tuning

Chiang, W.-F., Baranowski, M., Briggs, I., Solovyev, A., Gopalakrishnan, G., and Rakamari ´c, Z. Rigorous floating- point mixed-precision tuning. ACM SIGPLAN Notices 52 , 1 (2017), 300–315

work page 2017
[14]

Nvidia A100 GPU: Performance & innovation for GPU computing

Choquette, J., and Gandhi, W. Nvidia A100 GPU: Performance & innovation for GPU computing. In 2020 IEEE Hot Chips 32 Symposium (HCS) (2020), IEEE Computer Society, pp. 1–43

work page 2020
[15]

Croci, M., and de Souza, G. R. Mixed-precision explicit stabilized Runge-Kutta methods for single- and multi-scale differential equations. Journal of Computational Physics 464 (Sept. 2022), 111349. arXiv:2109.12153 [cs, math]

work page arXiv 2022
[16]

Reduced order modeling

Dar, Z., Baiges, J., and Codina, R. Reduced order modeling. In Machine Learning in Modeling and Simulation: Methods and Applications. Springer, 2023, pp. 297–339

work page 2023
[17]

Low-order classical Runge-Kutta formulas with stepsize control and their application to some heat transfer problems, vol

Fehlberg, E. Low-order classical Runge-Kutta formulas with stepsize control and their application to some heat transfer problems, vol. 315. National aeronautics and space administration, 1969

work page 1969
[18]

Impulses and physiological states in theoretical models of nerve membrane

FitzHugh, R. Impulses and physiological states in theoretical models of nerve membrane. Biophysical journal 1 , 6 (1961), 445–466

work page 1961
[19]

Goodwin, B. C. Oscillatory behavior in enzymatic control processes. Advances in enzyme regulation 3 (1965), 425–437

work page 1965
[20]

Grant, Z. J. Perturbed Runge-Kutta methods for mixed precision applications, Dec. 2020. arXiv:2012.13055 [cs, math]

work page arXiv 2020
[21]

Solving Ordinary Differential Equations I: Nonstiff Problems , second revised edition ed., vol

Hairer, E., Norsett, S., and Wanner, G. Solving Ordinary Differential Equations I: Nonstiff Problems , second revised edition ed., vol. 8 of Springer Series in Computational Mathematics . Springer, Jan. 1993

work page 1993
[22]

Speeding up and reducing memory usage for scientific machine learning via mixed precision

Hayford, J., Goldman-Wetzler, J., Wang, E., and Lu, L. Speeding up and reducing memory usage for scientific machine learning via mixed precision. Computer Methods in Applied Mechanics and Engineering 428 (2024), 117093

work page 2024
[23]

J., and Mary, T

Higham, N. J., and Mary, T. Mixed precision algorithms in numerical linear algebra. Acta Numerica 31 (May 2022), 347–414

work page 2022
[24]

W., Smale, S., and Devaney, R

Hirsch, M. W., Smale, S., and Devaney, R. L. Differential equations, dynamical systems, and an introduction to chaos . Academic press, 2013

work page 2013
[25]

Activations in low precision with high accuracy

Hubrecht, T., Desrentes, O., and de Dinechin, F. Activations in low precision with high accuracy. hal preprint hal- 04776745 (2024)

work page 2024
[26]

C., Schulthess, T

Ichimura, T., Fujita, K., Yamaguchi, T., Naruse, A., Wells, J. C., Schulthess, T. C., Straatsma, T. P., Zimmer, C. J., Martinasso, M., Nakajima, K., et al. A fast scalable implicit solver for nonlinear time-evolution earthquake city problem on low-ordered unstructured finite elements with artificial intelligence and transprecision computing. In SC18: Inte...

work page 2018
[27]

Mixed-precision numerics in scientific applications: survey and perspectives

Kashi, A., Lu, H., Brewer, W., Rogers, D., Matheson, M., Shankar, M., and Wang, F. Mixed-precision numerics in scientific applications: survey and perspectives. arXiv preprint arXiv:2412.19322 (2024)

work page arXiv 2024
[28]

A., Carpenter, M

Kennedy, C. A., Carpenter, M. H., and Lewis, R. M. Low-storage, explicit Runge-Kutta schemes for the compressible Navier-Stokes equations. Applied numerical mathematics 35 , 3 (2000), 177–219

work page 2000
[29]

Chemical turbulence

Kuramoto, Y. Chemical turbulence. Springer, 1984

work page 1984
[30]

Exascale deep learning for climate analytics

Kurth, T., Treichler, S., Romero, J., Mudigonda, M., Luehr, N., Phillips, E., Mahesh, A., Matheson, M., Deslippe, J., Fatica, M., et al. Exascale deep learning for climate analytics. In SC18: International conference for high performance computing, networking, storage and analysis (2018), IEEE, pp. 649–660

work page 2018
[31]

W., and Walker, R

Le Grand, S., G ¨otz, A. W., and Walker, R. C. Spfp: Speed without compromise—a mixed precision model for gpu accelerated molecular dynamics simulations. Computer Physics Communications 184 , 2 (2013), 374–380

work page 2013
[32]

J., Beran, P

Lucia, D. J., Beran, P. S., and Silva, W. A. Reduced-order modeling: New approaches for computational physics. Progress in aerospace sciences 40, 1-2 (2004), 51–117

work page 2004
[33]

Mixed Precision Training

Micikevicius, P., Narang, S., Alben, J., Diamos, G., Elsen, E., Garcia, D., Ginsburg, B., Houston, M., Kuchaiev, O., Venkatesh, G., et al. Mixed precision training. arXiv preprint arXiv:1710.03740 (2017)

work page internal anchor Pith review Pith/arXiv arXiv 2017
[34]

Murray, J. D. Mathematical Biology I: An introduction . Springer, 2002

work page 2002
[35]

Agent-based modeling of microbial communities

Nagarajan, K., Ni, C., and Lu, T. Agent-based modeling of microbial communities. ACS synthetic biology 11 , 11 (2022), 3564–3574

work page 2022
[36]

An active pulse transmission line simulating nerve axon

Nagumo, J., Arimoto, S., and Yoshizawa, S. An active pulse transmission line simulating nerve axon. Proceedings of the IRE 50, 10 (1962), 2061–2070

work page 1962
[37]

Palmer, T. N. More reliable forecasts with less precise computations: A fast-track route to cloud-resolved weather and climate simulators? Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 372 , 2018 (June 2014), 20130391

work page 2018
[38]

J., and Kollmannsberger, P

Paul, T. J., and Kollmannsberger, P. Biological network growth in complex environments: A computational framework. PLOS Computational Biology 16 , 11 (2020), e1008003

work page 2020
[39]

E., Khargonekar, P., and Kurdahi, F

Rakka, M., Fouda, M. E., Khargonekar, P., and Kurdahi, F. Mixed-precision neural networks: A survey. arXiv preprint arXiv:2208.06064 (2022). 13

work page arXiv 2022
[40]

Reduced-precision parametrization: Lessons from an intermediate- complexity atmospheric model

Saffin, L., Hatfield, S., D ¨uben, P., and Palmer, T. Reduced-precision parametrization: Lessons from an intermediate- complexity atmospheric model. Quarterly Journal of the Royal Meteorological Society 146 , 729 (2020), 1590–1607

work page 2020
[41]

F., and Reichelt, M

Shampine, L. F., and Reichelt, M. W. The Matlab ODE suite. SIAM journal on scientific computing 18 , 1 (1997), 1–22. Publisher: SIAM

work page 1997
[42]

Strogatz, S. H. Nonlinear dynamics and chaos with student solutions manual: With applications to physics, biology, chemistry, and engineering . CRC press, 2018

work page 2018
[43]

BFloat16: the secret to high performance on Cloud TPUs

Wang, S., and Kanwar, P. BFloat16: the secret to high performance on Cloud TPUs. Google Cloud Blog 4 (2019)

work page 2019
[44]

IEEE standard for floating-point arithmetic

Zuras, D., Cowlishaw, M., Aiken, A., Applegate, M., Bailey, D., Bass, S., Bhandarkar, D., Bhat, M., Bindel, D., Boldo, S., et al. IEEE standard for floating-point arithmetic. IEEE Std 754 , 2008 (2008), 1–70. 14 A Parameters For all benchmarks we have Tolerances The absolute tolerance is lower or equal than the relative tolerance. Explored parameters All ...

work page 2008

[1] [1]

D., Palmer, T., and Smolarkiewicz, P

Ackmann, J., Dueben, P. D., Palmer, T., and Smolarkiewicz, P. K.Mixed-precision for linear solvers in global geophysical flows. Journal of Advances in Modeling Earth Systems 14 , 9 (2022), e2022MS003148. Publisher: Wiley Online Library

work page 2022

[2] [2]

Approximation of large-scale dynamical systems: An overview

Antoulas, A. Approximation of large-scale dynamical systems: An overview. IFAC Proceedings Volumes 37, 11 (2004), 19–28

work page 2004

[3] [3]

Nonequilibrium dynamical mean-field theory and its applications

Aoki, H., Tsuji, N., Eckstein, M., Kollar, M., Oka, T., and Werner, P. Nonequilibrium dynamical mean-field theory and its applications. Reviews of Modern Physics 86 , 2 (2014), 779–837

work page 2014

[4] [4]

Acceler- ating scientific computations with mixed precision algorithms

Baboulin, M., Buttari, A., Dongarra, J., Kurzak, J., Langou, J., Langou, J., Luszczek, P., and Tomov, S. Acceler- ating scientific computations with mixed precision algorithms. Computer Physics Communications 180, 12 (2009), 2526–2533

work page 2009

[5] [5]

Baker, J., and Christofides, P. D. Finite-dimensional approximation and control of non-linear parabolic PDE systems. International Journal of Control 73 , 5 (2000), 439–456

work page 2000

[6] [6]

Bogacki, P., and Shampine, L. F. A 3 (2) pair of Runge-Kutta formulas. Applied Mathematics Letters 2, 4 (1989), 321–325. Publisher: Elsevier

work page 1989

[7] [7]

A particle system in interaction with a rapidly varying environment: Mean field limits and applications

Bordenave, C., McDonald, D., and Proutiere, A. A particle system in interaction with a rapidly varying environment: Mean field limits and applications. arXiv preprint math/0701363 (2007). 12

work page internal anchor Pith review Pith/arXiv arXiv 2007

[8] [8]

Mean-field limits beyond ordinary differential equations

Bortolussi, L., and Gast, N. Mean-field limits beyond ordinary differential equations. Formal Methods for the Quantita- tive Evaluation of Collective Adaptive Systems: 16th International School on Formal Methods for the Design of Computer, Communication, and Software Systems, SFM 2016, Bertinoro, Italy, June 20-24, 2016, Advanced Lectures 16 (2016), 61–82

work page 2016

[9] [9]

E., and Cerminara, M

Brogi, F., Bn`a, S., Boga, G., Amati, G., Ongaro, T. E., and Cerminara, M. On floating point precision in computational fluid dynamics using openfoam. Future Generation Computer Systems 152 (2024), 1–16

work page 2024

[10] [10]

Burnett, B., Gottlieb, S., and Grant, Z. J. Stability analysis and performance evaluation of mixed-precision Runge-Kutta methods, Dec. 2022

work page 2022

[11] [11]

Butcher, J. C. Numerical methods for ordinary differential equations . John Wiley & Sons, 2016

work page 2016

[12] [12]

Low-storage implicit/explicit Runge-Kutta schemes for the simulation of stiff high- dimensional ODE systems

Cavaglieri, D., and Bewley, T. Low-storage implicit/explicit Runge-Kutta schemes for the simulation of stiff high- dimensional ODE systems. Journal of Computational Physics 286 (2015), 172–193

work page 2015

[13] [13]

Rigorous floating- point mixed-precision tuning

Chiang, W.-F., Baranowski, M., Briggs, I., Solovyev, A., Gopalakrishnan, G., and Rakamari ´c, Z. Rigorous floating- point mixed-precision tuning. ACM SIGPLAN Notices 52 , 1 (2017), 300–315

work page 2017

[14] [14]

Nvidia A100 GPU: Performance & innovation for GPU computing

Choquette, J., and Gandhi, W. Nvidia A100 GPU: Performance & innovation for GPU computing. In 2020 IEEE Hot Chips 32 Symposium (HCS) (2020), IEEE Computer Society, pp. 1–43

work page 2020

[15] [15]

Croci, M., and de Souza, G. R. Mixed-precision explicit stabilized Runge-Kutta methods for single- and multi-scale differential equations. Journal of Computational Physics 464 (Sept. 2022), 111349. arXiv:2109.12153 [cs, math]

work page arXiv 2022

[16] [16]

Reduced order modeling

Dar, Z., Baiges, J., and Codina, R. Reduced order modeling. In Machine Learning in Modeling and Simulation: Methods and Applications. Springer, 2023, pp. 297–339

work page 2023

[17] [17]

Low-order classical Runge-Kutta formulas with stepsize control and their application to some heat transfer problems, vol

Fehlberg, E. Low-order classical Runge-Kutta formulas with stepsize control and their application to some heat transfer problems, vol. 315. National aeronautics and space administration, 1969

work page 1969

[18] [18]

Impulses and physiological states in theoretical models of nerve membrane

FitzHugh, R. Impulses and physiological states in theoretical models of nerve membrane. Biophysical journal 1 , 6 (1961), 445–466

work page 1961

[19] [19]

Goodwin, B. C. Oscillatory behavior in enzymatic control processes. Advances in enzyme regulation 3 (1965), 425–437

work page 1965

[20] [20]

Grant, Z. J. Perturbed Runge-Kutta methods for mixed precision applications, Dec. 2020. arXiv:2012.13055 [cs, math]

work page arXiv 2020

[21] [21]

Solving Ordinary Differential Equations I: Nonstiff Problems , second revised edition ed., vol

Hairer, E., Norsett, S., and Wanner, G. Solving Ordinary Differential Equations I: Nonstiff Problems , second revised edition ed., vol. 8 of Springer Series in Computational Mathematics . Springer, Jan. 1993

work page 1993

[22] [22]

Speeding up and reducing memory usage for scientific machine learning via mixed precision

Hayford, J., Goldman-Wetzler, J., Wang, E., and Lu, L. Speeding up and reducing memory usage for scientific machine learning via mixed precision. Computer Methods in Applied Mechanics and Engineering 428 (2024), 117093

work page 2024

[23] [23]

J., and Mary, T

Higham, N. J., and Mary, T. Mixed precision algorithms in numerical linear algebra. Acta Numerica 31 (May 2022), 347–414

work page 2022

[24] [24]

W., Smale, S., and Devaney, R

Hirsch, M. W., Smale, S., and Devaney, R. L. Differential equations, dynamical systems, and an introduction to chaos . Academic press, 2013

work page 2013

[25] [25]

Activations in low precision with high accuracy

Hubrecht, T., Desrentes, O., and de Dinechin, F. Activations in low precision with high accuracy. hal preprint hal- 04776745 (2024)

work page 2024

[26] [26]

C., Schulthess, T

Ichimura, T., Fujita, K., Yamaguchi, T., Naruse, A., Wells, J. C., Schulthess, T. C., Straatsma, T. P., Zimmer, C. J., Martinasso, M., Nakajima, K., et al. A fast scalable implicit solver for nonlinear time-evolution earthquake city problem on low-ordered unstructured finite elements with artificial intelligence and transprecision computing. In SC18: Inte...

work page 2018

[27] [27]

Mixed-precision numerics in scientific applications: survey and perspectives

Kashi, A., Lu, H., Brewer, W., Rogers, D., Matheson, M., Shankar, M., and Wang, F. Mixed-precision numerics in scientific applications: survey and perspectives. arXiv preprint arXiv:2412.19322 (2024)

work page arXiv 2024

[28] [28]

A., Carpenter, M

Kennedy, C. A., Carpenter, M. H., and Lewis, R. M. Low-storage, explicit Runge-Kutta schemes for the compressible Navier-Stokes equations. Applied numerical mathematics 35 , 3 (2000), 177–219

work page 2000

[29] [29]

Chemical turbulence

Kuramoto, Y. Chemical turbulence. Springer, 1984

work page 1984

[30] [30]

Exascale deep learning for climate analytics

Kurth, T., Treichler, S., Romero, J., Mudigonda, M., Luehr, N., Phillips, E., Mahesh, A., Matheson, M., Deslippe, J., Fatica, M., et al. Exascale deep learning for climate analytics. In SC18: International conference for high performance computing, networking, storage and analysis (2018), IEEE, pp. 649–660

work page 2018

[31] [31]

W., and Walker, R

Le Grand, S., G ¨otz, A. W., and Walker, R. C. Spfp: Speed without compromise—a mixed precision model for gpu accelerated molecular dynamics simulations. Computer Physics Communications 184 , 2 (2013), 374–380

work page 2013

[32] [32]

J., Beran, P

Lucia, D. J., Beran, P. S., and Silva, W. A. Reduced-order modeling: New approaches for computational physics. Progress in aerospace sciences 40, 1-2 (2004), 51–117

work page 2004

[33] [33]

Mixed Precision Training

Micikevicius, P., Narang, S., Alben, J., Diamos, G., Elsen, E., Garcia, D., Ginsburg, B., Houston, M., Kuchaiev, O., Venkatesh, G., et al. Mixed precision training. arXiv preprint arXiv:1710.03740 (2017)

work page internal anchor Pith review Pith/arXiv arXiv 2017

[34] [34]

Murray, J. D. Mathematical Biology I: An introduction . Springer, 2002

work page 2002

[35] [35]

Agent-based modeling of microbial communities

Nagarajan, K., Ni, C., and Lu, T. Agent-based modeling of microbial communities. ACS synthetic biology 11 , 11 (2022), 3564–3574

work page 2022

[36] [36]

An active pulse transmission line simulating nerve axon

Nagumo, J., Arimoto, S., and Yoshizawa, S. An active pulse transmission line simulating nerve axon. Proceedings of the IRE 50, 10 (1962), 2061–2070

work page 1962

[37] [37]

Palmer, T. N. More reliable forecasts with less precise computations: A fast-track route to cloud-resolved weather and climate simulators? Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 372 , 2018 (June 2014), 20130391

work page 2018

[38] [38]

J., and Kollmannsberger, P

Paul, T. J., and Kollmannsberger, P. Biological network growth in complex environments: A computational framework. PLOS Computational Biology 16 , 11 (2020), e1008003

work page 2020

[39] [39]

E., Khargonekar, P., and Kurdahi, F

Rakka, M., Fouda, M. E., Khargonekar, P., and Kurdahi, F. Mixed-precision neural networks: A survey. arXiv preprint arXiv:2208.06064 (2022). 13

work page arXiv 2022

[40] [40]

Reduced-precision parametrization: Lessons from an intermediate- complexity atmospheric model

Saffin, L., Hatfield, S., D ¨uben, P., and Palmer, T. Reduced-precision parametrization: Lessons from an intermediate- complexity atmospheric model. Quarterly Journal of the Royal Meteorological Society 146 , 729 (2020), 1590–1607

work page 2020

[41] [41]

F., and Reichelt, M

Shampine, L. F., and Reichelt, M. W. The Matlab ODE suite. SIAM journal on scientific computing 18 , 1 (1997), 1–22. Publisher: SIAM

work page 1997

[42] [42]

Strogatz, S. H. Nonlinear dynamics and chaos with student solutions manual: With applications to physics, biology, chemistry, and engineering . CRC press, 2018

work page 2018

[43] [43]

BFloat16: the secret to high performance on Cloud TPUs

Wang, S., and Kanwar, P. BFloat16: the secret to high performance on Cloud TPUs. Google Cloud Blog 4 (2019)

work page 2019

[44] [44]

IEEE standard for floating-point arithmetic

Zuras, D., Cowlishaw, M., Aiken, A., Applegate, M., Bailey, D., Bass, S., Bhandarkar, D., Bhat, M., Bindel, D., Boldo, S., et al. IEEE standard for floating-point arithmetic. IEEE Std 754 , 2008 (2008), 1–70. 14 A Parameters For all benchmarks we have Tolerances The absolute tolerance is lower or equal than the relative tolerance. Explored parameters All ...

work page 2008