pith. sign in

arxiv: 2605.13088 · v1 · pith:B5UPK2T2new · submitted 2026-05-13 · 💻 cs.LG

Bayesian Nonparametric Mixed-Effect ODEs with Gaussian Processes

Pith reviewed 2026-05-14 19:44 UTC · model grok-4.3

classification 💻 cs.LG
keywords mixed-effect ODEsGaussian processesBayesian nonparametricdynamical systemsheterogeneous trajectoriesvector field decompositionKalman smoothing
0
0 comments X

The pith

A Bayesian nonparametric model decomposes each subject's ODE vector field into a shared population Gaussian process and subject-specific deviations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a mixed-effects approach to nonparametric ODE modeling that explicitly handles heterogeneity across subjects or units. It places independent Gaussian process priors on a common population vector field and on each subject's deviation from that field. Inference uses state-space representations of the trajectories together with virtual collocation observations, which replace repeated numerical integration of the ODEs during training. On controlled benchmarks that include oscillatory and biomedical systems, the resulting model recovers the population vector field more accurately and predicts individual trajectories better than strong single-system baselines.

Core claim

MEGPODE decomposes each subject's vector field into a shared population component and a subject-specific deviation, both endowed with Gaussian process priors, and performs efficient Bayesian inference by combining state-space GP trajectory priors with virtual collocation observations to obtain Kalman-smoothing updates and closed-form vector-field regressions without repeated ODE solves per subject.

What carries the argument

The mixed-effect decomposition of each subject's vector field into a shared population Gaussian process and a subject-specific deviation Gaussian process, combined with state-space trajectory priors and virtual collocation observations for closed-form posterior updates.

If this is right

  • Population-level vector fields can be recovered more accurately when subject heterogeneity is modeled explicitly rather than ignored.
  • Subject-specific trajectories can be predicted with lower error than under single-system nonparametric ODE models.
  • Uncertainty quantification remains available for both the shared population dynamics and the individual deviations.
  • The method applies directly to pharmacometrics, systems biology, and epidemiology without requiring a parametric form for the vector field.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same state-space and virtual-observation construction could be applied to other nonparametric priors on vector fields if analogous efficient inference schemes exist.
  • Real patient or experimental time-series data with known heterogeneity would provide a direct test of whether the benchmark gains translate to practice.
  • Allowing the shared population process to be non-stationary or to depend on covariates would extend the model to time-varying or covariate-driven population dynamics.

Load-bearing premise

That subject vector fields can be usefully decomposed into a shared population component and a subject-specific deviation, both well captured by Gaussian process priors, and that virtual collocation observations suffice for accurate posterior inference without repeated ODE solves.

What would settle it

On the controlled heterogeneous ODE benchmarks, if population-field recovery error or subject-level trajectory prediction error is not lower than that of strong baselines, the claimed performance advantage would not hold.

Figures

Figures reproduced from arXiv: 2605.13088 by Harri L\"ahdesm\"aki, Julien Martinelli, Maksim Sinelnikov, M\'elanie Prague, Quentin Clairon.

Figure 1
Figure 1. Figure 1: summarizes the method, [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Misspecified FHN setting. Population fields and forecasts when the true dynamics include a smooth residual term. The semi-mechanistic version of MEGPODE best recovers the corrected field and forecast dynamics. plus-subject decomposition; and MEGPODE-M, our semi-mechanistic variant with an FHN mean and GP residual (Appendix C.3). The mechanistic fit is constrained by the misspecified FHN family and accumula… view at source ↗
Figure 3
Figure 3. Figure 3: Ablations for MEGPODE (forecast setting). Top: hyperparameter ablations. Each panel varies a single hyperparameter around the default configuration, while keeping the rest of the training and evaluation pipeline fixed; the blue tick marks the default value. Mean ±0.5 std relative change with respect to the default setting shown, averaged over 10 seeds and the benchmark systems used in the ablation study. B… view at source ↗
read the original abstract

Dynamical modelling is central to many scientific domains, including pharmacometrics, systems biology, physiology, and epidemiology. In these settings, heterogeneity is often intrinsic: different subjects or units follow related but distinct continuous-time dynamics. Classical nonlinear mixed-effects Ordinary Differential Equation (ODE) models address this by combining population-level structure with subject-specific effects, but they rely on a parametric vector field and are therefore vulnerable to structural misspecification and unmodelled mechanisms. This motivates nonparametric approaches that can retain principled uncertainty quantification, yet existing nonparametric ODE methods typically assume a single shared dynamical system rather than an explicit mixed-effect hierarchy over subject-specific dynamics. We propose MEGPODE, a Bayesian nonparametric mixed-effect ODE model in which each subject's vector field is decomposed into a shared population component and a subject-specific deviation, both endowed with Gaussian process (GP) priors. To avoid repeated ODE solves per subject during training, we combine state-space GP trajectory priors with virtual collocation observations, yielding Kalman-smoothing trajectory updates and closed-form regressions for the vector fields. Across controlled heterogeneous ODE benchmarks spanning oscillatory, biomedical systems, MEGPODE improves population-field recovery and subject-level trajectory prediction relative to strong baselines.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces MEGPODE, a Bayesian nonparametric mixed-effect ODE model in which each subject's vector field decomposes into a shared population component and subject-specific deviation, both endowed with Gaussian process priors. Inference avoids repeated ODE solves by combining state-space GP trajectory priors with virtual collocation observations, yielding Kalman-smoothing trajectory updates and closed-form vector-field regressions. Experiments on controlled heterogeneous ODE benchmarks (oscillatory and biomedical) report improved population-field recovery and subject-level trajectory prediction relative to strong baselines.

Significance. If the central claims hold, the work would supply a computationally tractable nonparametric extension of mixed-effects ODE modeling that retains principled uncertainty quantification, addressing a clear gap between classical parametric nonlinear mixed-effects models and single-system nonparametric ODE methods. The closed-form Kalman-smoothing updates constitute a genuine efficiency gain that could enable broader adoption in pharmacometrics and systems biology.

major comments (2)
  1. [Methods (virtual collocation and Kalman smoothing)] The virtual collocation approximation that enforces the ODE only at chosen discrete points is load-bearing for the claim of accurate posterior inference without repeated solves. For nonlinear vector fields and heterogeneous subject deviations, the approximation error depends on collocation density and placement; no quantitative bound or sensitivity analysis is provided to show that this surrogate does not systematically bias population-component recovery.
  2. [Experiments and Results] Benchmark results claim consistent improvements in population-field recovery and subject-level prediction, yet no error bars, standard deviations across random seeds, or explicit data-exclusion rules are reported. This omission prevents assessment of whether the reported gains are statistically reliable or sensitive to experimental choices.
minor comments (2)
  1. Notation for the state-space GP prior, virtual observations, and the precise form of the mixed-effect decomposition should be introduced with explicit equations on first use to improve readability.
  2. [Abstract] The abstract refers to 'oscillatory, biomedical systems' without naming the concrete benchmark ODEs; listing them (e.g., Lotka-Volterra variants, pharmacokinetic models) would aid reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which highlight important aspects of our methods and experimental reporting. We address each major comment below and indicate planned revisions to the manuscript.

read point-by-point responses
  1. Referee: [Methods (virtual collocation and Kalman smoothing)] The virtual collocation approximation that enforces the ODE only at chosen discrete points is load-bearing for the claim of accurate posterior inference without repeated solves. For nonlinear vector fields and heterogeneous subject deviations, the approximation error depends on collocation density and placement; no quantitative bound or sensitivity analysis is provided to show that this surrogate does not systematically bias population-component recovery.

    Authors: We agree that the virtual collocation scheme is central to the efficiency claim and that its approximation quality for nonlinear heterogeneous dynamics merits further scrutiny. While our current experiments show stable recovery across the tested benchmarks, we do not provide a theoretical error bound. In the revised manuscript we will add a dedicated sensitivity study that varies collocation density and placement (including random and uniform grids) and reports the resulting changes in population-field recovery error and subject-level prediction metrics. revision: yes

  2. Referee: [Experiments and Results] Benchmark results claim consistent improvements in population-field recovery and subject-level prediction, yet no error bars, standard deviations across random seeds, or explicit data-exclusion rules are reported. This omission prevents assessment of whether the reported gains are statistically reliable or sensitive to experimental choices.

    Authors: We acknowledge that the absence of variability measures and explicit data-handling protocols limits the ability to judge statistical reliability. In the revision we will report all metrics with standard deviations computed over multiple random seeds, include error bars in the figures, and add a clear subsection describing the data-generation process, any exclusion criteria, and the precise train/test splits used for each benchmark. revision: yes

Circularity Check

0 steps flagged

No circularity: derivation uses external GP/Kalman methods and reports empirical gains

full rationale

The paper's core construction decomposes each subject's vector field into a shared population GP component plus subject-specific GP deviation, then applies standard state-space GP trajectory priors together with virtual collocation observations to obtain Kalman-smoothing updates and closed-form vector-field regressions. These building blocks (GP priors, state-space representation, Kalman smoothing, collocation) are standard techniques whose properties are independent of the present work. The reported improvements in population-field recovery and subject-level prediction are measured on external heterogeneous ODE benchmarks against separate baselines; they do not reduce to any quantity defined by the model's own fitted parameters or by a self-citation chain. No step matches the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Based on abstract only; no explicit free parameters, axioms, or invented entities are stated beyond standard GP and state-space assumptions.

pith-pipeline@v0.9.0 · 5521 in / 1160 out tokens · 53014 ms · 2026-05-14T19:44:47.057371+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

44 extracted references · 44 canonical work pages

  1. [1]

    Prediction of long- term humoral response induced by the two-dose heterologous ad26.zebov, mva-bn-filo vaccine against ebola.npj Vaccines, 2023

    Marie Alexandre, Mélanie Prague, Chelsea McLean, Viki Bockstal, Macaya Douoguih, Rodolphe Thiébaut, Thierry Effelterre, Laura Solforosi, and Anna Dari. Prediction of long- term humoral response induced by the two-dose heterologous ad26.zebov, mva-bn-filo vaccine against ebola.npj Vaccines, 2023

  2. [2]

    An amortized approach to non-linear mixed-effects modeling based on neural posterior estimation

    Jonas Arruda, Yannik Schälte, Clemens Peiter, Olga Teplytska, Ulrich Jaehde, and Jan Hase- nauer. An amortized approach to non-linear mixed-effects modeling based on neural posterior estimation. InProceedings of the 41st International Conference on Machine Learning, 2024

  3. [3]

    Springer, 2006

    Christopher M Bishop and Nasser M Nasrabadi.Pattern recognition and machine learning. Springer, 2006

  4. [4]

    Cambridge University Press, 2004

    Stephen Boyd and Lieven Vandenberghe.Convex Optimization. Cambridge University Press, 2004

  5. [5]

    Dominic Stefan Bräm, Bernhard Steiert, Marc Pfister, Britta Steffens, and Gilbert Koch. Low- dimensional neural ordinary differential equations accounting for inter-individual variability implemented in monolix and nonmem.CPT: Pharmacometrics & Systems Pharmacology, 2025

  6. [6]

    Ricky T. Q. Chen, Yulia Rubanova, Jesse Bettencourt, and David Duvenaud. Neural ordinary differential equations. InAdvances in Neural Information Processing Systems, 2018

  7. [7]

    Deep mixed effect model using gaussian processes: A personalized and reliable prediction for healthcare.Proceedings of the AAAI Conference on Artificial Intelligence, 2020

    Ingyo Chung, Saehoon Kim, Juho Lee, Kwang Joon Kim, Sung Ju Hwang, and Eunho Yang. Deep mixed effect model using gaussian processes: A personalized and reliable prediction for healthcare.Proceedings of the AAAI Conference on Artificial Intelligence, 2020

  8. [8]

    Modeling the kinet- ics of the neutralizing antibody response against sars-cov-2 variants after several administrations of bnt162b2.PLoS Computational Biology, 2023

    Quentin Clairon, Mélanie Prague, Delphine Planas, Timothée Bruel, Laurent Hocqueloux, Thierry Prazuck, Olivier Schwartz, Rodolphe Thiébaut, and Jérémie Guedj. Modeling the kinet- ics of the neutralizing antibody response against sars-cov-2 variants after several administrations of bnt162b2.PLoS Computational Biology, 2023

  9. [9]

    Estimation for dynamical systems using a population-based kalman filter–applications in computational biology.MathematicS In Action, 2022

    Annabelle Collin, Mélanie Prague, and Philippe Moireau. Estimation for dynamical systems using a population-based kalman filter–applications in computational biology.MathematicS In Action, 2022

  10. [10]

    Parameter estimation in nonlinear mixed effect models using saemix, an r implementation of the saem algorithm.Journal of Statistical Software, 2017

    Emmanuelle Comets, Audrey Lavenu, and Marc Lavielle. Parameter estimation in nonlinear mixed effect models using saemix, an r implementation of the saem algorithm.Journal of Statistical Software, 2017

  11. [11]

    Ode parameter inference using adaptive gradient matching with gaussian processes

    Frank Dondelinger, Dirk Husmeier, Simon Rogers, and Maurizio Filippone. Ode parameter inference using adaptive gradient matching with gaussian processes. InArtificial intelligence and statistics, 2013

  12. [12]

    Discovering physics laws of dynamical systems via invariant function learning

    Shurui Gui, Xiner Li, and Shuiwang Ji. Discovering physics laws of dynamical systems via invariant function learning. InProceedings of the 42nd International Conference on Machine Learning, 2025

  13. [13]

    Physics-informed variational state- space Gaussian processes

    Oliver Hamelijnck, Arno Solin, and Theodoros Damoulas. Physics-informed variational state- space Gaussian processes. InAdvances in Neural Information Processing Systems, 2024

  14. [14]

    Variational multiple shooting for bayesian odes with gaussian processes

    Pashupati Hegde, Ça˘gatay Yıldız, Harri Lähdesmäki, Samuel Kaski, and Markus Heinonen. Variational multiple shooting for bayesian odes with gaussian processes. InUncertainty in Artificial Intelligence, 2022

  15. [15]

    Learning unknown ODE models with gaussian processes

    Markus Heinonen, Cagatay Yildiz, Henrik Mannerstrom, Jukka Intosalmi, and Harri Lahdes- maki. Learning unknown ODE models with gaussian processes. InProceedings of the 35th International Conference on Machine Learning, 2018

  16. [16]

    Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 2020

    Patrick Kidger, James Morrill, James Foster, and Terry Lyons. Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 2020

  17. [17]

    Generalizing to new physical systems via context-informed dynamics model

    Matthieu Kirchmeyer, Yuan Yin, Jérémie Donà, Nicolas Baskiotis, Alain Rakotomamonjy, and Patrick Gallinari. Generalizing to new physical systems via context-informed dynamics model. InInternational conference on machine learning, 2022. 11

  18. [18]

    CRC press, 2014

    Marc Lavielle.Mixed Effects Models for the Population Approach: Models, Tasks, Methods and Tools. CRC press, 2014

  19. [19]

    MAGMA: inference and prediction using multi-task Gaussian processes with common mean.Machine Learning, 2022

    Arthur Leroy, Pierre Latouche, Benjamin Guedj, and Servane Gey. MAGMA: inference and prediction using multi-task Gaussian processes with common mean.Machine Learning, 2022

  20. [20]

    Variational autoencoder for inference of nonlinear mixed effect models based on ordinary differential equations.arXiv preprint arXiv:2601.17400, 2026

    Zhe Li, Mélanie Prague, Rodolphe Thiébaut, and Quentin Clairon. Variational autoencoder for inference of nonlinear mixed effect models based on ordinary differential equations.arXiv preprint arXiv:2601.17400, 2026

  21. [21]

    Lindstrom and Douglas M

    Mary J. Lindstrom and Douglas M. Bates. Nonlinear mixed effects models for repeated measures data.Biometrics, 1990

  22. [22]

    Autoip: A united framework to integrate physics into gaussian processes

    Da Long, Zheng Wang, Aditi Krishnapriyan, Robert Kirby, Shandian Zhe, and Michael Mahoney. Autoip: A united framework to integrate physics into gaussian processes. InInternational Conference on Machine Learning, 2022

  23. [23]

    Constraining the dynamics of deep probabilistic models

    Marco Lorenzi and Maurizio Filippone. Constraining the dynamics of deep probabilistic models. InProceedings of the 35th International Conference on Machine Learning, 2018

  24. [24]

    Modeling formalisms in systems biology.AMB express, 2011

    Daniel Machado, Rafael S Costa, Miguel Rocha, Eugénio C Ferreira, Bruce Tidor, and Isabel Rocha. Modeling formalisms in systems biology.AMB express, 2011

  25. [25]

    Position: Biology is the challenge physics-informed ML needs to evolve

    Julien Martinelli. Position: Biology is the challenge physics-informed ML needs to evolve. In The Thirty-Ninth Annual Conference on Neural Information Processing Systems Position Paper Track, 2025

  26. [26]

    Metaphysica: Improving OOD robustness in physics-informed machine learning

    S Chandra Mouli, Muhammad Alam, and Bruno Ribeiro. Metaphysica: Improving OOD robustness in physics-informed machine learning. InThe Twelfth International Conference on Learning Representations, 2024

  27. [27]

    A variational approximation for analyzing the dynamics of panel data

    Jurijs Nazarovs, Rudrasis Chakraborty, Songwong Tasneeyapant, Sathya Ravi, and Vikas Singh. A variational approximation for analyzing the dynamics of panel data. InProceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

  28. [28]

    Amortized in-context mixed effect transformer models: A zero-shot approach for pharmacokinetics

    Cesar Ojeda, Niklas Hartung, Wilhelm Huisinga, and Ramses J Sanchez. Amortized in-context mixed effect transformer models: A zero-shot approach for pharmacokinetics. InThe 29th International Conference on Artificial Intelligence and Statistics, 2026

  29. [29]

    Random features for large-scale kernel machines

    Ali Rahimi and Benjamin Recht. Random features for large-scale kernel machines. InAdvances in Neural Information Processing Systems, 2007

  30. [30]

    Carl Edward Rasmussen and Christopher K. I. Williams.Gaussian Processes for Machine Learning. MIT Press, 2006

  31. [31]

    Yulia Rubanova, Ricky T. Q. Chen, and David Duvenaud. Latent ODEs for irregularly-sampled time series. InAdvances in Neural Information Processing Systems, 2019

  32. [32]

    Cambridge University Press, 2013

    Simo Särkkä.Bayesian Filtering and Smoothing. Cambridge University Press, 2013

  33. [33]

    Cambridge University Press, 2019

    Simo Särkkä and Arno Solin.Applied Stochastic Differential Equations. Cambridge University Press, 2019

  34. [34]

    Variational grey-box dynamics matching

    Gurjeet Sangra Singh, Frantzeska Lavda, Giangiacomo Mercatali, and Alexandros Kalousis. Variational grey-box dynamics matching. InProceedings of The 29th International Conference on Artificial Intelligence and Statistics, 2026

  35. [35]

    Hilbert space methods for reduced-rank Gaussian process regression.Statistics and Computing, 2020

    Arno Solin and Simo Särkkä. Hilbert space methods for reduced-rank Gaussian process regression.Statistics and Computing, 2020

  36. [36]

    Deep grey-box modeling with adaptive data-driven models toward trustworthy estimation of theory-driven models

    Naoya Takeishi and Alexandros Kalousis. Deep grey-box modeling with adaptive data-driven models toward trustworthy estimation of theory-driven models. InProceedings of The 26th International Conference on Artificial Intelligence and Statistics, 2023

  37. [37]

    American Mathematical Soc., 2012

    Gerald Teschl.Ordinary differential equations and dynamical systems. American Mathematical Soc., 2012

  38. [38]

    Probabilistic solutions to ordinary differential equations as nonlinear bayesian filtering: a new perspective.Statistics and Computing, 2019

    Filip Tronarp, Hans Kersting, Simo Särkkä, and Philipp Hennig. Probabilistic solutions to ordinary differential equations as nonlinear bayesian filtering: a new perspective.Statistics and Computing, 2019. 12

  39. [39]

    Estimating mixed-effects differential equation models.Statistics and Computing, 2014

    Liangliang Wang, Jiguo Cao, James O Ramsay, DM Burger, CJL Laporte, and Jürgen K Rockstroh. Estimating mixed-effects differential equation models.Statistics and Computing, 2014

  40. [40]

    Odin: Ode-informed regression for parameter and state inference in time- continuous dynamical systems

    Philippe Wenk, Gabriele Abbati, Michael A Osborne, Bernhard Schölkopf, Andreas Krause, and Stefan Bauer. Odin: Ode-informed regression for parameter and state inference in time- continuous dynamical systems. InProceedings of the AAAI Conference on Artificial Intelligence, 2020

  41. [41]

    Shihao Yang, Samuel W. K. Wong, and S. C. Kou. Inference of dynamic systems from noisy and sparse data via manifold-constrained gaussian processes.Proceedings of the National Academy of Sciences, 2021

  42. [42]

    Doubly mixed-effects gaussian process regression

    Jun Ho Yoon, Daniel P Jeong, and Seyoung Kim. Doubly mixed-effects gaussian process regression. InInternational Conference on Artificial Intelligence and Statistics, 2022

  43. [43]

    Model-based smoothing with integrated wiener processes and overlapping splines.Journal of Computational and Graphical Statistics, 2024

    Ziang Zhang, Alex Stringer, Patrick Brown, and Jamie Stafford. Model-based smoothing with integrated wiener processes and overlapping splines.Journal of Computational and Graphical Statistics, 2024

  44. [44]

    q(xi,0) Ni−1Y n=1 q(si,n) # . The resulting population ELBO is LGPODE-pop = MX i=1

    Bob Junyi Zou, Matthew E Levine, Dessi P. Zaharieva, Ramesh Johari, and Emily Fox. Hybrid2 neural ODE causal modeling and an application to glycemic response. InProceedings of the 41st International Conference on Machine Learning, 2024. 13 Appendix A Supplementary figures 15 B Baselines and hyperparameters 18 B.1 Concurrent baselines . . . . . . . . . . ....