Bayesian Nonparametric Mixed-Effect ODEs with Gaussian Processes

Harri L\"ahdesm\"aki; Julien Martinelli; Maksim Sinelnikov; M\'elanie Prague; Quentin Clairon

arxiv: 2605.13088 · v1 · pith:B5UPK2T2new · submitted 2026-05-13 · 💻 cs.LG

Bayesian Nonparametric Mixed-Effect ODEs with Gaussian Processes

Julien Martinelli , Maksim Sinelnikov , Harri L\"ahdesm\"aki , Quentin Clairon , M\'elanie Prague This is my paper

Pith reviewed 2026-05-14 19:44 UTC · model grok-4.3

classification 💻 cs.LG

keywords mixed-effect ODEsGaussian processesBayesian nonparametricdynamical systemsheterogeneous trajectoriesvector field decompositionKalman smoothing

0 comments

The pith

A Bayesian nonparametric model decomposes each subject's ODE vector field into a shared population Gaussian process and subject-specific deviations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a mixed-effects approach to nonparametric ODE modeling that explicitly handles heterogeneity across subjects or units. It places independent Gaussian process priors on a common population vector field and on each subject's deviation from that field. Inference uses state-space representations of the trajectories together with virtual collocation observations, which replace repeated numerical integration of the ODEs during training. On controlled benchmarks that include oscillatory and biomedical systems, the resulting model recovers the population vector field more accurately and predicts individual trajectories better than strong single-system baselines.

Core claim

MEGPODE decomposes each subject's vector field into a shared population component and a subject-specific deviation, both endowed with Gaussian process priors, and performs efficient Bayesian inference by combining state-space GP trajectory priors with virtual collocation observations to obtain Kalman-smoothing updates and closed-form vector-field regressions without repeated ODE solves per subject.

What carries the argument

The mixed-effect decomposition of each subject's vector field into a shared population Gaussian process and a subject-specific deviation Gaussian process, combined with state-space trajectory priors and virtual collocation observations for closed-form posterior updates.

If this is right

Population-level vector fields can be recovered more accurately when subject heterogeneity is modeled explicitly rather than ignored.
Subject-specific trajectories can be predicted with lower error than under single-system nonparametric ODE models.
Uncertainty quantification remains available for both the shared population dynamics and the individual deviations.
The method applies directly to pharmacometrics, systems biology, and epidemiology without requiring a parametric form for the vector field.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same state-space and virtual-observation construction could be applied to other nonparametric priors on vector fields if analogous efficient inference schemes exist.
Real patient or experimental time-series data with known heterogeneity would provide a direct test of whether the benchmark gains translate to practice.
Allowing the shared population process to be non-stationary or to depend on covariates would extend the model to time-varying or covariate-driven population dynamics.

Load-bearing premise

That subject vector fields can be usefully decomposed into a shared population component and a subject-specific deviation, both well captured by Gaussian process priors, and that virtual collocation observations suffice for accurate posterior inference without repeated ODE solves.

What would settle it

On the controlled heterogeneous ODE benchmarks, if population-field recovery error or subject-level trajectory prediction error is not lower than that of strong baselines, the claimed performance advantage would not hold.

Figures

Figures reproduced from arXiv: 2605.13088 by Harri L\"ahdesm\"aki, Julien Martinelli, Maksim Sinelnikov, M\'elanie Prague, Quentin Clairon.

**Figure 2.** Figure 2: Misspecified FHN setting. Population fields and forecasts when the true dynamics include a smooth residual term. The semi-mechanistic version of MEGPODE best recovers the corrected field and forecast dynamics. plus-subject decomposition; and MEGPODE-M, our semi-mechanistic variant with an FHN mean and GP residual (Appendix C.3). The mechanistic fit is constrained by the misspecified FHN family and accumula… view at source ↗

**Figure 3.** Figure 3: Ablations for MEGPODE (forecast setting). Top: hyperparameter ablations. Each panel varies a single hyperparameter around the default configuration, while keeping the rest of the training and evaluation pipeline fixed; the blue tick marks the default value. Mean ±0.5 std relative change with respect to the default setting shown, averaged over 10 seeds and the benchmark systems used in the ablation study. B… view at source ↗

read the original abstract

Dynamical modelling is central to many scientific domains, including pharmacometrics, systems biology, physiology, and epidemiology. In these settings, heterogeneity is often intrinsic: different subjects or units follow related but distinct continuous-time dynamics. Classical nonlinear mixed-effects Ordinary Differential Equation (ODE) models address this by combining population-level structure with subject-specific effects, but they rely on a parametric vector field and are therefore vulnerable to structural misspecification and unmodelled mechanisms. This motivates nonparametric approaches that can retain principled uncertainty quantification, yet existing nonparametric ODE methods typically assume a single shared dynamical system rather than an explicit mixed-effect hierarchy over subject-specific dynamics. We propose MEGPODE, a Bayesian nonparametric mixed-effect ODE model in which each subject's vector field is decomposed into a shared population component and a subject-specific deviation, both endowed with Gaussian process (GP) priors. To avoid repeated ODE solves per subject during training, we combine state-space GP trajectory priors with virtual collocation observations, yielding Kalman-smoothing trajectory updates and closed-form regressions for the vector fields. Across controlled heterogeneous ODE benchmarks spanning oscillatory, biomedical systems, MEGPODE improves population-field recovery and subject-level trajectory prediction relative to strong baselines.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MEGPODE adds a mixed-effects GP layer to nonparametric ODE modeling and uses state-space collocation to keep inference tractable, with reported gains on heterogeneous benchmarks.

read the letter

The main point is that this paper builds a Bayesian nonparametric mixed-effects model for ODEs. Each subject's vector field splits into a shared population GP and a subject-specific deviation GP, then they fit it with Kalman smoothing on state-space trajectory priors plus virtual collocation points instead of repeated ODE solves. That combination is new relative to earlier single-system nonparametric ODE work and the benchmarks on oscillatory and biomedical examples show better population-field recovery and subject predictions than the baselines they compare against. The method keeps proper uncertainty and avoids parametric assumptions on the dynamics, which is the practical payoff for pharmacometrics or systems biology settings with real heterogeneity. The state-space route is a clean way to get closed-form updates for the GPs. The soft spot is the virtual collocation step itself. It enforces the ODE only at chosen points, so for nonlinear vector fields and sizable subject deviations the approximation error could leak into the population estimate. The stress-test concern about possible bias is reasonable and the paper would be stronger with explicit checks on collocation density and placement. The experiments look controlled but the abstract-level claims would benefit from more visible error bars and sensitivity results. This is for readers who already work with continuous-time models and need to handle individual variation without locking in a parametric form. A serious referee should see it because the modeling idea is coherent and the inference trick is reproducible even if the approximation needs scrutiny. I would send it to review.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces MEGPODE, a Bayesian nonparametric mixed-effect ODE model in which each subject's vector field decomposes into a shared population component and subject-specific deviation, both endowed with Gaussian process priors. Inference avoids repeated ODE solves by combining state-space GP trajectory priors with virtual collocation observations, yielding Kalman-smoothing trajectory updates and closed-form vector-field regressions. Experiments on controlled heterogeneous ODE benchmarks (oscillatory and biomedical) report improved population-field recovery and subject-level trajectory prediction relative to strong baselines.

Significance. If the central claims hold, the work would supply a computationally tractable nonparametric extension of mixed-effects ODE modeling that retains principled uncertainty quantification, addressing a clear gap between classical parametric nonlinear mixed-effects models and single-system nonparametric ODE methods. The closed-form Kalman-smoothing updates constitute a genuine efficiency gain that could enable broader adoption in pharmacometrics and systems biology.

major comments (2)

[Methods (virtual collocation and Kalman smoothing)] The virtual collocation approximation that enforces the ODE only at chosen discrete points is load-bearing for the claim of accurate posterior inference without repeated solves. For nonlinear vector fields and heterogeneous subject deviations, the approximation error depends on collocation density and placement; no quantitative bound or sensitivity analysis is provided to show that this surrogate does not systematically bias population-component recovery.
[Experiments and Results] Benchmark results claim consistent improvements in population-field recovery and subject-level prediction, yet no error bars, standard deviations across random seeds, or explicit data-exclusion rules are reported. This omission prevents assessment of whether the reported gains are statistically reliable or sensitive to experimental choices.

minor comments (2)

Notation for the state-space GP prior, virtual observations, and the precise form of the mixed-effect decomposition should be introduced with explicit equations on first use to improve readability.
[Abstract] The abstract refers to 'oscillatory, biomedical systems' without naming the concrete benchmark ODEs; listing them (e.g., Lotka-Volterra variants, pharmacokinetic models) would aid reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which highlight important aspects of our methods and experimental reporting. We address each major comment below and indicate planned revisions to the manuscript.

read point-by-point responses

Referee: [Methods (virtual collocation and Kalman smoothing)] The virtual collocation approximation that enforces the ODE only at chosen discrete points is load-bearing for the claim of accurate posterior inference without repeated solves. For nonlinear vector fields and heterogeneous subject deviations, the approximation error depends on collocation density and placement; no quantitative bound or sensitivity analysis is provided to show that this surrogate does not systematically bias population-component recovery.

Authors: We agree that the virtual collocation scheme is central to the efficiency claim and that its approximation quality for nonlinear heterogeneous dynamics merits further scrutiny. While our current experiments show stable recovery across the tested benchmarks, we do not provide a theoretical error bound. In the revised manuscript we will add a dedicated sensitivity study that varies collocation density and placement (including random and uniform grids) and reports the resulting changes in population-field recovery error and subject-level prediction metrics. revision: yes
Referee: [Experiments and Results] Benchmark results claim consistent improvements in population-field recovery and subject-level prediction, yet no error bars, standard deviations across random seeds, or explicit data-exclusion rules are reported. This omission prevents assessment of whether the reported gains are statistically reliable or sensitive to experimental choices.

Authors: We acknowledge that the absence of variability measures and explicit data-handling protocols limits the ability to judge statistical reliability. In the revision we will report all metrics with standard deviations computed over multiple random seeds, include error bars in the figures, and add a clear subsection describing the data-generation process, any exclusion criteria, and the precise train/test splits used for each benchmark. revision: yes

Circularity Check

0 steps flagged

No circularity: derivation uses external GP/Kalman methods and reports empirical gains

full rationale

The paper's core construction decomposes each subject's vector field into a shared population GP component plus subject-specific GP deviation, then applies standard state-space GP trajectory priors together with virtual collocation observations to obtain Kalman-smoothing updates and closed-form vector-field regressions. These building blocks (GP priors, state-space representation, Kalman smoothing, collocation) are standard techniques whose properties are independent of the present work. The reported improvements in population-field recovery and subject-level prediction are measured on external heterogeneous ODE benchmarks against separate baselines; they do not reduce to any quantity defined by the model's own fitted parameters or by a self-citation chain. No step matches the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Based on abstract only; no explicit free parameters, axioms, or invented entities are stated beyond standard GP and state-space assumptions.

pith-pipeline@v0.9.0 · 5521 in / 1160 out tokens · 53014 ms · 2026-05-14T19:44:47.057371+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

44 extracted references · 44 canonical work pages

[1]

Prediction of long- term humoral response induced by the two-dose heterologous ad26.zebov, mva-bn-filo vaccine against ebola.npj Vaccines, 2023

Marie Alexandre, Mélanie Prague, Chelsea McLean, Viki Bockstal, Macaya Douoguih, Rodolphe Thiébaut, Thierry Effelterre, Laura Solforosi, and Anna Dari. Prediction of long- term humoral response induced by the two-dose heterologous ad26.zebov, mva-bn-filo vaccine against ebola.npj Vaccines, 2023

work page 2023
[2]

An amortized approach to non-linear mixed-effects modeling based on neural posterior estimation

Jonas Arruda, Yannik Schälte, Clemens Peiter, Olga Teplytska, Ulrich Jaehde, and Jan Hase- nauer. An amortized approach to non-linear mixed-effects modeling based on neural posterior estimation. InProceedings of the 41st International Conference on Machine Learning, 2024

work page 2024
[3]

Springer, 2006

Christopher M Bishop and Nasser M Nasrabadi.Pattern recognition and machine learning. Springer, 2006

work page 2006
[4]

Cambridge University Press, 2004

Stephen Boyd and Lieven Vandenberghe.Convex Optimization. Cambridge University Press, 2004

work page 2004
[5]

Dominic Stefan Bräm, Bernhard Steiert, Marc Pfister, Britta Steffens, and Gilbert Koch. Low- dimensional neural ordinary differential equations accounting for inter-individual variability implemented in monolix and nonmem.CPT: Pharmacometrics & Systems Pharmacology, 2025

work page 2025
[6]

Ricky T. Q. Chen, Yulia Rubanova, Jesse Bettencourt, and David Duvenaud. Neural ordinary differential equations. InAdvances in Neural Information Processing Systems, 2018

work page 2018
[7]

Deep mixed effect model using gaussian processes: A personalized and reliable prediction for healthcare.Proceedings of the AAAI Conference on Artificial Intelligence, 2020

Ingyo Chung, Saehoon Kim, Juho Lee, Kwang Joon Kim, Sung Ju Hwang, and Eunho Yang. Deep mixed effect model using gaussian processes: A personalized and reliable prediction for healthcare.Proceedings of the AAAI Conference on Artificial Intelligence, 2020

work page 2020
[8]

Modeling the kinet- ics of the neutralizing antibody response against sars-cov-2 variants after several administrations of bnt162b2.PLoS Computational Biology, 2023

Quentin Clairon, Mélanie Prague, Delphine Planas, Timothée Bruel, Laurent Hocqueloux, Thierry Prazuck, Olivier Schwartz, Rodolphe Thiébaut, and Jérémie Guedj. Modeling the kinet- ics of the neutralizing antibody response against sars-cov-2 variants after several administrations of bnt162b2.PLoS Computational Biology, 2023

work page 2023
[9]

Estimation for dynamical systems using a population-based kalman filter–applications in computational biology.MathematicS In Action, 2022

Annabelle Collin, Mélanie Prague, and Philippe Moireau. Estimation for dynamical systems using a population-based kalman filter–applications in computational biology.MathematicS In Action, 2022

work page 2022
[10]

Parameter estimation in nonlinear mixed effect models using saemix, an r implementation of the saem algorithm.Journal of Statistical Software, 2017

Emmanuelle Comets, Audrey Lavenu, and Marc Lavielle. Parameter estimation in nonlinear mixed effect models using saemix, an r implementation of the saem algorithm.Journal of Statistical Software, 2017

work page 2017
[11]

Ode parameter inference using adaptive gradient matching with gaussian processes

Frank Dondelinger, Dirk Husmeier, Simon Rogers, and Maurizio Filippone. Ode parameter inference using adaptive gradient matching with gaussian processes. InArtificial intelligence and statistics, 2013

work page 2013
[12]

Discovering physics laws of dynamical systems via invariant function learning

Shurui Gui, Xiner Li, and Shuiwang Ji. Discovering physics laws of dynamical systems via invariant function learning. InProceedings of the 42nd International Conference on Machine Learning, 2025

work page 2025
[13]

Physics-informed variational state- space Gaussian processes

Oliver Hamelijnck, Arno Solin, and Theodoros Damoulas. Physics-informed variational state- space Gaussian processes. InAdvances in Neural Information Processing Systems, 2024

work page 2024
[14]

Variational multiple shooting for bayesian odes with gaussian processes

Pashupati Hegde, Ça˘gatay Yıldız, Harri Lähdesmäki, Samuel Kaski, and Markus Heinonen. Variational multiple shooting for bayesian odes with gaussian processes. InUncertainty in Artificial Intelligence, 2022

work page 2022
[15]

Learning unknown ODE models with gaussian processes

Markus Heinonen, Cagatay Yildiz, Henrik Mannerstrom, Jukka Intosalmi, and Harri Lahdes- maki. Learning unknown ODE models with gaussian processes. InProceedings of the 35th International Conference on Machine Learning, 2018

work page 2018
[16]

Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 2020

Patrick Kidger, James Morrill, James Foster, and Terry Lyons. Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 2020

work page 2020
[17]

Generalizing to new physical systems via context-informed dynamics model

Matthieu Kirchmeyer, Yuan Yin, Jérémie Donà, Nicolas Baskiotis, Alain Rakotomamonjy, and Patrick Gallinari. Generalizing to new physical systems via context-informed dynamics model. InInternational conference on machine learning, 2022. 11

work page 2022
[18]

CRC press, 2014

Marc Lavielle.Mixed Effects Models for the Population Approach: Models, Tasks, Methods and Tools. CRC press, 2014

work page 2014
[19]

MAGMA: inference and prediction using multi-task Gaussian processes with common mean.Machine Learning, 2022

Arthur Leroy, Pierre Latouche, Benjamin Guedj, and Servane Gey. MAGMA: inference and prediction using multi-task Gaussian processes with common mean.Machine Learning, 2022

work page 2022
[20]

Variational autoencoder for inference of nonlinear mixed effect models based on ordinary differential equations.arXiv preprint arXiv:2601.17400, 2026

Zhe Li, Mélanie Prague, Rodolphe Thiébaut, and Quentin Clairon. Variational autoencoder for inference of nonlinear mixed effect models based on ordinary differential equations.arXiv preprint arXiv:2601.17400, 2026

work page arXiv 2026
[21]

Lindstrom and Douglas M

Mary J. Lindstrom and Douglas M. Bates. Nonlinear mixed effects models for repeated measures data.Biometrics, 1990

work page 1990
[22]

Autoip: A united framework to integrate physics into gaussian processes

Da Long, Zheng Wang, Aditi Krishnapriyan, Robert Kirby, Shandian Zhe, and Michael Mahoney. Autoip: A united framework to integrate physics into gaussian processes. InInternational Conference on Machine Learning, 2022

work page 2022
[23]

Constraining the dynamics of deep probabilistic models

Marco Lorenzi and Maurizio Filippone. Constraining the dynamics of deep probabilistic models. InProceedings of the 35th International Conference on Machine Learning, 2018

work page 2018
[24]

Modeling formalisms in systems biology.AMB express, 2011

Daniel Machado, Rafael S Costa, Miguel Rocha, Eugénio C Ferreira, Bruce Tidor, and Isabel Rocha. Modeling formalisms in systems biology.AMB express, 2011

work page 2011
[25]

Position: Biology is the challenge physics-informed ML needs to evolve

Julien Martinelli. Position: Biology is the challenge physics-informed ML needs to evolve. In The Thirty-Ninth Annual Conference on Neural Information Processing Systems Position Paper Track, 2025

work page 2025
[26]

Metaphysica: Improving OOD robustness in physics-informed machine learning

S Chandra Mouli, Muhammad Alam, and Bruno Ribeiro. Metaphysica: Improving OOD robustness in physics-informed machine learning. InThe Twelfth International Conference on Learning Representations, 2024

work page 2024
[27]

A variational approximation for analyzing the dynamics of panel data

Jurijs Nazarovs, Rudrasis Chakraborty, Songwong Tasneeyapant, Sathya Ravi, and Vikas Singh. A variational approximation for analyzing the dynamics of panel data. InProceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

work page 2021
[28]

Amortized in-context mixed effect transformer models: A zero-shot approach for pharmacokinetics

Cesar Ojeda, Niklas Hartung, Wilhelm Huisinga, and Ramses J Sanchez. Amortized in-context mixed effect transformer models: A zero-shot approach for pharmacokinetics. InThe 29th International Conference on Artificial Intelligence and Statistics, 2026

work page 2026
[29]

Random features for large-scale kernel machines

Ali Rahimi and Benjamin Recht. Random features for large-scale kernel machines. InAdvances in Neural Information Processing Systems, 2007

work page 2007
[30]

Carl Edward Rasmussen and Christopher K. I. Williams.Gaussian Processes for Machine Learning. MIT Press, 2006

work page 2006
[31]

Yulia Rubanova, Ricky T. Q. Chen, and David Duvenaud. Latent ODEs for irregularly-sampled time series. InAdvances in Neural Information Processing Systems, 2019

work page 2019
[32]

Cambridge University Press, 2013

Simo Särkkä.Bayesian Filtering and Smoothing. Cambridge University Press, 2013

work page 2013
[33]

Cambridge University Press, 2019

Simo Särkkä and Arno Solin.Applied Stochastic Differential Equations. Cambridge University Press, 2019

work page 2019
[34]

Variational grey-box dynamics matching

Gurjeet Sangra Singh, Frantzeska Lavda, Giangiacomo Mercatali, and Alexandros Kalousis. Variational grey-box dynamics matching. InProceedings of The 29th International Conference on Artificial Intelligence and Statistics, 2026

work page 2026
[35]

Hilbert space methods for reduced-rank Gaussian process regression.Statistics and Computing, 2020

Arno Solin and Simo Särkkä. Hilbert space methods for reduced-rank Gaussian process regression.Statistics and Computing, 2020

work page 2020
[36]

Deep grey-box modeling with adaptive data-driven models toward trustworthy estimation of theory-driven models

Naoya Takeishi and Alexandros Kalousis. Deep grey-box modeling with adaptive data-driven models toward trustworthy estimation of theory-driven models. InProceedings of The 26th International Conference on Artificial Intelligence and Statistics, 2023

work page 2023
[37]

American Mathematical Soc., 2012

Gerald Teschl.Ordinary differential equations and dynamical systems. American Mathematical Soc., 2012

work page 2012
[38]

Probabilistic solutions to ordinary differential equations as nonlinear bayesian filtering: a new perspective.Statistics and Computing, 2019

Filip Tronarp, Hans Kersting, Simo Särkkä, and Philipp Hennig. Probabilistic solutions to ordinary differential equations as nonlinear bayesian filtering: a new perspective.Statistics and Computing, 2019. 12

work page 2019
[39]

Estimating mixed-effects differential equation models.Statistics and Computing, 2014

Liangliang Wang, Jiguo Cao, James O Ramsay, DM Burger, CJL Laporte, and Jürgen K Rockstroh. Estimating mixed-effects differential equation models.Statistics and Computing, 2014

work page 2014
[40]

Odin: Ode-informed regression for parameter and state inference in time- continuous dynamical systems

Philippe Wenk, Gabriele Abbati, Michael A Osborne, Bernhard Schölkopf, Andreas Krause, and Stefan Bauer. Odin: Ode-informed regression for parameter and state inference in time- continuous dynamical systems. InProceedings of the AAAI Conference on Artificial Intelligence, 2020

work page 2020
[41]

Shihao Yang, Samuel W. K. Wong, and S. C. Kou. Inference of dynamic systems from noisy and sparse data via manifold-constrained gaussian processes.Proceedings of the National Academy of Sciences, 2021

work page 2021
[42]

Doubly mixed-effects gaussian process regression

Jun Ho Yoon, Daniel P Jeong, and Seyoung Kim. Doubly mixed-effects gaussian process regression. InInternational Conference on Artificial Intelligence and Statistics, 2022

work page 2022
[43]

Model-based smoothing with integrated wiener processes and overlapping splines.Journal of Computational and Graphical Statistics, 2024

Ziang Zhang, Alex Stringer, Patrick Brown, and Jamie Stafford. Model-based smoothing with integrated wiener processes and overlapping splines.Journal of Computational and Graphical Statistics, 2024

work page 2024
[44]

q(xi,0) Ni−1Y n=1 q(si,n) # . The resulting population ELBO is LGPODE-pop = MX i=1

Bob Junyi Zou, Matthew E Levine, Dessi P. Zaharieva, Ramesh Johari, and Emily Fox. Hybrid2 neural ODE causal modeling and an application to glycemic response. InProceedings of the 41st International Conference on Machine Learning, 2024. 13 Appendix A Supplementary figures 15 B Baselines and hyperparameters 18 B.1 Concurrent baselines . . . . . . . . . . ....

work page 2024

[1] [1]

Prediction of long- term humoral response induced by the two-dose heterologous ad26.zebov, mva-bn-filo vaccine against ebola.npj Vaccines, 2023

Marie Alexandre, Mélanie Prague, Chelsea McLean, Viki Bockstal, Macaya Douoguih, Rodolphe Thiébaut, Thierry Effelterre, Laura Solforosi, and Anna Dari. Prediction of long- term humoral response induced by the two-dose heterologous ad26.zebov, mva-bn-filo vaccine against ebola.npj Vaccines, 2023

work page 2023

[2] [2]

An amortized approach to non-linear mixed-effects modeling based on neural posterior estimation

Jonas Arruda, Yannik Schälte, Clemens Peiter, Olga Teplytska, Ulrich Jaehde, and Jan Hase- nauer. An amortized approach to non-linear mixed-effects modeling based on neural posterior estimation. InProceedings of the 41st International Conference on Machine Learning, 2024

work page 2024

[3] [3]

Springer, 2006

Christopher M Bishop and Nasser M Nasrabadi.Pattern recognition and machine learning. Springer, 2006

work page 2006

[4] [4]

Cambridge University Press, 2004

Stephen Boyd and Lieven Vandenberghe.Convex Optimization. Cambridge University Press, 2004

work page 2004

[5] [5]

Dominic Stefan Bräm, Bernhard Steiert, Marc Pfister, Britta Steffens, and Gilbert Koch. Low- dimensional neural ordinary differential equations accounting for inter-individual variability implemented in monolix and nonmem.CPT: Pharmacometrics & Systems Pharmacology, 2025

work page 2025

[6] [6]

Ricky T. Q. Chen, Yulia Rubanova, Jesse Bettencourt, and David Duvenaud. Neural ordinary differential equations. InAdvances in Neural Information Processing Systems, 2018

work page 2018

[7] [7]

Deep mixed effect model using gaussian processes: A personalized and reliable prediction for healthcare.Proceedings of the AAAI Conference on Artificial Intelligence, 2020

Ingyo Chung, Saehoon Kim, Juho Lee, Kwang Joon Kim, Sung Ju Hwang, and Eunho Yang. Deep mixed effect model using gaussian processes: A personalized and reliable prediction for healthcare.Proceedings of the AAAI Conference on Artificial Intelligence, 2020

work page 2020

[8] [8]

Modeling the kinet- ics of the neutralizing antibody response against sars-cov-2 variants after several administrations of bnt162b2.PLoS Computational Biology, 2023

Quentin Clairon, Mélanie Prague, Delphine Planas, Timothée Bruel, Laurent Hocqueloux, Thierry Prazuck, Olivier Schwartz, Rodolphe Thiébaut, and Jérémie Guedj. Modeling the kinet- ics of the neutralizing antibody response against sars-cov-2 variants after several administrations of bnt162b2.PLoS Computational Biology, 2023

work page 2023

[9] [9]

Estimation for dynamical systems using a population-based kalman filter–applications in computational biology.MathematicS In Action, 2022

Annabelle Collin, Mélanie Prague, and Philippe Moireau. Estimation for dynamical systems using a population-based kalman filter–applications in computational biology.MathematicS In Action, 2022

work page 2022

[10] [10]

Parameter estimation in nonlinear mixed effect models using saemix, an r implementation of the saem algorithm.Journal of Statistical Software, 2017

Emmanuelle Comets, Audrey Lavenu, and Marc Lavielle. Parameter estimation in nonlinear mixed effect models using saemix, an r implementation of the saem algorithm.Journal of Statistical Software, 2017

work page 2017

[11] [11]

Ode parameter inference using adaptive gradient matching with gaussian processes

Frank Dondelinger, Dirk Husmeier, Simon Rogers, and Maurizio Filippone. Ode parameter inference using adaptive gradient matching with gaussian processes. InArtificial intelligence and statistics, 2013

work page 2013

[12] [12]

Discovering physics laws of dynamical systems via invariant function learning

Shurui Gui, Xiner Li, and Shuiwang Ji. Discovering physics laws of dynamical systems via invariant function learning. InProceedings of the 42nd International Conference on Machine Learning, 2025

work page 2025

[13] [13]

Physics-informed variational state- space Gaussian processes

Oliver Hamelijnck, Arno Solin, and Theodoros Damoulas. Physics-informed variational state- space Gaussian processes. InAdvances in Neural Information Processing Systems, 2024

work page 2024

[14] [14]

Variational multiple shooting for bayesian odes with gaussian processes

Pashupati Hegde, Ça˘gatay Yıldız, Harri Lähdesmäki, Samuel Kaski, and Markus Heinonen. Variational multiple shooting for bayesian odes with gaussian processes. InUncertainty in Artificial Intelligence, 2022

work page 2022

[15] [15]

Learning unknown ODE models with gaussian processes

Markus Heinonen, Cagatay Yildiz, Henrik Mannerstrom, Jukka Intosalmi, and Harri Lahdes- maki. Learning unknown ODE models with gaussian processes. InProceedings of the 35th International Conference on Machine Learning, 2018

work page 2018

[16] [16]

Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 2020

Patrick Kidger, James Morrill, James Foster, and Terry Lyons. Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 2020

work page 2020

[17] [17]

Generalizing to new physical systems via context-informed dynamics model

Matthieu Kirchmeyer, Yuan Yin, Jérémie Donà, Nicolas Baskiotis, Alain Rakotomamonjy, and Patrick Gallinari. Generalizing to new physical systems via context-informed dynamics model. InInternational conference on machine learning, 2022. 11

work page 2022

[18] [18]

CRC press, 2014

Marc Lavielle.Mixed Effects Models for the Population Approach: Models, Tasks, Methods and Tools. CRC press, 2014

work page 2014

[19] [19]

MAGMA: inference and prediction using multi-task Gaussian processes with common mean.Machine Learning, 2022

Arthur Leroy, Pierre Latouche, Benjamin Guedj, and Servane Gey. MAGMA: inference and prediction using multi-task Gaussian processes with common mean.Machine Learning, 2022

work page 2022

[20] [20]

Variational autoencoder for inference of nonlinear mixed effect models based on ordinary differential equations.arXiv preprint arXiv:2601.17400, 2026

Zhe Li, Mélanie Prague, Rodolphe Thiébaut, and Quentin Clairon. Variational autoencoder for inference of nonlinear mixed effect models based on ordinary differential equations.arXiv preprint arXiv:2601.17400, 2026

work page arXiv 2026

[21] [21]

Lindstrom and Douglas M

Mary J. Lindstrom and Douglas M. Bates. Nonlinear mixed effects models for repeated measures data.Biometrics, 1990

work page 1990

[22] [22]

Autoip: A united framework to integrate physics into gaussian processes

Da Long, Zheng Wang, Aditi Krishnapriyan, Robert Kirby, Shandian Zhe, and Michael Mahoney. Autoip: A united framework to integrate physics into gaussian processes. InInternational Conference on Machine Learning, 2022

work page 2022

[23] [23]

Constraining the dynamics of deep probabilistic models

Marco Lorenzi and Maurizio Filippone. Constraining the dynamics of deep probabilistic models. InProceedings of the 35th International Conference on Machine Learning, 2018

work page 2018

[24] [24]

Modeling formalisms in systems biology.AMB express, 2011

Daniel Machado, Rafael S Costa, Miguel Rocha, Eugénio C Ferreira, Bruce Tidor, and Isabel Rocha. Modeling formalisms in systems biology.AMB express, 2011

work page 2011

[25] [25]

Position: Biology is the challenge physics-informed ML needs to evolve

Julien Martinelli. Position: Biology is the challenge physics-informed ML needs to evolve. In The Thirty-Ninth Annual Conference on Neural Information Processing Systems Position Paper Track, 2025

work page 2025

[26] [26]

Metaphysica: Improving OOD robustness in physics-informed machine learning

S Chandra Mouli, Muhammad Alam, and Bruno Ribeiro. Metaphysica: Improving OOD robustness in physics-informed machine learning. InThe Twelfth International Conference on Learning Representations, 2024

work page 2024

[27] [27]

A variational approximation for analyzing the dynamics of panel data

Jurijs Nazarovs, Rudrasis Chakraborty, Songwong Tasneeyapant, Sathya Ravi, and Vikas Singh. A variational approximation for analyzing the dynamics of panel data. InProceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

work page 2021

[28] [28]

Amortized in-context mixed effect transformer models: A zero-shot approach for pharmacokinetics

Cesar Ojeda, Niklas Hartung, Wilhelm Huisinga, and Ramses J Sanchez. Amortized in-context mixed effect transformer models: A zero-shot approach for pharmacokinetics. InThe 29th International Conference on Artificial Intelligence and Statistics, 2026

work page 2026

[29] [29]

Random features for large-scale kernel machines

Ali Rahimi and Benjamin Recht. Random features for large-scale kernel machines. InAdvances in Neural Information Processing Systems, 2007

work page 2007

[30] [30]

Carl Edward Rasmussen and Christopher K. I. Williams.Gaussian Processes for Machine Learning. MIT Press, 2006

work page 2006

[31] [31]

Yulia Rubanova, Ricky T. Q. Chen, and David Duvenaud. Latent ODEs for irregularly-sampled time series. InAdvances in Neural Information Processing Systems, 2019

work page 2019

[32] [32]

Cambridge University Press, 2013

Simo Särkkä.Bayesian Filtering and Smoothing. Cambridge University Press, 2013

work page 2013

[33] [33]

Cambridge University Press, 2019

Simo Särkkä and Arno Solin.Applied Stochastic Differential Equations. Cambridge University Press, 2019

work page 2019

[34] [34]

Variational grey-box dynamics matching

Gurjeet Sangra Singh, Frantzeska Lavda, Giangiacomo Mercatali, and Alexandros Kalousis. Variational grey-box dynamics matching. InProceedings of The 29th International Conference on Artificial Intelligence and Statistics, 2026

work page 2026

[35] [35]

Hilbert space methods for reduced-rank Gaussian process regression.Statistics and Computing, 2020

Arno Solin and Simo Särkkä. Hilbert space methods for reduced-rank Gaussian process regression.Statistics and Computing, 2020

work page 2020

[36] [36]

Deep grey-box modeling with adaptive data-driven models toward trustworthy estimation of theory-driven models

Naoya Takeishi and Alexandros Kalousis. Deep grey-box modeling with adaptive data-driven models toward trustworthy estimation of theory-driven models. InProceedings of The 26th International Conference on Artificial Intelligence and Statistics, 2023

work page 2023

[37] [37]

American Mathematical Soc., 2012

Gerald Teschl.Ordinary differential equations and dynamical systems. American Mathematical Soc., 2012

work page 2012

[38] [38]

Probabilistic solutions to ordinary differential equations as nonlinear bayesian filtering: a new perspective.Statistics and Computing, 2019

Filip Tronarp, Hans Kersting, Simo Särkkä, and Philipp Hennig. Probabilistic solutions to ordinary differential equations as nonlinear bayesian filtering: a new perspective.Statistics and Computing, 2019. 12

work page 2019

[39] [39]

Estimating mixed-effects differential equation models.Statistics and Computing, 2014

Liangliang Wang, Jiguo Cao, James O Ramsay, DM Burger, CJL Laporte, and Jürgen K Rockstroh. Estimating mixed-effects differential equation models.Statistics and Computing, 2014

work page 2014

[40] [40]

Odin: Ode-informed regression for parameter and state inference in time- continuous dynamical systems

Philippe Wenk, Gabriele Abbati, Michael A Osborne, Bernhard Schölkopf, Andreas Krause, and Stefan Bauer. Odin: Ode-informed regression for parameter and state inference in time- continuous dynamical systems. InProceedings of the AAAI Conference on Artificial Intelligence, 2020

work page 2020

[41] [41]

Shihao Yang, Samuel W. K. Wong, and S. C. Kou. Inference of dynamic systems from noisy and sparse data via manifold-constrained gaussian processes.Proceedings of the National Academy of Sciences, 2021

work page 2021

[42] [42]

Doubly mixed-effects gaussian process regression

Jun Ho Yoon, Daniel P Jeong, and Seyoung Kim. Doubly mixed-effects gaussian process regression. InInternational Conference on Artificial Intelligence and Statistics, 2022

work page 2022

[43] [43]

Model-based smoothing with integrated wiener processes and overlapping splines.Journal of Computational and Graphical Statistics, 2024

Ziang Zhang, Alex Stringer, Patrick Brown, and Jamie Stafford. Model-based smoothing with integrated wiener processes and overlapping splines.Journal of Computational and Graphical Statistics, 2024

work page 2024

[44] [44]

q(xi,0) Ni−1Y n=1 q(si,n) # . The resulting population ELBO is LGPODE-pop = MX i=1

Bob Junyi Zou, Matthew E Levine, Dessi P. Zaharieva, Ramesh Johari, and Emily Fox. Hybrid2 neural ODE causal modeling and an application to glycemic response. InProceedings of the 41st International Conference on Machine Learning, 2024. 13 Appendix A Supplementary figures 15 B Baselines and hyperparameters 18 B.1 Concurrent baselines . . . . . . . . . . ....

work page 2024