A machine learning framework for uncovering stochastic nonlinear dynamics from noisy data
Pith reviewed 2026-05-10 19:52 UTC · model grok-4.3
The pith
A hybrid method recovers the exact symbolic form of stochastic differential equations from noisy data while quantifying parameter uncertainty.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that a hybrid symbolic regression and probabilistic machine learning framework recovers the symbolic deterministic dynamics of stochastic differential equations while simultaneously inferring the structure and uncertainty of the noise from the same noisy time series, without requiring prior assumptions on the functional forms of either part.
What carries the argument
Hybrid framework pairing deep symbolic regression for deterministic equation discovery with Gaussian-process maximum likelihood estimation for separate noise and parameter uncertainty modeling.
Load-bearing premise
The deterministic dynamics and the noise can be modeled separately from the same observations without any assumed functional forms for either, and the symbolic regression step will still recover the correct equations despite the noise.
What would settle it
Apply the method to synthetic trajectories generated from a known stochastic van der Pol oscillator with controlled noise amplitude and check whether the recovered symbolic equation matches the true deterministic part while the noise model correctly isolates the added stochastic term.
Figures
read the original abstract
Modeling real-world systems requires accounting for noise - whether it arises from unpredictable fluctuations in financial markets, irregular rhythms in biological systems, or environmental variability in ecosystems. While the behavior of such systems can often be described by stochastic differential equations, a central challenge is understanding how noise influences the inference of system parameters and dynamics from data. Traditional symbolic regression methods can uncover governing equations but typically ignore uncertainty. Conversely, Gaussian processes provide principled uncertainty quantification but offer little insight into the underlying dynamics. In this work, we bridge this gap with a hybrid symbolic regression-probabilistic machine learning framework that recovers the symbolic form of the governing equations while simultaneously inferring uncertainty in the system parameters. The framework combines deep symbolic regression with Gaussian process-based maximum likelihood estimation to separately model the deterministic dynamics and the noise structure, without requiring prior assumptions about their functional forms. We verify the approach on numerical benchmarks, including harmonic, Duffing, and van der Pol oscillators, and validate it on an experimental system of coupled biological oscillators exhibiting synchronization, where the algorithm successfully identifies both the symbolic and stochastic components. The framework is data-efficient, requiring as few as 100-1000 data points, and robust to noise - demonstrating its broad potential in domains where uncertainty is intrinsic and both the structure and variability of dynamical systems must be understood.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a hybrid framework that combines deep symbolic regression (DSR) with Gaussian process maximum likelihood estimation (GP-MLE) to recover the symbolic form of governing stochastic differential equations from noisy time-series data while simultaneously inferring uncertainty in the system parameters. The approach is presented as requiring no prior assumptions on the functional forms of either the deterministic drift or the noise structure. Validation is reported on numerical benchmarks (harmonic, Duffing, and van der Pol oscillators) and on experimental data from coupled biological oscillators, with claims of data efficiency (100–1000 points) and robustness to noise.
Significance. If the separation of deterministic dynamics from noise structure can be achieved reliably, the work would usefully bridge symbolic regression methods (which typically ignore uncertainty) and probabilistic approaches (which typically lack interpretable structure). The experimental validation on biological synchronization data and the reported data efficiency are concrete strengths that would support applicability in domains where both equation discovery and uncertainty quantification matter. The hybrid construction itself is a reasonable engineering response to the limitations of each component method taken alone.
major comments (2)
- [Abstract / framework description] Abstract and framework description: The repeated claim that the method proceeds 'without requiring prior assumptions about their functional forms' is load-bearing for the stated novelty. Deep symbolic regression searches over a discrete, fixed library of operators and terminals, while GP-MLE conditions on a chosen kernel family; both choices constitute functional priors. The manuscript must clarify the scope of this claim, discuss the generality of the chosen library and kernels, and address failure modes when the true drift or noise process lies outside those families. Without such discussion the hybrid method reduces to a library-based procedure augmented with uncertainty quantification rather than a prior-free discovery procedure.
- [Methods (framework description)] Methods section on component separation: The central modeling assumption—that deterministic dynamics (via DSR) and noise structure (via GP-MLE) can be separately recovered from the same noisy observations without one contaminating the other—requires explicit justification and algorithmic detail. It is unclear how the two sub-problems are coupled or decoupled during optimization and whether noise can be misattributed to the drift term (or vice versa). A precise description of the joint or alternating inference procedure, together with any convergence or identifiability arguments, is needed to support the separation claim.
minor comments (2)
- [Abstract] The abstract is lengthy and contains several overlapping sentences; condensing it would improve readability while preserving the core claims.
- [Numerical experiments] Quantitative recovery metrics (e.g., equation recovery rate, parameter error with uncertainty intervals, comparison against pure DSR or pure GP baselines) should be reported more prominently for the numerical benchmarks to allow direct assessment of performance.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments. These have highlighted important points regarding the framing of our claims and the clarity of the methodological description. We address each major comment below and commit to revisions that will strengthen the manuscript without altering its core contributions.
read point-by-point responses
-
Referee: [Abstract / framework description] Abstract and framework description: The repeated claim that the method proceeds 'without requiring prior assumptions about their functional forms' is load-bearing for the stated novelty. Deep symbolic regression searches over a discrete, fixed library of operators and terminals, while GP-MLE conditions on a chosen kernel family; both choices constitute functional priors. The manuscript must clarify the scope of this claim, discuss the generality of the chosen library and kernels, and address failure modes when the true drift or noise process lies outside those families. Without such discussion the hybrid method reduces to a library-based procedure augmented with uncertainty quantification rather than a prior-free discovery procedure.
Authors: We agree that the phrasing 'without requiring prior assumptions about their functional forms' is imprecise and risks overstating the method's generality. The approach does rely on a fixed operator library for DSR and a kernel family for GP-MLE. In the revised manuscript we will update the abstract, introduction, and framework description to state that the method avoids strong parametric assumptions on the specific forms of the drift and diffusion terms, while still depending on the choice of a general library and kernel. We will explicitly specify the libraries and kernels used in the experiments, discuss their breadth for common classes of stochastic nonlinear systems, and add a dedicated limitations subsection that examines failure modes (including misidentification) when the true functions lie outside these families. These changes will better contextualize the hybrid method's novelty. revision: yes
-
Referee: [Methods (framework description)] Methods section on component separation: The central modeling assumption—that deterministic dynamics (via DSR) and noise structure (via GP-MLE) can be separately recovered from the same noisy observations without one contaminating the other—requires explicit justification and algorithmic detail. It is unclear how the two sub-problems are coupled or decoupled during optimization and whether noise can be misattributed to the drift term (or vice versa). A precise description of the joint or alternating inference procedure, together with any convergence or identifiability arguments, is needed to support the separation claim.
Authors: We acknowledge that the current Methods section provides insufficient detail on the separation procedure and its robustness. The revised manuscript will expand this section with a precise algorithmic description of the inference workflow, clarifying the sequence or alternation between DSR and GP-MLE steps and how they are decoupled in practice. We will add discussion of potential noise-drift misattribution, supported by additional analysis of the benchmark results, and include a brief treatment of identifiability under the modeling assumptions. While full theoretical convergence guarantees may remain limited, we will strengthen the empirical justification and note this as an area for future analysis. revision: yes
Circularity Check
No circularity; hybrid framework derivation remains independent of its inputs
full rationale
The paper describes a hybrid approach that combines deep symbolic regression for deterministic dynamics with Gaussian process MLE for noise structure. No step in the abstract or described framework reduces by construction to a self-definition, a fitted parameter renamed as prediction, or a load-bearing self-citation chain. The central separation of deterministic and stochastic components is presented as an empirical procedure whose correctness is checked against external numerical benchmarks (harmonic, Duffing, van der Pol) and independent experimental oscillator data rather than being tautological with the method's own library or kernel choices. The 'no prior assumptions' phrasing is an overstatement given the fixed operator library and kernel family, but this is a claim-correctness issue, not a circular reduction of the derivation itself.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Data is generated from a stochastic differential equation whose deterministic and noise components can be modeled separately
Forward citations
Cited by 1 Pith paper
-
Experimental Quantification of Nonlinear Mode Coupling in Nanomechanical Resonators using Multi-tone Excitation
A multi-tone excitation technique with sideband analysis quantifies ten pairwise nonlinear coupling coefficients across five modes in tensioned nanostrings, producing experimental reduced-order models that match finit...
Reference graph
Works this paper leans on
-
[1]
The butterfly effect.World Scientific Series on Nonlinear Science Series A, 39:91–94, 2000
Edward Lorenz. The butterfly effect.World Scientific Series on Nonlinear Science Series A, 39:91–94, 2000
work page 2000
-
[2]
Huaiyu Tian, Yonghong Liu, Yidan Li, Chieh-Hsi Wu, Bin Chen, Moritz UG Kraemer, Bingying Li, Jun Cai, Bo Xu, Qiqi Yang, et al. An investigation of transmission control measures during the first 50 days of the covid-19 epidemic in china.Science, 368(6491):638–642, 2020
work page 2020
-
[3]
Steven L. Brunton, Joshua L. Proctor, and J. Nathan Kutz. Discovering governing equations from data by sparse identification of nonlinear dynamical systems.Proceedings of the National Academy of Sciences of the United States of America, 113:3932–3937, 4 2016
work page 2016
-
[4]
Niall M Mangan, Travis Askham, Steven L Brunton, J Nathan Kutz, and Joshua L Proctor. Model selection for hybrid dynamical systems via sparse regression.Proceedings of the Royal Society A, 475(2223):20180534, 2019. 21 Learning Stochastic Nonlinear Dynamics from Noisy DataA Preprint
work page 2019
-
[5]
Urban Fasel, J Nathan Kutz, Bingni W Brunton, and Steven L Brunton. Ensemble-sindy: Robust sparse model discovery in the low-data, high-noise limit, with active learning and control.Proceedings of the Royal Society A, 478(2260):20210904, 2022
work page 2022
-
[6]
gplearn: Genetic programming in python, with a scikit-learn inspired api
Trevor Stephens. gplearn: Genetic programming in python, with a scikit-learn inspired api. InProceedings of the 14th Annual Conference Companion on Genetic and Evolutionary Computation, 2023
work page 2023
-
[7]
Marco Virgolin, Tanja Alderliesten, and Peter Bosman. Linear scaling with and within semantic backpropagation-based genetic programming for symbolic regression. pages 1084–1092, 07 2019
work page 2019
-
[8]
M. Virgolin, T. Alderliesten, C. Witteveen, and P. A. N. Bosman. Improving model-based genetic programming for symbolic regression of small expressions.Evolutionary Computation, 29(2):211–237, 2021
work page 2021
-
[9]
Operon c++: An efficient genetic programming framework for symbolic regression
Bogdan Burlacu, Gabriel Kronberger, and Michael Kommenda. Operon c++: An efficient genetic programming framework for symbolic regression. InProceedings of the 2020 Genetic and Evolution- ary Computation Conference Companion, GECCO ’20, page 1562–1570, New York, NY, USA, 2020. Association for Computing Machinery
work page 2020
-
[10]
Brenden K. Petersen, Mikel Landajuela, T. Nathan Mundhenk, Claudio P. Santiago, Soo K. Kim, and Joanne T. Kim. Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients, 2021
work page 2021
-
[11]
Mikel Landajuela, Chak Shing Lee, Jiachen Yang, Ruben Glatt, Claudio P Santiago, Ignacio Aravena, Terrell Mundhenk, Garrett Mulcahy, and Brenden K Petersen. A unified framework for deep symbolic regression.Advances in Neural Information Processing Systems, 35:33985–33998, 2022
work page 2022
-
[12]
Odeformer: Symbolic regression of dynamical systems with transformers, 2023
Stéphane d’Ascoli, Sören Becker, Alexander Mathis, Philippe Schwaller, and Niki Kilbertus. Odeformer: Symbolic regression of dynamical systems with transformers, 2023
work page 2023
-
[13]
William La Cava, Patryk Orzechowski, Bogdan Burlacu, Fabrício Olivetti de França, Marco Virgolin, Ying Jin, Michael Kommenda, and Jason H. Moore. Contemporary symbolic regression methods and their relative performance, 2021
work page 2021
-
[14]
Xiaolin Tang, Kai Yang, Hong Wang, Jiahang Wu, Yechen Qin, Wenhao Yu, and Dongpu Cao. Prediction- uncertainty-aware decision-making for autonomous vehicles.IEEE Transactions on Intelligent Vehicles, 7(4):849–862, 2022
work page 2022
-
[15]
Bessa, Urs Staufer, and Farbod Alijani
Abhilash Chandrashekar, Pierpaolo Belardinelli, Miguel A. Bessa, Urs Staufer, and Farbod Alijani. Quantifying nanoscale forces using machine learning in dynamic atomic force microscopy.Nanoscale Adv., 4:2134–2143, 2022
work page 2022
-
[16]
Learning in financial markets.Annu
Lubos Pastor and Pietro Veronesi. Learning in financial markets.Annu. Rev. Financ. Econ., 1(1):361–381, 2009
work page 2009
-
[17]
Bullard, Marc Gély, Thomas Alava, Eric Colinet, Akshay K
Marc Sansa, Eric Sage, Elizabeth C. Bullard, Marc Gély, Thomas Alava, Eric Colinet, Akshay K. Naik, Luis Guillermo Villanueva, Laurent Duraffourg, Michael L. Roukes, Guillaume Jourdan, and Sébastien Hentz. Frequency fluctuations in silicon nanoresonators.Nature Nanotechnology, 11(6):552–558, February 2016
work page 2016
-
[18]
A. N. Cleland and M. L. Roukes. Noise processes in nanomechanical resonators.Journal of Applied Physics, 92(5):2758–2769, 09 2002
work page 2002
-
[19]
Xianfeng Chen, Satya K. Ammu, Kunal Masania, Peter G. Steeneken, and Farbod Alijani. Diamagnetic composites for high-q levitating resonators.Advanced Science, 9(32):2203619, 2022
work page 2022
-
[20]
Domínguez, Eduardo Gil-Santos, Montserrat Calleja, and Javier Tamayo
Oscar Malvar, Jose Ruz, Priscila Kosaka, Carmen M. Domínguez, Eduardo Gil-Santos, Montserrat Calleja, and Javier Tamayo. Mass and stiffness spectrometry of nanoparticles and whole intact bacteria by multimode nanomechanical resonators.Nature Communications, 7:13452, 11 2016
work page 2016
-
[21]
Springer, Berlin, Heidelberg, 6th edition, 2003
Bernt Øksendal.Stochastic Differential Equations: An Introduction with Applications. Springer, Berlin, Heidelberg, 6th edition, 2003
work page 2003
-
[22]
A bayesian framework for discovering interpretable lagrangian of dynamical systems from data, 2023
Tapas Tripura and Souvik Chakraborty. A bayesian framework for discovering interpretable lagrangian of dynamical systems from data, 2023
work page 2023
-
[23]
Data-driven discovery of stochastic differential equations.Engineering, 17:244–252, 2022
Yasen Wang, Huazhen Fang, Junyang Jin, Guijun Ma, Xin He, Xing Dai, Zuogong Yue, Cheng Cheng, Hai-Tao Zhang, Donglin Pu, Dongrui Wu, Ye Yuan, Jorge Gonçalves, Jürgen Kurths, and Han Ding. Data-driven discovery of stochastic differential equations.Engineering, 17:244–252, 2022
work page 2022
-
[24]
Yunfei Huang, Youssef Mabrouk, Gerhard Gompper, and Benedikt Sabass. Sparse inference and active learning of stochastic differential equations from data.Scientific Reports, 12(1):21691, 2022. 22 Learning Stochastic Nonlinear Dynamics from Noisy DataA Preprint
work page 2022
-
[25]
Sparse learning of stochastic dynamical equations.The Journal of Chemical Physics, 148(24), mar 2018
Lorenzo Boninsegna, Feliks Nüske, and Cecilia Clementi. Sparse learning of stochastic dynamical equations.The Journal of Chemical Physics, 148(24), mar 2018
work page 2018
-
[26]
Rudolf Friedrich, Joachim Peinke, Muhammad Sahimi, and Mohammad Reza Rahimi Tabar. Approaching complexity by stochastic methods: From biological systems to turbulence.Physics Reports, 506:87–162, 2011
work page 2011
-
[27]
Extracting model equations from experimental data.Physics Letters A, 271(3):217–222, 2000
Rudolf Friedrich, Silke Siegert, Joachim Peinke, Marcus Siefert, Michael Lindemann, Jan Raethjen, Güntner Deuschl, Gerhard Pfister, et al. Extracting model equations from experimental data.Physics Letters A, 271(3):217–222, 2000
work page 2000
-
[28]
Mozes Jacobs, Bingni W. Brunton, Steven L. Brunton, J. Nathan Kutz, and Ryan V. Raut. Hypersindy: Deep generative modeling of nonlinear stochastic governing equations, 2023
work page 2023
-
[29]
Carl Edward Rasmussen and Christopher K. I. Williams.Gaussian Processes for Machine Learning. The MIT Press, Cambridge, MA, USA, 2006
work page 2006
-
[30]
García, Abraham Otero, Paulo Félix, Jesús Presedo, and David G
Constantino A. García, Abraham Otero, Paulo Félix, Jesús Presedo, and David G. Márquez. Nonpara- metric estimation of stochastic differential equations with sparse gaussian processes.Physical Review E, 96(2), August 2017
work page 2017
-
[31]
Philipp Batz, Andreas Ruttor, and Manfred Opper. Approximate bayes learning of stochastic differential equations.Physical Review E, 98(2), August 2018
work page 2018
-
[32]
P.W. Goldberg, C.K.I. Williams, and C.M. Bishop. Regression with input-dependent noise: A gaussian process treatment. InAdvances in Neural Information Processing Systems, volume 10, pages 493–499, 1998
work page 1998
-
[33]
K. Kersting, C. Plagemann, P. Pfaff, and W. Burgard. Most likely heteroscedastic gaussian process regression. InProceedings of the 24th International Conference on Machine Learning, pages 393–400. ACM, 2007
work page 2007
-
[34]
Ricky T. Q. Chen, Yulia Rubanova, Jesse Bettencourt, and David Duvenaud. Neural ordinary differential equations.Advances in Neural Information Processing Systems, 31, 2018
work page 2018
-
[35]
Xuechen Li, Ting-Kam Leonard Wong, Ricky T. Q. Chen, and David Duvenaud. Scalable gradients for stochastic differential equations.Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 108:3870–3882, 2020
work page 2020
-
[36]
Patrick Kidger, James Foster, Xuechen Li, and Terry J. Lyons. Neural SDEs as infinite-dimensional GANs.Proceedings of the 38th International Conference on Machine Learning, 139:5453–5463, 2021
work page 2021
-
[37]
Kloeden and Eckhard Platen.Numerical Solution of Stochastic Differential Equations
Peter E. Kloeden and Eckhard Platen.Numerical Solution of Stochastic Differential Equations. Applica- tions of Mathematics. Springer, Berlin, New York, 1999
work page 1999
-
[38]
William Bialek, Andrea Cavagna, Irene Giardina, Thierry Mora, Oliver Pohl, Edmondo Silvestri, Massimiliano Viale, and Aleksandra M. Walczak. Social interactions dominate speed control in poising natural flocks near criticality.Proceedings of the National Academy of Sciences of the United States of America, 111(20), 2014
work page 2014
-
[39]
J. Buck and E. Buck. Synchronous fireflies.Scientific American, 234(5), 1976
work page 1976
-
[40]
Aleksandre Japaridze, Victor Struijk, Kushal Swamy, Ireneusz Rosłoń, Oriel Shoshani, Cees Dekker, and Farbod Alijani. Synchronization of e. coli bacteria moving in coupled microwells.Small, 21(3):2407832, 2025
work page 2025
-
[41]
J Elgeti, R G Winkler, and G Gompper. Physics of microswimmers—single particle motion and collective behavior: a review.Reports on Progress in Physics, 78(5):056601, apr 2015
work page 2015
-
[42]
Data-driven discovery of stochastic dynamical equations of collective motion
Arshed Nabeel, Vivek Jadhav, Danny Raj M, Clément Sire, Guy Theraulaz, Ramón Escobedo, Srikanth K Iyer, and Vishwesha Guttal. Data-driven discovery of stochastic dynamical equations of collective motion. Physical Biology, 20(5):056003, July 2023
work page 2023
-
[43]
Mikel Landajuela, Brenden K. Petersen, Soo K. Kim, Claudio P. Santiago, Ruben Glatt, T. Nathan Mundhenk, Jacob F. Pettit, and Daniel M. Faissol. Improving exploration in policy gradient search: Application to symbolic optimization, 2021
work page 2021
-
[44]
Halim Doss. Liens entre équations différentielles stochastiques et ordinaires.Annales de l’IHP Probabilités et statistiques, 13(2):99–125, 1977
work page 1977
-
[45]
Héctor J. Sussmann. On the gap between deterministic and stochastic ordinary differential equations. The Annals of Probability, 6(1):19–41, 1978. 23 Learning Stochastic Nonlinear Dynamics from Noisy DataA Preprint Linear oscillator with additive noise Drift:µ=−x+ 0.4 cos(Ωt)−0.2 ˙x Diffusion:σ = σa (additive noise;σa = 0.01×pwhere p is the noise level in ...
work page 1978
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.