LLM-Guided ODE Discovery and Parameter Inference from Small-Cohort Aggregate Data

Cristina Has; Hanning Yang; Lennart Purucker; Meropi Karakioulaki; Moritz Hess; Tim Litwin

arxiv: 2607.00733 · v1 · pith:RLPFDSZ7new · submitted 2026-07-01 · 💻 cs.LG · cs.AI

LLM-Guided ODE Discovery and Parameter Inference from Small-Cohort Aggregate Data

Hanning Yang , Meropi Karakioulaki , Lennart Purucker , Tim Litwin , Cristina Has , Moritz Hess This is my paper

Pith reviewed 2026-07-02 16:12 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords ODE discoveryLLM agentsparameter inferencerare diseasesaggregate datamechanistic modelingpopulation statistics

0 comments

The pith

An LLM-guided agent discovers consistent ODE structures and refines parameter distributions from population summary statistics alone.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces AgentODE to build mechanistic ODE models when individual patient data is scarce, noisy, or private, as happens in rare diseases. An LLM suggests candidate equation structures while a tool-using agent repeatedly checks and adjusts parameter distributions against group-level averages through a diagnosis-update process. On benchmarks and on RDEB data with 231 observations from 46 patients, the method produces structures that remain functionally consistent, and summary-statistic reasoning yields more mechanistically plausible models than baselines given individual records, even when the latter achieve better numerical fit.

Core claim

AgentODE recovers functionally consistent ODE structures across all settings, and experiments on RDEB demonstrates that in sparse and noisy data settings reasoning from summary statistics promotes mechanistically principled structure discovery, whereas baselines with individual-level data access recover implausible structures despite better predictive performance.

What carries the argument

AgentODE, an end-to-end framework in which an LLM proposes candidate ODE structures and a tool-augmented inference agent iteratively refines parameter distributions through a diagnosis-update loop operating solely on population-level summary statistics.

If this is right

Mechanistic ODE modeling becomes possible for rare diseases under privacy constraints that block individual records.
Structure discovery can favor functional consistency over predictive accuracy when data are sparse and noisy.
Parameters can be treated as distributions to capture heterogeneity using only group averages.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could extend to other domains that publish only aggregate statistics, such as public health or ecology.
Systematic tests on additional synthetic systems with known dynamics would quantify how much information summary statistics retain for structure recovery.
The diagnosis-update loop might be adapted to other model classes beyond ODEs when only summary data are available.

Load-bearing premise

An LLM can generate functionally consistent ODE candidates and the inference agent can iteratively refine parameter distributions accurately from aggregate statistics without any individual-level data.

What would settle it

Apply the full AgentODE pipeline to synthetic data generated from a known ground-truth ODE, supply only the resulting population summary statistics, and check whether the recovered structure is functionally equivalent to the known true equations.

Figures

Figures reproduced from arXiv: 2607.00733 by Cristina Has, Hanning Yang, Lennart Purucker, Meropi Karakioulaki, Moritz Hess, Tim Litwin.

**Figure 1.** Figure 1: Overview of AgentODE. Inputs consist of a problem specification and empirical summaries. An LLM proposes candidate ODE structures. For each structure, the agent performs an initial inference to obtain starting parameter distributions, from which synthetic data are generated, evaluated using logSL, and used to produce comparative visual summaries against empirical data. The agent then refines the parameter… view at source ↗

**Figure 2.** Figure 2: Ablation study comparing AgentODE, AgentODE w/o iterative refinement, and LLM-based [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 3.** Figure 3: Diagnosis-update iterations for an ODE structure on polymer. Left: logSL score across iterations, with the best iteration marked. Right: Spearman correlation heatmaps and mean trajectories ± 95% CI comparing empirical and synthetic data at iteration 1 and the best iteration. based optimization, and thus monotonic convergence is not guaranteed. This is further compounded by parameter non-identifiability [Ra… view at source ↗

**Figure 4.** Figure 4: Recovered ODE structures for the Apoptosis benchmark using AgentODE, LLM-SR [PITH_FULL_IMAGE:figures/full_fig_p020_4.png] view at source ↗

**Figure 5.** Figure 5: Recovered ODE structures for the Polymer DA Cross-linking benchmark using AgentODE, [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗

**Figure 6.** Figure 6: Recovered ODE structures for the PKPD-Immune benchmark using AgentODE, LLM-SR [PITH_FULL_IMAGE:figures/full_fig_p023_6.png] view at source ↗

**Figure 7.** Figure 7: Causal diagrams for the RDEB ODE structures from ( [PITH_FULL_IMAGE:figures/full_fig_p026_7.png] view at source ↗

**Figure 8.** Figure 8: Parameter distribution comparisons for the AgentODE-discovered RDEB ODE structure. [PITH_FULL_IMAGE:figures/full_fig_p027_8.png] view at source ↗

**Figure 9.** Figure 9: RDEB first-difference Spearman correlation heatmaps for empirical and synthetic trajecto [PITH_FULL_IMAGE:figures/full_fig_p028_9.png] view at source ↗

read the original abstract

Mechanistic modeling via ordinary differential equations (ODEs) provides interpretable descriptions of complex dynamics and enables inference of underlying mechanisms, which is particularly valuable in clinical settings. However, in rare diseases, both the structure and parameters of the model are typically unknown, while individual-level data is scarce, noisy, heterogeneous, and subject to privacy constraints. In such settings, population-level summary statistics provide a practical privacy-preserving data representation, while capturing heterogeneity further requires modeling parameters as distributions rather than fixed values. Yet no existing method jointly discovers ODE structure and refines parameter distributions solely from summary statistics. We present AgentODE, an end-to-end framework that addresses this gap. An LLM proposes candidate ODE structures, while a tool-augmented inference agent iteratively refines parameter distributions through a diagnosis--update loop, operating on population-level summary statistics alone. We evaluate AgentODE on three benchmark problems across different fields and two clinical datasets, including the rare disease recessive dystrophic epidermolysis bullosa (RDEB), with only 231 observations across 46 patients. AgentODE recovers functionally consistent ODE structures across all settings, and experiments on RDEB demonstrates that in sparse and noisy data settings reasoning from summary statistics promotes mechanistically principled structure discovery, whereas baselines with individual-level data access recover implausible structures despite better predictive performance. AgentODE opens new possibilities for mechanistic modeling of rare diseases directly from population-level summary statistics, where data scarcity and privacy constraints have traditionally limited such analyses.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

AgentODE's LLM-plus-agent pipeline for ODE structure discovery and parameter distribution fitting from summary statistics alone is a genuine technical step forward for rare-disease modeling, but the abstract gives almost no concrete validation numbers or consistency checks.

read the letter

The central thing to know is that the authors built a system where an LLM proposes candidate ODEs and a tool-using agent then iteratively updates parameter distributions using only population-level summaries. They test it on standard benchmarks plus a real RDEB dataset with 231 observations from 46 patients.

What is actually new is the joint handling of structure discovery and distributional parameter inference from aggregates; the abstract states no prior method does both from summaries. The framing around privacy constraints and heterogeneity in rare diseases is also handled directly rather than as an afterthought.

The work is useful in that it shows a practical path for mechanistic modeling when individual records cannot be shared. Treating parameters as distributions instead of point values fits the clinical setting.

The soft spots are in the validation. The abstract claims recovery of functionally consistent structures and that summary-statistic reasoning yields more plausible models than individual-level baselines, yet it supplies no definition of functional consistency, no error quantification, and no tables or figures showing how that consistency was scored. Without those details the surprising baseline result cannot be assessed. The diagnosis-update loop is described at a high level but the mechanics of how the agent avoids fitting noise or producing tautological structures remain unclear from what is visible.

This paper is for researchers working on scientific machine learning or systems biology who need to work with constrained clinical data. A reader looking for concrete methods to adapt would find the high-level architecture suggestive but would still need the full methods and results sections to judge reproducibility.

It deserves peer review. The problem is real, the approach is distinct from existing ODE discovery tools, and the RDEB application is a concrete test case; referees can check the missing validation steps and the quantitative claims.

Referee Report

2 major / 2 minor

Summary. The paper introduces AgentODE, an end-to-end framework for discovering ODE structures and inferring parameter distributions from small-cohort aggregate (population-level summary) data. An LLM proposes candidate ODE structures while a tool-augmented inference agent iteratively refines parameter distributions via a diagnosis-update loop using only summary statistics. The approach is evaluated on three benchmark problems from different fields plus two clinical datasets, including RDEB (231 observations across 46 patients). The central claims are that AgentODE recovers functionally consistent ODE structures across settings and that, on RDEB data, reasoning from summary statistics yields more mechanistically principled structures than baselines given individual-level data access (despite the latter having better predictive performance).

Significance. If the empirical claims hold, the work addresses a genuine gap in mechanistic modeling under privacy constraints and data scarcity typical of rare diseases. Enabling ODE structure discovery and distributional parameter inference directly from aggregate statistics could open analyses that are currently infeasible. The reported contrast between summary-statistic and individual-level regimes on RDEB is potentially important if the plausibility metric and experimental controls are robust.

major comments (2)

[Abstract and §4] Abstract and §4 (RDEB experiments): the claim that baselines recover 'implausible structures' while AgentODE recovers 'mechanistically principled' ones is load-bearing for the main contribution, yet the manuscript provides no explicit, reproducible criterion or quantitative score for mechanistic plausibility versus predictive performance. Without this, the comparative conclusion cannot be evaluated.
[§3] §3 (AgentODE framework): the description of the diagnosis-update loop operating solely on population-level summary statistics is central to the novelty, but the text does not specify the exact form of the summary statistics used, the distance or likelihood function inside the update step, or how the LLM-proposed structures are validated for functional consistency before parameter inference begins.

minor comments (2)

[Abstract] The abstract states recovery of 'functionally consistent' structures on all benchmarks but does not define the term or report the quantitative metric used; this should be stated explicitly in the main text with a reference to the relevant table or figure.
Table or figure reporting benchmark results should include both predictive error and the functional-consistency metric side-by-side for all methods to allow direct comparison.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which help clarify key aspects of the presentation. We respond to each major comment below and indicate where revisions will be made.

read point-by-point responses

Referee: [Abstract and §4] Abstract and §4 (RDEB experiments): the claim that baselines recover 'implausible structures' while AgentODE recovers 'mechanistically principled' ones is load-bearing for the main contribution, yet the manuscript provides no explicit, reproducible criterion or quantitative score for mechanistic plausibility versus predictive performance. Without this, the comparative conclusion cannot be evaluated.

Authors: We agree that the distinction between mechanistically principled and implausible structures would benefit from an explicit, reproducible criterion. The original assessment drew on qualitative expert review of biological consistency for the RDEB case. In the revision we will add a dedicated subsection in §4 that defines a scoring rubric (e.g., alignment with known disease pathways, sign consistency of inferred rates, and parameter-range feasibility) together with inter-rater reliability statistics. This will make the comparison with predictive performance metrics fully evaluable. revision: yes
Referee: [§3] §3 (AgentODE framework): the description of the diagnosis-update loop operating solely on population-level summary statistics is central to the novelty, but the text does not specify the exact form of the summary statistics used, the distance or likelihood function inside the update step, or how the LLM-proposed structures are validated for functional consistency before parameter inference begins.

Authors: We accept that the current wording in §3 leaves these implementation details underspecified. The summary statistics are the cohort means and standard deviations of each observed variable at the recorded time points. The update step minimizes a weighted Euclidean discrepancy between these observed moments and the corresponding moments obtained by integrating the candidate ODE. Functional consistency is checked by attempting forward integration over the observation window and rejecting any structure that produces numerical divergence or trajectories whose sign pattern contradicts the data. The revised §3 will include explicit formulas, the precise discrepancy measure, and pseudocode for the validation step. revision: yes

Circularity Check

0 steps flagged

No circularity: framework claims rest on external LLM/agent behavior and empirical evaluation, not self-referential definitions or fits

full rationale

The paper presents AgentODE as an LLM-proposed ODE structure generator plus a tool-augmented agent that iteratively updates parameter distributions from population summary statistics. No equations, derivations, or parameter-fitting steps are described that reduce the claimed outputs (functionally consistent structures, mechanistically principled discovery) to the inputs by construction. The central claims are evaluated on benchmark problems and the RDEB clinical dataset via reported experimental outcomes rather than by re-deriving fitted quantities. No self-citation load-bearing steps, uniqueness theorems, or ansatz smuggling are indicated in the provided material. This matches the default expectation of a non-circular methods paper whose validity hinges on external reproducibility of the LLM/agent loop and the reported metrics.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters, axioms, or invented entities; the framework relies on LLM proposals and agent-based inference whose internal assumptions are not detailed.

pith-pipeline@v0.9.1-grok · 5808 in / 1194 out tokens · 26400 ms · 2026-07-02T16:12:58.398204+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

49 extracted references · 11 canonical work pages · 6 internal anchors

[1]

Journal of Rare Diseases Research & Treatment , volume=

Design and analysis of clinical trials for small rare disease populations , author=. Journal of Rare Diseases Research & Treatment , volume=. 2016 , publisher=

2016
[2]

Nature , volume=

Statistical inference for noisy nonlinear ecological dynamic systems , author=. Nature , volume=. 2010 , publisher=

2010
[3]

Proceedings of the National Academy of Sciences , volume=

The frontier of simulation-based inference , author=. Proceedings of the National Academy of Sciences , volume=. 2020 , publisher=

2020
[4]

Scientific Reports , volume=

Using large language models to suggest informative prior distributions in Bayesian regression analysis , author=. Scientific Reports , volume=. 2025 , publisher=

2025
[5]

LLM-SR : Scientific equation discovery via programming with large language models

Llm-sr: Scientific equation discovery via programming with large language models , author=. arXiv preprint arXiv:2404.18400 , year=

work page arXiv
[6]

Feature engineering for machine learning and data analytics , pages=

Feature-based time-series analysis , author=. Feature engineering for machine learning and data analytics , pages=. 2018 , publisher=

2018
[7]

IEEE Transactions on Knowledge and Data Engineering , volume=

Highly comparative feature-based time-series classification , author=. IEEE Transactions on Knowledge and Data Engineering , volume=. 2014 , publisher=

2014
[8]

Physical review letters , volume=

Permutation entropy: a natural complexity measure for time series , author=. Physical review letters , volume=. 2002 , publisher=

2002
[9]

Physica A: Statistical Mechanics and its Applications , volume=

Detecting long-range correlations with detrended fluctuation analysis , author=. Physica A: Statistical Mechanics and its Applications , volume=. 2001 , publisher=

2001
[10]

2018 , publisher=

Forecasting: principles and practice , author=. 2018 , publisher=

2018
[11]

2016 , publisher=

Systems biology: a textbook , author=. 2016 , publisher=

2016
[12]

science , volume=

Systems biology: a brief overview , author=. science , volume=. 2002 , publisher=

2002
[13]

SIAM review , volume=

The mathematics of infectious diseases , author=. SIAM review , volume=. 2000 , publisher=

2000
[14]

PloS one , volume=

Lessons learned from quantitative dynamical modeling in systems biology , author=. PloS one , volume=. 2013 , publisher=

2013
[15]

Nature Reviews Molecular Cell Biology , volume=

Linking data to models: data regression , author=. Nature Reviews Molecular Cell Biology , volume=. 2006 , publisher=

2006
[16]

Proceedings of the national academy of sciences , volume=

Discovering governing equations from data by sparse identification of nonlinear dynamical systems , author=. Proceedings of the national academy of sciences , volume=. 2016 , publisher=

2016
[17]

Artificial Intelligence Review , volume=

Interpretable scientific discovery with symbolic regression: a review , author=. Artificial Intelligence Review , volume=. 2024 , publisher=

2024
[18]

2014 , publisher=

Mixed effects models for the population approach: models, tasks, methods and tools , author=. 2014 , publisher=

2014
[19]

Advances in Neural Information Processing Systems , volume=

Data-driven discovery of dynamical systems in pharmacology using large language models , author=. Advances in Neural Information Processing Systems , volume=
[20]

LLM4Ed : Large language models for automatic equation discovery, 2024

Llm4ed: Large language models for automatic equation discovery , author=. arXiv preprint arXiv:2405.07761 , year=

work page arXiv
[21]

arXiv preprint arXiv:2602.12259 , year=

Think like a Scientist: Physics-guided LLM Agent for Equation Discovery , author=. arXiv preprint arXiv:2602.12259 , year=

work page arXiv
[22]

LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models , author=. arXiv preprint arXiv:2603.20910 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[23]

The eleventh international conference on learning representations , year=

React: Synergizing reasoning and acting in language models , author=. The eleventh international conference on learning representations , year=
[24]

GPT-4 Technical Report

Gpt-4 technical report , author=. arXiv preprint arXiv:2303.08774 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[25]

Nature , volume=

Mathematical discoveries from program search with large language models , author=. Nature , volume=. 2024 , publisher=

2024
[26]

Advances in neural information processing systems , volume=

Reflexion: Language agents with verbal reinforcement learning , author=. Advances in neural information processing systems , volume=
[27]

1989 , publisher=

Distributed genetic algorithms for function optimization , author=. 1989 , publisher=

1989
[28]

1986 , publisher=

Robust statistics—the approach based on influence functions , author=. 1986 , publisher=

1986
[29]

Advances in neural information processing systems , volume=

Tree of thoughts: Deliberate problem solving with large language models , author=. Advances in neural information processing systems , volume=
[30]

Proceedings of the 61st annual meeting of the association for computational linguistics (volume 1: long papers) , pages=

Plan-and-solve prompting: Improving zero-shot chain-of-thought reasoning by large language models , author=. Proceedings of the 61st annual meeting of the association for computational linguistics (volume 1: long papers) , pages=
[31]

Advances in neural information processing systems , volume=

Toolformer: Language models can teach themselves to use tools , author=. Advances in neural information processing systems , volume=
[32]

Advances in neural information processing systems , volume=

Neural ordinary differential equations , author=. Advances in neural information processing systems , volume=
[33]

Universal Differential Equations for Scientific Machine Learning

Universal differential equations for scientific machine learning , author=. arXiv preprint arXiv:2001.04385 , year=

work page internal anchor Pith review Pith/arXiv arXiv 2001
[34]

Advances in Neural Information Processing Systems , volume=

Symbolic regression with a learned concept library , author=. Advances in Neural Information Processing Systems , volume=
[35]

Annual review of statistics and its application , volume=

Approximate bayesian computation , author=. Annual review of statistics and its application , volume=. 2019 , publisher=

2019
[36]

The 22nd international conference on artificial intelligence and statistics , pages=

Sequential neural likelihood: Fast likelihood-free inference with autoregressive flows , author=. The 22nd international conference on artificial intelligence and statistics , pages=. 2019 , organization=

2019
[37]

Nature machine intelligence , volume=

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead , author=. Nature machine intelligence , volume=. 2019 , publisher=

2019
[38]

International conference on machine learning , pages=

Automatic posterior transformation for likelihood-free inference , author=. International conference on machine learning , pages=. 2019 , organization=

2019
[39]

Scientific data , volume=

MIMIC-IV, a freely accessible electronic health record dataset , author=. Scientific data , volume=. 2023 , publisher=

2023
[40]

British Journal of Dermatology , volume=

Natural history of growth and anaemia in children with epidermolysis bullosa: a retrospective cohort study , author=. British Journal of Dermatology , volume=. 2020 , publisher=

2020
[41]

British Journal of Dermatology , pages=

Systemic inflammation in recessive dystrophic epidermolysis bullosa: a five-year longitudinal study , author=. British Journal of Dermatology , pages=. 2026 , publisher=

2026
[42]

Nature Reviews Disease Primers , volume=

Epidermolysis bullosa , author=. Nature Reviews Disease Primers , volume=. 2020 , publisher=

2020
[43]

LLM-SRBench : A new benchmark for scientific equation discovery with large language models, 2025

Llm-srbench: A new benchmark for scientific equation discovery with large language models , author=. arXiv preprint arXiv:2504.10415 , year=

work page arXiv
[44]

Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl

Interpretable machine learning for science with PySR and SymbolicRegression. jl , author=. arXiv preprint arXiv:2305.01582 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[45]

arXiv preprint arXiv:2004.08424 , year=

Pysindy: a python package for the sparse identification of nonlinear dynamics from data , author=. arXiv preprint arXiv:2004.08424 , year=

work page arXiv 2004
[46]

Adam: A Method for Stochastic Optimization

Adam: A method for stochastic optimization , author=. arXiv preprint arXiv:1412.6980 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[47]

Meta-Harness: End-to-End Optimization of Model Harnesses

Meta-Harness: End-to-End Optimization of Model Harnesses , author=. arXiv preprint arXiv:2603.28052 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[48]

Science , volume=

Agentic AI and the next intelligence explosion , author=. Science , volume=. 2026 , publisher=

2026
[49]

Bioinformatics , volume=

Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood , author=. Bioinformatics , volume=. 2009 , publisher=

2009

[1] [1]

Journal of Rare Diseases Research & Treatment , volume=

Design and analysis of clinical trials for small rare disease populations , author=. Journal of Rare Diseases Research & Treatment , volume=. 2016 , publisher=

2016

[2] [2]

Nature , volume=

Statistical inference for noisy nonlinear ecological dynamic systems , author=. Nature , volume=. 2010 , publisher=

2010

[3] [3]

Proceedings of the National Academy of Sciences , volume=

The frontier of simulation-based inference , author=. Proceedings of the National Academy of Sciences , volume=. 2020 , publisher=

2020

[4] [4]

Scientific Reports , volume=

Using large language models to suggest informative prior distributions in Bayesian regression analysis , author=. Scientific Reports , volume=. 2025 , publisher=

2025

[5] [5]

LLM-SR : Scientific equation discovery via programming with large language models

Llm-sr: Scientific equation discovery via programming with large language models , author=. arXiv preprint arXiv:2404.18400 , year=

work page arXiv

[6] [6]

Feature engineering for machine learning and data analytics , pages=

Feature-based time-series analysis , author=. Feature engineering for machine learning and data analytics , pages=. 2018 , publisher=

2018

[7] [7]

IEEE Transactions on Knowledge and Data Engineering , volume=

Highly comparative feature-based time-series classification , author=. IEEE Transactions on Knowledge and Data Engineering , volume=. 2014 , publisher=

2014

[8] [8]

Physical review letters , volume=

Permutation entropy: a natural complexity measure for time series , author=. Physical review letters , volume=. 2002 , publisher=

2002

[9] [9]

Physica A: Statistical Mechanics and its Applications , volume=

Detecting long-range correlations with detrended fluctuation analysis , author=. Physica A: Statistical Mechanics and its Applications , volume=. 2001 , publisher=

2001

[10] [10]

2018 , publisher=

Forecasting: principles and practice , author=. 2018 , publisher=

2018

[11] [11]

2016 , publisher=

Systems biology: a textbook , author=. 2016 , publisher=

2016

[12] [12]

science , volume=

Systems biology: a brief overview , author=. science , volume=. 2002 , publisher=

2002

[13] [13]

SIAM review , volume=

The mathematics of infectious diseases , author=. SIAM review , volume=. 2000 , publisher=

2000

[14] [14]

PloS one , volume=

Lessons learned from quantitative dynamical modeling in systems biology , author=. PloS one , volume=. 2013 , publisher=

2013

[15] [15]

Nature Reviews Molecular Cell Biology , volume=

Linking data to models: data regression , author=. Nature Reviews Molecular Cell Biology , volume=. 2006 , publisher=

2006

[16] [16]

Proceedings of the national academy of sciences , volume=

Discovering governing equations from data by sparse identification of nonlinear dynamical systems , author=. Proceedings of the national academy of sciences , volume=. 2016 , publisher=

2016

[17] [17]

Artificial Intelligence Review , volume=

Interpretable scientific discovery with symbolic regression: a review , author=. Artificial Intelligence Review , volume=. 2024 , publisher=

2024

[18] [18]

2014 , publisher=

Mixed effects models for the population approach: models, tasks, methods and tools , author=. 2014 , publisher=

2014

[19] [19]

Advances in Neural Information Processing Systems , volume=

Data-driven discovery of dynamical systems in pharmacology using large language models , author=. Advances in Neural Information Processing Systems , volume=

[20] [20]

LLM4Ed : Large language models for automatic equation discovery, 2024

Llm4ed: Large language models for automatic equation discovery , author=. arXiv preprint arXiv:2405.07761 , year=

work page arXiv

[21] [21]

arXiv preprint arXiv:2602.12259 , year=

Think like a Scientist: Physics-guided LLM Agent for Equation Discovery , author=. arXiv preprint arXiv:2602.12259 , year=

work page arXiv

[22] [22]

LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models

LLM-ODE: Data-driven Discovery of Dynamical Systems with Large Language Models , author=. arXiv preprint arXiv:2603.20910 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[23] [23]

The eleventh international conference on learning representations , year=

React: Synergizing reasoning and acting in language models , author=. The eleventh international conference on learning representations , year=

[24] [24]

GPT-4 Technical Report

Gpt-4 technical report , author=. arXiv preprint arXiv:2303.08774 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[25] [25]

Nature , volume=

Mathematical discoveries from program search with large language models , author=. Nature , volume=. 2024 , publisher=

2024

[26] [26]

Advances in neural information processing systems , volume=

Reflexion: Language agents with verbal reinforcement learning , author=. Advances in neural information processing systems , volume=

[27] [27]

1989 , publisher=

Distributed genetic algorithms for function optimization , author=. 1989 , publisher=

1989

[28] [28]

1986 , publisher=

Robust statistics—the approach based on influence functions , author=. 1986 , publisher=

1986

[29] [29]

Advances in neural information processing systems , volume=

Tree of thoughts: Deliberate problem solving with large language models , author=. Advances in neural information processing systems , volume=

[30] [30]

Proceedings of the 61st annual meeting of the association for computational linguistics (volume 1: long papers) , pages=

Plan-and-solve prompting: Improving zero-shot chain-of-thought reasoning by large language models , author=. Proceedings of the 61st annual meeting of the association for computational linguistics (volume 1: long papers) , pages=

[31] [31]

Advances in neural information processing systems , volume=

Toolformer: Language models can teach themselves to use tools , author=. Advances in neural information processing systems , volume=

[32] [32]

Advances in neural information processing systems , volume=

Neural ordinary differential equations , author=. Advances in neural information processing systems , volume=

[33] [33]

Universal Differential Equations for Scientific Machine Learning

Universal differential equations for scientific machine learning , author=. arXiv preprint arXiv:2001.04385 , year=

work page internal anchor Pith review Pith/arXiv arXiv 2001

[34] [34]

Advances in Neural Information Processing Systems , volume=

Symbolic regression with a learned concept library , author=. Advances in Neural Information Processing Systems , volume=

[35] [35]

Annual review of statistics and its application , volume=

Approximate bayesian computation , author=. Annual review of statistics and its application , volume=. 2019 , publisher=

2019

[36] [36]

The 22nd international conference on artificial intelligence and statistics , pages=

Sequential neural likelihood: Fast likelihood-free inference with autoregressive flows , author=. The 22nd international conference on artificial intelligence and statistics , pages=. 2019 , organization=

2019

[37] [37]

Nature machine intelligence , volume=

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead , author=. Nature machine intelligence , volume=. 2019 , publisher=

2019

[38] [38]

International conference on machine learning , pages=

Automatic posterior transformation for likelihood-free inference , author=. International conference on machine learning , pages=. 2019 , organization=

2019

[39] [39]

Scientific data , volume=

MIMIC-IV, a freely accessible electronic health record dataset , author=. Scientific data , volume=. 2023 , publisher=

2023

[40] [40]

British Journal of Dermatology , volume=

Natural history of growth and anaemia in children with epidermolysis bullosa: a retrospective cohort study , author=. British Journal of Dermatology , volume=. 2020 , publisher=

2020

[41] [41]

British Journal of Dermatology , pages=

Systemic inflammation in recessive dystrophic epidermolysis bullosa: a five-year longitudinal study , author=. British Journal of Dermatology , pages=. 2026 , publisher=

2026

[42] [42]

Nature Reviews Disease Primers , volume=

Epidermolysis bullosa , author=. Nature Reviews Disease Primers , volume=. 2020 , publisher=

2020

[43] [43]

LLM-SRBench : A new benchmark for scientific equation discovery with large language models, 2025

Llm-srbench: A new benchmark for scientific equation discovery with large language models , author=. arXiv preprint arXiv:2504.10415 , year=

work page arXiv

[44] [44]

Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl

Interpretable machine learning for science with PySR and SymbolicRegression. jl , author=. arXiv preprint arXiv:2305.01582 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[45] [45]

arXiv preprint arXiv:2004.08424 , year=

Pysindy: a python package for the sparse identification of nonlinear dynamics from data , author=. arXiv preprint arXiv:2004.08424 , year=

work page arXiv 2004

[46] [46]

Adam: A Method for Stochastic Optimization

Adam: A method for stochastic optimization , author=. arXiv preprint arXiv:1412.6980 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[47] [47]

Meta-Harness: End-to-End Optimization of Model Harnesses

Meta-Harness: End-to-End Optimization of Model Harnesses , author=. arXiv preprint arXiv:2603.28052 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[48] [48]

Science , volume=

Agentic AI and the next intelligence explosion , author=. Science , volume=. 2026 , publisher=

2026

[49] [49]

Bioinformatics , volume=

Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood , author=. Bioinformatics , volume=. 2009 , publisher=

2009