pith. sign in

arxiv: 2606.27285 · v2 · pith:GPKUPKAJnew · submitted 2026-06-25 · 💻 cs.LG · cs.IT· math.CA· math.DS· math.IT

Recovering Governing Equations from Solution Data: Identifiability Bounds for Linear and Nonlinear ODEs

Pith reviewed 2026-06-29 04:38 UTC · model grok-4.3

classification 💻 cs.LG cs.ITmath.CAmath.DSmath.IT
keywords governing equationsODE identificationidentifiability boundsHausdorff distancesample complexityscientific machine learningmetric entropy
0
0 comments X

The pith

The Hausdorff distance on solution sets determines when governing ODEs can be uniquely identified from data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces the Hausdorff distance on solution sets to compare differential equations by their worst-case separation over initial conditions. It establishes identifiability bounds for linear ODEs and nonlinear ones with Lipschitz or Hölder continuous vector fields, showing precisely when two equations can be distinguished from solution observations. The work then derives metric entropy estimates for these classes and provides sample complexity bounds for reliable recovery of the governing equation.

Core claim

We introduce the Hausdorff distance on solution sets as the natural metric for comparing differential equations, since it captures the worst-case separation between two equations over all admissible initial conditions. We establish identifiability bounds for governing ODEs ranging from linear to nonlinear classes with Lipschitz (Hölder)-continuous vector fields, characterizing when two distinct equations can be distinguished from solution data, and derive metric entropy estimates and sample complexity bounds.

What carries the argument

Hausdorff distance on solution sets, which measures the maximum separation between trajectories of two ODEs over all initial conditions.

If this is right

  • Positive Hausdorff distance between solution sets implies the equations are distinguishable given enough solution observations.
  • Metric entropy estimates bound the number of distinguishable ODEs in a class, directly controlling the sample size needed for identification.
  • Sample complexity bounds quantify the number of observed trajectories required to recover the governing equation with high probability.
  • The bounds hold uniformly for linear ODEs and for nonlinear ODEs whose vector fields satisfy Lipschitz or Hölder continuity.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same distance could be used to derive identifiability results for systems with time-varying or control inputs.
  • Practical recovery algorithms might use the derived entropy numbers to set stopping criteria or regularization strengths.
  • The minimax character of the Hausdorff metric suggests that worst-case initial-condition sampling is necessary for robust identification guarantees.

Load-bearing premise

The Hausdorff distance on solution sets is the natural metric for comparing differential equations because it captures the worst-case separation between two equations over all admissible initial conditions.

What would settle it

Observing two distinct ODEs whose solution sets have small Hausdorff distance yet yield data that allows reliable distinction under some sampling of initial conditions would contradict the metric's claimed suitability for the identification problem.

Figures

Figures reproduced from arXiv: 2606.27285 by Helmut B\"olcskei, Yang Pan.

Figure 1
Figure 1. Figure 1: Lipschitz functions f and ˜f We have [PITH_FULL_IMAGE:figures/full_fig_p007_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Red points represent the randomly generated samples. Yellow and blue [PITH_FULL_IMAGE:figures/full_fig_p025_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Red points represent the randomly generated samples. Yellow and blue [PITH_FULL_IMAGE:figures/full_fig_p026_3.png] view at source ↗
read the original abstract

Learning governing equations from observed solution data is a fundamental challenge in scientific machine learning, yet the theoretical conditions under which a ground-truth ODE can be uniquely and stably identified from multiple solution observations remain largely undeveloped, and no quantitative analysis of the sample complexity of such learning tasks exists in the literature. To address this gap, we introduce the Hausdorff distance on solution sets as the natural metric for comparing differential equations, since it captures the worst-case separation between two equations over all admissible initial conditions and thus encodes the minimax structure of the identification problem. We establish identifiability bounds for governing ODEs across a wide class of structure equations--ranging from linear ODEs to nonlinear classes with Lipschitz (H\"older)-continuous vector fields--characterizing precisely when two distinct equations can be distinguished from solution data. Using this metric, we derive metric entropy estimates for the relevant ODE classes and analyze sample complexity bounds, quantifying how many solution observations are needed to reliably recover the governing equation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper introduces the Hausdorff distance on solution sets as the natural metric for comparing differential equations and uses it to establish identifiability bounds for governing ODEs ranging from linear systems to nonlinear classes with Lipschitz or Hölder-continuous vector fields. It characterizes when two distinct equations can be distinguished from solution data, derives metric entropy estimates for the relevant ODE classes, and analyzes sample complexity bounds quantifying the number of solution observations needed to recover the governing equation.

Significance. If the central constructions hold with the required restrictions, the work would provide the first quantitative identifiability and sample-complexity theory for ODE recovery from solution data, directly addressing a gap in scientific machine learning. The minimax framing via Hausdorff distance on solution sets is a coherent choice for the identification problem.

major comments (2)
  1. [Section 2 (definition of the metric)] The definition of the Hausdorff distance between solution sets (introduced to encode the minimax identification problem) does not specify a fixed finite time horizon T or a compact domain for admissible initial conditions. Without these, trajectories of distinct Lipschitz or Hölder fields can diverge arbitrarily far, rendering the Hausdorff distance infinite and the subsequent metric entropy and sample-complexity estimates vacuous. This is load-bearing for all claimed bounds.
  2. [Section 3 (nonlinear Hölder case)] For the Hölder-continuous case with exponent <1, uniqueness of solutions may fail, making the solution set multi-valued; the Hausdorff comparison then requires additional selection rules or Filippov regularization that are not addressed, undermining the identifiability claims for that class.
minor comments (1)
  1. Notation for the structure equation classes and the precise function spaces (e.g., C^{0,α} vs. Lip) should be stated explicitly at first use to avoid ambiguity in the entropy calculations.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful and constructive report. The two major comments identify important technical points that must be made fully explicit. We address each below and will revise the manuscript accordingly.

read point-by-point responses
  1. Referee: [Section 2 (definition of the metric)] The definition of the Hausdorff distance between solution sets (introduced to encode the minimax identification problem) does not specify a fixed finite time horizon T or a compact domain for admissible initial conditions. Without these, trajectories of distinct Lipschitz or Hölder fields can diverge arbitrarily far, rendering the Hausdorff distance infinite and the subsequent metric entropy and sample-complexity estimates vacuous. This is load-bearing for all claimed bounds.

    Authors: We agree that a fixed finite horizon T and a compact set of admissible initial conditions are required for the Hausdorff distance to be finite and for the subsequent entropy and sample-complexity results to be meaningful. While these restrictions are implicit in the problem formulation (standard for global existence and boundedness arguments in ODE theory), they are not stated explicitly in Section 2. We will add a precise statement of the standing assumptions on T and the initial-condition domain at the beginning of Section 2, together with a short remark confirming that all subsequent bounds are understood to hold under these restrictions. This change will be made in the revised manuscript. revision: yes

  2. Referee: [Section 3 (nonlinear Hölder case)] For the Hölder-continuous case with exponent <1, uniqueness of solutions may fail, making the solution set multi-valued; the Hausdorff comparison then requires additional selection rules or Filippov regularization that are not addressed, undermining the identifiability claims for that class.

    Authors: The referee correctly notes that for Hölder exponents α < 1 the classical Picard–Lindelöf theorem does not guarantee uniqueness, so the solution set may be multi-valued. The manuscript treats the solution set as the collection of all absolutely continuous curves satisfying the integral equation; the Hausdorff distance is then taken between these (possibly multi-valued) reachable sets. This construction remains well-defined without additional selection rules. Nevertheless, to avoid any ambiguity we will insert a clarifying paragraph in Section 3 that (i) recalls the possible non-uniqueness for α < 1, (ii) states that the Hausdorff metric is applied directly to the full reachable sets, and (iii) notes that the identifiability and entropy bounds continue to hold for the set-valued case. This revision will be included. revision: yes

Circularity Check

0 steps flagged

No circularity; metric definition followed by independent mathematical derivation

full rationale

The paper defines the Hausdorff distance on solution sets as its comparison metric for ODEs and then derives identifiability bounds, metric entropy estimates, and sample-complexity results from that definition across linear and nonlinear classes. No self-citations appear in the provided text, no parameters are fitted to data and then relabeled as predictions, and no step reduces by construction to its own inputs. The derivation chain is therefore self-contained: the metric is introduced explicitly, and subsequent bounds follow from standard functional-analytic arguments on the chosen function classes.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The paper relies on the assumption that the Hausdorff metric captures the minimax structure of the identification problem; no free parameters or invented entities are mentioned in the abstract.

axioms (1)
  • domain assumption Hausdorff distance on solution sets is the natural metric for comparing differential equations as it captures the worst-case separation over all admissible initial conditions
    Explicitly stated in the abstract as the natural metric encoding the minimax structure.

pith-pipeline@v0.9.1-grok · 5710 in / 1283 out tokens · 40060 ms · 2026-06-29T04:38:47.807716+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

36 extracted references · 11 canonical work pages · 3 internal anchors

  1. [1]

    Lecture Notes Mathematics of Information

    Helmut Bolcskei. “Lecture Notes Mathematics of Information”. In: (2020)

  2. [2]

    Discovering governing equations from data: Sparse identification of nonlinear dynamical systems

    Steven L. Brunton, Joshua L. Proctor, and J. Nathan Kutz. “Discovering Gov- erning Equations from Data: Sparse Identification of Nonlinear Dynamical Sys- tems”. In:Proceedings of the National Academy of Sciences113.15 (Apr. 2016), pp. 3932–3937.issn: 0027-8424, 1091-6490.doi:10.1073/pnas.1517384113. arXiv:1509.03580 [math]

  3. [3]

    Finite sample properties of system identi- fication methods

    Marco C Campi and Erik Weyer. “Finite sample properties of system identi- fication methods”. In:IEEE Transactions on Automatic Control47.8 (2002), pp. 1329–1334

  4. [4]

    Physics-informed learning of governing equations from scarce data

    Zhao Chen, Yang Liu, and Hao Sun. “Physics-informed learning of governing equations from scarce data”. In:Nature communications12.1 (2021), p. 6136

  5. [5]

    Identifiability of linear and nonlinear dynamical systems

    M Grewal and Keith Glover. “Identifiability of linear and nonlinear dynamical systems”. In:IEEE Transactions on automatic control21.6 (2003), pp. 833– 837

  6. [6]

    Martin Holler and Erion Morina.On Uniqueness in Structured Model Learning. Oct. 2024. arXiv:2410.22009

  7. [7]

    Metric entropy limits on recurrent neural network learning of linear dynamical systems

    Clemens Hutter, Recep G¨ ul, and Helmut B¨ olcskei. “Metric entropy limits on recurrent neural network learning of linear dynamical systems”. In:Applied and Computational Harmonic Analysis59 (2022), pp. 198–223

  8. [8]

    Springer, 2006

    Victor Isakov.Inverse problems for partial differential equations. Springer, 2006

  9. [9]

    Entropy per Unit Time as a Metric Invariant of Automor- phism

    AN Kolgomorov. “Entropy per Unit Time as a Metric Invariant of Automor- phism”. In:Doklady of Russian Academy of Sciences: Moscow, Russia124 (1959), pp. 754–755

  10. [10]

    Neural Operator: Learning Maps Between Function Spaces With Applications to PDEs

    Nikola Kovachki et al. “Neural Operator: Learning Maps Between Function Spaces With Applications to PDEs”. In:Journal of Machine Learning Re- search24.89 (2023), pp. 1–97.issn: 1533-7928

  11. [11]

    Neural operator: Learning maps between function spaces with applications to pdes

    Nikola Kovachki et al. “Neural operator: Learning maps between function spaces with applications to pdes”. In:Journal of Machine Learning Research 24.89 (2023), pp. 1–97

  12. [12]

    Fourier Neural Operator for Parametric Partial Differential Equations

    Zongyi Li et al. “Fourier neural operator for parametric partial differential equations”. In:arXiv preprint arXiv:2010.08895(2020)

  13. [13]

    System identification

    Lennart Ljung. “System identification”. In:Signal analysis and prediction. Springer, 1998, pp. 163–173

  14. [14]

    On global identifiability for arbitrary model parametrizations

    Lennart Ljung and Torkel Glad. “On global identifiability for arbitrary model parametrizations”. In:automatica30.2 (1994), pp. 265–276

  15. [15]

    PDE-Net: Learning PDEs from Data

    Zichao Long et al. “PDE-Net: Learning PDEs from Data”. In:Proceedings of the 35th International Conference on Machine Learning. PMLR, July 2018, pp. 3208–3216. 55

  16. [16]

    Extrapolation and learning equations

    Georg Martius and Christoph H Lampert. “Extrapolation and learning equa- tions”. In:arXiv preprint arXiv:1610.02995(2016)

  17. [17]

    EMS press Berlin, 2023

    Richard Nickl.Bayesian non-linear statistical inverse problems. EMS press Berlin, 2023

  18. [18]

    Non-asymptotic identification of lti sys- tems from a single trajectory

    Samet Oymak and Necmiye Ozay. “Non-asymptotic identification of lti sys- tems from a single trajectory”. In:2019 American control conference (ACC). IEEE. 2019, pp. 5655–5661

  19. [19]

    Baburao G Pachpatte.Integral and finite difference inequalities and applica- tions. Vol. 205. Elsevier, 2006

  20. [20]

    Metric-Entropy Limits on Nonlinear Dynamical System Learning

    Yang Pan, Clemens Hutter, and Helmut B¨ olcskei. “Metric-Entropy Limits on Nonlinear Dynamical System Learning”. In:arXiv preprint arXiv:2407.01250 (2024)

  21. [21]

    Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations

    Maziar Raissi, Paris Perdikaris, and George E Karniadakis. “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations”. In:Journal of Computational physics378 (2019), pp. 686–707

  22. [22]

    Convolutional Neural Operators for Robust and Ac- curate Learning of PDEs

    Bogdan Raonic et al. “Convolutional Neural Operators for Robust and Ac- curate Learning of PDEs”. In:Advances in Neural Information Processing Systems36 (Dec. 2023), pp. 77187–77200

  23. [23]

    Convolutional neural operators for robust and accurate learning of PDEs

    Bogdan Raonic et al. “Convolutional neural operators for robust and accurate learning of PDEs”. In:Advances in Neural Information Processing Systems36 (2024)

  24. [24]

    ESPRIT-estimation of signal parameters via rotational invariance techniques

    Richard Roy and Thomas Kailath. “ESPRIT-estimation of signal parameters via rotational invariance techniques”. In:IEEE Transactions on acoustics, speech, and signal processing37.7 (2002), pp. 984–995

  25. [25]

    2021 The source of the major solar energetic particle events from super active region 11944

    Samuel H. Rudy et al. “Data-Driven Discovery of Partial Differential Equa- tions”. In:Science Advances3.4 (Apr. 2017), e1602614.doi:10.1126/sciadv. 1602614

  26. [26]

    Learning equa- tions for extrapolation and control

    Subham Sahoo, Christoph Lampert, and Georg Martius. “Learning equa- tions for extrapolation and control”. In:International Conference on Machine Learning. Pmlr. 2018, pp. 4442–4450

  27. [27]

    Warm, comforting recollection

    Philipp Scholl et al. “The Uniqueness Problem of Physical Law Learning”. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece: IEEE, June 2023, pp. 1– 5.isbn: 978-1-72816-327-7.doi:10.1109/ICASSP49357.2023.10095017

  28. [28]

    Philipp Scholl et al.Well-Definedness of Physical Law Learning: The Unique- ness Problem. Jan. 2023. arXiv:2210.08342 [math-ph]

  29. [29]

    On the notion of entropy of a dynamical system

    Yakov G Sinai. “On the notion of entropy of a dynamical system”. In:Doklady of Russian Academy of Sciences. Vol. 124. 3. 1959, pp. 768–771

  30. [30]

    Inverse problems: a Bayesian perspective

    Andrew M Stuart. “Inverse problems: a Bayesian perspective”. In:Acta nu- merica19 (2010), pp. 451–559. 56 [31]System identification - Wikipedia — en.wikipedia.org. Aug. 2025.url:https: //en.wikipedia.org/wiki/System_identification

  31. [31]

    Gorenflo, F

    V. M. Tikhomirov. “ε-Entropy andε-Capacity of Sets In Functional Spaces”. In:Selected Works of A. N. Kolmogorov: Volume III: Information Theory and the Theory of Algorithms. Ed. by A. N. Shiryayev. Dordrecht: Springer Netherlands, 1993, pp. 86–170.isbn: 978-94-017-2973-4.doi:10.1007/978- 94-017-2973-4_7

  32. [32]

    AI Feynman: A physics-inspired method for symbolic regression

    Silviu-Marian Udrescu and Max Tegmark. “AI Feynman: A physics-inspired method for symbolic regression”. In:Science advances6.16 (2020), eaay2631

  33. [33]

    Martin J Wainwright.High-dimensional statistics: A non-asymptotic view- point. Vol. 48. Cambridge university press, 2019

  34. [34]

    Wolfgang Walter.Ordinary Differential Equations. Vol. 182. Graduate Texts in Mathematics. New York, NY: Springer, 1998.doi:10.1007/978-1-4612- 0601-9

  35. [35]

    On the metric complexity of causal linear systems: Estimates ofε- entropy andε-dimension

    G Zames. “On the metric complexity of causal linear systems: Estimates ofε- entropy andε-dimension”. In:1977 IEEE Conference on Decision and Control including the 16th Symposium on Adaptive Processes and A Special Symposium on Fuzzy Set Theory and Applications. IEEE. 1977, pp. 807–810

  36. [36]

    Classes of ODE solutions: smoothness, cov- ering numbers, implications for noisy function fitting, and the curse of smooth- ness phenomenon

    Ying Zhu and Mozhgan Mirzaei. “Classes of ODE solutions: smoothness, cov- ering numbers, implications for noisy function fitting, and the curse of smooth- ness phenomenon”. In:arXiv preprint arXiv:2011.11371(2020). 57