Lagrangian Flow Matching: A Least-Action Framework for Principled Path Design
Pith reviewed 2026-05-21 08:06 UTC · model grok-4.3
The pith
Minimizing the action of a general Lagrangian yields simulation-free training objectives for designing probability paths in flow matching.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that the dynamic problem of minimizing the action of a general Lagrangian subject to the continuity equation and prescribed endpoints admits an equivalent static optimal transport formulation. This equivalence produces a family of simulation-free training objectives. It recovers optimal transport-based flow matching as the kinetic special case and the trigonometric variance-preserving diffusion path as the harmonic-oscillator case. More general Lagrangians lead to new probability paths and velocity fields that induce meaningful changes in the learned dynamics.
What carries the argument
The least-action principle, in which the probability path and associated velocity field are found by minimizing the integral of the Lagrangian along trajectories subject to the continuity equation for the probability density.
If this is right
- OT-based flow matching emerges when the Lagrangian reduces to kinetic energy alone.
- The trigonometric variance-preserving diffusion paths emerge for the harmonic oscillator Lagrangian.
- General Lagrangians produce new families of curved or otherwise modified probability paths.
- These new paths lead to different velocity fields that remain competitive in performance with standard conditional flow matching.
- The equivalence allows simulation-free training for any such Lagrangian choice.
Where Pith is reading between the lines
- Applying this framework to Lagrangians with additional physical terms could incorporate conservation principles into generative model training.
- Testing these paths on datasets with specific geometric structures might reveal advantages over straight-line transports.
- The mechanical interpretation could help explain why certain path choices work better for particular data distributions.
- Extensions might connect this to variational principles in related generative modeling approaches.
Load-bearing premise
The dynamic optimization of the Lagrangian action under the continuity equation and fixed endpoints is equivalent to a static optimal transport problem between the initial and final distributions.
What would settle it
Deriving the velocity field for a Lagrangian that includes a non-kinetic potential term and checking whether the resulting paths satisfy the continuity equation while differing from straight-line trajectories would test the claimed equivalence; mismatch in either property would falsify it.
Figures
read the original abstract
Flow matching trains a neural velocity field by regression against a target velocity associated with a prescribed probability path connecting a simple initial distribution to the data distribution. A central design choice is the path itself. Existing constructions, including rectified and optimal-transport-based paths, transport samples along straight lines between coupled endpoints and thus cover only a narrow class of dynamics. We observe that this corresponds to the simplest case of the least-action principle in classical mechanics, in which the kinetic Lagrangian yields free-particle straight-line trajectories. Building on this observation, we propose Lagrangian flow matching, a physics-based framework in which the probability path and velocity field are determined by minimizing the action of a general Lagrangian subject to the continuity equation and the prescribed endpoints. We show that this dynamic problem admits an equivalent static optimal transport (OT) formulation, yielding a family of simulation-free training objectives that recover OT-based flow matching as the kinetic special case and the trigonometric variance-preserving diffusion path as the harmonic-oscillator case. More general Lagrangians give rise to new probability paths and velocity fields, and numerical experiments show that they induce meaningful changes in the learned dynamics while remaining competitive with existing conditional flow matching models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces Lagrangian Flow Matching, framing the design of probability paths in flow matching as minimization of the action of a general Lagrangian subject to the continuity equation and fixed marginals. It claims this dynamic problem is equivalent to a static optimal transport formulation, producing a family of simulation-free training objectives. Special cases recover OT-based flow matching (kinetic Lagrangian) and trigonometric variance-preserving diffusion paths (harmonic oscillator Lagrangian). Numerical experiments indicate that more general Lagrangians induce meaningful changes in learned dynamics while remaining competitive with existing conditional flow matching models.
Significance. If the claimed equivalence holds with simulation-free objectives for arbitrary Lagrangians, the framework would supply a principled, physics-motivated method for constructing probability paths beyond straight-line or fixed diffusion schedules. Recovering known methods as special cases strengthens the contribution, and the potential for new, data-adapted dynamics could improve generative modeling performance.
major comments (2)
- [Abstract] Abstract: The central claim that the dynamic least-action problem admits an equivalent static OT formulation yielding simulation-free objectives for general Lagrangians is load-bearing but unsupported by derivation. For the kinetic case the target velocity is (x1 − x0); for the harmonic oscillator it is trigonometric. For a generic Lagrangian the Euler-Lagrange equation produces a two-point boundary-value problem whose solution generally lacks closed form, so evaluating the regression target requires numerical integration and the objective ceases to be simulation-free.
- [Equivalence to static OT] The manuscript asserts equivalence to a static OT problem but supplies only the high-level statement without the explicit reduction, the resulting regression target, or verification that the continuity-equation constraint is preserved under the static formulation. A concrete derivation or worked example for a non-quadratic Lagrangian is required to substantiate the family of simulation-free objectives.
minor comments (2)
- [Abstract] The abstract refers to 'numerical experiments' showing competitive results and meaningful changes in dynamics, yet no datasets, baselines, metrics, or specific Lagrangians tested are mentioned.
- Notation for the general Lagrangian L and the action functional should be introduced with an explicit equation before the equivalence claim is stated.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive feedback on our manuscript. We address the major comments point by point below, clarifying the equivalence and committing to strengthen the presentation with explicit derivations.
read point-by-point responses
-
Referee: [Abstract] Abstract: The central claim that the dynamic least-action problem admits an equivalent static OT formulation yielding simulation-free objectives for general Lagrangians is load-bearing but unsupported by derivation. For the kinetic case the target velocity is (x1 − x0); for the harmonic oscillator it is trigonometric. For a generic Lagrangian the Euler-Lagrange equation produces a two-point boundary-value problem whose solution generally lacks closed form, so evaluating the regression target requires numerical integration and the objective ceases to be simulation-free.
Authors: We agree that the abstract statement is concise and that the supporting derivation merits expansion. The equivalence is obtained by showing that the action-minimizing velocity field satisfying the continuity equation and fixed marginals coincides with the optimal transport map for the cost functional induced by the Lagrangian; the resulting regression target is then the velocity along the associated geodesic. For the kinetic and harmonic cases this velocity admits the closed forms noted by the referee. For general Lagrangians the two-point boundary-value problem may indeed lack a closed form, in which case the target is obtained by solving the Euler-Lagrange ODE once per training pair (a fixed, offline cost that does not involve simulating the generative dynamics). We will add a self-contained derivation of the static reduction together with a worked non-quadratic example in the revised manuscript to make this distinction explicit. revision: yes
-
Referee: [Equivalence to static OT] The manuscript asserts equivalence to a static OT problem but supplies only the high-level statement without the explicit reduction, the resulting regression target, or verification that the continuity-equation constraint is preserved under the static formulation. A concrete derivation or worked example for a non-quadratic Lagrangian is required to substantiate the family of simulation-free objectives.
Authors: We acknowledge that the main text presents the equivalence at a high level. The reduction proceeds by substituting the continuity-equation constraint into the action integral, yielding a static variational problem whose Euler-Lagrange conditions recover the same velocity field; because the optimal map automatically satisfies the prescribed marginals, the continuity equation remains satisfied by construction. We will insert the full step-by-step reduction (including verification of the constraint) and provide an explicit non-quadratic Lagrangian example with its corresponding regression target in the revision. revision: yes
Circularity Check
No circularity: derivation proceeds from least-action principle to static OT equivalence without reducing to inputs or self-citations
full rationale
The paper starts from the classical least-action principle applied to a general Lagrangian subject to the continuity equation and fixed marginals, then proves an equivalence to a static optimal transport problem whose solution supplies the velocity targets. Special cases (kinetic Lagrangian recovering OT flow matching; harmonic oscillator recovering trigonometric paths) follow directly as mathematical reductions rather than being presupposed. No step defines a quantity in terms of itself, renames a fitted result as a prediction, or relies on a load-bearing self-citation whose content is unverified outside the present work. The framework is therefore self-contained against external benchmarks in classical mechanics and optimal transport.
Axiom & Free-Parameter Ledger
axioms (2)
- standard math The evolution of the probability density obeys the continuity equation.
- domain assumption Endpoints consist of a simple initial distribution and the data distribution.
Lean theorems connected to this paper
-
IndisputableMonolith/CostcostAlphaLog_fourth_deriv_at_zero / dAlembert_cosh_solution_aczel echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
A canonical non-kinetic example is Lω(x, v) = ½‖v‖² - ½ω²‖x‖² ... Euler–Lagrange equation γ̈ + ω²γ = 0 has unique solution γωx0,x1(t) = sin(ω(1-t))/sinω x0 + sin(ωt)/sinω x1
-
IndisputableMonolith/Foundation/BranchSelectionbranch_selection echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
We show that this dynamic problem admits an equivalent static optimal transport (OT) formulation, yielding a family of simulation-free training objectives
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
M. S. Albergo, N. M. Boffi, and E. Vanden-Eijnden. Stochastic interpolants: A unifying framework for flows and diffusions.Journal of Machine Learning Research, 26(209):1–80, 2025
work page 2025
-
[2]
M. S. Albergo and E. Vanden-Eijnden. Building normalizing flows with stochastic interpolants. InThe Eleventh International Conference on Learning Representations (ICLR), 2023
work page 2023
-
[3]
L. Ambrosio, N. Gigli, and G. Savaré.Gradient Flows: In Metric Spaces and in the Space of Probability Measures. Birkhäuser, Basel, 2005
work page 2005
-
[4]
V . I. Arnold, K. V ogtmann, and A. Weinstein.Mathematical Methods of Classical Mechanics, volume 60 ofGraduate Texts in Mathematics. Springer, New York, 2 edition, 1989
work page 1989
-
[5]
M. Balcerak, T. Amiranashvili, A. Terpin, S. Shit, L. Bogensperger, S. Kaltenbach, P. Koumout- sakos, and B. Menze. Energy matching: Unifying flow matching and energy-based models for generative modeling.arXiv preprint arXiv:2504.10612, 2025
-
[6]
J.-D. Benamou and Y . Brenier. A computational fluid mechanics solution to the Monge– Kantorovich mass transfer problem.Numerische Mathematik, 84(3):375–393, 2000
work page 2000
-
[7]
P. Bernard and B. Buffoni. Optimal mass transportation and Mather theory.Journal of the European Mathematical Society, 9(1):85–121, 2007
work page 2007
-
[8]
D. Burkhardt, J. Bloom, R. Cannoodt, M. D. Luecken, S. Krishnaswamy, C. Lance, A. O. Pisco, and F. J. Theis. Multimodal single-cell integration across time, individuals, and batches. In NeurIPS 2022 Competition Track, 2022
work page 2022
-
[9]
R. T. Q. Chen, Y . Rubanova, J. Bettencourt, and D. K. Duvenaud. Neural ordinary differential equations. InAdvances in Neural Information Processing Systems, volume 31, 2018
work page 2018
-
[10]
Y . Chen, T. T. Georgiou, and M. Pavon. On the relation between optimal transport and Schrödinger bridges: A stochastic control viewpoint.Journal of Optimization Theory and Applications, 169(2):671–691, 2016
work page 2016
-
[11]
V . De Bortoli, J. Thornton, J. Heng, and A. Doucet. Diffusion Schrödinger bridge with applications to score-based generative modeling. InAdvances in Neural Information Processing Systems, volume 34, pages 17695–17709, 2021
work page 2021
- [12]
-
[13]
W. Grathwohl, R. T. Q. Chen, J. Bettencourt, and D. Duvenaud. Scalable reversible genera- tive models with free-form continuous dynamics. InInternational Conference on Learning Representations (ICLR), 2019
work page 2019
-
[14]
J. Ho, A. Jain, and P. Abbeel. Denoising diffusion probabilistic models. InAdvances in Neural Information Processing Systems, volume 33, pages 6840–6851, 2020. 10
work page 2020
-
[15]
T. Koshizuka and I. Sato. Neural Lagrangian Schrödinger bridge: Diffusion modeling for population dynamics.arXiv preprint arXiv:2204.04853, 2022
-
[16]
H. W. Kuhn. The Hungarian method for the assignment problem.Naval Research Logistics Quarterly, 2(1–2):83–97, 1955
work page 1955
- [17]
-
[18]
C. Léonard. A survey of the Schrödinger problem and some of its connections with optimal transport.Discrete and Continuous Dynamical Systems, 34(4):1533–1574, 2014
work page 2014
- [19]
-
[20]
X. Liu, C. Gong, and Q. Liu. Flow straight and fast: Learning to generate and transfer data with rectified flow. InThe Eleventh International Conference on Learning Representations (ICLR), 2023
work page 2023
-
[21]
K. R. Moon, D. van Dijk, Z. Wang, S. Gigante, D. B. Burkhardt, W. S. Chen, K. Yim, A. van den Elzen, M. J. Hirn, R. R. Coifman, et al. Visualizing structure and transitions in high-dimensional biological data.Nature Biotechnology, 37(12):1482–1492, 2019
work page 2019
-
[22]
K. Neklyudov, R. Brekelmans, D. Severo, and A. Makhzani. Action matching: Learning stochastic dynamics from samples. InProceedings of the 40th International Conference on Machine Learning (ICML), volume 202 ofProceedings of Machine Learning Research, pages 25858–25889. PMLR, 2023
work page 2023
-
[23]
K. Neklyudov, R. Brekelmans, A. Tong, L. Atanackovic, Q. Liu, and A. Makhzani. A com- putational framework for solving Wasserstein Lagrangian flows. InProceedings of the 41st International Conference on Machine Learning (ICML), 2024
work page 2024
-
[24]
A.-A. Pooladian, H. Ben-Hamu, C. Domingo-Enrich, B. Amos, Y . Lipman, and R. T. Q. Chen. Multisample flow matching: Straightening flows with minibatch couplings. InProceedings of the 40th International Conference on Machine Learning (ICML), volume 202 ofProceedings of Machine Learning Research, pages 28100–28127. PMLR, 2023
work page 2023
-
[25]
A.-A. Pooladian, C. Domingo-Enrich, R. T. Q. Chen, and B. Amos. Neural optimal transport with Lagrangian costs. InProceedings of the 40th Conference on Uncertainty in Artificial Intelligence (UAI), 2024
work page 2024
-
[26]
J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan, and S. Ganguli. Deep unsupervised learning using nonequilibrium thermodynamics. InProceedings of the 32nd International Conference on Machine Learning (ICML), volume 37 ofProceedings of Machine Learning Research, pages 2256–2265. PMLR, 2015
work page 2015
-
[27]
Y . Song, C. Durkan, I. Murray, and S. Ermon. Maximum likelihood training of score-based diffusion models. InAdvances in Neural Information Processing Systems, volume 34, pages 1415–1428, 2021
work page 2021
-
[28]
Y . Song, J. Sohl-Dickstein, D. P. Kingma, A. Kumar, S. Ermon, and B. Poole. Score-based generative modeling through stochastic differential equations. InInternational Conference on Learning Representations (ICLR), 2021
work page 2021
-
[29]
A. Tong, K. Fatras, N. Malkin, G. Huguet, Y . Zhang, J. Rector-Brooks, G. Wolf, and Y . Bengio. Improving and generalizing flow-based generative models with minibatch optimal transport. Transactions on Machine Learning Research, 2024. Expert Certification
work page 2024
-
[30]
Villani.Optimal Transport: Old and New, volume 338 ofGrundlehren der mathematischen Wissenschaften
C. Villani.Optimal Transport: Old and New, volume 338 ofGrundlehren der mathematischen Wissenschaften. Springer, Berlin, Heidelberg, 2009. 11 A Related Work Lagrangian flow matching sits at the intersection of two lines of work: flow- and diffusion-based generative models built on prescribed probability paths, and variational or action-based approaches to...
work page 2009
-
[31]
unifies these constructions through a stochastic differential equation, with sampling carried out by a reverse-time SDE or its associated probability-flow ODE. The probability path is implicitly prescribed by the forward noising schedule, and standard choices such as the variance-preserving and variance-exploding processes induce specific families of marg...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.