Learning Neural Maximal Lyapunov Functions on $\mathsf{SO}(n)$

Adeel Akhtar; Matthieu Barreau

arxiv: 2606.19669 · v1 · pith:EQSWAHZOnew · submitted 2026-06-18 · 🧮 math.OC · cs.SY· eess.SY

Learning Neural Maximal Lyapunov Functions on mathsf{SO}(n)

Adeel Akhtar , Matthieu Barreau This is my paper

Pith reviewed 2026-06-26 16:45 UTC · model grok-4.3

classification 🧮 math.OC cs.SYeess.SY

keywords neural Lyapunov functionsspecial orthogonal grouplogarithmic mapZubov equationregion of attractionLie groupsdynamical systemsstability analysis

0 comments

The pith

A neural architecture based on the logarithmic map learns maximal Lyapunov functions for dynamical systems on SO(n).

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to extend classical Lyapunov analysis, which works in flat Euclidean space, to systems whose states live on the curved manifold SO(n) of rotation matrices. It does so by defining a neural network whose outputs are guaranteed to approximate any continuous function on SO(n) when built around the logarithmic map, then casts the search for the largest possible region of attraction as a Zubov-type partial differential equation on that manifold. The practical step is the derivation of closed-form, computable expressions for the derivative of the log map; these expressions turn the learning task into a two-phase gradient-based algorithm that first finds a rough certificate and then refines it. If the construction holds, control engineers obtain stability certificates for rotational dynamics without having to flatten the geometry or restrict the domain artificially.

Core claim

The central claim is that a neural Lyapunov function constructed from the logarithmic map on SO(n) possesses universal approximation properties, that the maximal region of attraction can be recovered by solving a Zubov-type equation on the group, and that explicit, numerically tractable formulas for the derivative of the log map make gradient training feasible through a two-phase procedure that balances speed and accuracy.

What carries the argument

Neural Lyapunov architecture built on the logarithmic map from SO(n) to its Lie algebra, together with the explicit derivative formulas that enable back-propagation.

If this is right

Stability certificates become available for any dynamical system whose state evolves on SO(n) without first embedding the group into a larger Euclidean space.
The same architecture and derivative formulas can be reused for any right-invariant vector field on the group once the training data are generated.
The two-phase algorithm separates an initial coarse search from a fine-tuning stage, reducing the risk of getting stuck in poor local minima of the loss.
Empirical validation on a low-dimensional nonlinear example shows that the learned function indeed certifies a region larger than what a hand-crafted quadratic candidate would provide.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same log-map construction might be adapted to other compact Lie groups once analogous derivative formulas are obtained.
Because the architecture is defined intrinsically on the manifold, it could be combined with manifold-aware integrators to produce end-to-end certifiably stable controllers.
If the approximation guarantees extend to the learned region boundary, the method supplies a practical way to compute the domain of attraction without solving the Hamilton-Jacobi-Bellman equation directly.

Load-bearing premise

The derived formulas for the derivative of the logarithmic map are both mathematically correct and numerically stable enough to support reliable gradient descent during training.

What would settle it

A concrete low-dimensional SO(n) system for which the two-phase algorithm produces a function whose time derivative along trajectories is positive inside the claimed region of attraction.

read the original abstract

Establishing stability guarantees for dynamical systems on Lie groups is a fundamental challenge, as classical Lyapunov methods developed for Euclidean spaces do not directly transfer to curved geometries. In this paper, we propose a framework for learning maximal Lyapunov functions for systems evolving on the special orthogonal group $\mathsf{SO}(n)$. Theoretically, we introduce a neural Lyapunov architecture based on the logarithmic map with proven approximation capabilities, and we formulate the learning problem via a Zubov-type characterization of the maximal region of attraction. A key technical contribution is the derivation of explicit, numerically tractable formulas for the derivative of the logarithmic map, enabling training through a two-phase algorithm that balances computational efficiency and accuracy. Empirically, we validate the approach on a low-dimensional nonlinear system.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a concrete neural architecture for maximal Lyapunov functions on SO(n) via the log map and Zubov, but its value rests on unverified derivative formulas that need checking.

read the letter

The core contribution here is a neural Lyapunov function built around the logarithmic map on SO(n), paired with a Zubov-style characterization of the maximal region of attraction and a two-phase training procedure. They also supply explicit derivative formulas for the log map to make backpropagation feasible. This is a direct attempt to stay on the manifold instead of working in local charts, which matters for attitude control and rigid-body systems.

The approach is straightforward and the empirical test on a low-dimensional nonlinear system shows the training can run. The use of the log map plus Zubov is a reasonable way to target the largest possible region of attraction, and the two-phase split for efficiency versus accuracy is a practical touch.

The load-bearing piece is the claimed explicit, numerically tractable derivative of the logarithmic map. If those formulas contain an error in the adjoint action or series handling, the whole training pipeline collapses. The abstract presents them as solved, but without the derivation steps it is impossible to confirm they are free of singularities or stable under gradient descent. That is the main uncertainty.

The rest of the argument follows standard Lie-group tools and prior Zubov results, so there is no obvious circularity. The approximation claim is stated but not expanded in the abstract.

This is for people already working on nonlinear control on Lie groups or neural certificates for robotics and aerospace. A reader who needs a ready-to-implement method on SO(n) could get something usable if the derivatives check out. It is worth sending to peer review so domain experts can verify the derivative formulas and the training stability; the idea is specific enough that referees can give a clear yes or no on the math.

Referee Report

1 major / 1 minor

Summary. The paper proposes a neural architecture for learning maximal Lyapunov functions on the Lie group SO(n), constructed via the logarithmic map and equipped with proven approximation properties. The learning problem is cast as a Zubov-type characterization of the maximal region of attraction, and the central technical step is the derivation of explicit, closed-form expressions for the derivative of the matrix logarithm that enable gradient-based optimization through a two-phase training procedure. The framework is illustrated on a low-dimensional nonlinear example.

Significance. If the derivative formulas are verifiably correct and free of hidden singularities, the work supplies a concrete, trainable Lyapunov function class on a non-Euclidean manifold together with a region-of-attraction guarantee; this would constitute a useful bridge between Lie-group geometry and neural Lyapunov methods.

major comments (1)

[Derivative formulas (abstract and the section presenting the explicit expressions)] The load-bearing claim is the explicit derivative of the logarithmic map (stated in the abstract as the key technical contribution enabling the two-phase algorithm). The manuscript must exhibit the full derivation (including any use of the adjoint representation or series expansion of d log) so that it can be checked against the known differential of the matrix logarithm on so(n); without this verification the training procedure rests on an unconfirmed calculation.

minor comments (1)

[Empirical validation] The empirical section reports results only on a low-dimensional system; a second, higher-dimensional example would help demonstrate scalability of the two-phase procedure.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their thoughtful review and for highlighting the importance of verifying the derivative formulas. We address the single major comment below and will incorporate the requested changes in the revised manuscript.

read point-by-point responses

Referee: [Derivative formulas (abstract and the section presenting the explicit expressions)] The load-bearing claim is the explicit derivative of the logarithmic map (stated in the abstract as the key technical contribution enabling the two-phase algorithm). The manuscript must exhibit the full derivation (including any use of the adjoint representation or series expansion of d log) so that it can be checked against the known differential of the matrix logarithm on so(n); without this verification the training procedure rests on an unconfirmed calculation.

Authors: We agree that the full derivation must be exhibited for independent verification. In the revised manuscript we will expand the section on the logarithmic map derivative to include a complete, self-contained derivation that explicitly invokes the adjoint representation of SO(n) and the series expansion of d log, allowing direct comparison with the standard differential-geometric formula on so(n). This addition will be placed immediately before the description of the two-phase training algorithm. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation relies on independent Lie-group calculus and external Zubov theory.

full rationale

The paper introduces a neural Lyapunov architecture on SO(n) using the logarithmic map and derives explicit derivative formulas to enable a two-phase training algorithm, all framed via a Zubov-type maximal region of attraction characterization drawn from prior literature. No step reduces by construction to a fitted input renamed as prediction, a self-definitional loop, or a load-bearing self-citation chain; the derivative formulas are presented as a fresh explicit computation rather than an ansatz imported from the authors' prior work or a renaming of a known empirical pattern. The central claims therefore remain mathematically independent of the paper's own outputs.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Review limited to abstract; no explicit free parameters, invented entities, or ad-hoc axioms listed beyond standard Lie group and control theory background.

axioms (2)

standard math Properties of the logarithmic map on SO(n) allow construction of a neural Lyapunov function with approximation guarantees
Basis for the neural architecture stated in the abstract.
domain assumption Zubov-type characterization correctly identifies the maximal region of attraction for the learning objective
Used to formulate the learning problem in the abstract.

pith-pipeline@v0.9.1-grok · 5652 in / 1291 out tokens · 43419 ms · 2026-06-26T16:45:59.324963+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

24 extracted references · 3 canonical work pages · 1 internal anchor

[1]

J. E. Marsden and T. S. Ratiu,Introduction to Mechanics and Symmetry, ser. A Basic Exposition of Classical Mechanical Systems. Springer- Verlag, 1999

1999
[2]

A unified framework for consensus and synchronization on lie groups admitting a bi-invariant metric,

R. S. Chandrasekaran, R. N. Banavar, and A. D. Mahindrakar, “A unified framework for consensus and synchronization on lie groups admitting a bi-invariant metric,”IEEE Transactions on Automatic Control, vol. 70, no. 11, pp. 7718–7724, 2025

2025
[3]

The Algebra of Grand Unified Theories

J. C. Baez and J. Huerta, “The algebra of grand unified theories,” 2010. [Online]. Available: https://arxiv.org/abs/0904.1556

work page internal anchor Pith review Pith/arXiv arXiv 2010
[4]

H. K. Khalil,Nonlinear Systems, ser. Pearson Education. Prentice Hall, 2002

2002
[5]

S. Boyd, L. El Ghaoui, E. Feron, and V . Balakrishnan,Linear matrix inequalities in system and control theory. SIAM, 1994

1994
[6]

V . I. Zubov,Methods of AM Lyapunov and their application. US Atomic Energy Commission, 1961, vol. 4439

1961
[7]

Maximal Lyapunov functions and domains of attraction for autonomous nonlinear systems,

A. Vannelli and M. Vidyasagar, “Maximal Lyapunov functions and domains of attraction for autonomous nonlinear systems,”Automatica, vol. 21, no. 1, pp. 69–80, 1985

1985
[8]

Converse Lyapunov functions and converging inner approximations to maximal regions of attraction of nonlinear systems,

M. Jones and M. M. Peet, “Converse Lyapunov functions and converging inner approximations to maximal regions of attraction of nonlinear systems,” inIEEE Conference on Decision and Control (CDC), 2021

2021
[9]

Convex computation of the region of attrac- tion of polynomial control systems,

D. Henrion and M. Korda, “Convex computation of the region of attrac- tion of polynomial control systems,”IEEE Transactions on Automatic Control, vol. 59, no. 2, pp. 297–312, 2013

2013
[10]

Region of attraction estimation using invariant sets and rational Lyapunov functions,

G. Valmorbida and J. Anderson, “Region of attraction estimation using invariant sets and rational Lyapunov functions,”Automatica, 2017

2017
[11]

Physics-informed machine learning,

G. E. Karniadakis, I. G. Kevrekidis, L. Lu, P. Perdikaris, S. Wang, and L. Yang, “Physics-informed machine learning,”Nature Reviews Physics, 2021

2021
[12]

Lyapunov-net: A deep neural network architecture for lyapunov function approximation,

N. Gaby, F. Zhang, and X. Ye, “Lyapunov-net: A deep neural network architecture for lyapunov function approximation,” in2022 IEEE 61st Conference on Decision and Control (CDC). IEEE, 2022

2022
[13]

Physics-informed neural network Lyapunov functions: PDE characterization, learning, and verification,

J. Liu, Y . Meng, M. Fitzsimmons, and R. Zhou, “Physics-informed neural network Lyapunov functions: PDE characterization, learning, and verification,”Automatica, vol. 175, p. 112193, 2025

2025
[14]

(un)supervised learning of maximal lyapunov functions,

M. Barreau and N. Bastianello, “(un)supervised learning of maximal lyapunov functions,”arXiv preprint arXiv:2408.17246, 2024

work page arXiv 2024
[15]

Bullo and A

F. Bullo and A. D. Lewis,Geometric Control of Mechanical Systems, ser. Texts in Applied Mathematics. New York-Heidelberg-Berlin: Springer Verlag, 2004

2004
[16]

Approximation capabilities of multilayer feedforward net- works,

K. Hornik, “Approximation capabilities of multilayer feedforward net- works,”Neural networks, vol. 4, no. 2, pp. 251–257, 1991

1991
[17]

Computing exponentials of skew-symmetric matrices and logarithms of orthogonal matrices,

J. Gallier and D. Xu, “Computing exponentials of skew-symmetric matrices and logarithms of orthogonal matrices,”International Journal of Robotics and Automation, vol. 18, no. 1, pp. 10–20, 2003

2003
[18]

Dependence on initial conditions and parameters,

T. C. Sideris, “Dependence on initial conditions and parameters,” in Ordinary Differential Equations and Dynamical Systems, ser. Atlantis Studies in Differential Equations. The Netherlands: Atlantis Press (Zeger Karssen), 2013, vol. 2, pp. 89–94

2013
[19]

Automatic differentiation in machine learning: a survey,

A. G. Baydin, B. A. Pearlmutter, A. A. Radul, and J. M. Siskind, “Automatic differentiation in machine learning: a survey,”Journal of machine learning research, vol. 18, no. 153, pp. 1–43, 2018

2018
[20]

Gallier and J

J. Gallier and J. Quaintance,Differential Geometry and Lie Groups. Springer International Publishing, 2020

2020
[21]

N. J. Higham,Functions of Matrices: Theory and Computation. SIAM, 2008

2008
[22]

A control perspective on training pinns,

M. Barreau and H. Shen, “A control perspective on training pinns,”arXiv preprint arXiv:2501.18582, 2025

work page arXiv 2025
[23]

Hybrid geometric controllers for fully- actuated left-invariant systems on matrix lie groups,

A. Akhtar and R. G. Sanfelice, “Hybrid geometric controllers for fully- actuated left-invariant systems on matrix lie groups,” in2022 IEEE 61st Conference on Decision and Control (CDC), 2022

2022
[24]

TensorFlow: a system for Large-Scale machine learning,

M. Abadiet al., “TensorFlow: a system for Large-Scale machine learning,” in12th USENIX symposium on operating systems design and implementation (OSDI 16), 2016, pp. 265–283

2016

[1] [1]

J. E. Marsden and T. S. Ratiu,Introduction to Mechanics and Symmetry, ser. A Basic Exposition of Classical Mechanical Systems. Springer- Verlag, 1999

1999

[2] [2]

A unified framework for consensus and synchronization on lie groups admitting a bi-invariant metric,

R. S. Chandrasekaran, R. N. Banavar, and A. D. Mahindrakar, “A unified framework for consensus and synchronization on lie groups admitting a bi-invariant metric,”IEEE Transactions on Automatic Control, vol. 70, no. 11, pp. 7718–7724, 2025

2025

[3] [3]

The Algebra of Grand Unified Theories

J. C. Baez and J. Huerta, “The algebra of grand unified theories,” 2010. [Online]. Available: https://arxiv.org/abs/0904.1556

work page internal anchor Pith review Pith/arXiv arXiv 2010

[4] [4]

H. K. Khalil,Nonlinear Systems, ser. Pearson Education. Prentice Hall, 2002

2002

[5] [5]

S. Boyd, L. El Ghaoui, E. Feron, and V . Balakrishnan,Linear matrix inequalities in system and control theory. SIAM, 1994

1994

[6] [6]

V . I. Zubov,Methods of AM Lyapunov and their application. US Atomic Energy Commission, 1961, vol. 4439

1961

[7] [7]

Maximal Lyapunov functions and domains of attraction for autonomous nonlinear systems,

A. Vannelli and M. Vidyasagar, “Maximal Lyapunov functions and domains of attraction for autonomous nonlinear systems,”Automatica, vol. 21, no. 1, pp. 69–80, 1985

1985

[8] [8]

Converse Lyapunov functions and converging inner approximations to maximal regions of attraction of nonlinear systems,

M. Jones and M. M. Peet, “Converse Lyapunov functions and converging inner approximations to maximal regions of attraction of nonlinear systems,” inIEEE Conference on Decision and Control (CDC), 2021

2021

[9] [9]

Convex computation of the region of attrac- tion of polynomial control systems,

D. Henrion and M. Korda, “Convex computation of the region of attrac- tion of polynomial control systems,”IEEE Transactions on Automatic Control, vol. 59, no. 2, pp. 297–312, 2013

2013

[10] [10]

Region of attraction estimation using invariant sets and rational Lyapunov functions,

G. Valmorbida and J. Anderson, “Region of attraction estimation using invariant sets and rational Lyapunov functions,”Automatica, 2017

2017

[11] [11]

Physics-informed machine learning,

G. E. Karniadakis, I. G. Kevrekidis, L. Lu, P. Perdikaris, S. Wang, and L. Yang, “Physics-informed machine learning,”Nature Reviews Physics, 2021

2021

[12] [12]

Lyapunov-net: A deep neural network architecture for lyapunov function approximation,

N. Gaby, F. Zhang, and X. Ye, “Lyapunov-net: A deep neural network architecture for lyapunov function approximation,” in2022 IEEE 61st Conference on Decision and Control (CDC). IEEE, 2022

2022

[13] [13]

Physics-informed neural network Lyapunov functions: PDE characterization, learning, and verification,

J. Liu, Y . Meng, M. Fitzsimmons, and R. Zhou, “Physics-informed neural network Lyapunov functions: PDE characterization, learning, and verification,”Automatica, vol. 175, p. 112193, 2025

2025

[14] [14]

(un)supervised learning of maximal lyapunov functions,

M. Barreau and N. Bastianello, “(un)supervised learning of maximal lyapunov functions,”arXiv preprint arXiv:2408.17246, 2024

work page arXiv 2024

[15] [15]

Bullo and A

F. Bullo and A. D. Lewis,Geometric Control of Mechanical Systems, ser. Texts in Applied Mathematics. New York-Heidelberg-Berlin: Springer Verlag, 2004

2004

[16] [16]

Approximation capabilities of multilayer feedforward net- works,

K. Hornik, “Approximation capabilities of multilayer feedforward net- works,”Neural networks, vol. 4, no. 2, pp. 251–257, 1991

1991

[17] [17]

Computing exponentials of skew-symmetric matrices and logarithms of orthogonal matrices,

J. Gallier and D. Xu, “Computing exponentials of skew-symmetric matrices and logarithms of orthogonal matrices,”International Journal of Robotics and Automation, vol. 18, no. 1, pp. 10–20, 2003

2003

[18] [18]

Dependence on initial conditions and parameters,

T. C. Sideris, “Dependence on initial conditions and parameters,” in Ordinary Differential Equations and Dynamical Systems, ser. Atlantis Studies in Differential Equations. The Netherlands: Atlantis Press (Zeger Karssen), 2013, vol. 2, pp. 89–94

2013

[19] [19]

Automatic differentiation in machine learning: a survey,

A. G. Baydin, B. A. Pearlmutter, A. A. Radul, and J. M. Siskind, “Automatic differentiation in machine learning: a survey,”Journal of machine learning research, vol. 18, no. 153, pp. 1–43, 2018

2018

[20] [20]

Gallier and J

J. Gallier and J. Quaintance,Differential Geometry and Lie Groups. Springer International Publishing, 2020

2020

[21] [21]

N. J. Higham,Functions of Matrices: Theory and Computation. SIAM, 2008

2008

[22] [22]

A control perspective on training pinns,

M. Barreau and H. Shen, “A control perspective on training pinns,”arXiv preprint arXiv:2501.18582, 2025

work page arXiv 2025

[23] [23]

Hybrid geometric controllers for fully- actuated left-invariant systems on matrix lie groups,

A. Akhtar and R. G. Sanfelice, “Hybrid geometric controllers for fully- actuated left-invariant systems on matrix lie groups,” in2022 IEEE 61st Conference on Decision and Control (CDC), 2022

2022

[24] [24]

TensorFlow: a system for Large-Scale machine learning,

M. Abadiet al., “TensorFlow: a system for Large-Scale machine learning,” in12th USENIX symposium on operating systems design and implementation (OSDI 16), 2016, pp. 265–283

2016