arxiv: 2603.17134 · v2 · submitted 2026-03-17 · 📡 eess.SY · cs.SY· math.OC

Recognition: 2 theorem links

· Lean Theorem

Neural-NPV Control: Learning Parameter-Dependent Controllers and Lyapunov Functions with Neural Networks

MD Abul Kashem Niloy , Adam Hallmark , Yikun Cheng , Pan Zhao

Authors on Pith no claims yet

Pith reviewed 2026-05-15 09:21 UTC · model grok-4.3

classification 📡 eess.SY cs.SYmath.OC

keywords neural networksLyapunov functionsparameter-varying systemscontrol synthesisregion of attractioninput constraintsinverted pendulumquadrotor

0 comments

The pith

Neural networks jointly synthesize parameter-dependent controllers and Lyapunov functions for nonlinear systems with varying dynamics.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents Neural-NPV, a two-stage neural network method to create both a parameter-dependent controller and a matching Lyapunov function for nonlinear parameter-varying systems under input constraints. Traditional sum-of-squares methods are limited to control-affine cases, scale poorly, and produce conservative results. The first stage uses a cheap gradient-based counterexample search to get approximate functions; the second refines them via level-set guidance to guarantee validity and enlarge the robust region of attraction. Experiments on an inverted pendulum with one parameter and a quadrotor with three parameters show the approach works where sum-of-squares fails.

Core claim

We propose Neural-NPV, a two-stage learning-based framework that leverages neural networks to jointly synthesize a PD controller and a PD Lyapunov function for an NPV system under input constraints. In the first stage, we utilize a computationally cheap, gradient-based counterexample-guided procedure to synthesize an approximately valid PD Lyapunov function and a PD controller. In the second stage, a level-set guided refinement is then conducted to obtain a valid Lyapunov function and controller while maximizing the robust region of attraction.

What carries the argument

The Neural-NPV two-stage framework, in which neural networks first produce approximate parameter-dependent controller-Lyapunov pairs via counterexample-guided gradients and then refine them through level-set optimization to enforce validity and enlarge the robust region of attraction.

If this is right

Applies to control-affine and non-affine NPV systems with multiple scheduling parameters.
Produces larger robust regions of attraction than sum-of-squares under input constraints.
Scales to systems such as quadrotors with three parameters where sum-of-squares becomes intractable.
Replaces manual polynomial basis selection with automatic neural approximation followed by rigorous refinement.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method could be tested on hardware-in-the-loop quadrotor flights to check whether the computed regions translate to real disturbances.
Similar two-stage refinement might improve neural Lyapunov approaches for systems whose parameters vary continuously rather than in discrete modes.
Combining the learned functions with online adaptation could allow real-time retuning when the scheduling parameters drift outside the training distribution.

Load-bearing premise

The level-set guided refinement stage can always convert approximate neural outputs into rigorously valid Lyapunov functions and controllers that achieve the largest possible robust region of attraction for general NPV systems.

What would settle it

Apply the full pipeline to an NPV system whose largest robust region of attraction is already known from sum-of-squares methods; if the neural result yields a strictly smaller verified region, the maximality claim fails.

Figures

Figures reproduced from arXiv: 2603.17134 by Adam Hallmark, MD Abul Kashem Niloy, Pan Zhao, Yikun Cheng.

**Figure 2.** Figure 2: Robust ROA comparison between SOS-NPV and [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 5.** Figure 5: Ten random trajectories of the state (top) and the [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗

**Figure 6.** Figure 6: The disturbance and the control input trajectories [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗

read the original abstract

Nonlinear parameter-varying (NPV) systems are a class of nonlinear systems whose dynamics explicitly depend on time-varying external parameters, making them suitable for modeling real-world systems with dynamics variations. Traditional synthesis methods for NPV systems, such as sum-of-squares (SOS) optimization, are only applicable to control-affine systems, face scalability challenges and often lead to conservative results due to structural restrictions. To address these limitations, we propose Neural-NPV, a two-stage learning-based framework that leverages neural networks to jointly synthesize a PD controller and a PD Lyapunov function for an NPV system under input constraints. In the first stage, we utilize a computationally cheap, gradient-based counterexample-guided procedure to synthesize an approximately valid PD Lyapunov function and a PD controller. In the second stage, a level-set guided refinement is then conducted to obtain a valid Lyapunov function and controller while maximizing the robust region of attraction (R-ROA). We demonstrate the advantages of Neural-NPV in terms of applicability, performance, and scalability compared to SOS-based methods through numerical experiments involving an simple inverted pendulum with one scheduling parameter and a quadrotor system with three scheduling parameters.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Neural-NPV presents a neural two-stage synthesis for controllers and Lyapunov functions in parameter-varying systems but needs stronger validation on its refinement guarantees.

read the letter

Neural-NPV is a two-stage learning framework that uses neural networks to jointly create a parameter-dependent controller and a Lyapunov function for nonlinear systems whose dynamics shift with external parameters. What it does well is bypass the limitations of sum-of-squares optimization. The first stage runs a cheap gradient-based search with counterexamples to get an approximate valid pair. The second stage then refines using level sets to maximize the robust region of attraction under input constraints. The numerical tests on an inverted pendulum with one scheduling parameter and a quadrotor with three show better applicability and performance than SOS in those cases. The soft spot sits in the refinement. The paper claims the level-set guided step produces a valid Lyapunov function and controller, but it supplies no formal guarantee, termination condition, or certification for arbitrary nonlinear parameter dependence. If the process can leave areas where the time derivative stays positive along some parameter path, the validity and maximization claims rest only on the two examples. This paper suits readers working on scalable control design for varying systems in robotics or vehicles. It offers a concrete alternative to traditional methods. It deserves peer review. The approach is new in this setting and the experiments are informative, even if the theoretical side needs more development to support the stronger claims.

Referee Report

2 major / 2 minor

Summary. The paper proposes Neural-NPV, a two-stage neural-network framework for jointly synthesizing parameter-dependent controllers and Lyapunov functions for nonlinear parameter-varying (NPV) systems subject to input constraints. Stage 1 performs gradient-based counterexample-guided learning to obtain approximate PD Lyapunov functions and controllers; Stage 2 applies level-set guided refinement to enforce validity while maximizing the robust region of attraction (R-ROA). Advantages over sum-of-squares methods are claimed in applicability, performance, and scalability, and are illustrated on an inverted pendulum (one scheduling parameter) and a quadrotor (three scheduling parameters).

Significance. If the refinement procedure can be shown to produce rigorously valid PD Lyapunov functions and controllers for general NPV systems, the work would supply a scalable, learning-based route to controller synthesis that avoids the structural restrictions and computational cost of SOS optimization on non-polynomial or high-dimensional parameter dependence.

major comments (2)

[Section 3.2] Section 3.2 (level-set guided refinement): the manuscript asserts that this stage converts the approximate neural outputs into a rigorously valid PD Lyapunov function and controller that maximizes the R-ROA, yet supplies neither a convergence proof, a termination guarantee, nor a post-refinement certification (e.g., exact SOS verification on the final neural form) that holds for arbitrary nonlinear parameter dependence. Without such a guarantee, residual regions where V̇ > 0 may remain for some parameter trajectories, undermining the central claim of rigorous validity.
[Section 4] Section 4 (numerical experiments): the reported examples demonstrate feasibility but do not include quantitative metrics (e.g., measured R-ROA volume, wall-clock time, or failure rate of the refinement stage) or direct head-to-head comparisons against SOS baselines on identical problem instances, so the claimed scalability and performance advantages cannot be assessed from the presented data.

minor comments (2)

[Abstract] Abstract: 'an simple inverted pendulum' should read 'a simple inverted pendulum'.
[Section 2] Notation for the robust region of attraction (R-ROA) is introduced without an explicit mathematical definition; a formal set-theoretic statement would improve clarity.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive feedback, which helps clarify the scope of our theoretical claims and the need for stronger empirical support. We address each major comment below, indicating planned revisions to the manuscript.

read point-by-point responses

Referee: [Section 3.2] Section 3.2 (level-set guided refinement): the manuscript asserts that this stage converts the approximate neural outputs into a rigorously valid PD Lyapunov function and controller that maximizes the R-ROA, yet supplies neither a convergence proof, a termination guarantee, nor a post-refinement certification (e.g., exact SOS verification on the final neural form) that holds for arbitrary nonlinear parameter dependence. Without such a guarantee, residual regions where V̇ > 0 may remain for some parameter trajectories, undermining the central claim of rigorous validity.

Authors: We agree that the level-set guided refinement lacks a general convergence proof or termination guarantee that would hold for arbitrary nonlinear parameter dependence. The procedure relies on iterative gradient updates over sampled level sets to enforce Lyapunov conditions and maximize the R-ROA, but it is fundamentally a numerical optimization heuristic whose success depends on sampling density and the optimization landscape. In the revised manuscript we will (i) explicitly state that rigorous validity is guaranteed only up to the chosen sampling resolution and numerical tolerances, (ii) add a limitations paragraph discussing the absence of a general proof, and (iii) report empirical success rates of the refinement stage on the presented examples. We will also note that, when the underlying dynamics admit a polynomial approximation, post-refinement SOS verification could be performed, but this is not applicable to the general NPV setting considered here. revision: partial
Referee: [Section 4] Section 4 (numerical experiments): the reported examples demonstrate feasibility but do not include quantitative metrics (e.g., measured R-ROA volume, wall-clock time, or failure rate of the refinement stage) or direct head-to-head comparisons against SOS baselines on identical problem instances, so the claimed scalability and performance advantages cannot be assessed from the presented data.

Authors: We accept that the current experimental section is primarily qualitative. In the revised manuscript we will augment Section 4 with quantitative metrics: estimated R-ROA volumes obtained via Monte Carlo sampling over the joint state-parameter domain, average wall-clock times for both stages across repeated runs, and the refinement-stage success rate (fraction of trials in which the final networks satisfy the Lyapunov inequalities within a prescribed tolerance). For the inverted-pendulum example we will add a direct comparison against an SOS baseline on identical dynamics, reporting R-ROA volume and total synthesis time. For the quadrotor we will retain the feasibility demonstration while noting that SOS is not directly applicable due to non-polynomial terms and three-dimensional parameter dependence; we will include runtime scaling plots versus number of parameters to support the scalability claim. revision: yes

standing simulated objections not resolved

Absence of a general convergence proof or post-refinement certification for the level-set guided refinement that holds for arbitrary nonlinear parameter dependence.

Circularity Check

0 steps flagged

No significant circularity in Neural-NPV derivation

full rationale

The paper presents Neural-NPV as a two-stage neural learning procedure: a gradient-based counterexample-guided stage for approximate PD Lyapunov/controller synthesis followed by level-set guided refinement to enforce validity and maximize R-ROA. No equations, claims, or steps in the abstract or described framework reduce the output to a fitted quantity defined by the inputs themselves, nor rely on self-citation chains for load-bearing uniqueness or ansatz. The method is framed as an independent, scalable alternative to SOS without self-definitional loops or renaming of known results as novel predictions.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based on abstract only; the framework assumes neural networks can be trained to produce valid Lyapunov certificates for NPV dynamics, but no explicit free parameters, axioms, or invented entities are detailed.

axioms (1)

domain assumption NPV systems admit parameter-dependent Lyapunov functions and controllers that can be approximated by neural networks
Invoked implicitly as the basis for the synthesis procedure.

pith-pipeline@v0.9.0 · 5516 in / 1144 out tokens · 38346 ms · 2026-05-15T09:21:13.602445+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We parameterize our Lyapunov function ... V(x, θ) = x^T (ϕ_NN(x, θ) ϕ_NN^T(x, θ) + εI) x
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

a level-set guided refinement is then conducted to obtain a valid Lyapunov function and controller while maximizing the robust region of attraction

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

25 extracted references · 25 canonical work pages

[1]

Research on gain scheduling,

W. J. Rugh and J. S. Shamma, “Research on gain scheduling,” Automatica, vol. 36, no. 10, pp. 1401–1425, 2000

work page 2000
[2]

Mohammadpour and C

J. Mohammadpour and C. W. Scherer,Control of Linear Parameter Varying Systems with Applications. Springer Science & Business Media, 2012

work page 2012
[3]

H 8 mixed stabilization of nonlin- ear parameter-varying systems,

R. Fu, J. Zeng, and Z. Duan, “H 8 mixed stabilization of nonlin- ear parameter-varying systems,”International Journal of Robust and Nonlinear Control, vol. 28, no. 17, pp. 5232–5246, 2018

work page 2018
[4]

On the domain of attraction and local stabilization of nonlinear parameter-varying systems,

L. Lu, R. Fu, J. Zeng, and Z. Duan, “On the domain of attraction and local stabilization of nonlinear parameter-varying systems,”Int. J. Robust Nonlinear Control, vol. 30, no. 1, pp. 17–32, 2020

work page 2020
[5]

Parameter-dependent control Lyapunov functions for stabi- lizing nonlinear parameter-varying systems,

P. Zhao, “Parameter-dependent control Lyapunov functions for stabi- lizing nonlinear parameter-varying systems,”IEEE Control Systems Letters, vol. 9, pp. 360–365, 2025

work page 2025
[6]

Gain scheduling: Potential hazards and possible remedies,

J. S. Shamma and M. Athans, “Gain scheduling: Potential hazards and possible remedies,”IEEE Control Systems Magazine, vol. 12, no. 3, pp. 101–107, 1992

work page 1992
[7]

P. A. Parrilo,Structured Semidefinite Programs and Semialgebraic Geometry Methods in Robustness and Optimization. PhD thesis, Massachusetts Institute of Technology, 2000

work page 2000
[8]

DSOS and SDSOS optimization: more tractable alternatives to sum of squares and semidefinite opti- mization,

A. A. Ahmadi and A. Majumdar, “DSOS and SDSOS optimization: more tractable alternatives to sum of squares and semidefinite opti- mization,”SIAM Journal on Applied Algebra and Geometry, vol. 3, no. 2, pp. 193–230, 2019

work page 2019
[9]

Nonlinear control syn- thesis by sum of squares optimization: A Lyapunov-based approach,

S. Prajna, A. Papachristodoulou, and F. Wu, “Nonlinear control syn- thesis by sum of squares optimization: A Lyapunov-based approach,” in5th Asian control conference, vol. 1, pp. 157–165, IEEE, 2004

work page 2004
[10]

Safe control with learned certificates: A survey of neural Lyapunov, barrier, and contraction methods for robotics and control,

C. Dawson, S. Gao, and C. Fan, “Safe control with learned certificates: A survey of neural Lyapunov, barrier, and contraction methods for robotics and control,”IEEE Transactions on Robotics, vol. 39, no. 3, pp. 1749–1767, 2023

work page 2023
[11]

The Lyapunov neural network: Adaptive stability certification for safe learning of dynamical systems,

S. M. Richards, F. Berkenkamp, and A. Krause, “The Lyapunov neural network: Adaptive stability certification for safe learning of dynamical systems,” inConference on Robot Learning, pp. 466–476, PMLR, 2018

work page 2018
[12]

Neural Lyapunov control,

Y .-C. Chang, N. Roohi, and S. Gao, “Neural Lyapunov control,” Advances in Neural Information Processing Systems, vol. 32, 2019

work page 2019
[13]

Learning certified control using contrac- tion metric,

D. Sun, S. Jha, and C. Fan, “Learning certified control using contrac- tion metric,” inConference on Robot Learning, 2020

work page 2020
[14]

Neural contraction metrics for robust estimation and control: A convex optimization approach,

H. Tsukamoto and S.-J. Chung, “Neural contraction metrics for robust estimation and control: A convex optimization approach,”IEEE Control Systems Letters, vol. 5, no. 1, pp. 211–216, 2020

work page 2020
[15]

Learning control barrier functions from expert demonstrations,

A. Robey, H. Hu, L. Lindemann, H. Zhang, D. V . Dimarogonas, S. Tu, and N. Matni, “Learning control barrier functions from expert demonstrations,” inProc. CDC, pp. 3717–3724, 2020

work page 2020
[17]

Fault-tolerant control synthesis for a class of nonlinear systems: Sum of squares optimization approach,

H.-J. Ma and G.-H. Yang, “Fault-tolerant control synthesis for a class of nonlinear systems: Sum of squares optimization approach,”Int. J. Robust Nonlinear Control, vol. 19, no. 5, pp. 591–610, 2009

work page 2009
[18]

An improved nonlinearH 8 synthesis for parameter-dependent polynomial nonlinear systems using SOS pro- gramming,

D. Zhao and J. Wang, “An improved nonlinearH 8 synthesis for parameter-dependent polynomial nonlinear systems using SOS pro- gramming,” inAmerican Control Conference, pp. 796–801, 2009

work page 2009
[19]

Nonlinear state feedback control for a class of polynomial nonlinear discrete-time systems with norm-bounded uncertainties: An integrator approach,

S. Saat, D. Huang, S. K. Nguang, and A. Hamidon, “Nonlinear state feedback control for a class of polynomial nonlinear discrete-time systems with norm-bounded uncertainties: An integrator approach,” Journal of the Franklin Institute, vol. 350, no. 7, pp. 1739–1752, 2013

work page 2013
[20]

Safe nonlinear control using robust neural Lyapunov-barrier functions,

C. Dawson, Z. Qin, S. Gao, and C. Fan, “Safe nonlinear control using robust neural Lyapunov-barrier functions,” inConference on Robot Learning, pp. 1724–1735, PMLR, 2022

work page 2022
[21]

Convex synthesis of control barrier functions under input constraints,

P. Zhao, R. Ghabcheloo, Y . Cheng, H. Abdi, and N. Hovakimyan, “Convex synthesis of control barrier functions under input constraints,” IEEE Control Systems Letters, vol. 7, pp. 3102–3107, 2023

work page 2023
[22]

H. K. Khalil,Nonlinear Systems. Englewood Cliffs, NJ: Prentice Hall, 2002

work page 2002
[23]

Lyapunov-stable neural control for state and output feedback: A novel formulation for efficient synthesis and verification,

L. Yang, H. Dai, Z. Shi, C.-J. Hsieh, R. Tedrake, and H. Zhang, “Lyapunov-stable neural control for state and output feedback: A novel formulation for efficient synthesis and verification,”arXiv preprint arXiv:2404.07956, 2024

work page arXiv 2024
[24]

Counterexample guided inductive synthesis modulo theories,

A. Abate, C. David, P. Kesseli, D. Kroening, and E. Polgreen, “Counterexample guided inductive synthesis modulo theories,” in International Conference on Computer Aided Verification, pp. 270– 288, Springer, 2018

work page 2018
[25]

L 1Quad:L 1 adaptive augmentation of geometric control for agile quadrotors with perfor- mance guarantees,

Z. Wu, S. Cheng, P. Zhao, A. Gahlawat, K. A. Ackerman, A. Laksh- manan, C. Yang, J. Yu, and N. Hovakimyan, “L 1Quad:L 1 adaptive augmentation of geometric control for agile quadrotors with perfor- mance guarantees,”IEEE Transactions on Control System Technology, vol. 33, no. 2, pp. 597–612, 2025

work page 2025
[26]

Tracking flight control of quadrotor based on disturbance observer,

M. Chen, S. Xiong, and Q. Wu, “Tracking flight control of quadrotor based on disturbance observer,”IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 51, no. 3, pp. 1414–1423, 2019

work page 2019