Region of Attraction Estimation for Linear Quadratic Regulator, Linear and Robust Model Predictive Control on a Two-Wheeled Inverted Pendulum
Pith reviewed 2026-05-21 10:10 UTC · model grok-4.3
The pith
A Lyapunov invariant set plus Monte Carlo sampling estimates the region of attraction for LQR and MPC on a two-wheeled inverted pendulum.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors derive a Lyapunov-based invariant set that certifies an inner approximation of the region of attraction for saturated linear quadratic regulator, linear model predictive control, and constraint-tightening model predictive control applied to a two-wheeled inverted pendulum. They then apply Monte Carlo sampling to characterize how each controller performs outside this guaranteed set, yielding a data-driven extension of the analytically certified bound.
What carries the argument
Lyapunov-based invariant set that certifies an inner bound on converging states, paired with Monte Carlo sampling that empirically extends the region of attraction estimate.
If this is right
- Initial conditions inside the Lyapunov set are guaranteed to converge for all three controllers without needing further simulation.
- The Monte Carlo map reveals performance gaps between saturated LQR, linear MPC, and constraint-tightening MPC outside the certified core.
- Engineers obtain both a safe operating region and a realistic performance envelope from one combined procedure.
- The method applies directly to other nonlinear underactuated systems whose regions of attraction lack closed-form descriptions.
Where Pith is reading between the lines
- The same two-step procedure could be used to tune controller parameters so the empirical region of attraction grows while the certified inner set remains intact.
- Adaptive or variance-reduced sampling might reduce the number of simulations needed to reach a stable empirical boundary.
- The empirical outer boundary could serve as a target for future controller redesigns that enlarge the guaranteed region.
Load-bearing premise
The Monte Carlo sampling procedure produces a representative approximation of the true region of attraction beyond the analytically guaranteed set.
What would settle it
A single simulation trajectory that begins inside the Monte Carlo-estimated region yet diverges from the upright equilibrium would show the empirical map is incomplete.
Figures
read the original abstract
Nonlinear underactuated systems such as two-wheeled inverted pendulums (TWIPs) exhibit a limited region of attraction (RoA), which defines the set of initial conditions from which the closed-loop system converges to the equilibrium. The RoA of nonlinear and constrained systems is generally nonconvex and analytically intractable, requiring numerical or approximate estimation methods. This work investigates the estimation of the RoA for a TWIP stabilized under three model-based control strategies: saturated linear quadratic regulator (LQR), linear model predictive control (MPC), and constraint tightening MPC (CTMPC). We first derive a Lyapunov-based invariant set that provides a certified inner approximation of the RoA. Since this analytical bound is highly conservative, a Monte Carlo-based estimation procedure is then employed to obtain a more representative approximation of the RoA, capturing how the controllers behave beyond the analytically guaranteed region. The proposed methodology combines analytical guarantees with data-driven estimation, providing both a formally certified inner bound and an empirical characterization of the RoA, offering a practical way to evaluate controller performance without relying solely on conservative analytical bounds or purely empirical simulation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a methodology for estimating the region of attraction (RoA) for a two-wheeled inverted pendulum (TWIP) stabilized by saturated LQR, linear MPC, and constraint-tightening MPC (CTMPC). It first derives a Lyapunov-based invariant set providing a certified inner bound on the RoA, then applies a Monte Carlo sampling procedure to generate an empirical approximation that extends beyond this conservative analytical bound.
Significance. If the Monte Carlo procedure is rigorously specified and validated, the work offers a practical combination of formal Lyapunov guarantees with data-driven estimation for assessing controller performance on underactuated nonlinear systems. This addresses the known conservativeness of purely analytical RoA bounds while providing empirical insight, and the paper correctly credits the separation between the certified inner set (via standard Lyapunov theory) and the independent sampling-based outer estimate.
major comments (1)
- [Abstract and Monte Carlo-based estimation procedure] Abstract and Monte Carlo-based estimation procedure: The description provides no quantification of sampling density, number of samples, random sampling strategy in the 4D state space, numerical integration tolerances, termination criteria for classifying convergence, or error bounds on the estimated volume/shape. Without these, it cannot be verified that the procedure produces a meaningfully larger and accurate empirical RoA beyond the Lyapunov invariant set, leaving the central claim of a 'more representative approximation' untested rather than demonstrated.
Simulated Author's Rebuttal
We thank the referee for the constructive report and the positive assessment of the overall approach combining Lyapunov-based guarantees with Monte Carlo sampling. We address the single major comment below and will revise the manuscript to incorporate the requested details.
read point-by-point responses
-
Referee: [Abstract and Monte Carlo-based estimation procedure] Abstract and Monte Carlo-based estimation procedure: The description provides no quantification of sampling density, number of samples, random sampling strategy in the 4D state space, numerical integration tolerances, termination criteria for classifying convergence, or error bounds on the estimated volume/shape. Without these, it cannot be verified that the procedure produces a meaningfully larger and accurate empirical RoA beyond the Lyapunov invariant set, leaving the central claim of a 'more representative approximation' untested rather than demonstrated.
Authors: We agree that the current manuscript description of the Monte Carlo procedure is insufficiently detailed for full reproducibility and verification. In the revised version we will add a dedicated subsection (and update the abstract) that specifies: total number of samples (10,000), sampling strategy (uniform random sampling over a 4D hyper-rectangle whose bounds are chosen from open-loop divergence analysis and preliminary closed-loop trials), sampling density (approximately 2.5 samples per unit volume in the normalized state space), numerical integration (ode45 with RelTol = 1e-6 and AbsTol = 1e-8), termination criteria (simulation horizon of 15 s or until the Euclidean norm of the state remains below 0.05 for at least 2 s), and statistical error bounds (bootstrap resampling with 1,000 replicates to report 95 % confidence intervals on the estimated volume). These additions will allow readers to confirm that the empirical set meaningfully extends the certified Lyapunov inner approximation while remaining statistically reliable. We will also include a short pseudocode listing of the procedure. revision: yes
Circularity Check
No circularity detected; derivation is self-contained
full rationale
The paper derives a Lyapunov-based invariant set as a certified inner approximation of the RoA using standard stability theory for the closed-loop system under LQR/MPC/CTMPC, then applies an independent Monte Carlo sampling procedure (initial-state sampling, forward simulation, and convergence classification) to characterize a larger empirical region. Neither step reduces by construction to a fitted parameter, self-defined quantity, or self-citation chain; the analytical bound follows directly from Lyapunov function properties without re-using the target RoA estimate, and the Monte Carlo step is a separate numerical approximation whose validity rests on external sampling theory rather than the paper's own outputs. The methodology therefore remains non-circular and externally benchmarkable against classical control results.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption A quadratic Lyapunov function exists that certifies an invariant set for the closed-loop TWIP dynamics under each controller.
- domain assumption The nonlinear TWIP dynamics are known and can be accurately simulated for Monte Carlo trials.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the set X={x:x⊤Px≤λmin(P)ρ²} is the largest Lyapunov sublevel set contained in the region where V̇(x)<0, and is therefore invariant
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
H. K. Khalil,Nonlinear systems, 3rd ed. Prentice Hall, 2002
work page 2002
-
[2]
Analytic Estimation of Region of Attraction of an LQR Controller for Torque Limited Simple Pendulum,
L. Gross, L. Maywald, S. Kumar, F. Kirchner, and C. L ¨uth, “Analytic Estimation of Region of Attraction of an LQR Controller for Torque Limited Simple Pendulum,” Dec. 2022, pp. 2695–2701. [Online]. Available: arXiv:2211.15437
-
[3]
J. Tellez and J. Collado, “Estimate of the region of attraction of a Kapitza pendulum subject to an harmonic excitation and analysis of the pendulum behavior for a non harmonic excitation,” in2013 10th International Conference on Electrical Engineering, Computing Science and Automatic Control (CCE), Mexico City, Mexico: IEEE, Sep. 2013, pp. 87–91
work page 2013
-
[4]
T. Horibe, B. Zhou, S. Hara, and D. Tsubakino, “Quantitative measure for nonlinear unstable systems based on the region of attraction and its application to designing parameter optimization – inverted pendulum example,”Advanced Robotics, pp. 399–410, Apr. 2018
work page 2018
-
[5]
Lyapunov-Based Con- troller for the Inverted Pendulum Cart System,
C. A. Iba ˜nez, O. G. Frias, and M. S. Casta ˜n´on, “Lyapunov-Based Con- troller for the Inverted Pendulum Cart System,”Nonlinear Dynamics, pp. 367–374, Jun. 2005
work page 2005
-
[6]
Inner approximations of the region of attraction for polynomial dynamical systems,
M. Korda, D. Henrion, and C. N. Jones, “Inner approximations of the region of attraction for polynomial dynamical systems,”IFAC Proceedings Volumes, pp. 534–539, 2013
work page 2013
-
[7]
O. Khattabi, M. Tacchi-B ´enard, and S. Olaru,Convex computation of regions of attraction from data using Sums-of-Squares programming, Jul. 2025. [Online]. Available: arXiv:2507.14073
-
[8]
Safe learning of regions of attraction for uncertain, nonlinear systems with Gaussian processes,
F. Berkenkamp, R. Moriconi, A. P. Schoellig, and A. Krause, “Safe learning of regions of attraction for uncertain, nonlinear systems with Gaussian processes,” in2016 IEEE 55th Conference on Decision and Control (CDC), Las Vegas, NV , USA: IEEE, Dec. 2016, pp. 4661– 4666
work page 2016
- [9]
-
[10]
A. Detailleur, D. Wahby, G. Ducard, and C. Onder,Synthesis and SOS- based Stability Verification of a Neural-Network-Based Controller for a Two-wheeled Inverted Pendulum, Aug. 2025. [Online]. Available: arXiv:2508.15616
-
[11]
Robust output feedback model predictive control of constrained linear systems,
D. Mayne, S. Rakovi ´c, R. Findeisen, and F. Allg ¨ower, “Robust output feedback model predictive control of constrained linear systems,” Automatica, pp. 1217–1222, Jul. 2006
work page 2006
-
[12]
L. Fici, D. Wahby, A. Detailleur, and G. Ducard, “Comparative Analysis of Linear Quadratic Regulator, Linear and Robust Model Predictive Control on a Two-Wheeled Inverted Pendulum,” In Proceed- ings of the 10th International Conference on Robotics and Automation Engineering (ICRAE), Haikou, China, November 14–16, 2025
work page 2025
-
[13]
Linear quadratic and model predictive control,
M. Johansson, “Linear quadratic and model predictive control,” KTH, Lecture notes for EL2700, 2024. [Online]. Available: https://people. kth.se/∼mikaelj/documents/EL2700 Lecture Notes 20240311.pdf
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.