A bi-level optimization combining outer signaling strategy search with inner game-tree LQR solves approximate Nash equilibria for the informed player in LQ differential games at ~10 Hz.
Differen- tiable mpc for end-to-end planning and control
4 Pith papers cite this work. Polarity classification is still indexing.
years
2026 4verdicts
UNVERDICTED 4representative citing papers
MVAdapt conditions end-to-end autonomous driving policies on explicit vehicle physics to achieve better zero-shot transfer and few-shot calibration across different vehicles in CARLA simulation.
Equilibrium Matching decoder substitution in π₀ improves RoboTwin success from 40.4% to 50.2% across 19 tasks and reaches 87.0% on LIBERO-10.
Combining gradient-based policy optimization with recursive system identification in differentiable MPC ensures convergence to an optimal controller under model uncertainty.
citing papers explorer
-
Fast Strategy Solving for the Informed Player in Two-Player Zero-Sum Linear-Quadratic Differential Games with One-Sided Information
A bi-level optimization combining outer signaling strategy search with inner game-tree LQR solves approximate Nash equilibria for the informed player in LQ differential games at ~10 Hz.
-
MVAdapt: Zero-Shot Multi-Vehicle Adaptation for End-to-End Autonomous Driving
MVAdapt conditions end-to-end autonomous driving policies on explicit vehicle physics to achieve better zero-shot transfer and few-shot calibration across different vehicles in CARLA simulation.
-
$\pi_0$-EqM: Equilibrium Matching for Closed-Loop Vision-Language-Action Control
Equilibrium Matching decoder substitution in π₀ improves RoboTwin success from 40.4% to 50.2% across 19 tasks and reaches 87.0% on LIBERO-10.
-
Policy Optimization with Differentiable MPC: Convergence Analysis under Uncertainty
Combining gradient-based policy optimization with recursive system identification in differentiable MPC ensures convergence to an optimal controller under model uncertainty.