YANN-RL initializes RL actor and critic networks with explicit multi-parametric linear MPC solutions via YANNs to start from linear optimal control performance and then learn nonlinear policies through online interaction.
AC4MPC: Actor-critic reinforcement learning for nonlinear model predictive control,
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
A learned policy rollout is refined by one Newton step via Riccati recursion, yielding quadratic reduction in suboptimality for nonlinear MPC, shown on quadcopter trajectory tracking.
YANN-RL is tested on three PC-Gym chemical process case studies, showing reduced training time and near-NMPC performance compared to PPO, SAC, DDPG, and TD3.
citing papers explorer
-
Reinforcement Learning-based Control via Y-wise Affine Neural Networks (YANNs)
YANN-RL initializes RL actor and critic networks with explicit multi-parametric linear MPC solutions via YANNs to start from linear optimal control performance and then learn nonlinear policies through online interaction.
-
Rollout Then Optimize: A One-Step Newton Refinement of Learned Policies for Nonlinear Model Predictive Control
A learned policy rollout is refined by one Newton step via Riccati recursion, yielding quadratic reduction in suboptimality for nonlinear MPC, shown on quadcopter trajectory tracking.
-
Reinforcement Learning-based Control via Y-wise Affine Neural Networks: Comparative Case Studies for Chemical Processes
YANN-RL is tested on three PC-Gym chemical process case studies, showing reduced training time and near-NMPC performance compared to PPO, SAC, DDPG, and TD3.