An adaptive RL-MPC framework uses RL to inform MPPI sampling and aggregates MPPI samples for value estimation, delivering up to 72% higher success rates and 2.1x faster convergence on tasks like race driving and Lunar Lander with obstacles.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
verdicts
UNVERDICTED 3representative citing papers
SAT-RTS introduces a pipeline that abstracts high-dimensional RTS sequences into discrete tactical labels and hierarchical visualizations to improve interpretability of AI micromanagement.
MA-DHRL-OM decomposes overlay multicast routing into hierarchical stages with multi-agent RL to improve delay, bandwidth use, and stability over prior methods.
citing papers explorer
-
Learning to Plan, Planning to Learn: Adaptive Hierarchical RL-MPC for Sample-Efficient Decision Making
An adaptive RL-MPC framework uses RL to inform MPPI sampling and aggregates MPPI samples for value estimation, delivering up to 72% higher success rates and 2.1x faster convergence on tasks like race driving and Lunar Lander with obstacles.