MTA-RL predicts 3D driving affordances from multi-modal sensors with a transformer and uses them as the observation space for an RL policy, yielding better route completion and generalization than baselines in CARLA urban scenarios.
Vlm-rl: A unified vision language models and reinforcement learning framework for safe autonomous driving
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
Reflective Self-Adaptation combines failure-reflective reinforcement learning with success-guided imitation learning to enable faster and more reliable task adaptation for pre-trained Vision-Language-Action models.
citing papers explorer
-
MTA-RL: Robust Urban Driving via Multi-modal Transformer-based 3D Affordances and Reinforcement Learning
MTA-RL predicts 3D driving affordances from multi-modal sensors with a transformer and uses them as the observation space for an RL policy, yielding better route completion and generalization than baselines in CARLA urban scenarios.
-
Reflection-Based Task Adaptation for Self-Improving VLA
Reflective Self-Adaptation combines failure-reflective reinforcement learning with success-guided imitation learning to enable faster and more reliable task adaptation for pre-trained Vision-Language-Action models.