Introduces CARLA-Air simulator for air-ground VLA evaluation and shows that current aerial VLA models track ground partners but fail to achieve stable cooperative behavior under text-based interfaces.
TranSimHub: A unified air-ground simulation platform for multi-modal perception and decision-making
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
years
2026 3representative citing papers
ReasonLight uses multimodal foundation models to refine RL-proposed traffic signal phases based on camera images and sensor data, enabling zero-shot adaptation to unseen events such as emergency vehicle priority.
CARLA-Air unifies CARLA urban driving and AirSim drone flight into one high-fidelity simulation with preserved APIs for air-ground embodied AI research.
citing papers explorer
-
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence
CARLA-Air unifies CARLA urban driving and AirSim drone flight into one high-fidelity simulation with preserved APIs for air-ground embodied AI research.