arXiv preprint arXiv:2603.14363 , year=

AerialVLA: A Vision-Language-Action Model for UAV Navigation via Minimalist End-to-End Control , author= · arXiv 2603.14363

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

LiteVLA-H: Dual-Rate Vision-Language-Action Inference for Onboard Aerial Guidance and Semantic Perception

cs.CV · 2026-04-27 · unverdicted · novelty 5.0 · 2 refs

LiteVLA-H delivers 19.74 Hz action tokens and 6 Hz semantic outputs on Jetson Orin via dual-rate scheduling and mixed fine-tuning, outperforming recent VLA baselines in edge action rate while preserving descriptive competence.

The Unified Autonomy Stack: Toward a Blueprint for Generalizable Robot Autonomy

cs.RO · 2026-05-12 · accept · novelty 4.0

An open-sourced Unified Autonomy Stack fuses LiDAR, radar, vision and inertial data with sampling-based planning and control barrier functions to deliver resilient autonomy on aerial and ground robots in challenging real-world settings.

citing papers explorer

Showing 2 of 2 citing papers.

LiteVLA-H: Dual-Rate Vision-Language-Action Inference for Onboard Aerial Guidance and Semantic Perception cs.CV · 2026-04-27 · unverdicted · none · ref 12 · 2 links
LiteVLA-H delivers 19.74 Hz action tokens and 6 Hz semantic outputs on Jetson Orin via dual-rate scheduling and mixed fine-tuning, outperforming recent VLA baselines in edge action rate while preserving descriptive competence.
The Unified Autonomy Stack: Toward a Blueprint for Generalizable Robot Autonomy cs.RO · 2026-05-12 · accept · none · ref 273
An open-sourced Unified Autonomy Stack fuses LiDAR, radar, vision and inertial data with sampling-based planning and control barrier functions to deliver resilient autonomy on aerial and ground robots in challenging real-world settings.

arXiv preprint arXiv:2603.14363 , year=

fields

years

verdicts

representative citing papers

citing papers explorer