A zero-shot unified agent for VLN-CE, ObjectNav, EQA and Aerial-VLN on wheeled, quadruped, humanoid and UAV platforms that translates language and vision inputs into actions via MLLMs plus TDM and SCB mechanisms, matching trained foundation models on multiple benchmarks.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.RO 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
DynFly adds a B-spline and flow-matching trajectory layer with UAV-specific dynamic losses to existing UAV-VLN systems, yielding 4.69 NDTW and 4.51 m NE gains on the OpenUAV unseen split.
citing papers explorer
-
Uni-LaViRA: Language-Vision-Robot Actions Translation for Unified Embodied Navigation
A zero-shot unified agent for VLN-CE, ObjectNav, EQA and Aerial-VLN on wheeled, quadruped, humanoid and UAV platforms that translates language and vision inputs into actions via MLLMs plus TDM and SCB mechanisms, matching trained foundation models on multiple benchmarks.
-
DynFly: Dynamic-Aware Continuous Trajectory Generation for UAV Vision-Language Navigation in Urban Environments
DynFly adds a B-spline and flow-matching trajectory layer with UAV-specific dynamic losses to existing UAV-VLN systems, yielding 4.69 NDTW and 4.51 m NE gains on the OpenUAV unseen split.