Uav-ground visual tracking: A unified dataset and collaborative learning approach.IEEE Transactions on Circuits and Systems for Video Technology, 34(5):3619–3632

Dengdi Sun, Leilei Cheng, Song Chen, Chenglong Li, Yun Xiao, Bin Luo · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

UAV-Track VLA: Embodied Aerial Tracking via Vision-Language-Action Models

cs.CV · 2026-04-02 · conditional · novelty 6.0

UAV-Track VLA modifies the π0.5 VLA architecture with temporal compression and dual-branch decoding to reach 61.76% success and 269.65 average frames in long-distance pedestrian tracking on a new 890K-frame UAV dataset, while cutting inference latency by 33.4%.

citing papers explorer

Showing 1 of 1 citing paper.

UAV-Track VLA: Embodied Aerial Tracking via Vision-Language-Action Models cs.CV · 2026-04-02 · conditional · none · ref 17
UAV-Track VLA modifies the π0.5 VLA architecture with temporal compression and dual-branch decoding to reach 61.76% success and 269.65 average frames in long-distance pedestrian tracking on a new 890K-frame UAV dataset, while cutting inference latency by 33.4%.

Uav-ground visual tracking: A unified dataset and collaborative learning approach.IEEE Transactions on Circuits and Systems for Video Technology, 34(5):3619–3632

fields

years

verdicts

representative citing papers

citing papers explorer