RefAerial is a new benchmark dataset for text-based object detection in aerial imagery, accompanied by an SCS model that handles scale diversity better than prior ground-image methods.
Ref-nms: Breaking proposal bottlenecks in two-stage referring expression grounding
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2verdicts
UNVERDICTED 2representative citing papers
ThinkDeeper introduces a world-model-based reasoning step that predicts future spatial states to improve multimodal visual grounding for autonomous vehicles, achieving top results on Talk2Car and other benchmarks.
citing papers explorer
-
RefAerial: A Benchmark and Approach for Referring Detection in Aerial Images
RefAerial is a new benchmark dataset for text-based object detection in aerial imagery, accompanied by an SCS model that handles scale diversity better than prior ground-image methods.
-
Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles
ThinkDeeper introduces a world-model-based reasoning step that predicts future spatial states to improve multimodal visual grounding for autonomous vehicles, achieving top results on Talk2Car and other benchmarks.