Dota: A large-scale dataset for object detection in aerial images

Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang · 2018

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

GeoMMBench and GeoMMAgent: Toward Expert-Level Multimodal Intelligence in Geoscience and Remote Sensing

cs.CV · 2026-04-10 · unverdicted · novelty 7.0

GeoMMBench reveals deficiencies in current multimodal LLMs for geoscience tasks while GeoMMAgent demonstrates that tool-integrated agents achieve significantly higher performance.

Getting the Numbers Right$\unicode{x2014}$Modelling Multi-Class Object Counting in Dense and Varied Scenes

cs.CV · 2025-10-02 · unverdicted · novelty 7.0

A Twins-SVT vision transformer backbone with multiscale CNN decoder and Category Focus Module auxiliary task reduces MAE by 33-64% on VisDrone and iSAID multi-class counting benchmarks versus prior density estimators.

citing papers explorer

Showing 2 of 2 citing papers.

GeoMMBench and GeoMMAgent: Toward Expert-Level Multimodal Intelligence in Geoscience and Remote Sensing cs.CV · 2026-04-10 · unverdicted · none · ref 57
GeoMMBench reveals deficiencies in current multimodal LLMs for geoscience tasks while GeoMMAgent demonstrates that tool-integrated agents achieve significantly higher performance.
Getting the Numbers Right$\unicode{x2014}$Modelling Multi-Class Object Counting in Dense and Varied Scenes cs.CV · 2025-10-02 · unverdicted · none · ref 30
A Twins-SVT vision transformer backbone with multiscale CNN decoder and Category Focus Module auxiliary task reduces MAE by 33-64% on VisDrone and iSAID multi-class counting benchmarks versus prior density estimators.

Dota: A large-scale dataset for object detection in aerial images

fields

years

verdicts

representative citing papers

citing papers explorer