xView: Objects in Context in Overhead Imagery

Darius Lam , Richard Kuzma , Kevin McGee , Samuel Dooley , Michael Laielli , Matthew Klaric , Yaroslav Bulatov , Brendan McCord

Authors on Pith no claims yet

classification 💻 cs.CV

keywords imagerydetectiondatasetsobjectoverheadxviewdatasetobjects

0 comments

read the original abstract

We introduce a new large-scale dataset for the advancement of object detection techniques and overhead object detection research. This satellite imagery dataset enables research progress pertaining to four key computer vision frontiers. We utilize a novel process for geospatial category detection and bounding box annotation with three stages of quality control. Our data is collected from WorldView-3 satellites at 0.3m ground sample distance, providing higher resolution imagery than most public satellite imagery datasets. We compare xView to other object detection datasets in both natural and overhead imagery domains and then provide a baseline analysis using the Single Shot MultiBox Detector. xView is one of the largest and most diverse publicly available object-detection datasets to date, with over 1 million objects across 60 classes in over 1,400 km^2 of imagery.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

UHR-Micro: Diagnosing and Mitigating the Resolution Illusion in Earth Observation VLMs
cs.CV 2026-05 unverdicted novelty 7.0

VLMs show a resolution illusion on UHR Earth observation imagery where higher resolution does not improve micro-target perception; UHR-Micro benchmark and MAP-Agent address this via evidence-centered active inspection.
EO-Gym: A Multimodal, Interactive Environment for Earth Observation Agents
cs.AI 2026-05 unverdicted novelty 7.0

EO-Gym supplies an executable multimodal environment and 9k-trajectory benchmark that turns Earth Observation into a tool-using, multi-step reasoning task, revealing that current VLMs struggle on temporal and cross-se...
Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection
cs.CV 2026-04 unverdicted novelty 7.0

Noise2Map repurposes diffusion model denoising into a direct predictor for semantic segmentation and change detection tasks in remote sensing, achieving top average ranks on benchmark datasets.
Adaptive Slicing-Assisted Hyper Inference for Enhanced Small Object Detection in High-Resolution Imagery
cs.CV 2026-04 unverdicted novelty 7.0

ASAHI adaptively slices high-res images into 6 or 12 patches, adds slicing-assisted fine-tuning, and uses Cluster-DIoU-NMS to hit 56.8% mAP on VisDrone2019 and 22.7% on xView while running 20-25% faster than fixed sli...
Generalized Small Object Detection:A Point-Prompted Paradigm and Benchmark
cs.CV 2026-04 unverdicted novelty 7.0

TinySet-9M dataset and DEAL point-prompted framework deliver 31.4% relative AP75 gain over supervised baselines for small object detection with one click at inference and generalization to unseen categories.
HMR-Net: Hierarchical Modular Routing for Cross-Domain Object Detection in Aerial Images
cs.CV 2026-04 unverdicted novelty 6.0

HMR-Net introduces hierarchical routing with global dataset-level and local scene-level modularity plus conditional experts to improve cross-domain aerial object detection and enable novel category recognition without...