pith. the verified trust layer for science. sign in

arxiv: 1802.07856 · v1 · pith:XICCY2WAnew · submitted 2018-02-22 · 💻 cs.CV

xView: Objects in Context in Overhead Imagery

classification 💻 cs.CV
keywords imagerydetectiondatasetsobjectoverheadxviewdatasetobjects
0
0 comments X p. Extension
Add this Pith Number to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{XICCY2WA}

Prints a linked pith:XICCY2WA badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We introduce a new large-scale dataset for the advancement of object detection techniques and overhead object detection research. This satellite imagery dataset enables research progress pertaining to four key computer vision frontiers. We utilize a novel process for geospatial category detection and bounding box annotation with three stages of quality control. Our data is collected from WorldView-3 satellites at 0.3m ground sample distance, providing higher resolution imagery than most public satellite imagery datasets. We compare xView to other object detection datasets in both natural and overhead imagery domains and then provide a baseline analysis using the Single Shot MultiBox Detector. xView is one of the largest and most diverse publicly available object-detection datasets to date, with over 1 million objects across 60 classes in over 1,400 km^2 of imagery.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 8 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. UHR-Micro: Diagnosing and Mitigating the Resolution Illusion in Earth Observation VLMs

    cs.CV 2026-05 unverdicted novelty 7.0

    VLMs show a resolution illusion on UHR Earth observation imagery where higher resolution does not improve micro-target perception; UHR-Micro benchmark and MAP-Agent address this via evidence-centered active inspection.

  2. EO-Gym: A Multimodal, Interactive Environment for Earth Observation Agents

    cs.AI 2026-05 unverdicted novelty 7.0

    EO-Gym supplies an executable multimodal environment and 9k-trajectory benchmark that turns Earth Observation into a tool-using, multi-step reasoning task, revealing that current VLMs struggle on temporal and cross-se...

  3. Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection

    cs.CV 2026-04 unverdicted novelty 7.0

    Noise2Map repurposes diffusion model denoising into a direct predictor for semantic segmentation and change detection tasks in remote sensing, achieving top average ranks on benchmark datasets.

  4. Adaptive Slicing-Assisted Hyper Inference for Enhanced Small Object Detection in High-Resolution Imagery

    cs.CV 2026-04 unverdicted novelty 7.0

    ASAHI adaptively slices high-res images into 6 or 12 patches, adds slicing-assisted fine-tuning, and uses Cluster-DIoU-NMS to hit 56.8% mAP on VisDrone2019 and 22.7% on xView while running 20-25% faster than fixed sli...

  5. Generalized Small Object Detection:A Point-Prompted Paradigm and Benchmark

    cs.CV 2026-04 unverdicted novelty 7.0

    TinySet-9M dataset and DEAL point-prompted framework deliver 31.4% relative AP75 gain over supervised baselines for small object detection with one click at inference and generalization to unseen categories.

  6. Agentic AI in Remote Sensing: Foundations, Taxonomy, and Emerging Systems

    cs.CV 2026-01 unverdicted novelty 7.0

    The paper delivers the first comprehensive review and unified taxonomy of agentic AI in remote sensing, covering single-agent copilots, multi-agent systems, planning mechanisms, benchmarks, and a roadmap while noting ...

  7. MMLANDMARKS: a Cross-View Instance-Level Benchmark for Geo-Spatial Understanding

    cs.CV 2025-12 conditional novelty 7.0

    MMLandmarks supplies 197k aerial and 329k ground images plus text and GPS for 18,557 landmarks to benchmark multimodal geo-spatial understanding.

  8. HMR-Net: Hierarchical Modular Routing for Cross-Domain Object Detection in Aerial Images

    cs.CV 2026-04 unverdicted novelty 6.0

    HMR-Net introduces hierarchical routing with global dataset-level and local scene-level modularity plus conditional experts to improve cross-domain aerial object detection and enable novel category recognition without...