Vrsbench: A versatile vision-language benchmark dataset for remote sensing image understanding

Vrsbench: A versatile vision-language benchmark dataset for remote sensing image understanding , author= · 2024 · arXiv 2406.12384

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

RSRCC: A Remote Sensing Regional Change Comprehension Benchmark Constructed via Retrieval-Augmented Best-of-N Ranking

cs.CV · 2026-04-22 · unverdicted · novelty 7.0

RSRCC is a new 126k-question benchmark for fine-grained remote sensing change question-answering, constructed via a hierarchical semi-supervised pipeline with retrieval-augmented Best-of-N ranking.

Visual Reasoning Agent: Robust Vision Systems in Remote Sensing via Inference-Time Scaling

cs.CV · 2025-09-19 · unverdicted · novelty 5.0

VRA is a training-free agentic framework that orchestrates off-the-shelf LVLMs with a reasoning model via iterative verification and refinement, raising accuracy on remote sensing VQA from 52.8% to 78.8% and delivering up to 40.67% gains on hard question types.

UniReason-Med: A Shared Grounded Reasoning Interface for 2D-to-3D Transfer in Medical VQA

cs.CV · 2026-06-10 · unverdicted · novelty 4.0

UniReason-Med introduces a unified framework for 2D and 3D medical VQA with shared grounded reasoning, trained on a 220K dataset, claiming that joint 2D+3D supervision improves 3D performance over 3D-only training.

citing papers explorer

Showing 3 of 3 citing papers after filters.

RSRCC: A Remote Sensing Regional Change Comprehension Benchmark Constructed via Retrieval-Augmented Best-of-N Ranking cs.CV · 2026-04-22 · unverdicted · none · ref 20
RSRCC is a new 126k-question benchmark for fine-grained remote sensing change question-answering, constructed via a hierarchical semi-supervised pipeline with retrieval-augmented Best-of-N ranking.
Visual Reasoning Agent: Robust Vision Systems in Remote Sensing via Inference-Time Scaling cs.CV · 2025-09-19 · unverdicted · none · ref 8
VRA is a training-free agentic framework that orchestrates off-the-shelf LVLMs with a reasoning model via iterative verification and refinement, raising accuracy on remote sensing VQA from 52.8% to 78.8% and delivering up to 40.67% gains on hard question types.
UniReason-Med: A Shared Grounded Reasoning Interface for 2D-to-3D Transfer in Medical VQA cs.CV · 2026-06-10 · unverdicted · none · ref 68
UniReason-Med introduces a unified framework for 2D and 3D medical VQA with shared grounded reasoning, trained on a 220K dataset, claiming that joint 2D+3D supervision improves 3D performance over 3D-only training.

Vrsbench: A versatile vision-language benchmark dataset for remote sensing image understanding

fields

years

verdicts

representative citing papers

citing papers explorer