arXiv preprint arXiv:2501.14548 (2025)

Shui, Z · 2025 · arXiv 2501.14548

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Lost in Volume: The CT-SpatialVQA Benchmark for Evaluating Semantic-Spatial Understanding of 3D Medical Vision-Language Models

cs.CV · 2026-05-09 · unverdicted · novelty 7.0

CT-SpatialVQA benchmark shows 3D medical VLMs achieve only 34% average accuracy on semantic-spatial reasoning tasks in CT volumes, often below random chance.

JANUS: Anatomy-Conditioned Gating for Robust CT Triage Under Distribution Shift

cs.CV · 2026-05-13 · unverdicted · novelty 6.0

JANUS conditions Vision Transformer embeddings on macro-radiomic priors via anatomically guided gating, reaching macro-AUROC 0.88 on an internal test set of 5082 cases and 0.87 on an external set of 2000 cases while improving calibration and reducing high-confidence false positives under domainshift

CA-GCL: Cross-Anatomy Global-Local Contrastive Learning for Robust 3D Medical Image Understanding

cs.CV · 2026-05-13 · unverdicted · novelty 6.0

CA-GCL adds global contrastive separation and clinical text augmentation to fine-grained vision-language pretraining, reducing textual embedding collapse and prompt variance in 3D medical image tasks.

Enhancing Fine-Grained Spatial Grounding in 3D CT Report Generation via Discriminative Guidance

cs.CV · 2026-04-12 · unverdicted · novelty 6.0

DCP-PD improves macro F1 scores on CT report generation benchmarks and introduces a hierarchical location-aware evaluation protocol that reveals ongoing challenges in pathology spatial grounding.

citing papers explorer

Showing 4 of 4 citing papers.

Lost in Volume: The CT-SpatialVQA Benchmark for Evaluating Semantic-Spatial Understanding of 3D Medical Vision-Language Models cs.CV · 2026-05-09 · unverdicted · none · ref 16
CT-SpatialVQA benchmark shows 3D medical VLMs achieve only 34% average accuracy on semantic-spatial reasoning tasks in CT volumes, often below random chance.
JANUS: Anatomy-Conditioned Gating for Robust CT Triage Under Distribution Shift cs.CV · 2026-05-13 · unverdicted · none · ref 19
JANUS conditions Vision Transformer embeddings on macro-radiomic priors via anatomically guided gating, reaching macro-AUROC 0.88 on an internal test set of 5082 cases and 0.87 on an external set of 2000 cases while improving calibration and reducing high-confidence false positives under domainshift
CA-GCL: Cross-Anatomy Global-Local Contrastive Learning for Robust 3D Medical Image Understanding cs.CV · 2026-05-13 · unverdicted · none · ref 18
CA-GCL adds global contrastive separation and clinical text augmentation to fine-grained vision-language pretraining, reducing textual embedding collapse and prompt variance in 3D medical image tasks.
Enhancing Fine-Grained Spatial Grounding in 3D CT Report Generation via Discriminative Guidance cs.CV · 2026-04-12 · unverdicted · none · ref 37
DCP-PD improves macro F1 scores on CT report generation benchmarks and introduces a hierarchical location-aware evaluation protocol that reveals ongoing challenges in pathology spatial grounding.

arXiv preprint arXiv:2501.14548 (2025)

fields

years

verdicts

representative citing papers

citing papers explorer