pith. sign in

Are vision language models ready for clinical diagno- sis? a 3d medical benchmark for tumor-centric visual question answering

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

background 2 dataset 1

citation-polarity summary

fields

cs.CV 4

years

2026 4

clear filters

representative citing papers

RadThinking: A Dataset for Longitudinal Clinical Reasoning in Radiology

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

RadThinking releases a large longitudinal CT VQA dataset stratified into foundation perception questions, single-rule reasoning questions, and compositional multi-step chains grounded in clinical reporting standards for cancer screening.

Beyond Masks: The Case for Medical Image Parsing

cs.CV · 2026-05-12 · unverdicted · novelty 5.0

Medical image parsing is proposed as the central output for the field instead of masks, with an audit showing that none of eleven representative systems produces a well-formed parse containing attributes, relationships, and closure.

citing papers explorer

Showing 2 of 2 citing papers after filters.

  • CXR-ContraBench: Benchmarking Negated-Option Attraction in Medical VLMs cs.CV · 2026-05-07 · conditional · none · ref 5

    Medical VLMs frequently select negated options that contradict visible chest X-ray findings, achieving only ~30% accuracy on direct presence probes, but a post-hoc consistency verifier raises accuracy above 95%.

  • Beyond Masks: The Case for Medical Image Parsing cs.CV · 2026-05-12 · unverdicted · none · ref 10

    Medical image parsing is proposed as the central output for the field instead of masks, with an audit showing that none of eleven representative systems produces a well-formed parse containing attributes, relationships, and closure.