When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains.arXiv preprint arXiv:2603.01301, 2026

Ahmadreza Jeddi, Kimia Shaban, Negin Baghbanzadeh, Natasha Sharan, Abhishek Moturu, Elham Dolatabadi, Babak Taati · 2026 · arXiv 2603.01301

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

AVIS: Adaptive Test-Time Scaling for Vision-Language Models

cs.CV · 2026-06-10 · unverdicted · novelty 6.0

AVIS is an adaptive policy that jointly scales visual context via key-based token pruning and reasoning via difficulty-predicted self-consistency to improve the accuracy-compute curve on image and video tasks.

PathoSage: Towards Multi-Source Evidence Adjudication in Pathology via Experience-Aware Agentic Workflow

cs.AI · 2026-05-18 · unverdicted · novelty 5.0

PathoSage is a three-stage framework using Structured Evidence Deliberation and a Beta-Bernoulli experience system to improve patch-level pathology reasoning by mitigating hallucinations and tool conflicts.

citing papers explorer

Showing 2 of 2 citing papers after filters.

AVIS: Adaptive Test-Time Scaling for Vision-Language Models cs.CV · 2026-06-10 · unverdicted · none · ref 22
AVIS is an adaptive policy that jointly scales visual context via key-based token pruning and reasoning via difficulty-predicted self-consistency to improve the accuracy-compute curve on image and video tasks.
PathoSage: Towards Multi-Source Evidence Adjudication in Pathology via Experience-Aware Agentic Workflow cs.AI · 2026-05-18 · unverdicted · none · ref 67
PathoSage is a three-stage framework using Structured Evidence Deliberation and a Beta-Bernoulli experience system to improve patch-level pathology reasoning by mitigating hallucinations and tool conflicts.

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains.arXiv preprint arXiv:2603.01301, 2026

fields

years

verdicts

representative citing papers

citing papers explorer