arXiv preprint arXiv:2506.13793 (2025)

Med-refl: Medical reasoning enhancement via self-corrected fine-grained reflection · 2025 · arXiv 2506.13793

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Better Eyes, Better Thoughts: Why Vision Chain-of-Thought Fails in Medicine

cs.CV · 2026-03-02 · conditional · novelty 6.0

Chain-of-thought underperforms direct answering in medical VQA due to a perception bottleneck, but ROI cues and textual grounding interventions can improve results and reverse the gap.

Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning

cs.CV · 2026-06-30 · unverdicted · novelty 5.0

MRPO is a step-aware RL method that penalizes early reasoning errors exponentially more when the final answer is incorrect, reducing early-stage failures from 64% to 13% and outperforming baselines including larger models on medical VQA tasks.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Better Eyes, Better Thoughts: Why Vision Chain-of-Thought Fails in Medicine cs.CV · 2026-03-02 · conditional · none · ref 24
Chain-of-thought underperforms direct answering in medical VQA due to a perception bottleneck, but ROI cues and textual grounding interventions can improve results and reverse the gap.
Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning cs.CV · 2026-06-30 · unverdicted · none · ref 4
MRPO is a step-aware RL method that penalizes early reasoning errors exponentially more when the final answer is incorrect, reducing early-stage failures from 64% to 13% and outperforming baselines including larger models on medical VQA tasks.

arXiv preprint arXiv:2506.13793 (2025)

fields

years

verdicts

representative citing papers

citing papers explorer