Title resolution pending

with rank 8, alpha 32, dropout 0 · 2021 · arXiv 5540.5516

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning

cs.CV · 2026-06-30 · unverdicted · novelty 5.0

MRPO is a step-aware RL method that penalizes early reasoning errors exponentially more when the final answer is incorrect, reducing early-stage failures from 64% to 13% and outperforming baselines including larger models on medical VQA tasks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning cs.CV · 2026-06-30 · unverdicted · none · ref 7
MRPO is a step-aware RL method that penalizes early reasoning errors exponentially more when the final answer is incorrect, reducing early-stage failures from 64% to 13% and outperforming baselines including larger models on medical VQA tasks.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer