Title resolution pending

**Replace references to “description”, “caption”, ”rationale”** with wording that references **“the image

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

cs.CL · 2025-04-10 · unverdicted · novelty 6.0

SFT induces pseudo-reasoning paths that undermine RL in LVLMs, while RL with GRPO and mixed perception-cognition rewards on the new VLAA-Thinking dataset produces more genuine reasoning and top leaderboard performance.

citing papers explorer

Showing 1 of 1 citing paper.

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models cs.CL · 2025-04-10 · unverdicted · none · ref 1
SFT induces pseudo-reasoning paths that undermine RL in LVLMs, while RL with GRPO and mixed perception-cognition rewards on the new VLAA-Thinking dataset produces more genuine reasoning and top leaderboard performance.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer