Empowering small vlms to think with dynamic memorization and explo- ration.arXiv preprint arXiv:2506.23061, 2025

Jiazhen Liu, Yuchuan Deng, Long Chen · 2025 · arXiv 2506.23061

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models

cs.CV · 2025-10-12 · unverdicted · novelty 6.0

ViSurf unifies SFT and RLVR for LVLMs in one training stage by injecting ground-truth labels into rollouts and applying novel reward controls, outperforming standalone and two-stage baselines on diverse benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models cs.CV · 2025-10-12 · unverdicted · none · ref 19
ViSurf unifies SFT and RLVR for LVLMs in one training stage by injecting ground-truth labels into rollouts and applying novel reward controls, outperforming standalone and two-stage baselines on diverse benchmarks.

Empowering small vlms to think with dynamic memorization and explo- ration.arXiv preprint arXiv:2506.23061, 2025

fields

years

verdicts

representative citing papers

citing papers explorer