pith. sign in

arXiv preprint arXiv:2505.10320

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 3 2025 4

roles

background 1

polarities

background 1

representative citing papers

Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs

cs.LG · 2025-08-27 · conditional · novelty 6.0

GSR jointly trains LLMs to generate candidate solutions and refine a superior final answer from them, achieving state-of-the-art performance on five mathematical benchmarks while transferring across model scales.

VRPRM: Process Reward Modeling via Visual Reasoning

cs.LG · 2025-08-05 · unverdicted · novelty 5.0

VRPRM combines visual reasoning with a two-stage SFT-plus-RL strategy to deliver higher-quality process reward modeling using far less annotated data than prior non-thinking PRMs.

citing papers explorer

Showing 7 of 7 citing papers.