InProceed- ings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Pa- pers), pages 9426–9439, Bangkok, Thailand

Math-shepherd: Verify, reinforce LLMs step-by-step without human annotations · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

ReProbe: Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of Large Language Models

cs.AI · 2025-11-09 · unverdicted · novelty 6.0

Lightweight probes on LLM internal states verify reasoning steps as effectively as process reward models up to 810 times larger across math, planning, and QA tasks.

citing papers explorer

Showing 1 of 1 citing paper.

ReProbe: Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of Large Language Models cs.AI · 2025-11-09 · unverdicted · none · ref 3
Lightweight probes on LLM internal states verify reasoning steps as effectively as process reward models up to 810 times larger across math, planning, and QA tasks.

InProceed- ings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Pa- pers), pages 9426–9439, Bangkok, Thailand

fields

years

verdicts

representative citing papers

citing papers explorer