In Advances in Neural Information Processing Systems

Solving quantitative reasoning problems with language models

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Learning a Continue-Thinking Token for Enhanced Test-Time Scaling

cs.CL · 2025-06-12 · unverdicted · novelty 7.0

A learned continue-thinking token, trained via RL on its embedding alone, improves math benchmark accuracy more than fixed-token budget forcing in a frozen language model.

citing papers explorer

Showing 1 of 1 citing paper.

Learning a Continue-Thinking Token for Enhanced Test-Time Scaling cs.CL · 2025-06-12 · unverdicted · none · ref 1
A learned continue-thinking token, trained via RL on its embedding alone, improves math benchmark accuracy more than fixed-token budget forcing in a frozen language model.

In Advances in Neural Information Processing Systems

fields

years

verdicts

representative citing papers

citing papers explorer