Answer convergence as a signal for early stopping in reasoning.arXiv preprint arXiv:2506.02536

Xin Liu, Lu Wang · 2025 · arXiv 2506.02536

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Conformal Thinking: Risk Control for Reasoning on a Compute Budget

cs.AI · 2026-02-03 · unverdicted · novelty 6.0

Conformal risk control with upper and lower thresholds lets LLMs adaptively stop reasoning while guaranteeing a maximum error rate and minimizing token use.

Entropy After </Think> for reasoning model early exiting

cs.LG · 2025-09-30 · unverdicted · novelty 6.0

Entropy After </Think> (EAT) enables early exiting in reasoning LLMs by tracking entropy stabilization after a </think> token, cutting token use 12-22% on MATH500 and AIME2025 with no accuracy loss.

Efficient Test-Time Scaling via Temporal Reasoning Aggregation

cs.AI · 2026-04-19 · unverdicted · novelty 5.0

TRACE aggregates answer consistency and confidence trajectory over multiple reasoning steps to decide when to halt inference, reducing token usage by 25-30% while keeping accuracy within 1-2% of full reasoning.

When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning

cs.CL · 2026-04-08 · unverdicted · novelty 5.0

DTSR enables large reasoning models to dynamically assess chain-of-thought sufficiency via reflection signals and a sufficiency check, reducing reasoning length by 28.9-34.9% with minimal performance loss on Qwen3 models.

citing papers explorer

Showing 4 of 4 citing papers.

Conformal Thinking: Risk Control for Reasoning on a Compute Budget cs.AI · 2026-02-03 · unverdicted · none · ref 5
Conformal risk control with upper and lower thresholds lets LLMs adaptively stop reasoning while guaranteeing a maximum error rate and minimizing token use.
Entropy After </Think> for reasoning model early exiting cs.LG · 2025-09-30 · unverdicted · none · ref 10
Entropy After </Think> (EAT) enables early exiting in reasoning LLMs by tracking entropy stabilization after a </think> token, cutting token use 12-22% on MATH500 and AIME2025 with no accuracy loss.
Efficient Test-Time Scaling via Temporal Reasoning Aggregation cs.AI · 2026-04-19 · unverdicted · none · ref 62
TRACE aggregates answer consistency and confidence trajectory over multiple reasoning steps to decide when to halt inference, reducing token usage by 25-30% while keeping accuracy within 1-2% of full reasoning.
When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning cs.CL · 2026-04-08 · unverdicted · none · ref 3
DTSR enables large reasoning models to dynamically assess chain-of-thought sufficiency via reflection signals and a sufficiency check, reducing reasoning length by 28.9-34.9% with minimal performance loss on Qwen3 models.

Answer convergence as a signal for early stopping in reasoning.arXiv preprint arXiv:2506.02536

fields

years

verdicts

representative citing papers

citing papers explorer