In high-precision geometry (AIME 2024), smaller models likeDeepSeek-R1-Distill-Qwen-1.5Bof- ten falter when facing complex arithmetic

Overcoming Arithmetic Hesitation (Figure 7) · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.AI · 2026-04-16 · unverdicted · novelty 6.0

LLM reasoning failures cluster at early entropy-spike transitions; the GUARD inference-time framework redirects them for more reliable results.

Showing 1 of 1 citing paper.

Dissecting Failure Dynamics in Large Language Model Reasoning cs.AI · 2026-04-16 · unverdicted · none · ref 4
LLM reasoning failures cluster at early entropy-spike transitions; the GUARD inference-time framework redirects them for more reliable results.