This process yields1,892valid cases and8,102SFT training entries

Multi-turn SFT entries are constructed so that each round contains a reasoning step in<think> · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.AI · 2026-05-14 · unverdicted · novelty 6.0 · 2 refs

InsightReplay improves long CoT reasoning by extracting critical insights from the trace and replaying them near the active frontier, delivering +1.65 average accuracy gain across 24 model-benchmark settings.

citing papers explorer

Showing 1 of 1 citing paper.

Stateful Reasoning via Insight Replay cs.AI · 2026-05-14 · unverdicted · none · ref 37 · 2 links
InsightReplay improves long CoT reasoning by extracting critical insights from the trace and replaying them near the active frontier, delivering +1.65 average accuracy gain across 24 model-benchmark settings.

This process yields1,892valid cases and8,102SFT training entries

fields

years

verdicts

representative citing papers

citing papers explorer