Emily Pronin, Daniel Y Lin, and Lee Ross

Charilaos Pipis, Shivam Garg, Vasilis Kontonis, Vaishnavi Shrivastava, Akshay Krishnamurthy, Dimitris Papailiopoulos · 2025 · arXiv 2512.12895

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

representative citing papers

Know When to Stop: Segment-Level Credit Assignment for Reducing Overthinking

cs.CL · 2026-07-01 · unverdicted · novelty 7.0

DASH assigns segment-level credit in reasoning traces using drift toward ground-truth answers, yielding 50.8% accuracy on AIME25 versus 45.4% for GRPO while reducing overthinking behaviors.

Quantized Reasoning Models Think They Need to Think Longer, but They Do Not

cs.LG · 2026-05-29 · unverdicted · novelty 6.0

Post-training quantization increases overthinking errors in reasoning models; a logit penalty on curated overthinking markers reduces CoT length 12-23% without accuracy loss.

How Compliant Are GitHub Actions Workflows? A Checklist-Based Study with LLM-Assisted Auditing

cs.SE · 2026-05-03 · accept · novelty 6.0

GitHub Actions workflows achieve only 28% overall compliance with best practices, with LLMs enabling an 81% reduction in verification effort via hybrid adjudication but still requiring expert oversight for security judgments.

Escaping Mode Collapse in LLM Generation via Geometric Regulation

cs.CL · 2026-05-01 · unverdicted · novelty 6.0 · 2 refs

Reinforced Mode Regulation (RMR) applies low-rank damping to the Transformer value cache to prevent geometric collapse and enable stable autoregressive generation at entropy rates as low as 0.8 nats/step.

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

cs.AI · 2026-04-08 · unverdicted · novelty 6.0

Reasoning SFT generalizes cross-domain conditionally on sufficient optimization, high-quality long-CoT data, and strong base models, while degrading safety.

Agents of Chaos

cs.AI · 2026-02-23 · unverdicted · novelty 6.0

An exploratory red-teaming study documents eleven cases of security, privacy, and governance failures in autonomous language-model agents with tool access and persistent memory.

Search for Truth from Reasoning: A Dynamic Representation Editing Framework for Steering LLM Trajectories

cs.AI · 2026-06-26

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Emily Pronin, Daniel Y Lin, and Lee Ross

fields

years

verdicts

representative citing papers

citing papers explorer