Thought manipulation: External thought can be efficient for large reasoning models

When hindsight is not 20/20: Testing limits on reflective thinking in large language models · 2024 · arXiv 2504.13626

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Auditing Data Membership in Reinforcement Learning With Verifiable Rewards

cs.CR · 2025-11-18 · unverdicted · novelty 6.0

DIBA detects membership of prompts in RLVR training by measuring reward success changes and policy behavioral drift between pre- and post-RLVR model checkpoints.

Revisiting Anthropomorphic Reflection Markers in Large Language Model Reasoning

cs.CL · 2026-05-27 · unverdicted · novelty 5.0

Suppressing anthropomorphic reflection markers via prompt and token interventions preserves or improves LLM reasoning performance on four benchmarks while models continue marker-free verification.

citing papers explorer

Showing 2 of 2 citing papers.

Auditing Data Membership in Reinforcement Learning With Verifiable Rewards cs.CR · 2025-11-18 · unverdicted · none · ref 22
DIBA detects membership of prompts in RLVR training by measuring reward success changes and policy behavioral drift between pre- and post-RLVR model checkpoints.
Revisiting Anthropomorphic Reflection Markers in Large Language Model Reasoning cs.CL · 2026-05-27 · unverdicted · none · ref 2
Suppressing anthropomorphic reflection markers via prompt and token interventions preserves or improves LLM reasoning performance on four benchmarks while models continue marker-free verification.

Thought manipulation: External thought can be efficient for large reasoning models

fields

years

verdicts

representative citing papers

citing papers explorer