Zebraarena: A diagnostic simulation environment for studying reasoning-action coupling in tool-augmented llms.arXiv preprint arXiv:2603.18614,

Wanjia Zhao, Ludwig Schmidt, James Zou, Vidhisha Balachandran, Lingjiao Chen · arXiv 2603.18614

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning

cs.CL · 2026-05-19 · unverdicted · novelty 7.0

CopT reverses CoT by eliciting a draft answer first then using continuous-embedding contrastive verification and on-policy thinking to reflect and correct, yielding up to 23% higher accuracy and 57% fewer tokens without training.

citing papers explorer

Showing 1 of 1 citing paper.

CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning cs.CL · 2026-05-19 · unverdicted · none · ref 43
CopT reverses CoT by eliciting a draft answer first then using continuous-embedding contrastive verification and on-policy thinking to reflect and correct, yielding up to 23% higher accuracy and 57% fewer tokens without training.

Zebraarena: A diagnostic simulation environment for studying reasoning-action coupling in tool-augmented llms.arXiv preprint arXiv:2603.18614,

fields

years

verdicts

representative citing papers

citing papers explorer