The paper defines the task of generating reasoning trajectories for Socratic debugging of student code, releases an annotated dataset, and shows LLMs can produce up to 91% correct trajectories and 98.7% valid conversation turns per LLM-as-judge evaluation.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Reasoning Trajectories for Socratic Debugging of Student Code: From Misconceptions to Contradictions and Updated Beliefs
The paper defines the task of generating reasoning trajectories for Socratic debugging of student code, releases an annotated dataset, and shows LLMs can produce up to 91% correct trajectories and 98.7% valid conversation turns per LLM-as-judge evaluation.