← back to paper
arxiv: 2604.17338 · 2 revisions
Precise Debugging Benchmark: Is Your Model Debugging or Regenerating?