pith. sign in

Title resolution pending

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.SE 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

Investigating Test Overfitting on SWE-bench

cs.SE · 2025-11-20 · unverdicted · novelty 7.0

The first empirical study of test overfitting shows that auto-generated tests from issues can lead to code that passes observed tests but misses important cases or breaks functionality in SWE-bench issue resolution.

citing papers explorer

Showing 1 of 1 citing paper.

  • Investigating Test Overfitting on SWE-bench cs.SE · 2025-11-20 · unverdicted · none · ref 6

    The first empirical study of test overfitting shows that auto-generated tests from issues can lead to code that passes observed tests but misses important cases or breaks functionality in SWE-bench issue resolution.