Within each repository, tasks are analyzed in ascending order of total failure count, starting from tasks with fewer failed attempts and progressing to more diﬃcult ones

Difficulty ordering

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Characterizing the Failure Modes of LLMs in Resolving Real-World GitHub Issues

cs.SE · 2026-05-12 · unverdicted · novelty 6.0

LLMs fail most often during strategy formulation and logic synthesis when fixing GitHub issues, but succeed relatively well at localizing faults, according to a taxonomy derived from 243 manual failure cases.

citing papers explorer

Showing 1 of 1 citing paper.

Characterizing the Failure Modes of LLMs in Resolving Real-World GitHub Issues cs.SE · 2026-05-12 · unverdicted · none · ref 2
LLMs fail most often during strategy formulation and logic synthesis when fixing GitHub issues, but succeed relatively well at localizing faults, according to a taxonomy derived from 243 manual failure cases.

Within each repository, tasks are analyzed in ascending order of total failure count, starting from tasks with fewer failed attempts and progressing to more diﬃcult ones

fields

years

verdicts

representative citing papers

citing papers explorer