pith. sign in

hub Canonical reference

Demystifying llm-based software engineering agents

Canonical reference. 82% of citing Pith papers cite this work as background.

32 Pith papers citing it
38 external citations · Crossref
Background 82% of classified citations

hub tools

citation-role summary

background 9 baseline 1 method 1

citation-polarity summary

years

2026 28 2025 4

clear filters

representative citing papers

SmellBench: Evaluating LLM Agents on Architectural Code Smell Repair

cs.SE · 2026-05-07 · unverdicted · novelty 7.0 · 2 refs

SmellBench is the first benchmark showing LLM agents resolve 47.7% of architectural code smells while accurately spotting false positives, but aggressive repairs often introduce new smells and degrade overall quality.

Investigating Test Overfitting on SWE-bench

cs.SE · 2025-11-20 · unverdicted · novelty 7.0

The first empirical study of test overfitting shows that auto-generated tests from issues can lead to code that passes observed tests but misses important cases or breaks functionality in SWE-bench issue resolution.

FeatX: Editing Software by Editing Features for Repository-Level Code Evolution

cs.SE · 2026-06-30 · unverdicted · novelty 6.0

FeatX extracts epic-feature hierarchies with code mappings from repositories and applies feature edits via a three-stage Evolution Agent, reporting 42.6% relative F1 gain in function-level localization and lower cognitive load versus vanilla ChatGPT in a user study and 38-commit replay.

FuzzAgent: Multi-Agent System for Evolutionary Library Fuzzing

cs.SE · 2026-05-14 · conditional · novelty 6.0

FuzzAgent deploys specialized agents that collaborate on harness generation, execution, and crash triage to evolve fuzzing campaigns, delivering 45-191% more branch coverage than four baselines on 20 C/C++ libraries and surfacing 102 real bugs.

Reproduction Test Generation for Java SWE Issues

cs.SE · 2026-05-05 · unverdicted · novelty 6.0 · 2 refs

Introduces the first benchmark for Java reproduction test generation from repository issues and adapts a prior Python tool to produce high performance on it.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.