Test smells in LLM-generated unit tests,

· 2024 · arXiv 2410.10628

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Humanizing Automatically Generated Unit Test Suites with LLM-Based Refactoring

cs.SE · 2026-06-26 · unverdicted · novelty 6.0 · 2 refs

TestHumanizer uses LLMs as refactoring layers on EvoSuite suites to reach 88-98% compilation rates and better readability on 350 classes from Defects4J and SF110 while preserving coverage.

LLM vs. Human Unit Tests: Fault Detection on Real Python Bugs

cs.SE · 2026-06-07 · unverdicted · novelty 5.0

LLM-generated unit tests with retrieval-augmented context detect faults in 69% of real Python bugs versus 17.2% for general-purpose human-written tests, with similar coverage levels.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Humanizing Automatically Generated Unit Test Suites with LLM-Based Refactoring cs.SE · 2026-06-26 · unverdicted · none · ref 43 · 2 links
TestHumanizer uses LLMs as refactoring layers on EvoSuite suites to reach 88-98% compilation rates and better readability on 350 classes from Defects4J and SF110 while preserving coverage.
LLM vs. Human Unit Tests: Fault Detection on Real Python Bugs cs.SE · 2026-06-07 · unverdicted · none · ref 8
LLM-generated unit tests with retrieval-augmented context detect faults in 69% of real Python bugs versus 17.2% for general-purpose human-written tests, with similar coverage levels.

Test smells in LLM-generated unit tests,

fields

years

verdicts

representative citing papers

citing papers explorer