Yate: The role of test repair in llm-based unit test generation,

· 2025 · arXiv 2507.18316

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Inferring Code Correctness from Specification

cs.SE · 2026-05-28 · unverdicted · novelty 6.0

TRAILS infers code correctness by aggregating LLM judgments on input-output pairs from category-partitioned specification tests, improving MCC by up to 39% over Zero-Shot COT on LiveCodeBench and CoCoClaNeL.

PR-Aware Automated Unit Test Generation: Challenges and Opportunities

cs.SE · 2026-05-24 · unverdicted · novelty 5.0

EvoSuite produced at least one fail-to-pass test for 36% of PRs versus 13% for GPT-4o, but both tools generated no meaningful change-capturing tests for 64% of the PRs evaluated.

citing papers explorer

Showing 2 of 2 citing papers.

Inferring Code Correctness from Specification cs.SE · 2026-05-28 · unverdicted · none · ref 20
TRAILS infers code correctness by aggregating LLM judgments on input-output pairs from category-partitioned specification tests, improving MCC by up to 39% over Zero-Shot COT on LiveCodeBench and CoCoClaNeL.
PR-Aware Automated Unit Test Generation: Challenges and Opportunities cs.SE · 2026-05-24 · unverdicted · none · ref 47
EvoSuite produced at least one fail-to-pass test for 36% of PRs versus 13% for GPT-4o, but both tools generated no meaningful change-capturing tests for 64% of the PRs evaluated.

Yate: The role of test repair in llm-based unit test generation,

fields

years

verdicts

representative citing papers

citing papers explorer