pith. sign in

Yate: The role of test repair in llm-based unit test generation,

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.SE 2

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

Inferring Code Correctness from Specification

cs.SE · 2026-05-28 · unverdicted · novelty 6.0

TRAILS infers code correctness by aggregating LLM judgments on input-output pairs from category-partitioned specification tests, improving MCC by up to 39% over Zero-Shot COT on LiveCodeBench and CoCoClaNeL.

citing papers explorer

Showing 2 of 2 citing papers.

  • Inferring Code Correctness from Specification cs.SE · 2026-05-28 · unverdicted · none · ref 20

    TRAILS infers code correctness by aggregating LLM judgments on input-output pairs from category-partitioned specification tests, improving MCC by up to 39% over Zero-Shot COT on LiveCodeBench and CoCoClaNeL.

  • PR-Aware Automated Unit Test Generation: Challenges and Opportunities cs.SE · 2026-05-24 · unverdicted · none · ref 47

    EvoSuite produced at least one fail-to-pass test for 36% of PRs versus 13% for GPT-4o, but both tools generated no meaningful change-capturing tests for 64% of the PRs evaluated.