DiffCodeGen clusters code candidates by behavioral similarity from fuzzing-synthesized inputs and selects the largest cluster's medoid, matching or exceeding prior test-time scaling methods with far less token and time cost.
UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing
5 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 5representative citing papers
PickleFuzzer applies differential testing with a custom grammar to uncover 14 previously unknown discrepancies across Python pickle implementations, four of which bypass security scanners.
A systematic mapping study of 248 papers introduces a taxonomy of synergistic effects, inter-analysis workflows, and mapping functions to catalog patterns in combined program analysis techniques.
WarpL uses mutation to find and isolate suboptimal instruction sequences causing performance issues in WebAssembly runtimes by comparing machine code of original and non-problematic mutant programs.
Babbling Suppression stops LLM code generation upon test passage to reduce token output and energy consumption by up to 65% across Python and Java benchmarks.
citing papers explorer
-
PickleFuzzer: A Case Study in Fuzzing for Discrepancies Between Python Pickle Implementations
PickleFuzzer applies differential testing with a custom grammar to uncover 14 previously unknown discrepancies across Python pickle implementations, four of which bypass security scanners.