Generating executable oracles to check conformance of client code to requirements of

Jiang, Shan, Zhu, Chenguang, Khurshid, Sarfraz , journal=

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Sketch-and-Verify: Structured Inference-Time Scaling via Program Sketching

cs.LG · 2026-05-09 · conditional · novelty 7.0

Sketch-and-Verify improves small-LLM code generation on HumanEval+ by factorizing search into K algorithmic sketches and M fillings each, outperforming flat sampling by up to 32 percentage points at matched budget while remaining cheaper than upgrading model tier.

Semantic Voting: Execution-Grounded Consensus for LLM Code Generation

cs.SE · 2026-05-09 · unverdicted · novelty 6.0

Execution-based selectors for LLM code candidates outperform textual voting by large margins across configurations, with input generation quality mattering more than the specific aggregation rule.

citing papers explorer

Showing 2 of 2 citing papers.

Sketch-and-Verify: Structured Inference-Time Scaling via Program Sketching cs.LG · 2026-05-09 · conditional · none · ref 30
Sketch-and-Verify improves small-LLM code generation on HumanEval+ by factorizing search into K algorithmic sketches and M fillings each, outperforming flat sampling by up to 32 percentage points at matched budget while remaining cheaper than upgrading model tier.
Semantic Voting: Execution-Grounded Consensus for LLM Code Generation cs.SE · 2026-05-09 · unverdicted · none · ref 10
Execution-based selectors for LLM code candidates outperform textual voting by large margins across configurations, with input generation quality mattering more than the specific aggregation rule.

Generating executable oracles to check conformance of client code to requirements of

fields

years

verdicts

representative citing papers

citing papers explorer