Majority Voting for Code Generation

· 2026 · cs.LG · arXiv 2604.15618

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We investigate Functional Majority Voting (FMV), a method based on functional consensus for code generation with Large Language Models, which identifies a representative solution from multiple generations using their runtime execution signatures on test inputs. We find that FMV is an effective test-time inference strategy, substantially boosting performance on LiveCodeBench without a large compute overhead. Furthermore, we extend the utility of functional consensus and apply it as an aggregation strategy for label-free Test-Time Reinforcement Learning. We demonstrate that this increases pass@1 on holdout tasks, but find no evidence of self-improvement beyond the base model's performance ceiling.

representative citing papers

Underspecification does not imply Incoherence: The Risks of Semantic Collapse in Coding Models

cs.SE · 2026-07-02 · unverdicted · novelty 6.0 · 2 refs

Coding LLMs exhibit detrimental semantic collapse on underspecified prompts by producing consistent but incorrect code rather than incoherent variations, affecting 3-32% of tasks across MBPP, HumanEval, and LiveCodeBench.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Underspecification does not imply Incoherence: The Risks of Semantic Collapse in Coding Models cs.SE · 2026-07-02 · unverdicted · none · ref 14 · 2 links · internal anchor
Coding LLMs exhibit detrimental semantic collapse on underspecified prompts by producing consistent but incorrect code rather than incoherent variations, affecting 3-32% of tasks across MBPP, HumanEval, and LiveCodeBench.

Majority Voting for Code Generation

fields

years

verdicts

representative citing papers

citing papers explorer