SolidCoder bridges the mental-reality gap in LLM code generation via forced edge-case awareness and concrete sandbox execution, reaching 95.7% pass@1 on HumanEval, 77.0% on CodeContests, and 26.7% on APPS.
Plan Simulation Prompt: You are a programmer tasked with verifying a plan to solve a given problem using the **Python3** programming language
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.SE 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
SolidCoder: Bridging the Mental-Reality Gap in LLM Code Generation through Concrete Execution
SolidCoder bridges the mental-reality gap in LLM code generation via forced edge-case awareness and concrete sandbox execution, reaching 95.7% pass@1 on HumanEval, 77.0% on CodeContests, and 26.7% on APPS.