SolidCoder bridges the mental-reality gap in LLM code generation via forced edge-case awareness and concrete sandbox execution, reaching 95.7% pass@1 on HumanEval, 77.0% on CodeContests, and 26.7% on APPS.
The solution should be efficient in terms of time and space complexity to handle large inputs
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.SE 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
SolidCoder: Bridging the Mental-Reality Gap in LLM Code Generation through Concrete Execution
SolidCoder bridges the mental-reality gap in LLM code generation via forced edge-case awareness and concrete sandbox execution, reaching 95.7% pass@1 on HumanEval, 77.0% on CodeContests, and 26.7% on APPS.