pith:DMR7KHEB
FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale
An automated system evolves closed-ended competitive programming tasks into open-ended coding problems and uses the resulting data to train stronger LLM coders.
arxiv:2605.14445 v1 · 2026-05-14 · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{DMR7KHEBBJSJOQL6E7ISWMVEK4}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
training on our synthesized data yields substantial gains over the base models: Qwen3.5-9B improves by +8.82 score on FrontierCS and +306.36 (Elo-rating-based performance) on ALE-bench; Qwen3.5-27B improves by +12.12 and +309.12, respectively.
The quantitative idea divergence metric reliably selects problems that elicit genuinely diverse solution approaches from different solvers, and the automatically generated test cases and verifiers are sufficiently robust to support training.
FrontierSmith automates synthesis of open-ended coding problems from closed-ended seeds and shows measurable gains on two open-ended LLM coding benchmarks.
References
Receipt and verification
| First computed | 2026-05-17T23:39:06.965948Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
1b23f51c810a6497417e27d12b32a45714c44fd7a3a0067bb6c494ee1ee46572
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/DMR7KHEBBJSJOQL6E7ISWMVEK4 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 1b23f51c810a6497417e27d12b32a45714c44fd7a3a0067bb6c494ee1ee46572
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "90cea9fd00b568405153a47deeda6843b2d03473e5db4c23554a02208472e9fb",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-14T06:39:42Z",
"title_canon_sha256": "b21fd42d8f615d8f6a2477476abf9cf330659f87a65c72cf4be33b55f6263dfa"
},
"schema_version": "1.0",
"source": {
"id": "2605.14445",
"kind": "arxiv",
"version": 1
}
}