pith:LPM2GGKO
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
Multi-SWE-bench supplies 1632 expert-curated issue-resolving tasks across seven languages to test LLMs beyond Python-only benchmarks.
arxiv:2504.02605 v1 · 2025-04-03 · cs.SE · cs.AI · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{LPM2GGKOMAOVP3AIND2NNJ45OD}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
we introduce a multilingual issue-resolving benchmark, called Multi-SWE-bench, covering Java, TypeScript, JavaScript, Go, Rust, C, and C++. It includes a total of 1,632 high-quality instances, which were carefully annotated from 2,456 candidates by 68 expert annotators, ensuring that the benchmark can provide an accurate and reliable evaluation.
The 68 expert annotators' curation from 2,456 candidates to 1,632 instances produces an unbiased, high-quality, and representative set that accurately reflects real-world issue-resolving difficulty across languages.
Multi-SWE-bench provides 1,632 high-quality issue-resolving instances across Java, TypeScript, JavaScript, Go, Rust, C, and C++ for evaluating LLMs on codebase modifications.
References
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:48.785976Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
5bd9a3194e601d57ec0868f4d6a79d70dc64d6e781232b82ed790948529fe591
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/LPM2GGKOMAOVP3AIND2NNJ45OD \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 5bd9a3194e601d57ec0868f4d6a79d70dc64d6e781232b82ed790948529fe591
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "808b6ee12521cfa65d940dbff573db470cef0b046828badf3561d86e29929d47",
"cross_cats_sorted": [
"cs.AI",
"cs.CL"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.SE",
"submitted_at": "2025-04-03T14:06:17Z",
"title_canon_sha256": "786fcc79ffc89a2a8b47161e0f7428763a97880976273b83de4674713cf59455"
},
"schema_version": "1.0",
"source": {
"id": "2504.02605",
"kind": "arxiv",
"version": 1
}
}