pith:RTJFRTJV
RTL-BenchMT: Dynamic Maintenance of RTL Generation Benchmark Through Agent-Assisted Analysis and Revision
An agentic framework automatically identifies flawed RTL benchmark cases and detects overfitting to produce a refined suite.
arxiv:2605.15537 v1 · 2026-05-15 · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{RTJFRTJVNTUIQRGSYKVKHLYTSH}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
With the assistance of RTL-BenchMT, we conduct a thorough, in-depth analysis of flawed and overfitting cases and produce a refined benchmark suite that will be open-sourced to the community.
That AI agents can reliably and accurately detect flawed benchmark cases and overfitting instances in RTL generation tasks without introducing new errors or requiring substantial human validation.
RTL-BenchMT is an agent-assisted framework for dynamically maintaining RTL generation benchmarks by fixing flaws and reducing overfitting in LLM-based EDA applications.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:01:04.131587Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
8cd258cd356ce88844d2c2aaa3af1391ebdf2c556c7903a4099e6ad791cca20a
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/RTJFRTJVNTUIQRGSYKVKHLYTSH \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 8cd258cd356ce88844d2c2aaa3af1391ebdf2c556c7903a4099e6ad791cca20a
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "77448166df1ba81182aa49d40ee0d1fb855f31b15510469b6de6dfd93c09400b",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.AI",
"submitted_at": "2026-05-15T02:17:46Z",
"title_canon_sha256": "1f71be7c1cf100268d7be3729717ec7370b7cae987c2cd84a54e2bb44f5ee70c"
},
"schema_version": "1.0",
"source": {
"id": "2605.15537",
"kind": "arxiv",
"version": 1
}
}