pith:QEP33ICZ
BoLT: A Benchmark to Democratize Black-box Optimization Research for Expensive LLM Tasks
BoLT supplies lightweight surrogate models from thousands of real LLM runs so black-box optimization researchers can test methods on realistic expensive tasks without prohibitive costs.
arxiv:2605.17000 v1 · 2026-05-16 · cs.LG · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{QEP33ICZFE3RFOJFEIZWY4ZEHN}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
BoLT is the first LLM-centric benchmark that democratizes LLM research for the BBO community by releasing lightweight surrogate models fitted to the results of thousands of real LLM experiments, covering multi-fidelity, multi-objective, heteroscedastic noise, and high-dimensional search spaces; selected BO methods consistently outperform others across tasks.
The surrogate models fitted to the real LLM experiment data accurately reproduce the optimization landscapes, noise characteristics, and relative performance ordering of methods that would be observed on the actual expensive LLM tasks.
BoLT is a benchmark of surrogate models fitted to real LLM experiment data that enables evaluation of Bayesian and black-box optimization methods on multi-fidelity, multi-objective, high-dimensional LLM tasks.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:03:35.329787Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
811fbda059293712b92522336c73243b4134627b6d7f43eb1144ba1b6a8cd345
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/QEP33ICZFE3RFOJFEIZWY4ZEHN \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 811fbda059293712b92522336c73243b4134627b6d7f43eb1144ba1b6a8cd345
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "a964e91e3a00451ef5cc7b438a185025a4363acea902531dfb9abfe24cf9a797",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-16T13:53:44Z",
"title_canon_sha256": "f3a2c3e37183d47e3fe991d4b3d6ba918a7df437d722b49400b7be09599d07a8"
},
"schema_version": "1.0",
"source": {
"id": "2605.17000",
"kind": "arxiv",
"version": 1
}
}