pith:ZYBOZ7AD
Test-Time Learning with an Evolving Library
Large language models improve on complex reasoning by building and evolving a shared library of skills extracted from their own inference trajectories without any parameter updates or external supervision.
arxiv:2605.14477 v1 · 2026-05-14 · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{ZYBOZ7ADA2AP6FTVJDMND5NFE7}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Across challenging benchmarks in mathematical reasoning, code generation, and multi-turn agentic environments, EvoLib improves substantially over the top test-time scaling and learning methods without ground-truth feedback.
That modular skills and reflective insights automatically extracted from the model's own inference trajectories can be weighted and consolidated into increasingly general and reusable abstractions that deliver long-term value without any external supervision or ground-truth signals.
EvoLib enables LLMs to accumulate, reuse, and evolve knowledge abstractions from inference trajectories at test time, yielding substantial gains on math reasoning, code generation, and agentic benchmarks without parameter updates or supervision.
References
Receipt and verification
| First computed | 2026-05-17T23:39:06.589040Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
ce02ecfc030680ff167548d8d1f5a527d8c51f850c3d2a2d3540572e1bc4288d
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/ZYBOZ7ADA2AP6FTVJDMND5NFE7 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: ce02ecfc030680ff167548d8d1f5a527d8c51f850c3d2a2d3540572e1bc4288d
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "41bb27f962974400196d990a6a47b421f1e629498ca694ae56ecd50b45d58c8a",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-14T07:18:12Z",
"title_canon_sha256": "c3cbb1f626339a3780999d67594f72d96efa060e1da9d5f0f43ec8ac47f0aba2"
},
"schema_version": "1.0",
"source": {
"id": "2605.14477",
"kind": "arxiv",
"version": 1
}
}