pith:VPSR4ZIC
An Explanation of In-context Learning as Implicit Bayesian Inference
Large language models perform in-context learning by implicitly inferring latent concepts that explain coherence in both pretraining data and prompt examples.
arxiv:2111.02080 v6 · 2021-11-03 · cs.CL · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{VPSR4ZICWER5DNM3ZWX4JTRFRJ}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
We prove when this occurs despite a distribution mismatch between prompts and pretraining data in a setting where the pretraining distribution is a mixture of HMMs.
That real-world pretraining corpora exhibit long-range coherence driven by latent document-level concepts that can be adequately captured by a mixture-of-HMMs generative process, and that this mechanism dominates the in-context behavior of large-scale models trained on messy web data.
In-context learning emerges as implicit Bayesian inference of latent concepts when pretraining data has long-range coherence, proven for mixture-of-HMM distributions and replicated on the synthetic GINC dataset.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:46.404101Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
abe51e6502b123d1b59bcdafc4ce258a748628095a8fba3c262f049ff320956a
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/VPSR4ZICWER5DNM3ZWX4JTRFRJ \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: abe51e6502b123d1b59bcdafc4ce258a748628095a8fba3c262f049ff320956a
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "ece93fb738dd7f046125c81a57fa34c43199dcc6730005f0994bac4c271d5308",
"cross_cats_sorted": [
"cs.LG"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2021-11-03T09:12:33Z",
"title_canon_sha256": "d11121fff8b4527ebb3daba0fca9697ace21e7716128ef726203c70bc0afe3a2"
},
"schema_version": "1.0",
"source": {
"id": "2111.02080",
"kind": "arxiv",
"version": 6
}
}