pith:DJHSBN54
Measuring and Narrowing the Compositionality Gap in Language Models
Larger language models improve single-fact recall faster than they improve the ability to compose multiple facts into answers.
arxiv:2210.03350 v3 · 2022-10-07 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{DJHSBN54HU2VV2B7SIQRIVL5FE}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
In the GPT-3 family of models, as model size increases we show that the single-hop question answering performance improves faster than the multi-hop performance does, therefore the compositionality gap does not decrease.
That the multi-hop questions are built from facts unlikely to have been observed together during pretraining, so that correct answers to the full question must come from composition rather than direct memorization of the combined fact.
Larger language models improve faster at single facts than at composing them, but self-ask prompting reduces the compositionality gap by forcing explicit intermediate questions.
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:13.479942Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
1a4f20b7bc3d355ae83f922114557d291eb3fffea673138ac709f19448d57925
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/DJHSBN54HU2VV2B7SIQRIVL5FE \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 1a4f20b7bc3d355ae83f922114557d291eb3fffea673138ac709f19448d57925
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "b6f926ce0d2d9a6fb0798b7b877f894c9f7ab05f8cabc5a4854025fc477756e1",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2022-10-07T06:50:23Z",
"title_canon_sha256": "b51c2e7d513e3bc51f957d92064bdf2b30f25b753209fbaa8ff4a6266f98bb0d"
},
"schema_version": "1.0",
"source": {
"id": "2210.03350",
"kind": "arxiv",
"version": 3
}
}