pith:6OZJYRIW
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Recursive clustering and summarization builds a tree that improves retrieval-augmented reasoning over long documents.
arxiv:2401.18059 v1 · 2024-01-31 · cs.CL · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{6OZJYRIWZULFKVUIJ26HJP7DQK}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
by coupling RAPTOR retrieval with the use of GPT-4, we can improve the best performance on the QuALITY benchmark by 20% in absolute accuracy.
The recursive clustering and summarization process effectively captures and preserves all relevant information from the original document without significant loss or distortion.
RAPTOR introduces a tree-organized retrieval method using recursive abstractive summaries, achieving a 20% absolute accuracy improvement on the QuALITY benchmark when paired with GPT-4.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:52.495670Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
f3b29c4516cd165556884ebc74bfe382b5fb70fce695763d78937257b83660d2
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/6OZJYRIWZULFKVUIJ26HJP7DQK \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: f3b29c4516cd165556884ebc74bfe382b5fb70fce695763d78937257b83660d2
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "4b4e8795849e2575548f7f664081a1408d6d307b2d61c069211c535628b89293",
"cross_cats_sorted": [
"cs.LG"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2024-01-31T18:30:21Z",
"title_canon_sha256": "a7ff4b20c4ee1b6df25b8a18f737d4ad1efe8efe0ae5bf44c02ac57cc8869243"
},
"schema_version": "1.0",
"source": {
"id": "2401.18059",
"kind": "arxiv",
"version": 1
}
}