pith:42BTHFBO
UnWeaving the knots of GraphRAG -- turns out VectorRAG is almost enough
Entity decomposition of documents into cross-chunk links simplifies GraphRAG to near VectorRAG performance while preserving source fidelity.
arxiv:2603.29875 v3 · 2026-02-06 · cs.IR · cs.AI · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{42BTHFBO6733UOG5GGERTRYEDT}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
entity-based decomposition yields a more distilled representation of original information, and additionally serves to reduce noise in the indexing, and generation process.
That an LLM can reliably extract entities across chunks without introducing systematic errors or hallucinations that would then propagate into retrieval.
UnWeaver disentangles documents into entities via LLM to retrieve original chunks, yielding a simpler alternative to GraphRAG that still reduces noise and preserves source fidelity.
Cited by
Receipt and verification
| First computed | 2026-06-09T02:07:25.722438Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
e68333942ef7f7ba38dd318919c7041cc5300b9c6c5189388e97c7cf73a1d617
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/42BTHFBO6733UOG5GGERTRYEDT \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: e68333942ef7f7ba38dd318919c7041cc5300b9c6c5189388e97c7cf73a1d617
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "2c37fcac6caa56b6177af9c68f1d365c245853792da85a68903d754a17a7dec1",
"cross_cats_sorted": [
"cs.AI",
"cs.CL"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.IR",
"submitted_at": "2026-02-06T11:37:10Z",
"title_canon_sha256": "ac3dff5b864ed7943b3b8ce6fc1fde67b916e92266c42eda96eea0b7fa6115d4"
},
"schema_version": "1.0",
"source": {
"id": "2603.29875",
"kind": "arxiv",
"version": 3
}
}