pith. sign in
Pith Number

pith:42BTHFBO

pith:2026:42BTHFBO6733UOG5GGERTRYEDT
not attested not anchored not stored refs pending

UnWeaving the knots of GraphRAG -- turns out VectorRAG is almost enough

Adam Kozakiewicz, Mateusz Czy\.znikiewicz, Mateusz Gali\'nski, Micha{\l} Godziszewski, Micha{\l} Karpowicz, Ryszard Tuora, Tomasz Zi\k{e}tkiewicz

Entity decomposition of documents into cross-chunk links simplifies GraphRAG to near VectorRAG performance while preserving source fidelity.

arxiv:2603.29875 v3 · 2026-02-06 · cs.IR · cs.AI · cs.CL

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{42BTHFBO6733UOG5GGERTRYEDT}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

entity-based decomposition yields a more distilled representation of original information, and additionally serves to reduce noise in the indexing, and generation process.

C2weakest assumption

That an LLM can reliably extract entities across chunks without introducing systematic errors or hallucinations that would then propagate into retrieval.

C3one line summary

UnWeaver disentangles documents into entities via LLM to retrieve original chunks, yielding a simpler alternative to GraphRAG that still reduces noise and preserves source fidelity.

Cited by

1 paper in Pith

Receipt and verification
First computed 2026-06-09T02:07:25.722438Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

e68333942ef7f7ba38dd318919c7041cc5300b9c6c5189388e97c7cf73a1d617

Aliases

arxiv: 2603.29875 · arxiv_version: 2603.29875v3 · doi: 10.48550/arxiv.2603.29875 · pith_short_12: 42BTHFBO6733 · pith_short_16: 42BTHFBO6733UOG5 · pith_short_8: 42BTHFBO
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/42BTHFBO6733UOG5GGERTRYEDT \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: e68333942ef7f7ba38dd318919c7041cc5300b9c6c5189388e97c7cf73a1d617
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "2c37fcac6caa56b6177af9c68f1d365c245853792da85a68903d754a17a7dec1",
    "cross_cats_sorted": [
      "cs.AI",
      "cs.CL"
    ],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.IR",
    "submitted_at": "2026-02-06T11:37:10Z",
    "title_canon_sha256": "ac3dff5b864ed7943b3b8ce6fc1fde67b916e92266c42eda96eea0b7fa6115d4"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2603.29875",
    "kind": "arxiv",
    "version": 3
  }
}