pith. sign in
Pith Number

pith:LOVGWGVE

pith:2026:LOVGWGVECYQITTLES7U7SADJD3
not attested not anchored not stored refs resolved

Granite Embedding Multilingual R2 Models

Aashka Trivedi, Abraham Daniels, Bhavani Iyer, Jaydeep Sen, Ken Barker, Luis Lastras, Madison Lee, Martin Franz, Meet Doshi, Parul Awasthy, Radu Florian, Riyaz Bhat, Rudra Murthy, Todd Ward, Vignesh P, Vishwajeet Kumar, Yulong Li, Yushu Yang

Granite R2 multilingual embedding models achieve state-of-the-art retrieval across more than 200 languages and code.

arxiv:2605.13521 v1 · 2026-05-13 · cs.IR

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{LOVGWGVECYQITTLES7U7SADJD3}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

state-of-the-art overall performance across multilingual and cross-lingual text search, code retrieval, long-document search, and reasoning retrieval datasets

C2weakest assumption

The reported benchmarks accurately reflect generalization to real enterprise data without overfitting or benchmark-specific tuning, given that training details and exact evaluation protocols are not provided in the abstract.

C3one line summary

Granite Embedding Multilingual R2 releases 311M and 97M parameter bi-encoder models that achieve state-of-the-art retrieval performance on multilingual text, code, long-document, and reasoning datasets.

References

25 extracted · 25 resolved · 8 Pith anchors

[1] Phi-4 Technical Report · arXiv:2412.08905
[2] jina-embeddings-v5-text: Task-Targeted Embedding Distillation 2008 · arXiv:2602.15547
[3] 3 David Arthur and Sergei Vassilvitskii 2008
[4] Mmteb: Massive multilingual text embedding benchmark
[5] Mmteb: Massive multilingual text embedding benchmark 2021 · doi:10.48550/arxiv.2502.13595
Receipt and verification
First computed 2026-05-18T02:44:24.381883Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

5baa6b1aa4162089cd6497e9f900691ed0ed048efe296841434fd1ba852547dc

Aliases

arxiv: 2605.13521 · arxiv_version: 2605.13521v1 · doi: 10.48550/arxiv.2605.13521 · pith_short_12: LOVGWGVECYQI · pith_short_16: LOVGWGVECYQITTLE · pith_short_8: LOVGWGVE
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/LOVGWGVECYQITTLES7U7SADJD3 \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 5baa6b1aa4162089cd6497e9f900691ed0ed048efe296841434fd1ba852547dc
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "0e0dcc0e01adfddb4cbc529f49c8067f6d27be93d7c93868b6cde0f7ff76bde4",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.IR",
    "submitted_at": "2026-05-13T13:37:45Z",
    "title_canon_sha256": "b63f54c0e51d9b9a4fba99bdfb61cdd12a3c3c7d88e5c2a9a200b1658cd322f7"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.13521",
    "kind": "arxiv",
    "version": 1
  }
}