pith. sign in
Pith Number

pith:TXAUUGXT

pith:2026:TXAUUGXTYJOAVNWP6DGYIWDHTZ
not attested not anchored not stored refs resolved

PaliBench: A Multi-Reference Blueprint for Classical Language Translation Benchmarks

M\'at\'e Metzger, Nadnapang Phophichit

PaliBench shows how to build multi-reference benchmarks for classical language translation from existing scholarly translations without treating any one as definitive.

arxiv:2605.16881 v1 · 2026-05-16 · cs.CL

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{TXAUUGXTYJOAVNWP6DGYIWDHTZ}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

The broader contribution is methodological: PaliBench shows how existing scholarly translations can be transformed into evaluation infrastructure for interpretive textual traditions without treating any single translation as definitive.

C2weakest assumption

The assumption that LLM-assisted alignment of independently segmented translations, automated verification, passage-level quality filtering, and deduplication produce a reliable, unbiased multi-reference dataset that faithfully captures interpretive variation across the three human translations.

C3one line summary

PaliBench is a multi-reference benchmark for Pali-to-English translation built from 1,700 aligned passages of the Sutta Pitaka using three scholarly translations to evaluate LLMs with complementary metrics.

References

38 extracted · 38 resolved · 0 Pith anchors

[1] Frequently Asked Questions About Access to Insight , year =
[2] Nature , volume = 2022
[3] Bamman, David and Burns, Patrick J. , title =. 2020 , note = 2020
[4] Bodhi, Bhikkhu , title =
[5] The Middle Length Discourses of the Buddha: A Translation of the Majjhima Nik

Formal links

2 machine-checked theorem links

Receipt and verification
First computed 2026-05-20T00:03:28.006165Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

9dc14a1af3c25c0ab6cff0cd8458679e7bbc13a27ce1709e4e12dea71aee4c90

Aliases

arxiv: 2605.16881 · arxiv_version: 2605.16881v1 · doi: 10.48550/arxiv.2605.16881 · pith_short_12: TXAUUGXTYJOA · pith_short_16: TXAUUGXTYJOAVNWP · pith_short_8: TXAUUGXT
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/TXAUUGXTYJOAVNWP6DGYIWDHTZ \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 9dc14a1af3c25c0ab6cff0cd8458679e7bbc13a27ce1709e4e12dea71aee4c90
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "0795e511740c72ea4e04afd40cc13e830b9c9ccf021d99dd85931f7131336435",
    "cross_cats_sorted": [],
    "license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
    "primary_cat": "cs.CL",
    "submitted_at": "2026-05-16T08:43:01Z",
    "title_canon_sha256": "3e8846d19b2b787459b172fa9418d9d894a88af412677d7d4baaa904fd1ae6e1"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.16881",
    "kind": "arxiv",
    "version": 1
  }
}