pith. sign in
Pith Number

pith:ET75EIP3

pith:2026:ET75EIP3GNUMCAOH3UHD6TYXVD
not attested not anchored not stored refs pending

Pretraining Language Models on Historical Text

Freda Shi, Junchi Yu, Niclas Griesshaber, Philip Torr, Xiaoxi Luo, Yao Lu, Yixuan Wang, Zachary Shinnick

arxiv:2606.02991 v1 · 2026-06-02 · cs.CL · cs.AI

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{ET75EIP3GNUMCAOH3UHD6TYXVD}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.
Receipt and verification
First computed 2026-06-03T01:05:28.694576Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

24ffd221fb3368c101c7dd0e3f4f17a8c9fa884a22378344e73018acd646609b

Aliases

arxiv: 2606.02991 · arxiv_version: 2606.02991v1 · doi: 10.48550/arxiv.2606.02991 · pith_short_12: ET75EIP3GNUM · pith_short_16: ET75EIP3GNUMCAOH · pith_short_8: ET75EIP3
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/ET75EIP3GNUMCAOH3UHD6TYXVD \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 24ffd221fb3368c101c7dd0e3f4f17a8c9fa884a22378344e73018acd646609b
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "6e70dbb297999322c837c45399be4e67b79a617c4b426e6910e558a65b34be95",
    "cross_cats_sorted": [
      "cs.AI"
    ],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.CL",
    "submitted_at": "2026-06-02T00:59:06Z",
    "title_canon_sha256": "abb6d070159cf9695a6468e7abb84cb68b0b330d07224963fcfae08a6980fb2b"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2606.02991",
    "kind": "arxiv",
    "version": 1
  }
}