pith. sign in
Pith Number

pith:2ZJ4HTA7

pith:2026:2ZJ4HTA7GG74SYLRH6UQBGXPVW
not attested not anchored not stored refs resolved

Automatic Construction of a Legal Citation Graph from 100 Million Ukrainian Court Decisions: Large-Scale Extraction, Topological Analysis, and Ontology-Driven Clustering

Volodymyr Ovcharov

A citation graph from 100 million Ukrainian court decisions encodes legal domain boundaries without supervision and predicts future legislative importance with near-perfect accuracy.

arxiv:2605.15362 v1 · 2026-05-14 · cs.CL · cs.DL · cs.IR

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{2ZJ4HTA7GG74SYLRH6UQBGXPVW}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Half a billion citation edges extracted from 100.7 million Ukrainian court decisions reveal that judicial citation structure encodes legal domain boundaries without supervision and predicts future legislative importance with near-perfect accuracy.

C2weakest assumption

Regex patterns applied to full-text decisions accurately identify all six types of citation links at scale, with the 200-decision validation sample being representative of the full 99.5 million documents.

C3one line summary

A citation graph built from the complete Ukrainian court registry recovers legal domain boundaries via community detection and predicts legislative importance with AUC 0.9984.

References

22 extracted · 22 resolved · 1 Pith anchors

[1] Fast Unfolding of Communities in Large Networks 2008 · doi:10.1088/1742-5468/2008/10/p10008
[2] Bommarito, Daniel Martin Katz, and Eric M 2018
[3] LEGAL-BERT: The muppets straight out of law school 2020 · doi:10.18653/v1/2020.findings-emnlp.261
[4] Power-law distributions in empirical data 2009 · doi:10.1137/070710111
[5] Measuring law over time: A network analytical framework with an application to statutes and regulations in the united states and germany.Frontiers in Physics, 9:658463,

Formal links

2 machine-checked theorem links

Receipt and verification
First computed 2026-05-20T00:00:54.460168Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

d653c3cc1f31bfc961713fa9009aefadba8ea5f7eb0498644bd9ab923d5d1dff

Aliases

arxiv: 2605.15362 · arxiv_version: 2605.15362v1 · doi: 10.48550/arxiv.2605.15362 · pith_short_12: 2ZJ4HTA7GG74 · pith_short_16: 2ZJ4HTA7GG74SYLR · pith_short_8: 2ZJ4HTA7
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/2ZJ4HTA7GG74SYLRH6UQBGXPVW \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: d653c3cc1f31bfc961713fa9009aefadba8ea5f7eb0498644bd9ab923d5d1dff
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "0ebcc47cbe9315bea6ae427e74a25f4447536571620d4195715310544d4c8bb6",
    "cross_cats_sorted": [
      "cs.DL",
      "cs.IR"
    ],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.CL",
    "submitted_at": "2026-05-14T19:42:20Z",
    "title_canon_sha256": "14998f497ccc87b9050ed3bcebb619066285fb658ac20ebbd43714a39c0b0bfc"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.15362",
    "kind": "arxiv",
    "version": 1
  }
}