pith:2ZJ4HTA7
Automatic Construction of a Legal Citation Graph from 100 Million Ukrainian Court Decisions: Large-Scale Extraction, Topological Analysis, and Ontology-Driven Clustering
A citation graph from 100 million Ukrainian court decisions encodes legal domain boundaries without supervision and predicts future legislative importance with near-perfect accuracy.
arxiv:2605.15362 v1 · 2026-05-14 · cs.CL · cs.DL · cs.IR
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{2ZJ4HTA7GG74SYLRH6UQBGXPVW}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Half a billion citation edges extracted from 100.7 million Ukrainian court decisions reveal that judicial citation structure encodes legal domain boundaries without supervision and predicts future legislative importance with near-perfect accuracy.
Regex patterns applied to full-text decisions accurately identify all six types of citation links at scale, with the 200-decision validation sample being representative of the full 99.5 million documents.
A citation graph built from the complete Ukrainian court registry recovers legal domain boundaries via community detection and predicts legislative importance with AUC 0.9984.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:00:54.460168Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
d653c3cc1f31bfc961713fa9009aefadba8ea5f7eb0498644bd9ab923d5d1dff
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/2ZJ4HTA7GG74SYLRH6UQBGXPVW \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: d653c3cc1f31bfc961713fa9009aefadba8ea5f7eb0498644bd9ab923d5d1dff
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "0ebcc47cbe9315bea6ae427e74a25f4447536571620d4195715310544d4c8bb6",
"cross_cats_sorted": [
"cs.DL",
"cs.IR"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-05-14T19:42:20Z",
"title_canon_sha256": "14998f497ccc87b9050ed3bcebb619066285fb658ac20ebbd43714a39c0b0bfc"
},
"schema_version": "1.0",
"source": {
"id": "2605.15362",
"kind": "arxiv",
"version": 1
}
}