pith. sign in
Pith Number

pith:QPRGIAAY

pith:2026:QPRGIAAYAGUAWZNL5VQNJM3VYH
not attested not anchored not stored refs pending

A Grammar of Machine Learning Workflows: Rejecting Data Leakage at Call Time

Simon Roth

A grammar of eight typed primitives, a directed acyclic graph, and four hard constraints makes the most damaging forms of data leakage structurally unrepresentable in machine learning workflows.

arxiv:2603.10742 v4 · 2026-03-11 · cs.LG

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{QPRGIAAYAGUAWZNL5VQNJM3VYH}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

This paper presents a grammar - eight typed primitives, a directed acyclic graph, and four hard constraints - that makes the most damaging leakage types structurally unrepresentable.

C2weakest assumption

That the four hard constraints and the terminal assessment gate are sufficient to block all damaging leakage types while remaining practical for real workflows and enforceable at call time.

C3one line summary

A grammar of eight typed primitives, a DAG, and four constraints makes the most damaging forms of data leakage in ML workflows structurally unrepresentable.

Formal links

2 machine-checked theorem links

Receipt and verification
First computed 2026-06-02T01:03:45.350814Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

83e264001801a80b65abed60d4b375c1cd5b0469bb303b0798d7288b45836581

Aliases

arxiv: 2603.10742 · arxiv_version: 2603.10742v4 · doi: 10.48550/arxiv.2603.10742 · pith_short_12: QPRGIAAYAGUA · pith_short_16: QPRGIAAYAGUAWZNL · pith_short_8: QPRGIAAY
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/QPRGIAAYAGUAWZNL5VQNJM3VYH \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 83e264001801a80b65abed60d4b375c1cd5b0469bb303b0798d7288b45836581
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "f7b2a51f1b279dc6b76ff9f6899fb610e8fe4fad69eb9d6bc777adf8ea73ad2b",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.LG",
    "submitted_at": "2026-03-11T13:15:33Z",
    "title_canon_sha256": "48f12eded636e225f1591a7db43b33bdf4752a6e951ed8bd35919e7e0c07d05a"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2603.10742",
    "kind": "arxiv",
    "version": 4
  }
}