pith:DIYGYPBP
On the Expressive Power of Contextual Relations in Transformers
Transformers can approximate any contextual relation by treating it as a probability distribution or coupling.
arxiv:2603.25860 v3 · 2026-03-26 · stat.ML · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{DIYGYPBPELPZL2USZDUPXAUCO2}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
we establish a universal approximation theorem for contextual systems using standard Softmax Attention and alternately Sinkhorn normalization. These results show that Transformer architectures can approximate arbitrary contextual relations rules, and that the choice of normalization determines how these relations are represented.
Contextual relations can be fully and faithfully modeled as probabilistic objects, either as conditional distributions or as joint distributions (couplings), and that this modeling captures what Transformers actually compute.
Transformers using softmax or Sinkhorn attention can universally approximate any contextual relation modeled as a probabilistic coupling or conditional distribution.
Cited by
Receipt and verification
| First computed | 2026-05-20T00:03:09.568965Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
1a306c3c2f22df95ea92c8e8fb828276946d0edc0c8babaa8eca278235aae91a
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/DIYGYPBPELPZL2USZDUPXAUCO2 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 1a306c3c2f22df95ea92c8e8fb828276946d0edc0c8babaa8eca278235aae91a
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "d2465bc01ba689734b1490f1615a780cba58c267db80c2714de7180db49df244",
"cross_cats_sorted": [
"cs.LG"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "stat.ML",
"submitted_at": "2026-03-26T19:30:36Z",
"title_canon_sha256": "329b9ea9b336ce35e190fd72b669845d80e143f20af353d534cb05f38058f467"
},
"schema_version": "1.0",
"source": {
"id": "2603.25860",
"kind": "arxiv",
"version": 3
}
}