pith:O5FGSQF5
The Linear Representation Hypothesis and the Geometry of Large Language Models
High-level concepts in large language models are linear directions under a causal inner product built from counterfactual pairs.
arxiv:2311.03658 v2 · 2023-11-07 · cs.CL · cs.AI · cs.LG · stat.ML
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{O5FGSQF52JYQJG6M4EJUGWR4RU}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Using this causal inner product, we show how to unify all notions of linear representation. In particular, this allows the construction of probes and steering vectors using counterfactual pairs.
The assumption that the identified non-Euclidean inner product respects language structure in the precise sense required to unify probing and steering, and that counterfactual pairs can be reliably constructed or approximated in the model.
Linear representations of high-level concepts in LLMs are formalized via counterfactuals in input and output spaces, unified under a causal inner product that enables consistent probing and steering.
References
Cited by
Receipt and verification
| First computed | 2026-05-20T00:00:14.503329Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
774a6940bdd271049bcce113435a3c8d3c31947ca6c396b22ef91cc32f9ea2f9
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/O5FGSQF52JYQJG6M4EJUGWR4RU \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 774a6940bdd271049bcce113435a3c8d3c31947ca6c396b22ef91cc32f9ea2f9
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "52fc5acc1032b265edd952df1098bf1b816a082f7237490d8038ab7862d67fac",
"cross_cats_sorted": [
"cs.AI",
"cs.LG",
"stat.ML"
],
"license": "http://creativecommons.org/licenses/by-nc-sa/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2023-11-07T01:59:11Z",
"title_canon_sha256": "4f38a4423afec1c2192b83aca612444daf27dedf0b6ae368025085f67b69be7f"
},
"schema_version": "1.0",
"source": {
"id": "2311.03658",
"kind": "arxiv",
"version": 2
}
}