Beyond Hooking Onto the World: Referential Profiles and the Numerical Structure of LLM Grounding
Pith reviewed 2026-06-26 14:42 UTC · model grok-4.3
The pith
LLMs realize reference as distributed, profile-based numerical structures in vector weights and activations rather than fixed links to objects.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
LLMs do not acquire reference through perception or embodiment but parameterize inherited linguistic relations inside a finite vector system; referential profiles are distributed and recovered through context-sensitive computation, with weights, activations, attention-mediated hidden states, softmax-trained contrasts, and inner-product alignments serving as the mathematical sites at which these relations become stable and causally active.
What carries the argument
Profile-based reference realized as numerically structured, distributed, and superposable patterns recovered through context-sensitive vector computation.
Load-bearing premise
Mechanistic interpretability findings supply indirect support for the existence of numerically structured referential profiles inside LLMs.
What would settle it
A controlled test in which LLMs fail to maintain stable, context-sensitive referential distinctions across extended discourse when only vector-internal mechanisms are available and no additional human-like perceptual or embodied inputs are supplied.
read the original abstract
This paper revisits the grounding problem for large language models in light of recent vector-grounding accounts. I accept the shift from classical symbol grounding to vector grounding, but argue that the current debate remains incomplete in two respects. First, reference is often treated too thinly, as if it were a fixed link between an isolated expression and an object. I argue instead that reference is profile-based, context-sensitive, discourse-level, affectively shaped, and norm-governed. Even in the human case, reference is publicly stabilized through patterns of use, correction, distinction, inference, and continuation rather than through identical private representations. Second, vector grounding requires an account of numerical realization. LLMs do not acquire reference through human perception, memory, intention, embodiment, or understanding. Rather, through optimization, they parameterize linguistic traces of human world-directed practice. In a finite vector system, referential profiles must be distributed, may be superposed, and are recovered through context-sensitive computation. Weights, activations, attention-mediated hidden states, softmax-trained contrasts, and inner-product alignments are the mathematical sites at which inherited linguistic relations become stable and causally active. Mechanistic interpretability findings, including entity-like features, knowledge neurons, and emotion-related activation directions, provide indirect support for this view. They do not show that LLMs possess human reference. They support a more limited thesis: LLMs may possess derivative, language-mediated, profile-based, and numerically structured forms of reference.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper revisits the grounding problem for large language models, accepting vector-grounding accounts but arguing that reference should be reconceived as profile-based, context-sensitive, discourse-level, affectively shaped, and norm-governed rather than a thin fixed link. It claims that LLMs realize such referential profiles numerically through optimization over linguistic traces, with weights, activations, attention, and inner-product alignments serving as the mathematical sites of stabilization; mechanistic interpretability results (entity-like features, knowledge neurons, emotion-related directions) are invoked only as indirect, non-deductive support for the limited thesis that LLMs may possess derivative, language-mediated, profile-based, and numerically structured forms of reference.
Significance. If the limited modal thesis holds, the paper supplies a philosophically richer framework for interpreting reference in finite vector systems, emphasizing distributed superposition and context-sensitive recovery over isolated hooks. This could usefully inform how existing interpretability observations are read, without claiming human-like reference or advancing new empirical predictions. The absence of formal derivations, new data, or falsifiable tests keeps the contribution conceptual and interpretive rather than transformative for the core cs.CL literature.
minor comments (2)
- [Abstract] Abstract, final paragraph: the claim that interpretability findings 'provide indirect support' for numerically structured profiles would be clearer if the manuscript explicitly mapped at least one cited phenomenon (e.g., knowledge neurons) to a concrete aspect of profile-based reference rather than leaving the link suggestive.
- [Abstract] The repeated contrast between 'human perception, memory, intention, embodiment' and LLM optimization is effective, but a short parenthetical example of how a specific linguistic trace (e.g., a discourse continuation pattern) becomes numerically active would help readers see the numerical-realization step.
Simulated Author's Rebuttal
We thank the referee for the accurate summary of the manuscript and for recommending minor revision. The referee correctly identifies the paper as a conceptual and interpretive contribution that accepts vector-grounding accounts while advancing a profile-based reconception of reference and a numerical-realization thesis, supported only indirectly by existing mechanistic interpretability results. We agree that the work supplies no formal derivations, new data, or falsifiable predictions and is not positioned as transformative for core empirical cs.CL research.
Circularity Check
No significant circularity identified
full rationale
The paper advances a modal philosophical proposal that LLMs may possess derivative, language-mediated, profile-based, and numerically structured forms of reference. It accepts vector grounding as given and treats mechanistic interpretability observations only as indirect, non-deductive support rather than as premises from which the framework is derived. No equations, fitted parameters, predictions, self-citations, or uniqueness theorems appear in the provided text. The central claim does not reduce to its inputs by construction and remains consistent with multiple readings of the cited findings.
Axiom & Free-Parameter Ledger
axioms (3)
- domain assumption Reference is profile-based, context-sensitive, discourse-level, affectively shaped, and norm-governed rather than a fixed link between expression and object.
- domain assumption LLMs acquire reference by parameterizing linguistic traces of human world-directed practice through optimization rather than perception or embodiment.
- domain assumption Mechanistic interpretability findings supply indirect support for distributed, superposed referential profiles realized in weights, activations, and attention.
Reference graph
Works this paper leans on
-
[1]
Bender EM, Koller A (2020) Climbing towards NLU : On meaning, form, and understanding in the age of data. Association for Computational Linguistics, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp 5185--5198, doi:10.18653/v1/2020.acl-main.463, ://aclanthology.org/2020.acl-main.463
-
[2]
Bender EM, Gebru T, McMillan-Major A, et al (2021) On the dangers of stochastic parrots: Can language models be too big? doi:10.1145/3442188.3445922
-
[3]
Midwest Studies In Philosophy 4(1):73--121
Burge T (1979) Individualism and the mental. Midwest Studies In Philosophy 4(1):73--121. doi:10.1111/j.1475-4975.1979.tb00374.x, ://onlinelibrary.wiley.com/doi/abs/10.1111/j.1475-4975.1979.tb00374.x
-
[4]
Philosophy and the Mind Sciences 7(1)
Coelho Mollo D, Millière R (2026) The vector grounding problem. Philosophy and the Mind Sciences 7(1). doi:10.33735/phimisci.2026.12307, ://philosophymindscience.org/index.php/phimisci/article/view/12307
-
[5]
Dai D, Dong L, Hao Y, et al (2022) Knowledge neurons in pretrained transformers. Association for Computational Linguistics, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp 8493--8502, doi:10.18653/v1/2022.acl-long.581, ://aclanthology.org/2022.acl-long.581
-
[6]
Physica D: Nonlinear Phenomena 42(1--3):335--346
Harnad S (1990) The symbol grounding problem. Physica D: Nonlinear Phenomena 42(1--3):335--346. doi:10.1016/0167-2789(90)90087-6
-
[7]
Koch S (2025) Babbling stochastic parrots? a K ripkean argument for reference in large language models. Philosophy of AI 1:19--33. doi:10.18716/ojs/phai/2025.2325
-
[8]
Harvard University Press, Cambridge, MA
Kripke SA (1980) Naming and Necessity. Harvard University Press, Cambridge, MA
1980
-
[9]
doi:10.1162/coli_a_00522, ://aclanthology.org/2024.cl-3.12/
Mandelkern M, Linzen T (2024) Do language models' words refer? Computational Linguistics 50:1191--1200. doi:10.1162/coli_a_00522, ://aclanthology.org/2024.cl-3.12/
-
[10]
Communications of the ACM 19(3):113--126
Newell A, Simon HA (1976) Computer science as empirical inquiry: Symbols and search. Communications of the ACM 19(3):113--126. doi:10.1145/360018.360022
-
[11]
Putnam H (1975) The Meaning of `Meaning', Cambridge University Press, pp 215--271
1975
-
[12]
Behavioral and Brain Sciences 3(3):417--424
Searle JR (1980) Minds, brains, and programs. Behavioral and Brain Sciences 3(3):417--424. doi:10.1017/S0140525X00005756
-
[13]
Sofroniew N, Templeton A, Jermyn A, et al (2026) Emotion concepts and their function in a large language model. ://arxiv.org/abs/2604.07729
Pith/arXiv arXiv 2026
-
[14]
Transformer Circuits Thread ://transformer-circuits.pub/2024/scaling-monosemanticity/
Templeton A, Conerly T, Marcus J, et al (2024) Scaling monosemanticity: Extracting interpretable features from claude 3 sonnet. Transformer Circuits Thread ://transformer-circuits.pub/2024/scaling-monosemanticity/
2024
-
[15]
Blackwell, Oxford
Wittgenstein L (1953) Philosophical Investigations. Blackwell, Oxford
1953
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.