Taming the Centaur(s) with LAPITHS: a framework for a theoretically grounded interpretation of AI performances
Pith reviewed 2026-05-07 07:21 UTC · model grok-4.3
The pith
A new framework shows that high AI task performance does not justify claims of human-like cognition in models like CENTAUR.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors develop LAPITHS to counteract behavioristic tendencies that equate high task performance in transformer models with human-like underlying computation. They apply two assessments: the Minimal Cognitive Grid to estimate cognitive plausibility of artificial systems, and behavioral comparisons showing that equivalent outputs can be produced by systems that do not meet structural constraints associated with cognition. This shows that claims advanced for models such as CENTAUR, proposed as artificial unified models of cognition, are not theoretically or empirically justified and that their outputs do not provide independent explanatory insight into human cognition.
What carries the argument
LAPITHS, the framework that pairs a theoretically motivated scoring of cognitive plausibility with behavioral reproduction tests using non-cognitive systems to evaluate claims of human-likeness.
If this is right
- Cognitive claims for AI systems must include validation against structural criteria rather than relying on task performance alone.
- Models proposed as unified accounts of cognition require explicit checks against established constraints from cognitive science.
- Equivalent results achieved by systems without cognitive plausibility weaken inferences that the original models operate via human-like mechanisms.
- Evaluations of transformer-based models should incorporate theoretical grounding to support assertions of cognitive abilities.
- Research emphasis should shift toward explanatory value for human cognition instead of behavioral matching in isolation.
Where Pith is reading between the lines
- The same method could be applied to other large language models to test which capabilities truly require cognitive architectures.
- It points toward the value of building AI systems that embed explicit cognitive constraints rather than depending on scale alone.
- Connections emerge to broader questions about what counts as sufficient evidence for attributing computation types across artificial and natural systems.
- Future benchmarks could integrate the plausibility criteria directly to produce more discriminating tests of model claims.
Load-bearing premise
The Minimal Cognitive Grid supplies a complete and non-circular measure of cognitive plausibility, and that reproducing similar task performance with systems lacking cognitive structural constraints is enough to invalidate claims of human-like computation.
What would settle it
An experiment in which a system that satisfies the Minimal Cognitive Grid criteria still fails to match the specific performance patterns reported for CENTAUR-like models on the same cognitive benchmarks, or in which no non-cognitive system can be constructed that matches those patterns.
Figures
read the original abstract
We introduce a framework called LAPITHS (Language model Analysis through Paradigm grounded Interpretations of Theses about Human likenesS) and use it to show that several major claims advanced by models such as CENTAUR, proposed as an artificial Unified Model of Cognition, are not theoretically or empirically justified. LAPITHS provides a principled reference point for counteracting the current behaviouristic tendency in AI research to interpret the human level performances of transformer based language models as evidence of human like underlying computation and, by extension, as signs of cognitive abilities. The novelty of LAPITHS lies in making explicit the arguments grounded in two quantitative assessments: (i) the Minimal Cognitive Grid, a theoretically motivated method for estimating the cognitive plausibility of artificial systems, and (ii) a behavioural comparison showing that results similar to those reported for CENTAUR like models can be reproduced by other systems that do not satisfy the structural constraints typically associated with cognitive plausibility, and whose outputs do not provide independent explanatory insight into human cognition.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces the LAPITHS framework to provide a theoretically grounded interpretation of AI model performances. It claims that several major assertions about models such as CENTAUR (positioned as artificial Unified Models of Cognition) lack justification, because these models fail to satisfy the structural constraints of the Minimal Cognitive Grid and because comparable task results can be reproduced by non-cognitive baseline systems whose outputs offer no independent explanatory insight into human cognition.
Significance. If the central argument holds, LAPITHS supplies a principled counter to purely behavioristic readings of transformer performance by anchoring evaluation in explicit cognitive-science constraints rather than surface metrics alone. The manuscript earns credit for defining the Minimal Cognitive Grid via explicit criteria in §3 (derived from established constraints rather than post-hoc fitting) and for illustrating the framework with concrete behavioral reproductions against non-cognitive baselines in §4; these elements make the critique falsifiable and reproducible.
minor comments (3)
- [Abstract] Abstract: the summary of the two quantitative assessments is clear in outline but remains high-level; adding one concrete example of a reproduced metric or one explicit Grid criterion would make the contribution more immediately graspable without lengthening the abstract.
- [§4] §4: the behavioral comparison protocol is described at a high level; a short table or enumerated list of the exact non-cognitive baselines, the performance metrics matched, and the statistical criterion used to declare similarity would improve reproducibility and allow readers to assess the strength of the counter-examples directly.
- [§3] §3: while the Grid is presented as derived from established cognitive-science constraints, a brief paragraph situating its criteria relative to prior cognitive benchmarks (e.g., those used in human cognitive modeling) would clarify its novelty and reduce any appearance of circularity for readers unfamiliar with the authors’ earlier work.
Simulated Author's Rebuttal
We thank the referee for their positive and constructive review, as well as the recommendation for minor revision. The referee's summary accurately reflects the purpose of the LAPITHS framework in offering a theoretically motivated alternative to purely behavioral interpretations of high-performing AI models such as CENTAUR.
read point-by-point responses
-
Referee: The manuscript earns credit for defining the Minimal Cognitive Grid via explicit criteria in §3 (derived from established constraints rather than post-hoc fitting) and for illustrating the framework with concrete behavioral reproductions against non-cognitive baselines in §4; these elements make the critique falsifiable and reproducible.
Authors: We are pleased that the referee highlights these features. The criteria in §3 are drawn directly from prior cognitive-science literature on minimal requirements for cognitive plausibility, and the §4 comparisons use publicly available baselines to ensure reproducibility. We do not see a need for revision on this point. revision: no
-
Referee: If the central argument holds, LAPITHS supplies a principled counter to purely behavioristic readings of transformer performance by anchoring evaluation in explicit cognitive-science constraints rather than surface metrics alone.
Authors: We agree that this is the core intended contribution of the work. The framework is constructed precisely to shift evaluation away from surface performance toward structural constraints that have independent theoretical motivation in cognitive science. revision: no
Circularity Check
No significant circularity detected in LAPITHS derivation
full rationale
The paper defines LAPITHS explicitly via the Minimal Cognitive Grid in section 3, presented as derived from established cognitive science constraints rather than post-hoc fitting or self-referential inputs. Section 4 provides concrete behavioral reproductions with non-cognitive baselines that violate the grid's structural criteria while matching reported metrics. No equations or predictions reduce by construction to fitted parameters; the central claim (CENTAUR-style performance does not entail human-like computation) rests on independent application of the grid and external baselines. Any self-citations for the grid's origins are not load-bearing, as the criteria are stated explicitly and falsifiable against external benchmarks. The framework is self-contained against cognitive science literature without reducing to its own definitions.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Human cognition possesses identifiable structural constraints that can be listed in a grid and used to evaluate artificial systems.
- domain assumption Reproducing task performance with systems that lack those structural constraints is sufficient to show that the original model's performance does not indicate human-like computation.
invented entities (2)
-
LAPITHS framework
no independent evidence
-
Minimal Cognitive Grid
no independent evidence
Reference graph
Works this paper leans on
-
[1]
On the Opportunities and Risks of Foundation Models
Abbate, Francesco and Rocco Gaudenzi (2025). “Ungrounding intelligence: From biological minds to artificial cognition”. In:Philosophical Psychology, pp. 1–30. Akam, Thomas, Rui Costa, and Peter Dayan (2015). “Simple plans or sophisticated habits? State, transition and learning interactions in the two-step task”. In:PLoS computational biology11.12, e100464...
work page internal anchor Pith review arXiv 2025
-
[2]
The magical number 4 in short-term memory: A reconsideration of mental storage capacity
Springer Science & Business Media. Cowan, Nelson (2001). “The magical number 4 in short-term memory: A reconsideration of mental storage capacity”. In:Behavioral and brain sciences24.1, pp. 87–114. Crick, Francis (1989). “The recent excitement about neural networks”. In:Nature337.6203, pp. 129–132. Curtis, Clayton E and Mark D’Esposito (2003). “Persistent...
work page doi:10.18653/v1/2025.acl-long.1415.url:https://aclanthology.org/2025.acl-long.1415/ 2001
-
[3]
Structure mapping in analogy and similarity
1, pp. 59–79. 26 APREPRINT- MAY1, 2026 Gentner, Dedre and Arthur B Markman (1997). “Structure mapping in analogy and similarity.” In:American psychologist 52.1, p
2026
-
[4]
doi: 10.1038/s41586-025-09422-z
Gershman, Samuel J and Nathaniel D Daw (2017). “Reinforcement learning and episodic memory in humans and animals: an integrative framework”. In:Annual review of psychology68, pp. 101–128. Gluck, Kevin A. and John E. Laird, eds. (2018).Interactive Task Learning: Humans, Robots, and Agents Acquiring New Tasks Through Natural Interactions. Cambridge, MA: The...
-
[5]
LoRA: Low-Rank Adaptation of Large Language Models
Hick, William E (1952). “On the rate of gain of information”. In:Quarterly Journal of experimental psychology4.1, pp. 11–26. Holyoak, Keith J (2012). “Analogy and relational reasoning”. In:The Oxford handbook of thinking and reasoning, pp. 234–259. Hu, Edward J. et al. (2021).LoRA: Low-Rank Adaptation of Large Language Models. arXiv: 2106.09685 [cs.CL]. U...
work page internal anchor Pith review arXiv 1952
-
[6]
Text comprehension, memory, and learning
Kintsch, Walter (1994). “Text comprehension, memory, and learning.” In:American psychologist49.4, p
1994
-
[7]
Laird, John E, Christian Lebiere, and Paul S Rosenbloom (2017). “A standard model of the mind: Toward a common computational framework across artificial intelligence, cognitive science, neuroscience, and robotics”. In:Ai Magazine 38.4, pp. 13–26. Leivada, Evelina et al. (2025). “Large Language Model probabilities cannot distinguish between possible and im...
-
[8]
Rosenblueth, Arturo and Norbert Wiener (1945). “The role of models in science”. In:Philosophy of science12.4, pp. 316–321. Russell, Stuart Jonathan and Peter Norvig (2012).Artificial intelligence: A modern approach;[the intelligent agent book]. Prentice hall. Salis, Pietro and Matteo Da Pelo (forthcoming). “Chatting with LLMs: neither cognition nor commun...
-
[9]
Can robots make good models of biological behaviour?
Webb, Barbara (2001). “Can robots make good models of biological behaviour?” In:Behavioral and brain sciences 24.6, pp. 1033–1050. Wu, Charley M et al. (2018). “Generalization guides human exploration in vast decision spaces”. In:Nature human behaviour2.12, pp. 915–924. Wulff, Dirk U. and Rui Mata (2026).Addressing Longstanding Challenges in Cognitive Sci...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.