Naturalistic Computational Cognitive Science: Towards generalizable models and theories that capture the full range of natural behavior
Pith reviewed 2026-05-23 02:36 UTC · model grok-4.3
The pith
Cognitive science must shift to naturalistic experiments and AI-accommodating models to build theories that generalize to natural behavior.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Incorporating a broader range of naturalistic experimental paradigms, along with computational models that can accommodate them, is necessary to resolve some aspects of natural intelligence and to ensure that theories generalize beyond the lab.
What carries the argument
Naturalistic computational cognitive science, the integration of naturalistic experimental paradigms with AI-derived models capable of handling complex real-world data.
If this is right
- Cognitive theories will need to account for behaviors and processes that only appear under naturalistic conditions.
- AI models trained on naturalistic data can supply new hypotheses about the roots of cognitive and neural phenomena.
- Computational models can be built that solve the actual problems of natural cognition while providing reductive explanations of their operation.
- Methodological practices can be adjusted to support cumulative progress without sacrificing experimental control.
Where Pith is reading between the lines
- This approach could change how experiments are designed in areas like decision-making or perception by prioritizing tasks that mirror everyday environments.
- It raises the possibility that some current debates in cognitive science stem from the artificial constraints of lab tasks rather than fundamental disagreements about mechanisms.
- Testing whether AI models trained naturalistically better predict human behavior in uncontrolled settings would provide a direct check on the claim.
Load-bearing premise
Differences seen in some naturalistic settings reflect a general requirement to change methods across the field rather than isolated cases that existing frameworks can already address.
What would settle it
A demonstration that standard lab paradigms and models without naturalistic elements can fully predict and explain the key behaviors and processes observed in real-world cognitive tasks across multiple domains.
Figures
read the original abstract
How can cognitive science build generalizable theories that span the full scope of natural situations and behaviors? We argue that progress in Artificial Intelligence (AI) offers timely opportunities for cognitive science to embrace experiments with increasingly naturalistic stimuli, tasks, and behaviors; and computational models that can accommodate these changes. We first review a growing body of research spanning neuroscience, cognitive science, and AI that suggests that incorporating a broader range of naturalistic experimental paradigms, and models that accommodate them, may be necessary to resolve some aspects of natural intelligence and ensure that our theories generalize. We review cases from cognitive science and neuroscience where naturalistic paradigms elicit distinct behaviors or engage different processes. We then discuss recent progress in AI that shows that learning from naturalistic data yields qualitatively different patterns of behavior and generalization, and examine how these findings impact the conclusions we draw from cognitive modeling, and can help yield new hypotheses for the roots of cognitive and neural phenomena. We then suggest that integrating recent progress in AI and cognitive science will enable us to engage with more naturalistic phenomena without giving up experimental control or the pursuit of theoretically grounded understanding. We offer practical guidance on how methodological practices can contribute to cumulative progress in naturalistic computational cognitive science, and illustrate a path towards building computational models that solve the real problems of natural cognition, together with a reductive understanding of the processes and principles by which they do so.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript is a perspective piece arguing that cognitive science should adopt more naturalistic experimental paradigms (stimuli, tasks, behaviors) and computational models capable of accommodating them in order to build generalizable theories spanning the full range of natural situations. It reviews cases from cognitive science and neuroscience where naturalistic paradigms produce distinct behaviors or engage different processes, draws parallels to AI results showing qualitatively different generalization from naturalistic data, and offers practical guidance for integrating these approaches while preserving experimental control and theoretical grounding.
Significance. If the argument holds, the perspective could encourage a productive shift toward ecologically valid yet controlled research, helping resolve discrepancies in understanding natural intelligence and fostering cumulative progress by bridging cognitive science with recent AI advances on naturalistic data. The manuscript's value lies in its synthesis of existing literature and appropriately hedged claims rather than new quantitative evidence or derivations.
Simulated Author's Rebuttal
We thank the referee for their positive and constructive review. We are pleased that the manuscript's synthesis of existing literature, appropriately hedged claims, and potential to foster integration between naturalistic paradigms and AI models were recognized, and we appreciate the recommendation to accept.
Circularity Check
No significant circularity
full rationale
The paper is a perspective/review article with no mathematical derivations, equations, fitted parameters, or formal models. Its arguments rest on reviews of external literature from cognitive science, neuroscience, and AI, with hedged claims about the potential value of naturalistic paradigms. No self-citations function as load-bearing uniqueness theorems or ansatzes, and no predictions reduce by construction to the paper's own inputs. The central thesis is interpretive advocacy rather than a deductive chain, making the derivation self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Forward citations
Cited by 1 Pith paper
-
Active inference as a unified model of collision avoidance behavior in human drivers
Active inference model unifies human collision avoidance by reproducing meta-analysis aggregates and simulator-specific effects on response timing, maneuver selection, and execution.
Reference graph
Works this paper leans on
-
[1]
Abel, D., Barreto, A., Van Roy, B., Precup, D., van Hasselt, H. P., and Singh, S. (2024). A definition of continual reinforcement learning. Advances in Neural Information Processing Systems,
work page 2024
-
[2]
Agarwal, R., Schwarzer, M., Castro, P. S., Courville, A. C., and Bellemare, M. (2021). Deep reinforcement learning at the edge of the statistical precipice. Advances in neural information processing systems , 34:29304–29320. Allen, K., Br¨ andle, F., Botvinick, M., Fan, J. E., Gershman, S. J., Gopnik, A., Griffiths, T. L., Hartshorne, J. K., Hauser, T. U....
work page 2021
-
[3]
Cao, R. and Yamins, D. (2024a). Explanatory models in neuroscience, part 1: Taking mech- anistic abstraction seriously. Cognitive Systems Research , page 101244. Cao, R. and Yamins, D. (2024b). Explanatory models in neuroscience, part 2: Functional intelligibility and the contravariance principle. Cognitive Systems Research , 85:101200. Carlson, R. W. and...
-
[4]
Chomsky, N. (1957). Syntactic structures . Mouton de Gruyter. Chomsky, N. (1965). Aspects of the Theory of Syntax . MIT press. Chomsky, N. (2014). The minimalist program . MIT press. Cichy, R. M. and Kaiser, D. (2019). Deep neural networks as scientific models. Trends in cognitive sciences , 23(4):305–317. Cisek, P. (1999). Beyond the computer metaphor: B...
-
[5]
Elman, J. L. (1996). Rethinking innateness: A connectionist perspective on development , volume
work page 1996
-
[6]
MIT press. Engelhard, I. M., van den Hout, M. A., and McNally, R. J. (2008). Memory consistency for traumatic events in dutch soldiers deployed to iraq. Memory, 16(1):3–9. Erev, I. and Greiner, B. (2015). The 1-800 critique, counter-examples, and the future of behavioral economics. Handbook of experimental economic methodology , pages 151–165. Fan, L., Wa...
-
[7]
Harvard university press Cambridge, MA. Fodor, J. A. and Pylyshyn, Z. W. (1988). Connectionism and cognitive architecture: A critical analysis. Cognition, 28(1-2):3–71. Francis, K. B., Terbeck, S., Briazu, R. A., Haines, A., Gummerum, M., Ganis, G., and Howard, I. S. (2017). Simulating moral actions: An investigation of personal force in virtual moral dil...
work page 1988
-
[8]
Freeman, W. J. (1995). Chaos in the brain: Possible roles in biological intelligence. Interna- tional Journal of Intelligent Systems , 10(1):71–88. Fu, S., Tamir, N., Sundaram, S., Chai, L., Zhang, R., Dekel, T., and Isola, P. (2024). Dream- sim: Learning new dimensions of human visual similarity using synthetic data. Advances in Neural Information Proces...
work page 1995
-
[9]
CoRR abs/1811.12231 (2018), http://arxiv.org/ abs/1811.12231
Futrell, R., Gibson, E., Tily, H. J., Blank, I., Vishnevetsky, A., Piantadosi, S. T., and Fe- dorenko, E. (2021). The natural stories corpus: a reading-time corpus of english texts containing rare syntactic constructions. Language Resources and Evaluation , 55:63–77. Futrell, R. and Mahowald, K. (2025). How linguistics learned to stop worrying and love th...
-
[10]
L., Chater, N., Kemp, C., Perfors, A., and Tenenbaum, J
41 Griffiths, T. L., Chater, N., Kemp, C., Perfors, A., and Tenenbaum, J. B. (2010). Probabilis- tic models of cognition: Exploring representations and inductive biases. Trends in cognitive sciences, 14(8):357–364. Grim, P., Rosenberger, R., Rosenfeld, A., Anderson, B., and Eason, R. E. (2013). How simulations fail. Synthese, 190:2367–2390. Grimm, C., Bar...
work page 2010
-
[11]
He, K., Fan, H., Wu, Y., Xie, S., and Girshick, R
Springer. He, K., Fan, H., Wu, Y., Xie, S., and Girshick, R. (2020). Momentum contrast for unsu- pervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages 9729–9738. He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the...
work page 2020
-
[12]
42 Henrich, J. (2016). The secret of our success: How culture is driving human evolution, domesticating our species, and making us smarter . princeton University press. Higgins, I., Chang, L., Langston, V., Hassabis, D., Summerfield, C., Tsao, D., and Botvinick, M. (2021). Unsupervised deep learning identifies semantic disentanglement in single infer- ote...
-
[13]
Krypotos, A.-M., Vervliet, B., and Engelhard, I. M. (2018). The validity of human avoidance paradigms. Behaviour Research and Therapy , 111:99–105. 44 Laird, J. E. (2019). The Soar cognitive architecture . MIT press. Lake, B. M. and Baroni, M. (2023). Human-like systematic generalization through a meta- learning neural network. Nature, 623(7985):115–121. ...
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[14]
Lee, I. (2014). Publish or perish: The myth and reality of academic publishing. Language teaching, 47(2):250–261. Lee, M. D. and Wagenmakers, E.-J. (2013). Bayesian cognitive modeling: A practical course . Cambridge university press. Levine, S., Finn, C., Darrell, T., and Abbeel, P. (2016). End-to-end training of deep visuo- motor policies. Journal of Mac...
-
[15]
Lyle, C., Rowland, M., and Dabney, W. (2022). Understanding and preventing capacity loss in reinforcement learning. arXiv preprint arXiv:2204.09560. Machado, M. C., Bellemare, M. G., Talvitie, E., Veness, J., Hausknecht, M., and Bowling, M. (2018). Revisiting the arcade learning environment: Evaluation protocols and open problems for general agents. Journ...
-
[16]
McCoy, R. T. and Griffiths, T. L. (2025). Modeling rapid language learning by distilling bayesian priors into artificial neural networks. Nature Communications , 16(1):1–14. Melis, G., Dyer, C., and Blunsom, P. (2017). On the state of the art of evaluation in neural language models. arXiv preprint arXiv:1707.05589. Merel, J., Botvinick, M., and Wayne, G. ...
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[17]
Misra, K. and Mahowald, K. (2024). Language models learn rare phenomena from less rare phenomena: The case of the missing aanns. arXiv preprint arXiv:2403.19827. Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J., Bellemare, M. G., Graves, A., Riedmiller, M., Fidjeland, A. K., Ostrovski, G., et al. (2015). Human-level control through deep rein...
-
[18]
Nanda, N., Chan, L., Lieberum, T., Smith, J., and Steinhardt, J. (2023). Progress measures for grokking via mechanistic interpretability. In The Eleventh International Conference on Learning Representations. Naselaris, T., Bassett, D. S., Fletcher, A. K., Kording, K., Kriegeskorte, N., Nienborg, H., Poldrack, R. A., Shohamy, D., and Kay, K. (2018). Cognit...
work page 2023
-
[19]
D., Conway, A., Cowan, N., Donkin, C., Farrell, S., Hitch, G
Oberauer, K., Lewandowsky, S., Awh, E., Brown, G. D., Conway, A., Cowan, N., Donkin, C., Farrell, S., Hitch, G. J., Hurlstone, M. J., et al. (2018). Benchmarks for models of short-term and working memory. Psychological bulletin , 144(9):885. Olshausen, B. A. and Field, D. J. (1996). Emergence of simple-cell receptive field properties by learning a sparse ...
-
[20]
Poldrack, R. A., Baker, C. I., Durnez, J., Gorgolewski, K. J., Matthews, P. M., Munaf` o, M. R., Nichols, T. E., Poline, J.-B., Vul, E., and Yarkoni, T. (2017). Scanning the horizon: towards transparent and reproducible neuroimaging research. Nature reviews neuroscience , 18(2):115–126. Prystawski, B., Li, M., and Goodman, N. (2024). Why think step by ste...
work page 2017
-
[21]
Puebla, G. and Bowers, J. S. (2022). Can deep convolutional neural networks support rela- tional reasoning in the same-different task? Journal of Vision , 22(10):11–11. 49 Quilty-Dunn, J., Porot, N., and Mandelbaum, E. (2023). The best game in town: The reemergence of the language-of-thought hypothesis across the cognitive sciences. Behav- ioral and Brain...
work page 2022
-
[22]
Rahnev, D. and Denison, R. N. (2018). Suboptimality in perceptual decision making. Be- havioral and brain sciences , 41:e223. Rahnev, D., Desender, K., Lee, A. L., Adler, W. T., Aguilar-Lleyda, D., Akdo˘ gan, B., Arbuzova, P., Atlas, L. Y., Balcı, F., Bang, J. W., et al. (2020). The confidence database. Nature human behaviour , 4(3):317–325. Recht, B. (20...
-
[23]
Yamins, D. L. and DiCarlo, J. J. (2016). Using goal-driven deep learning models to understand sensory cortex. Nature neuroscience , 19(3):356–365. Yamins, D. L., Hong, H., Cadieu, C. F., Solomon, E. A., Seibert, D., and DiCarlo, J. J. (2014). Performance-optimized hierarchical models predict neural responses in higher visual cortex. Proceedings of the nat...
work page 2016
-
[24]
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., and Torralba, A. (2014). Object detectors emerge in deep scene cnns. arXiv preprint arXiv:1412.6856. 55
work page internal anchor Pith review Pith/arXiv arXiv 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.