Emergent social transmission of model-based representations without inference
Pith reviewed 2026-05-11 01:53 UTC · model grok-4.3
The pith
Simple observation of expert actions allows naive agents to acquire model-based representations without inferring mental states.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In simulations of a reconfigurable environment where agents search for rewards, a naive learner that either copies observed actions or boosts their values based on an expert's behavior develops internal representations that converge toward those of the expert. Model-based agents, which learn transition and reward models of the environment, show faster convergence and more expert-like models than model-free agents or solo learners, all without any mechanism for inferring the expert's mental states.
What carries the argument
Heuristic action selection or value boosting based on observed expert actions, which biases the learner's experience sampling to indirectly transmit model-based representations.
If this is right
- Model-based learners acquire expert-like internal models of the environment faster when exposed to social cues than when learning alone.
- Higher-level representations can be transmitted culturally through simple behavioral observation that exploits standard reinforcement learning updates.
- Mentalizing or belief inference is not required for the spread of flexible, model-based knowledge in these simulated settings.
- This form of transmission works across different environment configurations where rewards must be located.
Where Pith is reading between the lines
- The same bias mechanism could allow groups of agents to align their internal models over repeated interactions without explicit communication.
- Artificial agents designed with model-based reinforcement learning may naturally support cultural-like knowledge sharing when given access to observed behavior.
- Human social learning studies could test whether brief exposure to expert actions produces similar shifts in participants' causal models of a task.
- Extending the environment to include noisy observations or multiple experts might reveal how robust the convergence remains under more realistic conditions.
Load-bearing premise
The specific rules for how the learner selects actions or boosts values from observed expert behavior are enough to capture the key social learning processes without extra mechanisms.
What would settle it
Run the same simulations but disable the heuristic action selection and value boosting rules; if the learner's representations no longer converge toward the expert's, the transmission mechanism fails.
Figures
read the original abstract
How do people acquire rich, flexible knowledge about their environment from others despite limited cognitive capacity? Humans are often thought to rely on computationally costly mentalizing, such as inferring others' beliefs. In contrast, cultural evolution emphasizes that behavioral transmission can be supported by simple social cues. Using reinforcement learning simulations, we show how minimal social learning can indirectly transmit higher-level representations. We simulate a na\"ive agent searching for rewards in a reconfigurable environment, learning either alone or by observing an expert - crucially, without inferring mental states. Instead, the learner heuristically selects actions or boosts value representations based on observed actions. Our results demonstrate that these cues bias the learner's experience, causing its representation to converge toward the expert's. Model-based learners benefit most from social exposure, showing faster learning and more expert-like representations. These findings show how cultural transmission can arise from simple, non-mentalizing processes exploiting asocial learning mechanisms.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript uses reinforcement learning simulations to show that a naive agent can acquire model-based representations converging toward an expert's through minimal social cues—specifically, heuristic action selection or value boosting based solely on observed expert actions—without mental state inference. These cues bias the learner's experience in a reconfigurable reward environment, with model-based learners exhibiting faster learning and more expert-like representations than model-free ones.
Significance. If the implementation details confirm that the heuristics operate from observable actions alone, the work is significant as a computational demonstration that cultural transmission of higher-level, flexible knowledge can emerge from simple, non-mentalizing processes that exploit standard asocial RL mechanisms. It provides a concrete alternative to inference-heavy accounts of social learning and could inform both cultural evolution theory and the design of observational learning in AI agents.
major comments (2)
- [Methods] Methods section (value boosting and action selection rules): The paper must explicitly demonstrate that value boosting is computed exclusively from observable expert actions without supplying the learner with the expert's internal Q-values, transition model, or shared state representation. If the simulation provides any of these (as is common in standard model-based RL implementations), the reported convergence would be an artifact of the setup rather than emergent from raw behavioral cues, directly undermining the central claim of transmission 'without inference'.
- [Results] Results section (model-based learner advantages): The qualitative claim that model-based learners benefit most (faster learning, more expert-like representations) lacks reported statistical tests, effect sizes, or ablation controls varying the heuristic parameters. Without these, it is unclear whether the differential benefit is robust or specific to the chosen action-selection and boosting rules, weakening support for the broader conclusion about model-based representations.
minor comments (2)
- [Abstract] Abstract: The string 'naïve' is rendered with an encoding artifact ('naïve'); correct to standard spelling for clarity.
- [Methods] Methods: All simulation parameters (learning rates, environment reconfiguration schedule, reward magnitudes, number of trials) should be listed explicitly in the main text or a table to enable full reproducibility.
Simulated Author's Rebuttal
We thank the referee for their constructive comments, which have prompted us to strengthen the clarity and rigor of our manuscript. We address each major comment below and outline the revisions we will make.
read point-by-point responses
-
Referee: [Methods] Methods section (value boosting and action selection rules): The paper must explicitly demonstrate that value boosting is computed exclusively from observable expert actions without supplying the learner with the expert's internal Q-values, transition model, or shared state representation. If the simulation provides any of these (as is common in standard model-based RL implementations), the reported convergence would be an artifact of the setup rather than emergent from raw behavioral cues, directly undermining the central claim of transmission 'without inference'.
Authors: We agree that this distinction is crucial for the validity of our central claim. Our simulations are designed such that the learner only observes the expert's actions in each state, without any access to the expert's internal Q-values, transition model, or state representations. The value boosting mechanism simply increments the value estimate for the observed action by a fixed heuristic amount, and the action selection heuristic increases the probability of selecting the expert's action, both based purely on behavioral observation. No mental state inference or direct knowledge transfer occurs. To address the referee's concern, we will revise the Methods section to include explicit pseudocode and a statement confirming that all social cues derive exclusively from observable actions. This will eliminate any ambiguity regarding the implementation. revision: yes
-
Referee: [Results] Results section (model-based learner advantages): The qualitative claim that model-based learners benefit most (faster learning, more expert-like representations) lacks reported statistical tests, effect sizes, or ablation controls varying the heuristic parameters. Without these, it is unclear whether the differential benefit is robust or specific to the chosen action-selection and boosting rules, weakening support for the broader conclusion about model-based representations.
Authors: We acknowledge the value of quantitative support for our claims. While the current results show consistent patterns across multiple runs, we will enhance the Results section by adding appropriate statistical tests (such as independent t-tests comparing learning curves and representation similarity scores between conditions), reporting effect sizes, and including ablation analyses that vary the heuristic parameters (e.g., different boosting strengths and selection biases). These additions will demonstrate the robustness of the model-based advantage and will be presented in the main text and supplementary information. revision: yes
Circularity Check
No circularity: results emerge from forward simulation dynamics
full rationale
The paper reports outcomes from explicit reinforcement learning simulations of naive agents interacting with an environment and an expert's observable actions. Heuristic rules for action selection and value boosting are applied directly to generate experience; the convergence of model-based representations is an observed consequence of running these dynamics, not a quantity defined in terms of the target result or fitted to it. No equations, self-citations, or ansatzes are invoked that reduce the central claim to its own inputs by construction. The derivation chain is therefore self-contained against the simulation benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Heuristic action selection and value boosting based on observed expert actions are sufficient to bias experience toward expert-like representations.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We simulate a naïve agent searching for rewards in a reconfigurable environment, learning either alone or by observing an expert—crucially, without inferring mental states. Instead, the learner heuristically selects actions or boosts value representations based on observed actions.
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Model-based (MB) learning is implemented using Dyna-Q ... maintains an internal belief B about the environment—equivalent to a world model
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
antonov2025exploring APACrefauthors Antonov, G. \ Dayan, P. APACrefauthors \ 2025 . Exploring replay Exploring replay . Nature Communications 16 1 1657
work page 2025
-
[2]
Baker, Rebecca Saxe, and Joshua B
Baker_Saxe_Tenenbaum_2009 APACrefauthors Baker, C L. , Saxe, R. \ Tenenbaum, J B. APACrefauthors \ 2009 . Action understanding as inverse planning Action understanding as inverse planning . Cognition 113 3 329–349 . APACrefDOI doi:10.1016/j.cognition.2009.07.005 APACrefDOI
-
[3]
ve, adult, captive chimpanzees do not socially learn how to make and use sharp stone tools Na \
bandini2023naive APACrefauthors Bandini, E. \ Tennie, C. APACrefauthors \ 2023 . Na \" ve, adult, captive chimpanzees do not socially learn how to make and use sharp stone tools Na \" ve, adult, captive chimpanzees do not socially learn how to make and use sharp stone tools . Scientific Reports 13 1 22733
work page 2023
-
[4]
Bellman1957 APACrefauthors Bellman, R. APACrefauthors \ 1957 . Dynamic Programming Dynamic programming \ ( 1 \ ). Princeton, NJ, USA Princeton University Press
work page 1957
-
[5]
boyd1988culture APACrefauthors Boyd, R. \ Richerson, P J. APACrefauthors \ 1988 . Culture and the evolutionary process Culture and the evolutionary process . University of Chicago press
work page 1988
-
[6]
byrne_learning_1998 APACrefauthors Byrne, R W. \ Russon, A E. APACrefauthors \ 1998 10 . Learning by imitation: A hierarchical approach Learning by imitation: A hierarchical approach . Behavioral and Brain Sciences 21 5 667--684 . APACrefDOI doi:10.1017/S0140525X98001745 APACrefDOI
-
[7]
call2005copying APACrefauthors Call, J. , Carpenter, M. \ Tomasello, M. APACrefauthors \ 2005 . Copying results and copying actions in the process of social learning: chimpanzees (Pan troglodytes) and human children (Homo sapiens) Copying results and copying actions in the process of social learning: chimpanzees (pan troglodytes) and human children (homo ...
work page 2005
-
[8]
collette2017neural APACrefauthors Collette, S. , Pauli, W M. , Bossaerts, P. \ O'Doherty, J. APACrefauthors \ 2017 . Neural computations underlying inverse reinforcement learning in the human brain Neural computations underlying inverse reinforcement learning in the human brain . Elife 6 e29718
work page 2017
-
[9]
daw2011model APACrefauthors Daw, N D. , Gershman, S J. , Seymour, B. , Dayan, P. \ Dolan, R J. APACrefauthors \ 2011 . Model-based influences on humans' choices and striatal prediction errors Model-based influences on humans' choices and striatal prediction errors . Neuron 69 6 1204--1215
work page 2011
- [10]
-
[11]
Galef1988-bc APACrefauthors Galef, B G., Jr. APACrefauthors \ 1988 . Imitation in animals: History, definition and interpretation of the data from the psychological laboratory Imitation in animals: History, definition and interpretation of the data from the psychological laboratory . T R. Zentall\ B G. Galef Jr\ ( ), Social learning: Psychological and Bio...
work page 1988
-
[12]
garg2022individual APACrefauthors Garg, K. , Kello, C T. \ Smaldino, P E. APACrefauthors \ 2022 . Individual exploration and selective social learning: balancing exploration--exploitation trade-offs in collective foraging Individual exploration and selective social learning: balancing exploration--exploitation trade-offs in collective foraging . Journal o...
work page 2022
-
[13]
ha2018recurrent APACrefauthors Ha, D. \ Schmidhuber, J. APACrefauthors \ 2018 . Recurrent world models facilitate policy evolution Recurrent world models facilitate policy evolution . Advances in neural information processing systems 31
work page 2018
-
[14]
hackel2019model APACrefauthors Hackel, L M. , Berg, J J. , Lindstr \"o m, B R. \ Amodio, D M. APACrefauthors \ 2019 . Model-based and model-free social cognition: investigating the role of habit in social attitude formation and choice Model-based and model-free social cognition: investigating the role of habit in social attitude formation and choice . Fro...
work page 2019
-
[15]
hackel2020reinforcement APACrefauthors Hackel, L M. , Mende-Siedlecki, P. \ Amodio, D M. APACrefauthors \ 2020 . Reinforcement learning in social interaction: The distinguishing role of trait inference Reinforcement learning in social interaction: The distinguishing role of trait inference . Journal of Experimental Social Psychology 88 103948
work page 2020
-
[16]
hafner2025mastering APACrefauthors Hafner, D. , Pasukonis, J. , Ba, J. \ Lillicrap, T. APACrefauthors \ 2025 . Mastering diverse control tasks through world models Mastering diverse control tasks through world models . Nature 1--7
work page 2025
-
[17]
hawkins2023flexible APACrefauthors Hawkins, R D. , Berdahl, A M. , Pentland, A S. , Tenenbaum, J B. , Goodman, N D. \ Krafft, P. APACrefauthors \ 2023 . Flexible social inference facilitates targeted social learning when rewards are not observable Flexible social inference facilitates targeted social learning when rewards are not observable . Nature Human...
work page 2023
-
[18]
heyes2018cognitive APACrefauthors Heyes, C. APACrefauthors \ 2018 . Cognitive gadgets: The cultural evolution of thinking Cognitive gadgets: The cultural evolution of thinking . Harvard University Press
work page 2018
-
[19]
horner2005causal APACrefauthors Horner, V. \ Whiten, A. APACrefauthors \ 2005 . Causal knowledge and imitation/emulation switching in chimpanzees (Pan troglodytes) and children (Homo sapiens) Causal knowledge and imitation/emulation switching in chimpanzees (pan troglodytes) and children (homo sapiens) . Animal cognition 8 3 164--181
work page 2005
-
[20]
Jara-Ettinger_2019 APACrefauthors Jara-Ettinger, J. APACrefauthors \ 2019 . Theory of mind as inverse reinforcement learning Theory of mind as inverse reinforcement learning . Current Opinion in Behavioral Sciences 29 105–110
work page 2019
-
[21]
jern2017people APACrefauthors Jern, A. , Lucas, C G. \ Kemp, C. APACrefauthors \ 2017 . People learn other people’s preferences through inverse decision-making People learn other people’s preferences through inverse decision-making . Cognition 168 46--64
work page 2017
-
[22]
jimenez2019prestige APACrefauthors Jim \'e nez, \'A V. \ Mesoudi, A. APACrefauthors \ 2019 . Prestige-biased social learning: Current evidence and outstanding questions Prestige-biased social learning: Current evidence and outstanding questions . Palgrave Communications 5 1
work page 2019
-
[23]
Laland2004-pq APACrefauthors Laland, K N. APACrefauthors \ 2004 02 . Social learning strategies Social learning strategies . Learning & Behavior 32 1 4--14 . APACrefDOI doi:10.3758/bf03196002 APACrefDOI
-
[24]
legare2015imitation APACrefauthors Legare, C H. \ Nielsen, M. APACrefauthors \ 2015 . Imitation and innovation: The dual engines of cultural learning Imitation and innovation: The dual engines of cultural learning . Trends in cognitive sciences 19 11 688--699
work page 2015
-
[25]
lehnert2020reward APACrefauthors Lehnert, L. , Littman, M L. \ Frank, M J. APACrefauthors \ 2020 . Reward-predictive representations generalize across tasks in reinforcement learning Reward-predictive representations generalize across tasks in reinforcement learning . PLoS computational biology 16 10 e1008317
work page 2020
-
[26]
lyons2007hidden APACrefauthors Lyons, D E. , Young, A G. \ Keil, F C. APACrefauthors \ 2007 . The hidden structure of overimitation The hidden structure of overimitation . Proceedings of the National Academy of Sciences 104 50 19751--19756
work page 2007
-
[27]
mantiuk2025curiosity APACrefauthors Mantiuk, F. , Zhou, H. \ Wu, C M. APACrefauthors \ 2025 . From Curiosity to Competence: How World Models Interact with the Dynamics of Exploration From curiosity to competence: How world models interact with the dynamics of exploration . A. Ruggeri, D. Barner, C. Walker \ N. Bramley\ ( ), Proceedings of the 47th Annual ...
-
[28]
mcelreath2008beyond APACrefauthors McElreath, R. , Bell, A V. , Efferson, C. , Lubell, M. , Richerson, P J. \ Waring, T. APACrefauthors \ 2008 . Beyond existence and aiming outside the laboratory: estimating frequency-dependent and pay-off-biased social learning strategies Beyond existence and aiming outside the laboratory: estimating frequency-dependent ...
work page 2008
-
[29]
mesoudi2016cultural APACrefauthors Mesoudi, A. APACrefauthors \ 2016 . Cultural evolution: a review of theory, findings and controversies Cultural evolution: a review of theory, findings and controversies . Evolutionary biology 43 4 481--497
work page 2016
-
[30]
miller2017dorsal APACrefauthors Miller, K J. , Botvinick, M M. \ Brody, C D. APACrefauthors \ 2017 . Dorsal hippocampus contributes to model-based planning Dorsal hippocampus contributes to model-based planning . Nature neuroscience 20 9 1269--1276
work page 2017
-
[31]
najar2020actions APACrefauthors Najar, A. , Bonnet, E. , Bahrami, B. \ Palminteri, S. APACrefauthors \ 2020 . The actions of others act as a pseudo-reward to drive imitation in the context of social reinforcement learning The actions of others act as a pseudo-reward to drive imitation in the context of social reinforcement learning . PLoS biology 18 12 e3001028
work page 2020
- [32]
-
[33]
olsson2020neural APACrefauthors Olsson, A. , Knapska, E. \ Lindstr \"o m, B. APACrefauthors \ 2020 . The neural and computational systems of social learning The neural and computational systems of social learning . Nature Reviews Neuroscience 21 4 197--212
work page 2020
-
[34]
o2015structure APACrefauthors O’Doherty, J P. , Lee, S W. \ McNamee, D. APACrefauthors \ 2015 . The structure of reinforcement-learning mechanisms in the human brain The structure of reinforcement-learning mechanisms in the human brain . Current Opinion in Behavioral Sciences 1 94--100
work page 2015
-
[35]
park2017integration APACrefauthors Park, S A. , Go \" ame, S. , O'Connor, D A. \ Dreher, J C. APACrefauthors \ 2017 . Integration of individual and social information for decision-making in groups of different sizes Integration of individual and social information for decision-making in groups of different sizes . PLoS Biology 15 6 e2001958
work page 2017
-
[36]
peters2008reinforcement APACrefauthors Peters, J. \ Schaal, S. APACrefauthors \ 2008 . Reinforcement learning of motor skills with policy gradients Reinforcement learning of motor skills with policy gradients . Neural networks 21 4 682--697
work page 2008
-
[37]
rendell2010copy APACrefauthors Rendell, L. , Boyd, R. , Cownden, D. , Enquist, M. , Eriksson, K. , Feldman, M W. Laland, K N. APACrefauthors \ 2010 . Why copy others? Insights from the social learning strategies tournament Why copy others? insights from the social learning strategies tournament . Science 328 5975 208--213
work page 2010
-
[38]
rendell2010rogers APACrefauthors Rendell, L. , Fogarty, L. \ Laland, K N. APACrefauthors \ 2010 . ROGERS’PARADOX RECAST AND RESOLVED: POPULATION STRUCTURE AND THE EVOLUTION OF SOCIAL LEARNING STRATEGIES Rogers’paradox recast and resolved: Population structure and the evolution of social learning strategies . Evolution 64 2 534--548
work page 2010
-
[39]
roberts2025environmental APACrefauthors Roberts-Gaal, X. , Bolic, M. \ Cushman, F A. APACrefauthors \ 2025 . Environmental variability shapes the representational format of cultural learning Environmental variability shapes the representational format of cultural learning . Proceedings of the National Academy of Sciences 122 28 e2505283122
work page 2025
-
[40]
Russell_1998 APACrefauthors Russell, S. APACrefauthors \ 1998 07 . Learning agents for uncertain environments Learning agents for uncertain environments . ( 101–103). Madison Wisconsin USA ACM . APACrefDOI doi:10.1145/279943.279964 APACrefDOI
-
[41]
sato2023state APACrefauthors Sato, Y. , Sakai, Y. \ Hirata, S. APACrefauthors \ 2023 . State-transition-free reinforcement learning in chimpanzees (Pan troglodytes) State-transition-free reinforcement learning in chimpanzees (pan troglodytes) . Learning & behavior 51 4 413--427
work page 2023
-
[42]
shteingart2014reinforcement APACrefauthors Shteingart, H. \ Loewenstein, Y. APACrefauthors \ 2014 . Reinforcement learning and human behavior Reinforcement learning and human behavior . Current opinion in neurobiology 25 93--98
work page 2014
-
[43]
Spence1937-gr APACrefauthors Spence, K W. APACrefauthors \ 1937 12 . Experimental studies of learning and the higher mental processes in infra-human primates Experimental studies of learning and the higher mental processes in infra-human primates . Psychological bulletin 34 10 806--850 . APACrefDOI doi:10.1037/h0061498 APACrefDOI
-
[44]
storn1997differential APACrefauthors Storn, R. \ Price, K. APACrefauthors \ 1997 . Differential Evolution--A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces Differential evolution--a simple and efficient heuristic for global optimization over continuous spaces . Journal of Global Optimization 11 4 341--359 . APACrefDOI doi:10...
-
[45]
Sutton_1990 APACrefauthors Sutton, R S. APACrefauthors \ 1990 . Integrated architectures for learning, planning, and reacting based on approximating dynamic programming Integrated architectures for learning, planning, and reacting based on approximating dynamic programming . Proceedings of the Seventh International Conference on Machine Learning Proceedin...
work page 1990
-
[46]
sutton2018reinforcement APACrefauthors Sutton, R S. \ Barto, A G. APACrefauthors \ 2018 . Reinforcement learning: An introduction Reinforcement learning: An introduction . MIT press
work page 2018
-
[47]
tennie2010evidence APACrefauthors Tennie, C. , Call, J. \ Tomasello, M. APACrefauthors \ 2010 . Evidence for emulation in chimpanzees in social settings using the floating peanut task Evidence for emulation in chimpanzees in social settings using the floating peanut task . PLoS One 5 5 e10544
work page 2010
-
[48]
tomasello_cultural_1999 APACrefauthors Tomasello, M. APACrefauthors \ 1999 . The cultural origins of human cognition The cultural origins of human cognition . Cambridge, MA, US Harvard University Press . Pages: vi, 248
work page 1999
-
[49]
toyokawa_social_2019 APACrefauthors Toyokawa, W. , Whalen, A. \ Laland, K N. APACrefauthors \ 2019 01 . Social learning strategies regulate the wisdom and madness of interactive crowds Social learning strategies regulate the wisdom and madness of interactive crowds . Nature Human Behaviour 3 2 183--193 . APACrefDOI doi:10.1038/s41562-018-0518-x APACrefDOI
-
[50]
uchiyama2023model APACrefauthors Uchiyama, R. , Tennie, C. \ Wu, C M. APACrefauthors \ 2023 . Model-Based Assimilation Transmits and Recombines World Models Model-based assimilation transmits and recombines world models . L. Hunt, C. Summerfield, T. Konkle, E. Fedorenko \ T. Naselaris\ ( ), Proceedings of the 2023 Conference on Cognitive Computational Neu...
-
[51]
urain2025survey APACrefauthors Urain, J. , Mandlekar, A. , Du, Y. , Muhammad, N. , Xu, D. , Fragkiadaki, K. others APACrefauthors \ 2025 . A Survey on Deep Generative Models for Robot Learning From Multimodal Demonstrations A survey on deep generative models for robot learning from multimodal demonstrations . IEEE Transactions on Robotics 42 60--79
work page 2025
-
[52]
velez2023teachers APACrefauthors V \'e lez, N. , Chen, A M. , Burke, T. , Cushman, F A. \ Gershman, S J. APACrefauthors \ 2023 . Teachers recruit mentalizing regions to represent learners’ beliefs Teachers recruit mentalizing regions to represent learners’ beliefs . Proceedings of the National Academy of Sciences 120 22 e2215015120
work page 2023
-
[53]
vikbladh2019hippocampal APACrefauthors Vikbladh, O M. , Meager, M R. , King, J. , Blackmon, K. , Devinsky, O. , Shohamy, D. Daw, N D. APACrefauthors \ 2019 . Hippocampal contributions to model-based planning and spatial memory Hippocampal contributions to model-based planning and spatial memory . Neuron 102 3 683--693
work page 2019
-
[54]
witt2024flexible APACrefauthors Witt, A. , Toyokawa, W. , Lala, K N. , Gaissmaier, W. \ Wu, C M. APACrefauthors \ 2024 . Humans flexibly integrate social information despite interindividual differences in reward Humans flexibly integrate social information despite interindividual differences in reward . Proceedings of the National Academy of Sciences 121 ...
-
[55]
Adaptive mechanisms of social and asocial learning in immersive foraging environments , volume =
wu2025adaptive APACrefauthors Wu, C M. , Deffner, D. , Kahl, B. , Meder, B. , Ho, M K. \ Kurvers, R H. APACrefauthors \ 2025 . Adaptive mechanisms of social and asocial learning in immersive foraging environments Adaptive mechanisms of social and asocial learning in immersive foraging environments . Nature Communications 16 3539 . APACrefDOI doi:10.1038/s...
-
[56]
wu2022representational APACrefauthors Wu, C M. , V \'e lez, N. \ Cushman, F A. APACrefauthors \ 2022 . Representational exchange in human social learning: Balancing efficiency and flexibility Representational exchange in human social learning: Balancing efficiency and flexibility . I C. Dezza, E. Schulz \ C M. Wu\ ( ), The Drive for Knowledge: The Science...
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.