The role of System 1 and System 2 semantic memory structure in human and LLM biases

Giulio Rossetti; Katherine Abramski; Massimo Stella

arxiv: 2604.12816 · v1 · submitted 2026-04-14 · 💻 cs.CL

The role of System 1 and System 2 semantic memory structure in human and LLM biases

Katherine Abramski , Giulio Rossetti , Massimo Stella This is my paper

Pith reviewed 2026-05-10 14:45 UTC · model grok-4.3

classification 💻 cs.CL

keywords semantic memorySystem 1 thinkingSystem 2 thinkingimplicit biaslarge language modelsnetwork analysisgender biashuman cognition

0 comments

The pith

Human semantic memory networks show unique irreducible structures that link to lower implicit bias in deliberative thinking, a pattern absent in LLMs.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper builds comparable semantic memory networks from human and LLM data to represent fast associative System 1 thinking and slower deliberative System 2 thinking. It finds that only the human networks resist simplification, indicating people possess conceptual knowledge organizations that machines do not replicate. These human structures consistently relate to reduced gender bias, especially in System 2 networks, while no such reliable link appears in the LLM versions. The work therefore treats semantic organization as a cognitive mechanism that helps regulate bias in humans but not in current language models. This distinction matters because it points to limits in how LLMs acquire and use knowledge for bias mitigation.

Core claim

We model System 1 and System 2 thinking as semantic memory networks with distinct structures, built from comparable datasets generated by both humans and LLMs. We find that semantic memory structures are irreducible only in humans, suggesting that LLMs lack certain types of human-like conceptual knowledge. Moreover, semantic memory structure relates consistently to implicit bias only in humans, with lower levels of bias in System 2 structures.

What carries the argument

Semantic memory networks built separately for System 1 and System 2 processes from human and LLM data, then measured with network metrics to test relations to implicit gender bias.

Load-bearing premise

That the networks extracted from human and LLM datasets truly capture the cognitive split between fast associative and slow deliberative thinking, and that the chosen network measures reflect the parts of structure relevant to bias.

What would settle it

Finding that LLMs prompted with different data or methods produce semantic networks with the same irreducible properties as human ones, or that bias scores in humans show no correlation with those network properties once other variables are controlled.

Figures

Figures reproduced from arXiv: 2604.12816 by Giulio Rossetti, Katherine Abramski, Massimo Stella.

**Figure 2.** Figure 2: Measuring implicit bias with spreading activation. [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗

**Figure 3.** Figure 3: Structural reducibility of multilayer semantic networks. [PITH_FULL_IMAGE:figures/full_fig_p011_3.png] view at source ↗

**Figure 4.** Figure 4: Effect sizes disaggregated by stereotype topic. [PITH_FULL_IMAGE:figures/full_fig_p012_4.png] view at source ↗

**Figure 5.** Figure 5: Effect sizes aggregated by stereotype topics. [PITH_FULL_IMAGE:figures/full_fig_p013_5.png] view at source ↗

**Figure 6.** Figure 6: Normalized activation levels for the free associations layer in humans. [PITH_FULL_IMAGE:figures/full_fig_p018_6.png] view at source ↗

**Figure 7.** Figure 7: Normalized activation levels for the definitions layer in humans. [PITH_FULL_IMAGE:figures/full_fig_p019_7.png] view at source ↗

**Figure 8.** Figure 8: Normalized activation levels for the categorical relations layer in humans. [PITH_FULL_IMAGE:figures/full_fig_p020_8.png] view at source ↗

**Figure 9.** Figure 9: Normalized activation levels for the free associations layer in Mistral. [PITH_FULL_IMAGE:figures/full_fig_p021_9.png] view at source ↗

**Figure 10.** Figure 10: Normalized activation levels for the definitions layer in Mistral. [PITH_FULL_IMAGE:figures/full_fig_p022_10.png] view at source ↗

**Figure 11.** Figure 11: Normalized activation levels for the categorical relations layer in Mistral. [PITH_FULL_IMAGE:figures/full_fig_p023_11.png] view at source ↗

**Figure 12.** Figure 12: Normalized activation levels for the free associations layer in Llama3. [PITH_FULL_IMAGE:figures/full_fig_p024_12.png] view at source ↗

**Figure 13.** Figure 13: Normalized activation levels for the definitions layer in Llama3. [PITH_FULL_IMAGE:figures/full_fig_p025_13.png] view at source ↗

**Figure 14.** Figure 14: Normalized activation levels for the categorical relations layer in Llama3. [PITH_FULL_IMAGE:figures/full_fig_p026_14.png] view at source ↗

read the original abstract

Implicit biases in both humans and large language models (LLMs) pose significant societal risks. Dual process theories propose that biases arise primarily from associative System 1 thinking, while deliberative System 2 thinking mitigates bias, but the cognitive mechanisms that give rise to this phenomenon remain poorly understood. To better understand what underlies this duality in humans, and possibly in LLMs, we model System 1 and System 2 thinking as semantic memory networks with distinct structures, built from comparable datasets generated by both humans and LLMs. We then investigate how these distinct semantic memory structures relate to implicit gender bias using network-based evaluation metrics. We find that semantic memory structures are irreducible only in humans, suggesting that LLMs lack certain types of human-like conceptual knowledge. Moreover, semantic memory structure relates consistently to implicit bias only in humans, with lower levels of bias in System~2 structures. These findings suggest that certain types of conceptual knowledge contribute to bias regulation in humans, but not in LLMs, highlighting fundamental differences between human and machine cognition.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper claims human semantic networks for System 1/2 are irreducible and bias-linked in ways LLMs cannot match, but this rests on unvalidated assumptions about what prompting elicits.

read the letter

The one thing to know is that this work finds semantic memory structures built from human data are irreducible only for humans and tie to lower implicit gender bias only in the deliberative condition, while LLM versions do not show the same patterns. They generate comparable datasets from humans and models under conditions intended to separate associative from deliberative thinking, convert them to semantic networks, and test network metrics against bias measures. What is new is the direct side-by-side application of semantic network analysis to dual-process accounts of bias in humans versus LLMs. Earlier papers have examined bias in models or used networks for memory structure, but this specific combination and the resulting claims about missing conceptual knowledge in LLMs are not already in the cited literature. The framing is useful because it shifts attention from raw outputs to the organization of conceptual knowledge. The soft spot is the missing validation that the LLM prompting conditions actually produce networks analogous to human System 1 and System 2. The abstract gives no indication they checked against established markers such as response latency, explicit-implicit dissociation, or denser local clustering in the fast condition. Without that, differences could simply reflect prompt style rather than deeper memory architecture, which would make the irreducibility and bias-correlation results artifacts. Sample sizes, exact network construction steps, chosen metrics, and statistical controls are also not described, so it is hard to judge robustness. This is for researchers in cognitive modeling of LLMs and bias mitigation who want ideas about what humans might have that current models lack. A reader already working with semantic networks could extract some methodological pointers, but the central claims need more grounding to be reliable. I would send it to peer review so referees can examine the methods and data directly.

Referee Report

3 major / 2 minor

Summary. The paper models System 1 (associative) and System 2 (deliberative) thinking as semantic memory networks constructed from comparable human and LLM-generated datasets. It applies network-based metrics to examine how these structures relate to implicit gender bias, reporting that the structures are irreducible only in humans and that the structure-bias relation holds only for humans, with lower bias associated with System 2 networks. The authors conclude that certain conceptual knowledge supports bias regulation in humans but not in LLMs.

Significance. If the network constructions and metrics are shown to validly instantiate dual-process distinctions, the results would provide evidence of a qualitative divergence in how humans and LLMs organize conceptual knowledge and regulate bias. The network-analytic approach to linking memory topology with implicit bias is a potentially useful bridge between cognitive science and AI evaluation, though its interpretive power depends on the untested assumption that prompting regimes produce cognitively analogous structures.

major comments (3)

[§3] §3 (Network Construction and Elicitation): The claim that LLM networks under different prompting conditions instantiate System 1 vs. System 2 distinctions analogous to humans is load-bearing for both the irreducibility and bias-correlation results, yet the manuscript provides no validation against established dual-process markers (e.g., response latency, explicit/implicit dissociation, or IAT scores) in the human data, nor demonstrates expected qualitative differences such as higher local clustering in System 1 networks.
[Results] Results, bias-correlation analysis (around Tables 2–4): The reported finding that semantic memory structure relates to implicit bias only in humans rests on network metrics whose sensitivity to dataset size, density, or generation style is not controlled or reported; without these controls it is impossible to rule out that the human-only correlation is an artifact of differing network properties rather than a genuine cognitive difference.
[§4] §4 (Irreducibility claim): The assertion that semantic memory structures are 'irreducible only in humans' requires a precise definition of irreducibility (e.g., via specific topological invariants or embedding dimensionality) and a demonstration that the same metrics applied to LLM networks do not simply reflect surface-level differences in output coherence; the current presentation leaves this distinction underspecified.

minor comments (2)

[Abstract, §2] The abstract and §2 use 'System~2' LaTeX spacing inconsistently; standardize notation throughout.
[Figures] Figure captions should explicitly state the number of participants/LLM generations and the exact prompting templates used to build each network.

Simulated Author's Rebuttal

3 responses · 1 unresolved

We thank the referee for their constructive and detailed comments, which have prompted us to strengthen the methodological justifications and controls in the manuscript. We address each major comment below and indicate the revisions made.

read point-by-point responses

Referee: [§3] The claim that LLM networks under different prompting conditions instantiate System 1 vs. System 2 distinctions analogous to humans is load-bearing for both the irreducibility and bias-correlation results, yet the manuscript provides no validation against established dual-process markers (e.g., response latency, explicit/implicit dissociation, or IAT scores) in the human data, nor demonstrates expected qualitative differences such as higher local clustering in System 1 networks.

Authors: We agree that direct validation against markers such as response latency would provide stronger support for the analogy. The human dataset, based on established semantic memory elicitation protocols, does not contain latency or per-participant IAT data, so such validation is not possible with the current resources. In the revision we have expanded the methods section with a detailed theoretical mapping of the prompting regimes to dual-process theory. We have also added reporting of clustering coefficients and other local metrics, confirming higher clustering in human System 1 networks relative to System 2 (as predicted), with smaller differences observed in the LLM networks. These additions address the qualitative-difference concern while acknowledging the data limitation. revision: partial
Referee: [Results] The reported finding that semantic memory structure relates to implicit bias only in humans rests on network metrics whose sensitivity to dataset size, density, or generation style is not controlled or reported; without these controls it is impossible to rule out that the human-only correlation is an artifact of differing network properties rather than a genuine cognitive difference.

Authors: We accept that explicit controls are necessary. The revised manuscript now includes subsampling procedures that equalize network size and density across all conditions, together with sensitivity analyses that vary generation style. After these controls the human-specific structure-bias correlation remains statistically significant while the LLM correlation does not, indicating that the result is not an artifact of differing network properties. revision: yes
Referee: [§4] The assertion that semantic memory structures are 'irreducible only in humans' requires a precise definition of irreducibility (e.g., via specific topological invariants or embedding dimensionality) and a demonstration that the same metrics applied to LLM networks do not simply reflect surface-level differences in output coherence; the current presentation leaves this distinction underspecified.

Authors: We have revised §4 to supply an explicit operational definition: irreducibility is quantified by the persistence of higher-dimensional topological features (via persistent homology) that cannot be recovered from a lower-dimensional embedding without substantial loss of information. We further demonstrate that human networks retain higher irreducibility scores than LLM networks even after matching for coherence metrics and after comparison to randomized null models, indicating that the distinction is not reducible to surface-level output differences. revision: yes

standing simulated objections not resolved

Direct empirical validation against response latency or explicit/implicit dissociation measures from the same human participants cannot be performed because the source dataset does not contain these variables.

Circularity Check

0 steps flagged

No circularity: empirical network construction and correlation analysis remain independent of inputs.

full rationale

The paper builds semantic memory networks from separate human and LLM datasets under different prompting conditions to represent System 1 versus System 2 structures, then applies network metrics to examine relations with implicit bias. No equations, fitted parameters, or self-citations are shown that would make the reported irreducibility or bias correlations reduce to the construction method by definition. The derivation chain consists of data generation, network extraction, metric computation, and statistical comparison; these steps do not collapse into self-definition or renaming of the input data. The central claims therefore retain independent empirical content.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that dual-process theory applies equally to LLMs and that semantic networks built from task data faithfully capture System 1 versus System 2 distinctions.

axioms (1)

domain assumption Dual process theories accurately describe human cognition with System 1 as associative and System 2 as deliberative.
Invoked to frame the modeling of semantic memory networks for both humans and LLMs.

pith-pipeline@v0.9.0 · 5481 in / 1227 out tokens · 44893 ms · 2026-05-10T14:45:51.056354+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

85 extracted references · 85 canonical work pages · 2 internal anchors

[1]

Abramski, R

K. Abramski, R. Improta, G. Rossetti, and M. Stella. The ”llm world of words” english free association norms generated by large language models.Scientific data, 12(1):803, 2025

work page 2025
[2]

Abramski, G

K. Abramski, G. Rossetti, and M. Stella. A word association network methodology for evaluating implicit biases in llms compared to humans.arXiv preprint arXiv:2510.24488, 2025

work page arXiv 2025
[3]

Acerbi and J

A. Acerbi and J. M. Stubbersfield. Large language models show human-like content biases in transmission chain experiments.Proceedings of the National Academy of Sciences, 120(44):e2313790120, 2023

work page 2023
[4]

Agrawal, T

G. Agrawal, T. Kumarage, Z. Alghamdi, and H. Liu. Can knowledge graphs reduce hallucinations in llms?: A survey. InProceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 3947–3960, 2024

work page 2024
[5]

H. An, X. Liu, and D. Zhang. Learning bias-reduced word embeddings using dictionary definitions. In Findings of the Association for Computational Linguistics: ACL 2022, pages 1139–1152, 2022

work page 2022
[6]

J. R. Anderson. A spreading activation theory of memory.Journal of verbal learning and verbal behavior, 22(3):261–295, 1983

work page 1983
[7]

J. R. Anderson, D. Bothell, C. Lebiere, and M. Matessa. An integrated theory of list memory.Journal of Memory and Language, 38(4):341–380, 1998

work page 1998
[8]

J. R. Anderson, M. Matessa, and C. Lebiere. Act-r: A theory of higher level cognition and its relation to visual attention.Human–Computer Interaction, 12(4):439–462, 1997

work page 1997
[9]

X. Bai, A. Wang, I. Sucholutsky, and T. L. Griffiths. Explicitly unbiased large language models still form biased associations.Proceedings of the National Academy of Sciences, 122(8):e2416228122, 2025

work page 2025
[10]

J. A. Bargh and T. L. Chartrand. The unbearable automaticity of being.American psychologist, 54(7):462, 1999

work page 1999
[11]

A. Barr, E. A. Feigenbaum, and P. R. Cohen.The handbook of artificial intelligence, volume 1. HeurisTech Press, 1981

work page 1981
[12]

E. M. Bender, T. Gebru, A. McMillan-Major, and S. Shmitchell. On the dangers of stochastic parrots: Can language models be too big? InProceedings of the 2021 ACM conference on fairness, accountability, and transparency, pages 610–623, 2021

work page 2021
[13]

Boccaletti, G

S. Boccaletti, G. Bianconi, R. Criado, C. I. Del Genio, J. G´ omez-Gardenes, M. Romance, I. Sendina-Nadal, Z. Wang, and M. Zanin. The structure and dynamics of multilayer networks.Physics reports, 544(1):1–122, 2014

work page 2014
[14]

R. J. Brachman. On the epistemological status of semantic networks. InAssociative networks, pages 3–50. Elsevier, 1979

work page 1979
[15]

Brady, P

O. Brady, P. Nulty, L. Zhang, T. E. Ward, and D. P. McGovern. Dual-process theory and decision-making in large language models.Nature Reviews Psychology, pages 1–16, 2025

work page 2025
[16]

Bursell and F

M. Bursell and F. Olsson. Do we need dual-process theory to understand implicit bias? a study of the nature of implicit bias against muslims.Poetics, 87:101549, 2021

work page 2021
[17]

Castro and C

N. Castro and C. S. Siew. Contributions of modern network science to the cognitive sciences: Revisiting research spirals of representation and process.Proceedings of the Royal Society A, 476(2238):20190825, 2020. 27/31

work page 2020
[18]

Citraro, M

S. Citraro, M. S. Vitevitch, M. Stella, and G. Rossetti. Feature-rich multiplex lexical networks reveal mental strategies of early language learning.Scientific Reports, 13(1):1474, 2023

work page 2023
[19]

A. M. Collins and E. F. Loftus. A spreading-activation theory of semantic processing.Psychological review, 82(6):407, 1975

work page 1975
[20]

A. M. Collins and M. R. Quillian. Retrieval time from semantic memory.Journal of verbal learning and verbal behavior, 8(2):240–247, 1969

work page 1969
[21]

De Deyne, D

S. De Deyne, D. J. Navarro, A. Perfors, M. Brysbaert, and G. Storms. The ”small world of words” english word association norms for over 12,000 cue words.Behavior research methods, 51(3):987–1006, 2019

work page 2019
[22]

De Domenico, V

M. De Domenico, V. Nicosia, A. Arenas, and V. Latora. Structural reducibility of multilayer networks. Nature communications, 6(1):6864, 2015

work page 2015
[23]

De Domenico, A

M. De Domenico, A. Sol´ e-Ribalta, E. Cozzo, M. Kivel¨ a, Y. Moreno, M. A. Porter, S. G´ omez, and A. Arenas. Mathematical formulation of multilayer networks.Physical Review X, 3(4):041022, 2013

work page 2013
[24]

E. S. De Duro, E. Franchino, R. Improta, G. A. Veltri, and M. Stella. Cognitive networks identify ai biases on societal issues in large language models.EPJ Data Science, 15(1):7, 2026

work page 2026
[25]

De Houwer, P

J. De Houwer, P. Van Dessel, and T. Moran. Attitudes as propositional representations.Trends in Cognitive Sciences, 25(10):870–882, 2021

work page 2021
[26]

J. C. de Winter, D. Dodou, and Y. B. Eisma. System 2 thinking in openai’s o1-preview model: Near-perfect performance on a mathematics exam.Computers, 13(11):278, 2024

work page 2024
[27]

P. G. Devine. Stereotypes and prejudice: Their automatic and controlled components.Journal of personality and social psychology, 56(1):5, 1989

work page 1989
[28]

P. G. Devine, P. S. Forscher, A. J. Austin, and W. T. Cox. Long-term reduction in implicit race bias: A prejudice habit-breaking intervention.Journal of experimental social psychology, 48(6):1267–1278, 2012

work page 2012
[29]

J. S. B. Evans. Dual-processing accounts of reasoning, judgment, and social cognition.Annu. Rev. Psychol., 59(1):255–278, 2008

work page 2008
[30]

J. S. B. Evans. Intuition and reasoning: A dual-process perspective.Psychological Inquiry, 21(4):313–326, 2010

work page 2010
[31]

J. S. B. Evans. Dual-process theories of reasoning: Contemporary issues and developmental applications. Developmental review, 31(2-3):86–102, 2011

work page 2011
[32]

J. S. B. Evans and K. E. Stanovich. Dual-process theories of higher cognition: Advancing the debate. Perspectives on psychological science, 8(3):223–241, 2013

work page 2013
[33]

Fodor.The language of thought

J. Fodor.The language of thought. Harvard University Press, 1975

work page 1975
[34]

J. A. Fodor and Z. W. Pylyshyn. Connectionism and cognitive architecture: A critical analysis.Cognition, 28(1-2):3–71, 1988

work page 1988
[35]

Garimella, A

A. Garimella, A. Amarnath, K. Kumar, A. P. Yalla, N. Chhaya, B. V. Srinivasan, et al. He is very intelligent, she is very beautiful? on mitigating social biases in language modelling and generation. In Findings of the association for computational linguistics: ACL-IJCNLP 2021, pages 4534–4545, 2021

work page 2021
[36]

Gawronski and G

B. Gawronski and G. V. Bodenhausen. Associative and propositional processes in evaluation: an integrative review of implicit and explicit attitude change.Psychological bulletin, 132(5):692, 2006

work page 2006
[37]

Gawronski and G

B. Gawronski and G. V. Bodenhausen. The associative–propositional evaluation model: Theory, evidence, and open questions.Advances in experimental social psychology, 44:59–127, 2011. 28/31

work page 2011
[38]

A. G. Greenwald and M. R. Banaji. Implicit social cognition: attitudes, self-esteem, and stereotypes. Psychological review, 102(1):4, 1995

work page 1995
[39]

A. G. Greenwald, D. E. McGhee, and J. L. Schwartz. Measuring individual differences in implicit cognition: the implicit association test.Journal of personality and social psychology, 74(6):1464, 1998

work page 1998
[40]

Hagendorff, S

T. Hagendorff, S. Fabi, and M. Kosinski. Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in chatgpt.Nature Computational Science, 3(10):833–838, 2023

work page 2023
[41]

T. T. Hills and Y. N. Kenett. Is the mind a network? maps, vehicles, and skyhooks in cognitive network science.Topics in Cognitive Science, 14(1):189–208, 2022

work page 2022
[42]

K. A. Hutchison, D. A. Balota, J. H. Neely, M. J. Cortese, E. R. Cohen-Shikora, C.-S. Tse, M. J. Yap, J. J. Bengson, D. Niemeyer, and E. Buchanan. The semantic priming project.Behavior research methods, 45:1099–1114, 2013

work page 2013
[43]

Kahneman.Thinking, fast and slow

D. Kahneman.Thinking, fast and slow. macmillan, 2011

work page 2011
[44]

Chen Gao, Xiaochong Lan, Nian Li, Yuan Yuan, Jingtao Ding, Zhilun Zhou, Fengli Xu, and Yong Li

M. Kamruzzaman and G. L. Kim. Prompting techniques for reducing social bias in llms through system 1 and system 2 cognitive processes.arXiv preprint arXiv:2404.17218, 2024

work page arXiv 2024
[45]

Kaneko and D

M. Kaneko and D. Bollegala. Dictionary-based debiasing of pre-trained word embeddings.arXiv preprint arXiv:2101.09525, 2021

work page arXiv 2021
[46]

Kintsch.Comprehension: A paradigm for cognition

W. Kintsch.Comprehension: A paradigm for cognition. Cambridge university press, 1998

work page 1998
[47]

Kivel¨ a, A

M. Kivel¨ a, A. Arenas, M. Barthelemy, J. P. Gleeson, Y. Moreno, and M. A. Porter. Multilayer networks. Journal of complex networks, 2(3):203–271, 2014

work page 2014
[48]

Kozima and T

H. Kozima and T. Furugori. Similarity between words computed by spreading activation on an english dictionary.arXiv preprint cmp-lg/9601004, 1996

work page arXiv 1996
[49]

Kumar, H

R. Kumar, H. Kumar, and K. Shalini. Detecting and mitigating bias in llms through knowledge graph- augmented training. In2025 International Conference on Artificial Intelligence and Data Engineering (AIDE), pages 608–613. IEEE, 2025

work page 2025
[50]

Z.-Z. Li, D. Zhang, M.-L. Zhang, J. Zhang, Z. Liu, Y. Yao, H. Xu, J. Zheng, P.-J. Wang, X. Chen, et al. From system 1 to system 2: A survey of reasoning large language models.arXiv preprint arXiv:2502.17419, 2025

work page internal anchor Pith review arXiv 2025
[51]

C. Ma, T. Zhao, and M. Okumura. Debiasing large language models with structured knowledge. InFindings of the Association for Computational Linguistics: ACL 2024, pages 10274–10287, 2024

work page 2024
[52]

McRae, G

K. McRae, G. S. Cree, M. S. Seidenberg, and C. McNorgan. Semantic feature production norms for a large set of living and nonliving things.Behavior research methods, 37(4):547–559, 2005

work page 2005
[53]

G. A. Miller. Wordnet: a lexical database for english.Communications of the ACM, 38(11):39–41, 1995

work page 1995
[54]

M. J. Monteith. Self-regulation of prejudiced responses: Implications for progress in prejudice-reduction efforts.Journal of personality and social psychology, 65(3):469, 1993

work page 1993
[55]

M. J. Monteith, C. I. Voils, and L. Ashburn-Nardo. Taking a look underground: Detecting, interpreting, and reacting to implicit racial biases.Social Cognition, 19(4):395–417, 2001

work page 2001
[56]

Moors and J

A. Moors and J. De Houwer. Problems with dividing the realm of processes.Psychological Inquiry, 17(3):199–204, 2006

work page 2006
[57]

Mruthyunjaya, P

V. Mruthyunjaya, P. Pezeshkpour, E. Hruschka, and N. Bhutani. Rethinking language models as symbolic knowledge graphs.arXiv preprint arXiv:2308.13676, 2023. 29/31

work page arXiv 2023
[58]

B. A. Nosek, F. L. Smyth, J. J. Hansen, T. Devos, N. M. Lindner, K. A. Ranganath, C. T. Smith, K. R. Olson, D. Chugh, A. G. Greenwald, et al. Pervasiveness and correlates of implicit attitudes and stereotypes. European review of social psychology, 18(1):36–88, 2007

work page 2007
[59]

E. Pavlick. Symbols and grounding in large language models.Philosophical Transactions of the Royal Society A, 381(2251):20220041, 2023

work page 2023
[60]

B. K. Payne. Conceptualizing control in social cognition: how executive functioning modulates the expression of automatic stereotyping.Journal of personality and social psychology, 89(4):488, 2005

work page 2005
[61]

Perugini

M. Perugini. Predictive models of implicit and explicit attitudes.British Journal of Social Psychology, 44(1):29–45, 2005

work page 2005
[62]

L. M. Reder and J. R. Anderson. A partial resolution of the paradox of interference: The role of integrating knowledge.Cognitive Psychology, 12(4):447–472, 1980

work page 1980
[63]

D. E. Rumelhart, J. L. McClelland, P. R. Group, et al.Parallel distributed processing, volume 1: Explorations in the microstructure of cognition: Foundations. The MIT press, 1986

work page 1986
[64]

C. S. Siew. spreadr: An r package to simulate spreading activation in a network.Behavior Research Methods, 51(2):910–929, 2019

work page 2019
[65]

C. S. Siew, D. U. Wulff, N. M. Beckage, and Y. N. Kenett. Cognitive network science: A review of research on cognition through the lens of network representations, processes, and dynamics.Complexity, 2019, 2019

work page 2019
[66]

H. A. Simon and A. Newell. Human problem solving: The state of the theory in 1970.American psychologist, 26(2):145, 1971

work page 1970
[67]

S. A. Sloman. The empirical case for two systems of reasoning.Psychological bulletin, 119(1):3, 1996

work page 1996
[68]

E. E. Smith, E. J. Shoben, and L. J. Rips. Structure and process in semantic memory: A featural model for semantic decisions.Psychological review, 81(3):214, 1974

work page 1974
[69]

E. R. Smith and J. DeCoster. Dual-process models in social and cognitive psychology: Conceptual integration and links to underlying memory systems.Personality and social psychology review, 4(2):108–131, 2000

work page 2000
[70]

Smolensky

P. Smolensky. On the proper treatment of connectionism.Behavioral and brain sciences, 11(1):1–23, 1988

work page 1988
[71]

J. F. Sowa. Generating language from conceptual graphs. InComputational Linguistics, pages 29–43. Elsevier, 1983

work page 1983
[72]

K. E. Stanovich.Who is rational?: Studies of individual differences in reasoning. Psychology Press, 1999

work page 1999
[73]

Stella, S

M. Stella, S. Citraro, G. Rossetti, D. Marinazzo, Y. N. Kenett, and M. S. Vitevitch. Cognitive modelling of concepts in the mental lexicon with multilayer networks: Insights, advancements, and future challenges. Psychonomic Bulletin & Review, 31(5):1981–2004, 2024

work page 1981
[74]

Steyvers and J

M. Steyvers and J. B. Tenenbaum. The large-scale structure of semantic networks: Statistical analyses and a model of semantic growth.Cognitive science, 29(1):41–78, 2005

work page 2005
[75]

Strack and R

F. Strack and R. Deutsch. Reflective and impulsive determinants of social behavior.Personality and social psychology review, 8(3):220–247, 2004

work page 2004
[76]

Vincent-Lamarre, A

P. Vincent-Lamarre, A. B. Mass´ e, M. Lopes, M. Lord, O. Marcotte, and S. Harnad. The latent structure of dictionaries.Topics in cognitive science, 8(3):625–659, 2016

work page 2016
[77]

J. Wei, Y. Tay, R. Bommasani, C. Raffel, B. Zoph, S. Borgeaud, D. Yogatama, M. Bosma, D. Zhou, D. Metzler, et al. Emergent abilities of large language models.arXiv preprint arXiv:2206.07682, 2022. 30/31

work page internal anchor Pith review arXiv 2022
[78]

J. Wei, X. Wang, D. Schuurmans, M. Bosma, F. Xia, E. Chi, Q. V. Le, D. Zhou, et al. Chain-of-thought prompting elicits reasoning in large language models.Advances in Neural Information Processing Systems, 35:24824–24837, 2022

work page 2022
[79]

W. A. Woods. What’s in a link: Foundations for semantic networks. InRepresentation and understanding, pages 35–82. Elsevier, 1975

work page 1975
[80]

Towards system 2 reasoning in llms: Learning how to think with meta chain-of-thought, 2025

V. Xiang, C. Snell, K. Gandhi, A. Albalak, A. Singh, C. Blagden, D. Phung, R. Rafailov, N. Lile, D. Mahan, et al. Towards system 2 reasoning in llms: Learning how to think with meta chain-of-thought.arXiv preprint arXiv:2501.04682, 2025

work page arXiv 2025

Showing first 80 references.

[1] [1]

Abramski, R

K. Abramski, R. Improta, G. Rossetti, and M. Stella. The ”llm world of words” english free association norms generated by large language models.Scientific data, 12(1):803, 2025

work page 2025

[2] [2]

Abramski, G

K. Abramski, G. Rossetti, and M. Stella. A word association network methodology for evaluating implicit biases in llms compared to humans.arXiv preprint arXiv:2510.24488, 2025

work page arXiv 2025

[3] [3]

Acerbi and J

A. Acerbi and J. M. Stubbersfield. Large language models show human-like content biases in transmission chain experiments.Proceedings of the National Academy of Sciences, 120(44):e2313790120, 2023

work page 2023

[4] [4]

Agrawal, T

G. Agrawal, T. Kumarage, Z. Alghamdi, and H. Liu. Can knowledge graphs reduce hallucinations in llms?: A survey. InProceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 3947–3960, 2024

work page 2024

[5] [5]

H. An, X. Liu, and D. Zhang. Learning bias-reduced word embeddings using dictionary definitions. In Findings of the Association for Computational Linguistics: ACL 2022, pages 1139–1152, 2022

work page 2022

[6] [6]

J. R. Anderson. A spreading activation theory of memory.Journal of verbal learning and verbal behavior, 22(3):261–295, 1983

work page 1983

[7] [7]

J. R. Anderson, D. Bothell, C. Lebiere, and M. Matessa. An integrated theory of list memory.Journal of Memory and Language, 38(4):341–380, 1998

work page 1998

[8] [8]

J. R. Anderson, M. Matessa, and C. Lebiere. Act-r: A theory of higher level cognition and its relation to visual attention.Human–Computer Interaction, 12(4):439–462, 1997

work page 1997

[9] [9]

X. Bai, A. Wang, I. Sucholutsky, and T. L. Griffiths. Explicitly unbiased large language models still form biased associations.Proceedings of the National Academy of Sciences, 122(8):e2416228122, 2025

work page 2025

[10] [10]

J. A. Bargh and T. L. Chartrand. The unbearable automaticity of being.American psychologist, 54(7):462, 1999

work page 1999

[11] [11]

A. Barr, E. A. Feigenbaum, and P. R. Cohen.The handbook of artificial intelligence, volume 1. HeurisTech Press, 1981

work page 1981

[12] [12]

E. M. Bender, T. Gebru, A. McMillan-Major, and S. Shmitchell. On the dangers of stochastic parrots: Can language models be too big? InProceedings of the 2021 ACM conference on fairness, accountability, and transparency, pages 610–623, 2021

work page 2021

[13] [13]

Boccaletti, G

S. Boccaletti, G. Bianconi, R. Criado, C. I. Del Genio, J. G´ omez-Gardenes, M. Romance, I. Sendina-Nadal, Z. Wang, and M. Zanin. The structure and dynamics of multilayer networks.Physics reports, 544(1):1–122, 2014

work page 2014

[14] [14]

R. J. Brachman. On the epistemological status of semantic networks. InAssociative networks, pages 3–50. Elsevier, 1979

work page 1979

[15] [15]

Brady, P

O. Brady, P. Nulty, L. Zhang, T. E. Ward, and D. P. McGovern. Dual-process theory and decision-making in large language models.Nature Reviews Psychology, pages 1–16, 2025

work page 2025

[16] [16]

Bursell and F

M. Bursell and F. Olsson. Do we need dual-process theory to understand implicit bias? a study of the nature of implicit bias against muslims.Poetics, 87:101549, 2021

work page 2021

[17] [17]

Castro and C

N. Castro and C. S. Siew. Contributions of modern network science to the cognitive sciences: Revisiting research spirals of representation and process.Proceedings of the Royal Society A, 476(2238):20190825, 2020. 27/31

work page 2020

[18] [18]

Citraro, M

S. Citraro, M. S. Vitevitch, M. Stella, and G. Rossetti. Feature-rich multiplex lexical networks reveal mental strategies of early language learning.Scientific Reports, 13(1):1474, 2023

work page 2023

[19] [19]

A. M. Collins and E. F. Loftus. A spreading-activation theory of semantic processing.Psychological review, 82(6):407, 1975

work page 1975

[20] [20]

A. M. Collins and M. R. Quillian. Retrieval time from semantic memory.Journal of verbal learning and verbal behavior, 8(2):240–247, 1969

work page 1969

[21] [21]

De Deyne, D

S. De Deyne, D. J. Navarro, A. Perfors, M. Brysbaert, and G. Storms. The ”small world of words” english word association norms for over 12,000 cue words.Behavior research methods, 51(3):987–1006, 2019

work page 2019

[22] [22]

De Domenico, V

M. De Domenico, V. Nicosia, A. Arenas, and V. Latora. Structural reducibility of multilayer networks. Nature communications, 6(1):6864, 2015

work page 2015

[23] [23]

De Domenico, A

M. De Domenico, A. Sol´ e-Ribalta, E. Cozzo, M. Kivel¨ a, Y. Moreno, M. A. Porter, S. G´ omez, and A. Arenas. Mathematical formulation of multilayer networks.Physical Review X, 3(4):041022, 2013

work page 2013

[24] [24]

E. S. De Duro, E. Franchino, R. Improta, G. A. Veltri, and M. Stella. Cognitive networks identify ai biases on societal issues in large language models.EPJ Data Science, 15(1):7, 2026

work page 2026

[25] [25]

De Houwer, P

J. De Houwer, P. Van Dessel, and T. Moran. Attitudes as propositional representations.Trends in Cognitive Sciences, 25(10):870–882, 2021

work page 2021

[26] [26]

J. C. de Winter, D. Dodou, and Y. B. Eisma. System 2 thinking in openai’s o1-preview model: Near-perfect performance on a mathematics exam.Computers, 13(11):278, 2024

work page 2024

[27] [27]

P. G. Devine. Stereotypes and prejudice: Their automatic and controlled components.Journal of personality and social psychology, 56(1):5, 1989

work page 1989

[28] [28]

P. G. Devine, P. S. Forscher, A. J. Austin, and W. T. Cox. Long-term reduction in implicit race bias: A prejudice habit-breaking intervention.Journal of experimental social psychology, 48(6):1267–1278, 2012

work page 2012

[29] [29]

J. S. B. Evans. Dual-processing accounts of reasoning, judgment, and social cognition.Annu. Rev. Psychol., 59(1):255–278, 2008

work page 2008

[30] [30]

J. S. B. Evans. Intuition and reasoning: A dual-process perspective.Psychological Inquiry, 21(4):313–326, 2010

work page 2010

[31] [31]

J. S. B. Evans. Dual-process theories of reasoning: Contemporary issues and developmental applications. Developmental review, 31(2-3):86–102, 2011

work page 2011

[32] [32]

J. S. B. Evans and K. E. Stanovich. Dual-process theories of higher cognition: Advancing the debate. Perspectives on psychological science, 8(3):223–241, 2013

work page 2013

[33] [33]

Fodor.The language of thought

J. Fodor.The language of thought. Harvard University Press, 1975

work page 1975

[34] [34]

J. A. Fodor and Z. W. Pylyshyn. Connectionism and cognitive architecture: A critical analysis.Cognition, 28(1-2):3–71, 1988

work page 1988

[35] [35]

Garimella, A

A. Garimella, A. Amarnath, K. Kumar, A. P. Yalla, N. Chhaya, B. V. Srinivasan, et al. He is very intelligent, she is very beautiful? on mitigating social biases in language modelling and generation. In Findings of the association for computational linguistics: ACL-IJCNLP 2021, pages 4534–4545, 2021

work page 2021

[36] [36]

Gawronski and G

B. Gawronski and G. V. Bodenhausen. Associative and propositional processes in evaluation: an integrative review of implicit and explicit attitude change.Psychological bulletin, 132(5):692, 2006

work page 2006

[37] [37]

Gawronski and G

B. Gawronski and G. V. Bodenhausen. The associative–propositional evaluation model: Theory, evidence, and open questions.Advances in experimental social psychology, 44:59–127, 2011. 28/31

work page 2011

[38] [38]

A. G. Greenwald and M. R. Banaji. Implicit social cognition: attitudes, self-esteem, and stereotypes. Psychological review, 102(1):4, 1995

work page 1995

[39] [39]

A. G. Greenwald, D. E. McGhee, and J. L. Schwartz. Measuring individual differences in implicit cognition: the implicit association test.Journal of personality and social psychology, 74(6):1464, 1998

work page 1998

[40] [40]

Hagendorff, S

T. Hagendorff, S. Fabi, and M. Kosinski. Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in chatgpt.Nature Computational Science, 3(10):833–838, 2023

work page 2023

[41] [41]

T. T. Hills and Y. N. Kenett. Is the mind a network? maps, vehicles, and skyhooks in cognitive network science.Topics in Cognitive Science, 14(1):189–208, 2022

work page 2022

[42] [42]

K. A. Hutchison, D. A. Balota, J. H. Neely, M. J. Cortese, E. R. Cohen-Shikora, C.-S. Tse, M. J. Yap, J. J. Bengson, D. Niemeyer, and E. Buchanan. The semantic priming project.Behavior research methods, 45:1099–1114, 2013

work page 2013

[43] [43]

Kahneman.Thinking, fast and slow

D. Kahneman.Thinking, fast and slow. macmillan, 2011

work page 2011

[44] [44]

Chen Gao, Xiaochong Lan, Nian Li, Yuan Yuan, Jingtao Ding, Zhilun Zhou, Fengli Xu, and Yong Li

M. Kamruzzaman and G. L. Kim. Prompting techniques for reducing social bias in llms through system 1 and system 2 cognitive processes.arXiv preprint arXiv:2404.17218, 2024

work page arXiv 2024

[45] [45]

Kaneko and D

M. Kaneko and D. Bollegala. Dictionary-based debiasing of pre-trained word embeddings.arXiv preprint arXiv:2101.09525, 2021

work page arXiv 2021

[46] [46]

Kintsch.Comprehension: A paradigm for cognition

W. Kintsch.Comprehension: A paradigm for cognition. Cambridge university press, 1998

work page 1998

[47] [47]

Kivel¨ a, A

M. Kivel¨ a, A. Arenas, M. Barthelemy, J. P. Gleeson, Y. Moreno, and M. A. Porter. Multilayer networks. Journal of complex networks, 2(3):203–271, 2014

work page 2014

[48] [48]

Kozima and T

H. Kozima and T. Furugori. Similarity between words computed by spreading activation on an english dictionary.arXiv preprint cmp-lg/9601004, 1996

work page arXiv 1996

[49] [49]

Kumar, H

R. Kumar, H. Kumar, and K. Shalini. Detecting and mitigating bias in llms through knowledge graph- augmented training. In2025 International Conference on Artificial Intelligence and Data Engineering (AIDE), pages 608–613. IEEE, 2025

work page 2025

[50] [50]

Z.-Z. Li, D. Zhang, M.-L. Zhang, J. Zhang, Z. Liu, Y. Yao, H. Xu, J. Zheng, P.-J. Wang, X. Chen, et al. From system 1 to system 2: A survey of reasoning large language models.arXiv preprint arXiv:2502.17419, 2025

work page internal anchor Pith review arXiv 2025

[51] [51]

C. Ma, T. Zhao, and M. Okumura. Debiasing large language models with structured knowledge. InFindings of the Association for Computational Linguistics: ACL 2024, pages 10274–10287, 2024

work page 2024

[52] [52]

McRae, G

K. McRae, G. S. Cree, M. S. Seidenberg, and C. McNorgan. Semantic feature production norms for a large set of living and nonliving things.Behavior research methods, 37(4):547–559, 2005

work page 2005

[53] [53]

G. A. Miller. Wordnet: a lexical database for english.Communications of the ACM, 38(11):39–41, 1995

work page 1995

[54] [54]

M. J. Monteith. Self-regulation of prejudiced responses: Implications for progress in prejudice-reduction efforts.Journal of personality and social psychology, 65(3):469, 1993

work page 1993

[55] [55]

M. J. Monteith, C. I. Voils, and L. Ashburn-Nardo. Taking a look underground: Detecting, interpreting, and reacting to implicit racial biases.Social Cognition, 19(4):395–417, 2001

work page 2001

[56] [56]

Moors and J

A. Moors and J. De Houwer. Problems with dividing the realm of processes.Psychological Inquiry, 17(3):199–204, 2006

work page 2006

[57] [57]

Mruthyunjaya, P

V. Mruthyunjaya, P. Pezeshkpour, E. Hruschka, and N. Bhutani. Rethinking language models as symbolic knowledge graphs.arXiv preprint arXiv:2308.13676, 2023. 29/31

work page arXiv 2023

[58] [58]

B. A. Nosek, F. L. Smyth, J. J. Hansen, T. Devos, N. M. Lindner, K. A. Ranganath, C. T. Smith, K. R. Olson, D. Chugh, A. G. Greenwald, et al. Pervasiveness and correlates of implicit attitudes and stereotypes. European review of social psychology, 18(1):36–88, 2007

work page 2007

[59] [59]

E. Pavlick. Symbols and grounding in large language models.Philosophical Transactions of the Royal Society A, 381(2251):20220041, 2023

work page 2023

[60] [60]

B. K. Payne. Conceptualizing control in social cognition: how executive functioning modulates the expression of automatic stereotyping.Journal of personality and social psychology, 89(4):488, 2005

work page 2005

[61] [61]

Perugini

M. Perugini. Predictive models of implicit and explicit attitudes.British Journal of Social Psychology, 44(1):29–45, 2005

work page 2005

[62] [62]

L. M. Reder and J. R. Anderson. A partial resolution of the paradox of interference: The role of integrating knowledge.Cognitive Psychology, 12(4):447–472, 1980

work page 1980

[63] [63]

D. E. Rumelhart, J. L. McClelland, P. R. Group, et al.Parallel distributed processing, volume 1: Explorations in the microstructure of cognition: Foundations. The MIT press, 1986

work page 1986

[64] [64]

C. S. Siew. spreadr: An r package to simulate spreading activation in a network.Behavior Research Methods, 51(2):910–929, 2019

work page 2019

[65] [65]

C. S. Siew, D. U. Wulff, N. M. Beckage, and Y. N. Kenett. Cognitive network science: A review of research on cognition through the lens of network representations, processes, and dynamics.Complexity, 2019, 2019

work page 2019

[66] [66]

H. A. Simon and A. Newell. Human problem solving: The state of the theory in 1970.American psychologist, 26(2):145, 1971

work page 1970

[67] [67]

S. A. Sloman. The empirical case for two systems of reasoning.Psychological bulletin, 119(1):3, 1996

work page 1996

[68] [68]

E. E. Smith, E. J. Shoben, and L. J. Rips. Structure and process in semantic memory: A featural model for semantic decisions.Psychological review, 81(3):214, 1974

work page 1974

[69] [69]

E. R. Smith and J. DeCoster. Dual-process models in social and cognitive psychology: Conceptual integration and links to underlying memory systems.Personality and social psychology review, 4(2):108–131, 2000

work page 2000

[70] [70]

Smolensky

P. Smolensky. On the proper treatment of connectionism.Behavioral and brain sciences, 11(1):1–23, 1988

work page 1988

[71] [71]

J. F. Sowa. Generating language from conceptual graphs. InComputational Linguistics, pages 29–43. Elsevier, 1983

work page 1983

[72] [72]

K. E. Stanovich.Who is rational?: Studies of individual differences in reasoning. Psychology Press, 1999

work page 1999

[73] [73]

Stella, S

M. Stella, S. Citraro, G. Rossetti, D. Marinazzo, Y. N. Kenett, and M. S. Vitevitch. Cognitive modelling of concepts in the mental lexicon with multilayer networks: Insights, advancements, and future challenges. Psychonomic Bulletin & Review, 31(5):1981–2004, 2024

work page 1981

[74] [74]

Steyvers and J

M. Steyvers and J. B. Tenenbaum. The large-scale structure of semantic networks: Statistical analyses and a model of semantic growth.Cognitive science, 29(1):41–78, 2005

work page 2005

[75] [75]

Strack and R

F. Strack and R. Deutsch. Reflective and impulsive determinants of social behavior.Personality and social psychology review, 8(3):220–247, 2004

work page 2004

[76] [76]

Vincent-Lamarre, A

P. Vincent-Lamarre, A. B. Mass´ e, M. Lopes, M. Lord, O. Marcotte, and S. Harnad. The latent structure of dictionaries.Topics in cognitive science, 8(3):625–659, 2016

work page 2016

[77] [77]

J. Wei, Y. Tay, R. Bommasani, C. Raffel, B. Zoph, S. Borgeaud, D. Yogatama, M. Bosma, D. Zhou, D. Metzler, et al. Emergent abilities of large language models.arXiv preprint arXiv:2206.07682, 2022. 30/31

work page internal anchor Pith review arXiv 2022

[78] [78]

J. Wei, X. Wang, D. Schuurmans, M. Bosma, F. Xia, E. Chi, Q. V. Le, D. Zhou, et al. Chain-of-thought prompting elicits reasoning in large language models.Advances in Neural Information Processing Systems, 35:24824–24837, 2022

work page 2022

[79] [79]

W. A. Woods. What’s in a link: Foundations for semantic networks. InRepresentation and understanding, pages 35–82. Elsevier, 1975

work page 1975

[80] [80]

Towards system 2 reasoning in llms: Learning how to think with meta chain-of-thought, 2025

V. Xiang, C. Snell, K. Gandhi, A. Albalak, A. Singh, C. Blagden, D. Phung, R. Rafailov, N. Lile, D. Mahan, et al. Towards system 2 reasoning in llms: Learning how to think with meta chain-of-thought.arXiv preprint arXiv:2501.04682, 2025

work page arXiv 2025