Intellectual Humility as a Cognitive Filter for AI-Generated Health Misinformation. An Evolutionary Perspective on Epistemic Vigilance

Kacper Zacharski; Maciej Wodzi\'nski; Marcin Moskalewicz; Marcin Rz\k{a}deczka

arxiv: 2606.03377 · v2 · pith:V2YF2YKInew · submitted 2026-06-02 · 💻 cs.HC · cs.CY

Intellectual Humility as a Cognitive Filter for AI-Generated Health Misinformation. An Evolutionary Perspective on Epistemic Vigilance

Marcin Rz\k{a}deczka , Maciej Wodzi\'nski , Kacper Zacharski , Marcin Moskalewicz This is my paper

Pith reviewed 2026-06-28 08:41 UTC · model grok-4.3

classification 💻 cs.HC cs.CY

keywords intellectual humilityAI-generated contenthealth misinformationepistemic vigilancecognitive filtercredibility assessmentpseudoscienceevolutionary psychology

0 comments

The pith

Intellectual humility lowers credibility ratings for AI-generated pseudoscientific health dialogues but shows no link to ratings of accurate ones.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper reports an experiment in which 99 participants read one of three AI-generated dialogues on exercise and mental health that differed in scientific accuracy. Participants who scored higher on a measure of intellectual humility assigned lower credibility to the moderately and strongly pseudoscientific versions, yet their humility scores showed no relation to how credible they found the accurate version. The same humility scores also failed to predict whether participants correctly identified the dialogues as AI-generated. The authors interpret the pattern as evidence that intellectual humility functions as a selective filter tuned to information quality rather than to source detection, and they link this filter to an evolved capacity for epistemic vigilance in uncertain environments.

Core claim

Intellectual humility functions as a selective cognitive filter. Individuals with higher humility scores rated pseudoscientific content as significantly less credible, while showing no correlation with credibility assessments of accurate content. Humility did not predict the ability to identify AI as the source of the dialogues, indicating that epistemic vigilance operates on content quality rather than source attribution. The authors propose that this filter represents an ancestral adaptation that remains effective at detecting exploitation attempts even in AI-generated material.

What carries the argument

Intellectual humility as a selective cognitive filter that reduces perceived credibility of low-quality health information while leaving judgments of accurate information unaffected.

If this is right

Higher intellectual humility protects against accepting AI-generated health misinformation without affecting acceptance of accurate AI content.
The filter works by evaluating content features rather than by detecting the artificial source.
An evolved capacity for epistemic vigilance can still operate on modern AI outputs even though humans never evolved mechanisms to detect AI specifically.
Foundation models used in health communication can be countered at the receiver end by the same humility mechanism that handles ordinary misinformation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Programs that raise intellectual humility might reduce the impact of AI health misinformation across broader populations.
The same selective filter could be tested on AI content in non-health domains such as financial advice or political claims.
If the classification of dialogues drives the result, then studies that vary only surface features while holding scientific accuracy constant would weaken the observed pattern.

Load-bearing premise

The three dialogues were correctly classified by the researchers as scientifically accurate, moderately pseudoscientific, or strongly pseudoscientific, and any credibility differences arise from that classification rather than from other dialogue features.

What would settle it

A new sample of participants rates the same three dialogues after the accuracy labels have been swapped or withheld, and the correlation between humility scores and credibility ratings disappears or reverses.

Figures

Figures reproduced from arXiv: 2606.03377 by Kacper Zacharski, Maciej Wodzi\'nski, Marcin Moskalewicz, Marcin Rz\k{a}deczka.

**Figure 2.** Figure 2: Source Attribution by Dialogue Type [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

read the original abstract

We present experimental findings from a study (N=99) examining how intellectual humility (IH), i.e., the metacognitive awareness of epistemic limitations, affects the evaluation of AI-generated health dialogues varying in scientific rigor. Participants were randomly assigned to evaluate one of three dialogues about exercise and mental health: scientifically accurate, moderately pseudoscientific, or strongly pseudoscientific. Results reveal that IH functions as a selective cognitive filter. Individuals with higher humility scores rated pseudoscientific content as significantly less credible, while showing no correlation with credibility assessments of accurate content. Crucially, humility did not predict the ability to identify AI as the source of dialogues, suggesting that epistemic vigilance operates on content quality rather than source attribution. We interpret these findings through an evolutionary lens, proposing that IH represents an ancestral adaptation for navigating informationally uncertain environments. It remains effective at detecting exploitation attempts in AI-generated content, despite humans lacking evolved mechanisms for detecting AI sources. The study contributes to understanding how foundation models might improve or undermine human epistemic defenses, especially in health communication contexts.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper finds higher intellectual humility correlates with lower credibility ratings for AI-generated pseudoscientific health dialogues but not accurate ones, with the main weakness being unvalidated dialogue classification.

read the letter

The central finding is that participants with higher intellectual humility scores gave lower credibility ratings to the moderately and strongly pseudoscientific AI dialogues on exercise and mental health, while showing no correlation for the accurate dialogue. They also showed no link between humility and detecting the AI source. This selective filter effect is the main empirical claim.

The work applies an existing psychological construct to a new stimulus type—AI-generated health content varying in rigor—and runs a simple three-condition experiment with N=99. The selective pattern is a reasonable extension of prior IH research and the evolutionary framing stays interpretive rather than central.

The weakest part is the dialogue classification. The abstract and stress-test note give no information on how the three versions were assigned to accurate, moderate, or strong pseudoscience categories—no rubric, expert review, inter-rater reliability, or content analysis confirming differences in false claims or evidence strength. Without that, the selective effect could reflect uncontrolled features like length, tone, or specificity instead of scientific accuracy. The abstract also omits effect sizes, exact statistics, demographics, and exclusion criteria, which limits evaluation of the result's robustness.

This is aimed at researchers in cognitive psychology and HCI who study misinformation resistance or epistemic traits. A reader looking for empirical tests of how cognitive filters interact with AI content would find a clear, testable claim worth examining.

I would send it to peer review so the classification procedure and stats can be checked directly; the core idea is straightforward enough to merit that step.

Referee Report

2 major / 1 minor

Summary. The manuscript reports results from an experiment (N=99) in which participants were randomly assigned to rate the credibility of one of three AI-generated dialogues on exercise and mental health, pre-classified by the authors as scientifically accurate, moderately pseudoscientific, or strongly pseudoscientific. The central empirical claim is that intellectual humility (IH) scores correlate negatively with credibility ratings only for the pseudoscientific dialogues, show no correlation for the accurate dialogue, and do not predict participants' ability to identify the dialogues as AI-generated. The authors interpret these patterns as evidence that IH functions as a selective cognitive filter for epistemic vigilance that remains effective against AI-generated health content, framed through an evolutionary lens.

Significance. If the reported selective correlations hold after proper validation of the stimulus materials and full reporting of statistical analyses, the study would offer evidence that trait-level intellectual humility can help individuals discount low-quality AI-generated health information without impairing evaluation of accurate content. This would contribute to HCI and health-communication literatures on human-AI epistemic interactions and could inform design of AI systems that support rather than undermine user vigilance. The evolutionary interpretation remains speculative and is not directly tested by the data.

major comments (2)

[Abstract / Methods (stimulus construction)] The assignment of the three dialogues to the accurate, moderately pseudoscientific, and strongly pseudoscientific conditions is load-bearing for the selective-filter claim, yet the manuscript provides no description of the classification procedure, rubric, expert review, inter-rater agreement, or quantitative content analysis confirming differences in false claims or evidential quality (see Abstract and any Methods section on stimulus construction).
[Abstract / Results] The abstract asserts that higher IH scores produced significantly lower credibility ratings for pseudoscientific content with no correlation for accurate content, but supplies no statistical details, effect sizes, confidence intervals, demographic information, exclusion criteria, or checks for confounds such as dialogue length or emotional valence, preventing evaluation of whether the data support the central claim.

minor comments (1)

[Discussion] The evolutionary framing in the Discussion would benefit from explicit separation between the empirical findings and the interpretive claims, as the data do not test ancestral adaptations.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for these constructive comments on stimulus validation and statistical transparency. Both points identify areas where the manuscript can be strengthened for clarity and replicability. We address each below and commit to revisions that provide the requested details without altering the core findings or interpretations.

read point-by-point responses

Referee: [Abstract / Methods (stimulus construction)] The assignment of the three dialogues to the accurate, moderately pseudoscientific, and strongly pseudoscientific conditions is load-bearing for the selective-filter claim, yet the manuscript provides no description of the classification procedure, rubric, expert review, inter-rater agreement, or quantitative content analysis confirming differences in false claims or evidential quality (see Abstract and any Methods section on stimulus construction).

Authors: We agree that explicit documentation of the stimulus classification is essential for evaluating the selective-filter claim. The full manuscript describes the dialogues but does not include the full rubric, expert review process, or inter-rater statistics. In revision we will add a dedicated subsection in Methods detailing the classification procedure, the rubric criteria for scientific accuracy versus pseudoscience, any expert consultation, inter-rater agreement metrics, and quantitative indicators of evidential quality differences across conditions. revision: yes
Referee: [Abstract / Results] The abstract asserts that higher IH scores produced significantly lower credibility ratings for pseudoscientific content with no correlation for accurate content, but supplies no statistical details, effect sizes, confidence intervals, demographic information, exclusion criteria, or checks for confounds such as dialogue length or emotional valence, preventing evaluation of whether the data support the central claim.

Authors: We accept that the abstract and results reporting require greater statistical detail for independent evaluation. The manuscript reports the key correlations but omits effect sizes, CIs, and explicit confound checks in the abstract. We will revise the abstract to include the relevant statistics (r values, p values, effect sizes, CIs) and will expand the Methods and Results sections to report participant demographics, exclusion criteria, and any post-hoc checks for dialogue length, emotional valence, or other potential confounds. revision: yes

Circularity Check

0 steps flagged

No significant circularity; purely empirical study

full rationale

The paper reports an empirical experiment (N=99) measuring participant ratings of AI-generated dialogues on credibility and AI source detection, correlated with intellectual humility scores. No equations, fitted parameters, predictions, or derivations appear in the provided text or abstract. The central claim rests on observed selective correlations between IH and credibility ratings for pseudoscientific vs. accurate content, which are independent measurements from participants rather than quantities defined by the paper's own inputs. No self-citations, ansatzes, or uniqueness theorems are invoked as load-bearing steps. The dialogue classification is a methodological premise but does not reduce any result to a self-defined input by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on standard assumptions of psychological measurement and experimental design rather than new mathematical axioms or postulated entities.

axioms (2)

domain assumption The intellectual humility scale used produces valid and reliable scores that capture the intended metacognitive trait
The study treats IH scores as the independent variable without reporting validation data for the specific population or context.
domain assumption Expert or researcher classification correctly distinguishes the three dialogues by level of scientific rigor
Condition assignment and interpretation of credibility differences depend on this categorization being accurate.

pith-pipeline@v0.9.1-grok · 5741 in / 1431 out tokens · 29986 ms · 2026-06-28T08:41:38.441711+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

19 extracted references · 12 canonical work pages

[1]

Current Issues in Personality Psychology , author=

Intellectual humility: an old problem in a new psychological perspective , volume=. Current Issues in Personality Psychology , author=. 2022 , pages=

2022
[2]

All That ' s `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text

Clark, Elizabeth and August, Tal and Serrano, Sofia and Haduong, Nikita and Gururangan, Suchin and Smith, Noah A. All That ' s `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing...

work page doi:10.18653/v1/2021.acl-long.565 2021
[3]

and Buss, David M

Haselton, Martie G. and Buss, David M. , title =. Journal of Personality and Social Psychology , volume =. 2000 , ISSN =. doi:10.1037/0022-3514.78.1.81 , number =

work page doi:10.1037/0022-3514.78.1.81 2000
[4]

and Naaman, Mor , year =

Jakesch, Maurice and Hancock, Jeffrey T. and Naaman, Mor , year =. Human heuristics for AI-generated language are flawed , volume =. Proceedings of the National Academy of Sciences , publisher =. doi:10.1073/pnas.2208839120 , number =

work page doi:10.1073/pnas.2208839120
[5]

Artificial intelligence versus Maya Angelou: Experimental evidence that people cannot differentiate AI-generated from human-written poetry , volume =

K\". Artificial intelligence versus Maya Angelou: Experimental evidence that people cannot differentiate AI-generated from human-written poetry , volume =. 2021 , month = Jan, pages =. doi:10.1016/j.chb.2020.106553 , journal =

work page doi:10.1016/j.chb.2020.106553 2021
[6]

Social Psychological and Personality Science , publisher =

Koetke, Jonah and Schumann, Karina and Porter, Tenelle , title =. Social Psychological and Personality Science , publisher =. 2021 , month = Jan, pages =. doi:10.1177/1948550620988242 , number =

work page doi:10.1177/1948550620988242 2021
[7]

and Rouse, Steven V

Krumrei-Mancuso, Elizabeth J. and Rouse, Steven V. , year =. The Development and Validation of the Comprehensive Intellectual Humility Scale , volume =. Journal of Personality Assessment , publisher =. doi:10.1080/00223891.2015.1068174 , number =

work page doi:10.1080/00223891.2015.1068174 2015
[8]

and Diebels, Kate J

Leary, Mark R. and Diebels, Kate J. and Davisson, Erin K. and Jongman-Sereno, Katrina P. and Isherwood, Jennifer C. and Raimi, Kaitlin T. and Deffler, Samantha A. and Hoyle, Rick H. , year =. Cognitive and Interpersonal Features of Intellectual Humility , volume =. Personality and Social Psychology Bulletin , publisher =. doi:10.1177/0146167217697695 , number =

work page doi:10.1177/0146167217697695
[9]

doi:10.4159/9780674977860 , publisher =

Hugo Mercier and Dan Sperber , title =. doi:10.4159/9780674977860 , publisher =

work page doi:10.4159/9780674977860
[10]

AI-generated poetry is indistinguishable from human-written poetry and is rated more favorably , volume =

Porter, Brian and Machery, Edouard , year =. AI-generated poetry is indistinguishable from human-written poetry and is rated more favorably , volume =. Scientific Reports , publisher =. doi:10.1038/s41598-024-76900-1 , number =

work page doi:10.1038/s41598-024-76900-1
[11]

Epistemic Vigilance , volume =

Sperber, Dan and Clement, Fabrice and Heintz, Christophe and Mascaro, Olivier and Mercier, Hugo and Origgi, Gloria and Wilson, Deirdre , year =. Epistemic Vigilance , volume =. Mind `I&' Language , publisher =. doi:10.1111/j.1468-0017.2010.01394.x , number =

work page doi:10.1111/j.1468-0017.2010.01394.x 2010
[12]

Public Health and Online Misinformation: Challenges and Recommendations , volume =

Swire-Thompson, Briony and Lazer, David , year =. Public Health and Online Misinformation: Challenges and Recommendations , volume =. Annual Review of Public Health , publisher =. doi:10.1146/annurev-publhealth-040119-094127 , number =

work page doi:10.1146/annurev-publhealth-040119-094127
[13]

Aho and Jeffrey D

Alfred V. Aho and Jeffrey D. Ullman , title =. 1972

1972
[14]

Publications Manual , year = "1983", publisher =

1983
[15]

Chandra and Dexter C

Ashok K. Chandra and Dexter C. Kozen and Larry J. Stockmeyer , year = "1981", title =. doi:10.1145/322234.322243

work page doi:10.1145/322234.322243 1981
[16]

Scalable training of

Andrew, Galen and Gao, Jianfeng , booktitle=. Scalable training of
[17]

Dan Gusfield , title =. 1997

1997
[18]

Tetreault , title =

Mohammad Sadegh Rasooli and Joel R. Tetreault , title =. Computing Research Repository , volume =. 2015 , url =

2015
[19]

A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , Volume =

Ando, Rie Kubota and Zhang, Tong , Issn =. A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , Volume =. Journal of Machine Learning Research , Month = dec, Numpages =

[1] [1]

Current Issues in Personality Psychology , author=

Intellectual humility: an old problem in a new psychological perspective , volume=. Current Issues in Personality Psychology , author=. 2022 , pages=

2022

[2] [2]

All That ' s `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text

Clark, Elizabeth and August, Tal and Serrano, Sofia and Haduong, Nikita and Gururangan, Suchin and Smith, Noah A. All That ' s `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing...

work page doi:10.18653/v1/2021.acl-long.565 2021

[3] [3]

and Buss, David M

Haselton, Martie G. and Buss, David M. , title =. Journal of Personality and Social Psychology , volume =. 2000 , ISSN =. doi:10.1037/0022-3514.78.1.81 , number =

work page doi:10.1037/0022-3514.78.1.81 2000

[4] [4]

and Naaman, Mor , year =

Jakesch, Maurice and Hancock, Jeffrey T. and Naaman, Mor , year =. Human heuristics for AI-generated language are flawed , volume =. Proceedings of the National Academy of Sciences , publisher =. doi:10.1073/pnas.2208839120 , number =

work page doi:10.1073/pnas.2208839120

[5] [5]

Artificial intelligence versus Maya Angelou: Experimental evidence that people cannot differentiate AI-generated from human-written poetry , volume =

K\". Artificial intelligence versus Maya Angelou: Experimental evidence that people cannot differentiate AI-generated from human-written poetry , volume =. 2021 , month = Jan, pages =. doi:10.1016/j.chb.2020.106553 , journal =

work page doi:10.1016/j.chb.2020.106553 2021

[6] [6]

Social Psychological and Personality Science , publisher =

Koetke, Jonah and Schumann, Karina and Porter, Tenelle , title =. Social Psychological and Personality Science , publisher =. 2021 , month = Jan, pages =. doi:10.1177/1948550620988242 , number =

work page doi:10.1177/1948550620988242 2021

[7] [7]

and Rouse, Steven V

Krumrei-Mancuso, Elizabeth J. and Rouse, Steven V. , year =. The Development and Validation of the Comprehensive Intellectual Humility Scale , volume =. Journal of Personality Assessment , publisher =. doi:10.1080/00223891.2015.1068174 , number =

work page doi:10.1080/00223891.2015.1068174 2015

[8] [8]

and Diebels, Kate J

Leary, Mark R. and Diebels, Kate J. and Davisson, Erin K. and Jongman-Sereno, Katrina P. and Isherwood, Jennifer C. and Raimi, Kaitlin T. and Deffler, Samantha A. and Hoyle, Rick H. , year =. Cognitive and Interpersonal Features of Intellectual Humility , volume =. Personality and Social Psychology Bulletin , publisher =. doi:10.1177/0146167217697695 , number =

work page doi:10.1177/0146167217697695

[9] [9]

doi:10.4159/9780674977860 , publisher =

Hugo Mercier and Dan Sperber , title =. doi:10.4159/9780674977860 , publisher =

work page doi:10.4159/9780674977860

[10] [10]

AI-generated poetry is indistinguishable from human-written poetry and is rated more favorably , volume =

Porter, Brian and Machery, Edouard , year =. AI-generated poetry is indistinguishable from human-written poetry and is rated more favorably , volume =. Scientific Reports , publisher =. doi:10.1038/s41598-024-76900-1 , number =

work page doi:10.1038/s41598-024-76900-1

[11] [11]

Epistemic Vigilance , volume =

Sperber, Dan and Clement, Fabrice and Heintz, Christophe and Mascaro, Olivier and Mercier, Hugo and Origgi, Gloria and Wilson, Deirdre , year =. Epistemic Vigilance , volume =. Mind `I&' Language , publisher =. doi:10.1111/j.1468-0017.2010.01394.x , number =

work page doi:10.1111/j.1468-0017.2010.01394.x 2010

[12] [12]

Public Health and Online Misinformation: Challenges and Recommendations , volume =

Swire-Thompson, Briony and Lazer, David , year =. Public Health and Online Misinformation: Challenges and Recommendations , volume =. Annual Review of Public Health , publisher =. doi:10.1146/annurev-publhealth-040119-094127 , number =

work page doi:10.1146/annurev-publhealth-040119-094127

[13] [13]

Aho and Jeffrey D

Alfred V. Aho and Jeffrey D. Ullman , title =. 1972

1972

[14] [14]

Publications Manual , year = "1983", publisher =

1983

[15] [15]

Chandra and Dexter C

Ashok K. Chandra and Dexter C. Kozen and Larry J. Stockmeyer , year = "1981", title =. doi:10.1145/322234.322243

work page doi:10.1145/322234.322243 1981

[16] [16]

Scalable training of

Andrew, Galen and Gao, Jianfeng , booktitle=. Scalable training of

[17] [17]

Dan Gusfield , title =. 1997

1997

[18] [18]

Tetreault , title =

Mohammad Sadegh Rasooli and Joel R. Tetreault , title =. Computing Research Repository , volume =. 2015 , url =

2015

[19] [19]

A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , Volume =

Ando, Rie Kubota and Zhang, Tong , Issn =. A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , Volume =. Journal of Machine Learning Research , Month = dec, Numpages =