GPT-4o reads the mind in the eyes

Alessandro Rufo; Chunzhi Yi; Cristina Becchio; Eugenio Scaliti; Fabio Manzi; Guido Manzi; James W. A. Strachan; Krati Saxena; Marco Celotto; Michael S. A. Graziano

arxiv: 2410.22309 · v2 · pith:BME3VW4Wnew · submitted 2024-10-29 · 💻 cs.HC · cs.CY

GPT-4o reads the mind in the eyes

James W. A. Strachan , Oriana Pansardi , Eugenio Scaliti , Marco Celotto , Krati Saxena , Chunzhi Yi , Fabio Manzi , Alessandro Rufo

show 4 more authors

Guido Manzi Michael S. A. Graziano Stefano Panzeri Cristina Becchio

This is my paper

classification 💻 cs.HC cs.CY

keywords facesgpt-4ohumansmindeyesmentalprocessingtest

0 comments

read the original abstract

Large Language Models (LLMs) are capable of reproducing human-like inferences, including inferences about emotions and mental states, from text. Whether this capability extends beyond text to other modalities remains unclear. Humans possess a sophisticated ability to read the mind in the eyes of other people. Here we tested whether this ability is also present in GPT-4o, a multimodal LLM. Using two versions of a widely used theory of mind test, the Reading the Mind in Eyes Test and the Multiracial Reading the Mind in the Eyes Test, we found that GPT-4o outperformed humans in interpreting mental states from upright faces but underperformed humans when faces were inverted. While humans in our sample showed no difference between White and Non-white faces, GPT-4o's accuracy was higher for White than for Non-white faces. GPT-4o's errors were not random but revealed a highly consistent, yet incorrect, processing of mental-state information across trials, with an orientation-dependent error structure that qualitatively differed from that of humans for inverted faces but not for upright faces. These findings highlight how advanced mental state inference abilities and human-like face processing signatures, such as inversion effects, coexist in GPT-4o alongside substantial differences in information processing compared to humans.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Theory of Mind in Action: The Instruction Inference Task in Dynamic Human-Agent Collaboration
cs.CL 2025-06 conditional novelty 6.0

Tomcat, an LLM agent using few-shot chain-of-thought or commonsense prompting, matches human performance on intent accuracy, action optimality, and planning optimality in a dynamic collaborative task.