Can LLMs Infer Conversational Agent Users' Personality Traits from Chat History?

· 2026 · cs.CL · arXiv 2604.19785

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Sensitive information, such as knowledge about an individual's personality, can be can be misused to influence behavior (e.g., via personalized messaging). To assess to what extent an individual's personality can be inferred from user interactions with LLM-based conversational agents (CAs), we analyze and quantify related privacy risks of using CAs. We collected actual ChatGPT logs from N=668 participants, containing 62,090 individual chats, and report statistics about the different types of shared data and use cases. We fine-tuned RoBERTa-base text classification models to infer personality traits from CA interactions. The findings show that these models achieve trait inference with accuracy (ternary classification) better than random in multiple cases. For example, for extraversion, accuracy improves by +44% relative to the baseline on interactions for relationships and personal reflection. This research highlights how interactions with CAs pose privacy risks and provides fine-grained insights into the level of risk associated with different types of interactions.

representative citing papers

Inferential Privacy Leakage in Anonymized Conversational AI Logs

cs.CY · 2026-05-22 · unverdicted · novelty 6.0

LLM-based inference recovers user age, gender, and country from filtered ChatGPT logs at weighted F1 scores of 0.84-0.90, with median identification from the first 5% of history, driven by stereotype patterns.

citing papers explorer

Showing 1 of 1 citing paper.

Inferential Privacy Leakage in Anonymized Conversational AI Logs cs.CY · 2026-05-22 · unverdicted · none · ref 3 · internal anchor
LLM-based inference recovers user age, gender, and country from filtered ChatGPT logs at weighted F1 scores of 0.84-0.90, with median identification from the first 5% of history, driven by stereotype patterns.

Can LLMs Infer Conversational Agent Users' Personality Traits from Chat History?

fields

years

verdicts

representative citing papers

citing papers explorer