Dark Personality Traits and Online Toxicity: Linking Self-Reports to Reddit Activity
Pith reviewed 2026-05-16 22:59 UTC · model grok-4.3
The pith
Dark personality traits associate with self-reported uncivil online behavior but show no link to actual linguistic toxicity in Reddit comments.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Validated dark personality dimensions correlate consistently with self-reported engagement in uncivil online interactions, but no such dimension significantly predicts text-derived toxicity scores or other linguistic features from the Reddit comments. Self-reported experiences of engaging in or being targeted by toxic behavior, however, show robust associations with negativity, moral framing, and emotional intensity in users' language. The findings indicate a gap between personality traits and their expression in observable linguistic signals, with computational features capturing behavioral engagement but not serving as reliable proxies for underlying traits.
What carries the argument
A secure web application that links validated psychological questionnaire responses collected via Amazon Mechanical Turk to participants' actual Reddit comment history, yielding linguistic and behavioral features from 57K comments.
If this is right
- Computational linguistic features can capture self-reported behavioral engagement in online incivility.
- Validated dark personality measures do not serve as reliable proxies for text-based toxicity within the current feature set.
- Grounding computational approaches in validated psychological measures remains necessary for linking personality to online behavior.
- Richer, context-aware representations are needed to connect stable traits with observable language patterns.
Where Pith is reading between the lines
- Platforms could prioritize behavioral language signals over personality profiles when designing toxicity detection systems.
- Longitudinal tracking of the same users might reveal whether dark traits predict future behavior even if they miss current language patterns.
- Refining toxicity proxies to include thread context or reply chains could close the observed gap between self-reports and text features.
Load-bearing premise
The selected linguistic and behavioral features extracted from Reddit comments are valid and sufficient proxies for online toxicity, and self-report measures accurately reflect real behavior.
What would settle it
A larger replication study in which any validated dark personality dimension significantly predicts text-derived toxicity scores or linguistic markers in Reddit-style data.
Figures
read the original abstract
Dark personality traits have long been associated with antisocial and toxic online behaviors, yet their relationship with observable online activity remains unclear. We investigate the association between validated dark personality measures, self-reported experiences of online incivility, and linguistic and behavioral features extracted from real-world user activity. To this end, we developed a Web application that securely links responses to validated psychological questionnaires collected via Amazon Mechanical Turk with participants' Reddit activity. This yielded a dataset of nearly 57K comments (2.2M tokens) from 114 users, represented through a broad set of linguistic and behavioral features. Our analyses reveal a clear distinction between self-reported and observed behavior. Dark personality traits show consistent associations with self-reported engagement in uncivil interactions. However, no validated dark personality dimension significantly predicts text-derived toxicity or linguistic features. In contrast, self-reported experiences of engaging in or being targeted by toxic behavior are robustly reflected in users' language, exhibiting consistent associations with measures of negativity, moral framing, and emotional intensity. Taken together, these findings highlight a gap between stable personality traits and their manifestation in surface-level linguistic signals. While computational features effectively capture behavioral engagement in online incivility, they do not provide reliable proxies for underlying personality constructs within the present framework. Our results underscore the importance of grounding computational approaches in validated psychological measures and point to the need for richer, context-aware representations to better understand the relationship between personality and online behavior.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript examines associations between validated dark personality trait measures, self-reported experiences of online incivility, and a broad set of linguistic and behavioral features extracted from nearly 57K Reddit comments (2.2M tokens) by 114 MTurk participants whose accounts were securely linked via a custom web app. It reports consistent positive associations between dark traits and self-reported uncivil engagement, but no significant predictions of text-derived toxicity scores or linguistic markers (negativity, moral framing, emotional intensity) by any dark personality dimension. Self-reported incivility experiences, by contrast, show robust correlations with those same linguistic features. The authors conclude that computational features capture behavioral engagement but are unreliable proxies for underlying personality constructs.
Significance. If the dissociation holds after addressing proxy validation, the work offers a valuable empirical demonstration that self-report measures of dark traits align with reported online behavior while observable linguistic signals do not, highlighting limits of current text-based proxies in computational social science. The linkage of validated psychological scales to real Reddit activity on a dataset of this scale is a clear methodological strength and supports calls for richer, context-aware representations.
major comments (3)
- [Methods] Methods section: No details are provided on the specific toxicity classifier (model architecture, training corpus, calibration on Reddit data, or performance metrics such as precision/recall against human annotations for sarcasm and context). This is load-bearing for the central null result on dark traits, as substantial measurement error in the automated features would attenuate correlations with stable traits while permitting coarser self-report associations to remain detectable at N=114.
- [Results] Results section: The analyses involve a broad feature set and multiple personality and linguistic variables, yet no correction for multiple comparisons is mentioned. This affects interpretation of both the reported self-report correlations and the null personality-to-toxicity findings.
- [Discussion] Discussion section: The claim that 'computational features effectively capture behavioral engagement in online incivility' but 'do not provide reliable proxies for underlying personality constructs' requires explicit discussion of feature limitations (e.g., lack of thread context or sarcasm handling) to justify the dissociation as substantive rather than methodological.
minor comments (2)
- [Abstract] Abstract: The sample size (N=114) and comment volume should be stated with greater precision regarding inclusion criteria and any data filtering steps.
- [Methods] The manuscript would benefit from a table summarizing the exact linguistic and behavioral features used and their extraction methods.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed comments, which highlight important areas for clarification and strengthening of the manuscript. We address each major comment point by point below, outlining the specific revisions we will implement.
read point-by-point responses
-
Referee: [Methods] Methods section: No details are provided on the specific toxicity classifier (model architecture, training corpus, calibration on Reddit data, or performance metrics such as precision/recall against human annotations for sarcasm and context). This is load-bearing for the central null result on dark traits, as substantial measurement error in the automated features would attenuate correlations with stable traits while permitting coarser self-report associations to remain detectable at N=114.
Authors: We agree that the Methods section requires additional detail on the toxicity classifier to allow proper evaluation of the null results. In the revised manuscript, we will expand this section to specify the exact model (including architecture and source), training corpus, any calibration or fine-tuning performed on Reddit-style data, and performance metrics such as precision, recall, and F1 scores, with explicit discussion of sarcasm and context handling. These additions will directly address concerns about measurement error. revision: yes
-
Referee: [Results] Results section: The analyses involve a broad feature set and multiple personality and linguistic variables, yet no correction for multiple comparisons is mentioned. This affects interpretation of both the reported self-report correlations and the null personality-to-toxicity findings.
Authors: The referee is correct that multiple-comparison correction is needed given the number of tests performed. We will revise the Results section to apply FDR correction (or Bonferroni, as appropriate) and report both uncorrected and corrected p-values. This will provide a more rigorous view of the self-report associations; the null findings for dark traits predicting toxicity features remain robust, as none approached significance even without correction. revision: yes
-
Referee: [Discussion] Discussion section: The claim that 'computational features effectively capture behavioral engagement in online incivility' but 'do not provide reliable proxies for underlying personality constructs' requires explicit discussion of feature limitations (e.g., lack of thread context or sarcasm handling) to justify the dissociation as substantive rather than methodological.
Authors: We appreciate this suggestion and will expand the Discussion to include a dedicated paragraph on the limitations of the extracted linguistic features. This will explicitly cover the absence of full thread context, challenges in sarcasm detection, reliance on surface-level markers, and other constraints. These additions will clarify that the observed dissociation reflects substantive differences between self-report and observable signals rather than solely methodological artifacts. revision: yes
Circularity Check
Empirical correlation study with no derivation chain or self-referential reductions
full rationale
The paper is a data-driven empirical analysis that collects self-report questionnaire data via MTurk, links it to real Reddit comments, extracts standard linguistic and behavioral features, and reports statistical associations. No equations, first-principles derivations, fitted parameters renamed as predictions, or uniqueness theorems appear in the abstract or described methods. All central claims rest on observable data and externally validated scales rather than any step that reduces to its own inputs by construction. Self-citations, if present, are not load-bearing for the reported associations. This is a standard non-circular empirical design.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Validated self-report questionnaires accurately capture dark personality traits and experiences of online incivility.
- domain assumption Linguistic and behavioral features extracted from Reddit comments are valid indicators of toxicity.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We computed ... 224 linguistic and behavioral features ... toxicity scores from Perspective API ... LIWC-22 ... NRC Emotion Intensity Lexicon ... FrameAxis ... EmoAtlas ... Spearman rank correlations
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
no validated dark personality dimension significantly predicts text-derived toxicity or linguistic features
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Y. K. Dwivedi, G. Kelly, M. Janssen, N. P. Rana, E. L. Slade, M. Clement, Social media: The good, the bad, and the ugly, Information Systems Frontiers 20 (3) (2018) 419–423
work page 2018
-
[2]
Gillespie, Content moderation, ai, and the question of scale, Big Data & Society 7 (2) (2020)
T. Gillespie, Content moderation, ai, and the question of scale, Big Data & Society 7 (2) (2020)
work page 2020
-
[3]
M. Gambini, T. Fagni, F. Falchi, M. Tesconi, On pushing deepfake tweet detection capabilities to the limits, in: Proceedings of the 14th ACM web science conference 2022, 2022, pp. 154–163
work page 2022
-
[4]
M. Groh, Z. Epstein, C. Firestone, R. Picard, Deepfake detection by human crowds, machines, and machine-informed crowds, Proceedings of the National Academy of Sciences 119 (1) (2022) e2110013119
work page 2022
- [5]
- [6]
-
[7]
D. Graf, T. Yanagida, K. Runions, C. Spiel, Why did you do that? differential types of aggression in offline and in cyberbullying, Computers in Human Behavior 128 (2022) 107107
work page 2022
-
[8]
L. Cima, A. Miaschi, A. Trujillo, M. Avvenuti, F. Dell’Orletta, S. Cresci, Contextu- alized counterspeech: Strategies for adaptation, personalization, and evaluation, in: Proceedings of the ACM on Web Conference 2025, 2025, pp. 5022–5033
work page 2025
- [9]
-
[10]
O. Bogolyubova, P. Panicheva, R. Tikhonov, V. Ivanov, Y. Ledovaya, Dark person- alities on facebook: Harmful online behaviors and language, Computers in human Behavior 78 (2018) 151–159. 29
work page 2018
-
[11]
V. Popa, G. Cola, C. Senette, M. Tesconi, How effective are large language models (llms) at inferring people’s personality based on texts they authored?, in: G. Costa, R. Montanari, M. Carminati, G. Sciarretta (Eds.), Proceedings of the Joint National Conference on Cybersecurity (ITASEC & SERICS 2025), Bologna, Italy, February 03-08, 2025, Vol. 3962 of CE...
work page 2025
-
[12]
K.Y.Koay, W.C.Poon, Students’cyberslackingbehaviourine-learningenvironments: the role of the big five personality traits and situational factors, Journal of Applied Research in Higher Education 15 (2) (2023) 521–536
work page 2023
-
[13]
T. Bowden-Green, J. Hinds, A. Joinson, How is extraversion related to social media use? a literature review, Personality and Individual Differences 164 (2020) 110040
work page 2020
- [14]
-
[15]
N. Gugushvili, K. Täht, E. M. Schruff-Lim, R. A. Ruiter, P. Verduyn, The association between neuroticism and problematic social networking sites use: the role of fear of missing out and self-control, Psychological reports 127 (4) (2024) 1727–1750
work page 2024
-
[16]
N. A. Utami, W. Maharani, I. Atastina, Personality classification of facebook users according to big five personality using svm (support vector machine) method, Procedia Computer Science 179 (2021) 177–184
work page 2021
-
[17]
L. Chen, W. Cai, D. Yan, S. Berkovsky, Eye-tracking-based personality prediction with recommendation interfaces, User Model. User Adapt. Interact. 33 (1) (2023) 121–157
work page 2023
-
[18]
K. L. Meidenbauer, T. Niu, K. W. Choe, A. J. Stier, M. G. Berman, Mouse move- ments reflect personality traits and task attentiveness in online experiments, Journal of Personality 91 (2) (2023) 413–425
work page 2023
- [19]
- [20]
-
[21]
N. Mahadevan, Conceptualizing grandiose and vulnerable narcissism as alternative status-seeking strategies: Insights from hierometer theory, Social and Personality Psy- chology Compass 18 (6) (2024)
work page 2024
- [22]
- [23]
-
[24]
T. Biselli, K. Hartwig, C. Reuter, Mitigating misinformation sharing on social media through personalised nudging, Proceedings of the ACM on Human-Computer Interac- tion 9 (2) (2025) 1–44
work page 2025
- [25]
- [26]
-
[27]
L. Moor, J. R. Anderson, A systematic literature review of the relationship between dark personality traits and antisocial online behaviours, Personality and Individual Differences 144 (2019) 40–55
work page 2019
-
[28]
D. Paulhus, K. M. Williams, The dark triad of personality: Narcissism, machiavellian- ism and psychopathy, Journal of Research in Personality 36 (6) (2002) 556–563
work page 2002
-
[29]
H. Chabrol, N. Van Leeuwen, R. Rodgers, N. Séjourné, Contributions of psychopathic, narcissistic, machiavellian, and sadistic personality traits to juvenile delinquency, Per- sonality and Individual Differences 47 (7) (2009) 734–739
work page 2009
-
[30]
E. Buckels, D. Jones, D. Paulhus, Behavioral confirmation of everyday sadism, Psy- chological science 24 (09 2013)
work page 2013
-
[31]
M. van Geel, A. Goemans, F. Toprak, P. Vedder, Which personality traits are related to traditional bullying and cyberbullying? a study with the big five, dark triad and sadism, Personality and Individual Differences 106 (2017) 231–235
work page 2017
-
[32]
D. Paulhus, E. Buckels, P. Trapnell, D. Jones, Screening for dark personalities, Euro- pean Journal of Psychological Assessment 37 (2020) 208–222
work page 2020
- [33]
- [34]
-
[35]
K. C. Seigfried-Spellar, C. M. Lankford, Personality and online environment factors differ for posters, trolls, lurkers, and confessors on yik yak, Personality and Individual Differences 124 (2018) 54–56. 31
work page 2018
-
[36]
E. Buckels, P. Trapnell, T. Andjelovic, D. Paulhus, Internet trolling and everyday sadism: Parallel effects on pain perception and moral judgment, Journal of Personality 87 (04 2018)
work page 2018
- [37]
-
[38]
Z. G. Gibb, P. G. Devereux, Who does that anyway? predictors and personality correlates of cyberbullying in college, Computers in Human Behavior 38 (2014) 8–16
work page 2014
-
[39]
A. K. Goodboy, M. M. Martin, The personality profile of a cyberbully: Examining the dark triad, Computers in Human Behavior 49 (2015) 1–4
work page 2015
- [40]
-
[41]
K. Kircaburun, P. K. Jonason, M. D. Griffiths, The dark tetrad traits and problematic social media use: The mediating role of cyberbullying and cyberstalking, Personality and Individual Differences 135 (2018) 264–269
work page 2018
-
[42]
K. Kircaburun, Z. Demetrovics, Ş. B. Tosuntaş, Analyzing the links between prob- lematic social media use, dark triad traits, and self-esteem, International Journal of Mental Health and Addiction 17 (6) (2019) 1496–1507
work page 2019
-
[43]
Z. Demircioğlu, A. Göncü Köse, Effects of attachment styles, dark triad, rejection sensitivity, and relationship satisfaction on social media addiction: A mediated model, Current Psychology 40 (01 2021)
work page 2021
-
[44]
D. Preotiuc-Pietro, J. Carpenter, S. Giorgi, L. Ungar, Studying the dark triad of personality through twitter behavior, in: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM ’16, Association for Computing Machinery, New York, NY, USA, 2016, p. 761–770
work page 2016
-
[45]
A. Moskvichev, M. Dubova, S. Menshov, A. Filchenkov, Using linguistic activity in social networks to predict and interpret dark psychological traits, in: Communications in Computer and Information Science, 2018, pp. 16–26
work page 2018
-
[46]
C. Sumner, A. Byers, R. Boochever, C. Sumner, A. Byers, R. Boochever, G. Park, Predicting dark triad personality traits from twitter usage and a linguistic analysis of tweets, Proceedings - 2012 11th International Conference on Machine Learning and Applications, ICMLA 2012 2 (12 2012)
work page 2012
- [47]
-
[48]
V. Balakrishnan, S. Khan, T. Fernandez, H. R. Arabnia, Cyberbullying detection on twitter using big five and dark triad features, Personality and Individual Differences 141 (2019) 252–257
work page 2019
-
[49]
D. Paulhus, S. Vazire, et al., The self-report method, Handbook of research methods in personality psychology 1 (2007) (2007) 224–239
work page 2007
- [50]
-
[51]
K. Chraibi, I. Chaker, A. Zahi, Automatic personality prediction: A systematic map- ping study, in: 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020, Canberra, Australia, December 1-4, 2020, IEEE, 2020, pp. 2053–2060
work page 2020
-
[52]
A.Vinciarelli, G.Mohammadi, Asurveyofpersonalitycomputing, IEEETrans.Affect. Comput. 5 (3) (2014) 273–291
work page 2014
-
[53]
L. V. Phan, J. F. Rauthmann, Personality computing: New frontiers in personality assessment, Social and Personality Psychology Compass 15 (7) (2021)
work page 2021
-
[54]
L. Yang, S. Li, X. Luo, B. Xu, Y. Geng, Z. Zeng, F. Zhang, H. Lin, Computational personality: a survey, Soft Computing 26 (18) (2022) 9587–9605
work page 2022
-
[55]
G. R. VandenBos (Ed.), APA Dictionary of Psychology, American Psychological As- sociation, 2007
work page 2007
-
[56]
C. C. Saw, A. Inthiran, Designing for trust on e-commerce websites using two of the big five personality traits, Journal of Theoretical and Applied Electronic Commerce Research 17 (2) (2022) 375–393
work page 2022
-
[57]
S. Tardelli, M. Avvenuti, M. Tesconi, S. Cresci, Characterizing social bots spreading financial disinformation, in: Social Computing and Social Media. Design, Ethics, User Behavior, and Social Network Analysis - 12th International Conference, SCSM 2020, Held as Part of the 22nd HCI International Conference, HCII 2020, Copenhagen, Den- mark, July 19-24, 20...
work page 2020
-
[58]
D. Karanatsiou, P. Sermpezis, D. Gruda, K. Kafetsios, I. Dimitriadis, A. Vakali, My tweets bring all the traits to the yard: Predicting personality and relational traits in online social networks, ACM Trans. Web 16 (2) (2022) 10:1–10:26
work page 2022
-
[59]
N. Majumder, S. Poria, A. Gelbukh, E. Cambria, Deep learning-based document mod- eling for personality detection from text, IEEE Intelligent Systems 32 (2) (2017) 74–79. doi:10.1109/MIS.2017.23. 33
-
[60]
X. Zhao, Z. Tang, S. Zhang, Deep personality trait recognition: a survey, Frontiers in psychology 13 (2022) 839619
work page 2022
-
[61]
Z. Mushtaq, S. Ashraf, N. Sabahat, Predicting mbti personality type with k-means clustering and gradient boosting, in: 2020 IEEE 23rd International Multitopic Con- ference (INMIC), 2020, pp. 1–5. URLhttps://api.semanticscholar.org/CorpusID:231684233
work page 2020
-
[62]
M. Ramezani, M. Feizi-Derakhshi, M. A. Balafar, M. Asgari-Chenaghlu, A. Feizi- Derakhshi, N. Nikzad-Khasmakhi, M. Ranjbar-Khadivi, Z. Jahanbakhsh-Nagadeh, E. Zafarani-Moattar, T. Akan, Automatic personality prediction: an enhanced method using ensemble modeling, Neural Computing and Applications 34 (21) (2022) 18369– 18389
work page 2022
-
[63]
J. Xu, W. Tian, G. Lv, S. Liu, Y. Fan, Prediction of the big five personality traits using static facial images of college students with different academic backgrounds, IEEE Access 9 (2021) 76822–76832
work page 2021
-
[64]
L. Hickman, N. Bosch, V. Ng, R. Saef, L. Tay, S. E. Woo, Automated video inter- view personality assessments: Reliability, validity, and generalizability investigations, Journal of Applied Psychology 107 (8) (2022) 1323–1351
work page 2022
-
[65]
M. Lukac, Speech-based personality prediction using deep learning with acoustic and linguistic embeddings, Scientific Reports 14 (12 2024)
work page 2024
-
[66]
Y. Xu, Y. Tang, C. Y. Suen, Review of handwriting analysis for predicting personality traits, in: Structural, Syntactic, and Statistical Pattern Recognition - Joint IAPR International Workshops, S+SSPR 2022, Montreal, QC, Canada, August 26-27, 2022, Proceedings, Vol. 13813 of Lecture Notes in Computer Science, Springer, 2022, pp. 54–63
work page 2022
- [67]
-
[68]
Y. Tausczik, J. Pennebaker, The psychological meaning of words: Liwc and comput- erized text analysis methods, Journal of Language and Social Psychology 29 (2010) 24–54
work page 2010
-
[69]
D. Jain, A. Kumar, R. Beniwal, Personality bert: A transformer-based model for personality detection from textual data, in: A. K. Bashir, G. Fortino, A. Khanna, D. Gupta (Eds.), Proceedings of International Conference on Computing and Com- munication Networks, Springer Nature Singapore, Singapore, 2022, pp. 515–522
work page 2022
-
[70]
A. V. Ganesan, Y. K. Lal, A. H. Nilsson, H. A. Schwartz, Systematic evaluation of GPT-3 for zero-shot personality estimation, in: Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, 34 WASSA@ACL 2023, Toronto, Canada, July 14, 2023, Association for Computational Linguistics, 2023, pp. 390–400
work page 2023
-
[71]
T. Zhang, A. Koutsoumpis, J. K. Oostrom, D. Holtrop, S. Ghassemi, R. E. de Vries, Can large language models assess personality from asynchronous video interviews? A comprehensive evaluation of validity, reliability, fairness, and rating patterns, IEEE Transactions on Affective Computing 15 (3) (2024) 1769–1785
work page 2024
- [72]
- [73]
- [74]
-
[75]
L. Katz, C. Harvey, I. S. Baker, C. Howard, The dark side of humanity scale: a reconstruction of the dark tetrad constructs, Acta Psychologica 222 (2022) 103461
work page 2022
-
[76]
L. A. Zezulka, K. Seigfried-Spellar, Differentiating cyberbullies and internet trolls by personality characteristics and self-esteem, Journal of Digital Forensics, Security and Law (2016)
work page 2016
-
[77]
A. Sparavec, E. March, R. Grieve, The dark triad, empathy, and motives to use social media, Personality and Individual Differences 194 (2022) 111647
work page 2022
- [78]
-
[79]
M. E. Athar, Exploring the multidimensional nature of the psychopathy construct in social media context: Insights from instagram, Computers in Human Behavior Reports 17 (2025) 100603
work page 2025
-
[80]
C. Yuan, Y. Hong, J. Wu, Does facebook activity reveal your dark side? using online language features to understand an individual’s dark triad and needs, Behaviour & Information Technology 41 (2) (2022) 292–306
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.