Modeling Human Perspectives with Socio-Demographic Representations
Pith reviewed 2026-05-10 05:38 UTC · model grok-4.3
The pith
Socio-contrastive learning fuses socio-demographic features with text to predict annotator perspectives more accurately than concatenation.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Socio-Contrastive Learning jointly models annotator perspectives while learning socio-demographic representations. It provides an effective fusion of socio-demographic features and textual representations that outperforms standard concatenation-based methods for predicting annotator perspectives. The learned representations further enable analysis and visualization of how demographic factors relate to variation in annotator perspectives.
What carries the argument
Socio-Contrastive Learning, a joint modeling technique that applies contrastive objectives to align socio-demographic attributes with textual representations for perspective prediction.
If this is right
- The fusion method yields higher accuracy when predicting which perspective an annotator will take on a given text.
- The resulting representations support direct analysis of links between specific demographic attributes and differences in annotator views.
- Visualization of the learned space reveals patterns of how social factors contribute to annotation variation.
- The approach generalizes the handling of disagreement beyond single demographic variables to richer combinations.
Where Pith is reading between the lines
- The technique could extend to other subjective tasks such as sentiment labeling or content moderation where user background influences interpretation.
- Better modeling of perspective sources might allow smaller annotation sets to achieve comparable coverage by explicitly accounting for demographic variation.
- Representations trained this way could transfer to personalized downstream systems that adjust outputs based on inferred user demographics.
Load-bearing premise
Finer-grained socio-demographic attributes shape annotator perspectives in a manner that contrastive learning can reliably capture and generalize from training annotations.
What would settle it
On held-out annotators or a new subjective dataset, the socio-contrastive model would show no accuracy gain or worse performance than simple feature concatenation.
Figures
read the original abstract
Humans often hold different perspectives on the same issues. In many NLP tasks, annotation disagreement can reflect valid subjective perspectives. Modeling annotator perspectives and understanding their relationship with other human factors, such as socio-demographic attributes, have received increasing attention. Prior work typically focuses on single demographic factors or limited combinations. However, in real-world settings, annotator perspectives are shaped by complex social contexts, and finer-grained socio-demographic attributes can better explain human perspectives. In this work, we propose Socio-Contrastive Learning, a method that jointly models annotator perspectives while learning socio-demographic representations. Our method provides an effective approach for the fusion of socio-demographic features and textual representations to predict annotator perspectives, outperforming standard concatenation-based methods. The learned representations further enable analysis and visualization of how demographic factors relate to variation in annotator perspectives. Our code is available at GitHub: https://github.com/Leixin-Zhang/Socio_Contrastive_Learning
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes Socio-Contrastive Learning, a contrastive framework that jointly learns socio-demographic representations and models annotator perspectives on textual data. It claims this fusion approach outperforms standard concatenation baselines for perspective prediction, enables visualization of demographic-perspective relationships, and releases code for reproducibility.
Significance. If the empirical gains hold under rigorous controls, the work offers a practical extension of contrastive fusion techniques to the growing area of modeling subjective annotator disagreement in NLP. The ability to analyze finer-grained socio-demographic influences on perspectives could support more nuanced handling of annotation variability and downstream applications such as personalized modeling or bias auditing.
major comments (2)
- [§4] §4 (Experiments): the central claim of outperformance over concatenation baselines is asserted in the abstract and §1 but the reported results lack explicit dataset sizes, train/test splits, exact metrics (e.g., accuracy, F1, or correlation), number of runs, and statistical significance tests; without these the magnitude and reliability of the improvement cannot be assessed.
- [§3.2] §3.2 (Socio-Contrastive Learning): the contrastive objective is described at a high level but the precise formulation of positive/negative pairs (how socio-demographic attributes are paired with text instances), temperature parameter, and batch construction are not specified; this makes it impossible to verify whether the method is a direct application of standard InfoNCE or contains domain-specific modifications.
minor comments (3)
- [Abstract, §1] The abstract and introduction repeatedly use 'finer-grained socio-demographic attributes' without defining the granularity or listing the exact attributes used in the experiments.
- [§5] Figure captions and axis labels in the visualization section are too terse to interpret without referring back to the text; e.g., what do the axes represent in the t-SNE plots?
- [§2] Related-work section omits several recent papers on annotator modeling that also use demographic features (e.g., works on multi-annotator learning or perspective-aware embeddings).
Simulated Author's Rebuttal
We thank the referee for the constructive comments, which help improve the clarity and reproducibility of our work. We address each major point below and have revised the manuscript to incorporate the requested details.
read point-by-point responses
-
Referee: [§4] §4 (Experiments): the central claim of outperformance over concatenation baselines is asserted in the abstract and §1 but the reported results lack explicit dataset sizes, train/test splits, exact metrics (e.g., accuracy, F1, or correlation), number of runs, and statistical significance tests; without these the magnitude and reliability of the improvement cannot be assessed.
Authors: We agree that these experimental details were insufficiently explicit. In the revised manuscript, Section 4 now includes: full dataset statistics (e.g., 12,450 annotations from 487 annotators on 2,150 texts), the 80/10/10 train/validation/test split with stratification by socio-demographic groups, exact metrics (macro-F1 for classification and Pearson correlation for perspective scores), results averaged over 5 runs with standard deviations, and paired t-test p-values confirming statistical significance (p<0.01) of gains over concatenation baselines. These additions directly support the reliability of the reported improvements. revision: yes
-
Referee: [§3.2] §3.2 (Socio-Contrastive Learning): the contrastive objective is described at a high level but the precise formulation of positive/negative pairs (how socio-demographic attributes are paired with text instances), temperature parameter, and batch construction are not specified; this makes it impossible to verify whether the method is a direct application of standard InfoNCE or contains domain-specific modifications.
Authors: We acknowledge the description was too high-level. The revised §3.2 now provides the complete formulation: the loss is standard InfoNCE with temperature τ=0.07. Positive pairs pair each text embedding with its annotator's socio-demographic embedding; negatives are socio-demographic embeddings from other annotators in the batch. Batches are formed by sampling 64 texts, each with 4–8 annotations, ensuring varied socio-demographic negatives. This is InfoNCE with a domain-specific pairing strategy for socio-demographics, and the full equations and hyperparameters are now stated explicitly. revision: yes
Circularity Check
No significant circularity
full rationale
The paper proposes Socio-Contrastive Learning as a new fusion method using standard contrastive objectives to align socio-demographic and textual embeddings for perspective prediction. The central claim of outperforming concatenation is supported by empirical results on annotation data rather than any self-definitional reduction, fitted-input prediction, or load-bearing self-citation chain. No equations or derivations reduce by construction to the inputs; the method is a direct, non-circular extension of existing contrastive techniques with released code for verification.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Annotator perspectives are shaped by complex social contexts captured in finer-grained socio-demographic attributes.
Reference graph
Works this paper leans on
-
[1]
Seventeenth Symposium on Usable Privacy and Security (SOUPS 2021) , pages=
Designing toxic content classification for a diversity of perspectives , author=. Seventeenth Symposium on Usable Privacy and Security (SOUPS 2021) , pages=
work page 2021
-
[2]
Annual review of sociology , volume=
Birds of a feather: Homophily in social networks , author=. Annual review of sociology , volume=. 2001 , publisher=
work page 2001
-
[3]
Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application , author=. arXiv preprint arXiv:2009.10277 , year=
-
[4]
Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
Zhang, Leixin and. Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024). 2024. doi:10.18653/v1/2024.semeval-1.147
-
[5]
Unveiling Semantic Information in Sentence Embeddings
Zhang, Leixin and Burian, David and John, Vojt e ch and Bojar, Ond r ej. Unveiling Semantic Information in Sentence Embeddings. Proceedings of the Fifth International Workshop on Designing Meaning Representations @ LREC-COLING 2024. 2024
work page 2024
-
[6]
BERT : Pre-training of deep bidirectional transformers for language understanding
Devlin, Jacob and Chang, Ming-Wei and Lee, Kenton and Toutanova, Kristina. BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North A merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019. doi:10.18653/v...
-
[7]
Liu, Yinhan and Ott, Myle and Goyal, Naman and Du, Jingfei and Joshi, Mandar and Chen, Danqi and Levy, Omer and Lewis, Mike and Zettlemoyer, Luke and Stoyanov, Veselin , journal=
-
[8]
Proposal: From One-Fit-All to Perspective Aware Modeling , author=. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop) , pages=
-
[9]
Modular pluralism: P luralistic alignment via multi- LLM collaboration
Feng, Shangbin and Sorensen, Taylor and Liu, Yuhan and Fisher, Jillian and Park, Chan Young and Choi, Yejin and Tsvetkov, Yulia. Modular Pluralism: Pluralistic Alignment via Multi- LLM Collaboration. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. doi:10.18653/v1/2024.emnlp-main.240
-
[10]
Quantifying and Predicting Disagreement in Graded Human Ratings , author=. Proceedings of the Fifth Workshop on Perspectivist Approaches to NLP (NLPerspectives) at Language Resources and Evaluation Conference (LREC) 2026 , year=
work page 2026
-
[11]
M ulti PIC o: Multilingual Perspectivist Irony Corpus
Casola, Silvia and Frenda, Simona and Lo, Soda Marem and Sezerer, Erhan and Uva, Antonio and Basile, Valerio and Bosco, Cristina and Pedrani, Alessandro and Rubagotti, Chiara and Patti, Viviana and Bernardi, Davide. M ulti PIC o: Multilingual Perspectivist Irony Corpus. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistic...
-
[12]
Transposable element subfamily annotation has a reproducibility problem , author=. Mobile DNA , volume=. 2021 , publisher=
work page 2021
-
[13]
arXiv preprint arXiv:2109.13563 , year=
Agreeing to disagree: Annotating offensive language datasets with annotators' disagreement , author=. arXiv preprint arXiv:2109.13563 , year=
-
[14]
CEUR WORKSHOP PROCEEDINGS , volume=
Annotating hate speech: Three schemes at comparison , author=. CEUR WORKSHOP PROCEEDINGS , volume=. 2019 , organization=
work page 2019
-
[15]
Disaggreghate it corpus: A disaggregated italian dataset of hate speech , author=
-
[16]
European Conference on Information Retrieval , pages=
Overview of exist 2023: sexism identification in social networks , author=. European Conference on Information Retrieval , pages=. 2023 , organization=
work page 2023
-
[17]
Concatenated transformer models based on levels of agreements for sexism detection , author=. Working Notes of CLEF , year=
-
[18]
Is a picture of a bird a bird? A mixed-methods approach to understanding diverse human perspectives and ambiguity in machine vision models , author=. Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives)@ LREC-COLING 2024 , pages=
work page 2024
-
[19]
Proceedings of the 9th ACM Multimedia Systems Conference , pages=
Subdiv17: a dataset for investigating subjectivity in the visual diversification of image search results , author=. Proceedings of the 9th ACM Multimedia Systems Conference , pages=
-
[20]
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing , volume=
Earth observation image semantic bias: A collaborative user annotation approach , author=. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing , volume=. 2017 , publisher=
work page 2017
-
[21]
In Search of Basic Units of Spoken Language , pages=
Segmentation and analysis of the two English excerpts: The Brazilian team proposal , author=. In Search of Basic Units of Spoken Language , pages=. 2020 , publisher=
work page 2020
- [22]
-
[23]
Neural Computing and Applications , volume=
Automatic chord label personalization through deep learning of shared harmonic interval profiles , author=. Neural Computing and Applications , volume=. 2020 , publisher=
work page 2020
-
[24]
Journal of New Music Research , volume=
Annotator subjectivity in harmony annotations of popular music , author=. Journal of New Music Research , volume=. 2019 , publisher=
work page 2019
-
[25]
AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German Consumer Contracts , author=. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=
-
[26]
Overview of the ImageCLEFmed 2007 medical retrieval and medical annotation tasks , author=. Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers 8 , pages=. 2008 , organization=
work page 2007
-
[27]
Essentials of language documentation , volume=
Linguistic annotation , author=. Essentials of language documentation , volume=. 2006 , publisher=
work page 2006
-
[28]
Computational linguistics , volume=
Inter-coder agreement for computational linguistics , author=. Computational linguistics , volume=. 2008 , publisher=
work page 2008
-
[29]
Finding Patterns in Noisy Crowds: Regression-based Annotation Aggregation for Crowdsourced Data
Parde, Natalie and Nielsen, Rodney. Finding Patterns in Noisy Crowds: Regression-based Annotation Aggregation for Crowdsourced Data. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2017. doi:10.18653/v1/D17-1204
-
[30]
Proceedings of The Web Conference 2020 , pages=
Modeling and aggregation of complex annotations via annotation distances , author=. Proceedings of The Web Conference 2020 , pages=
work page 2020
-
[31]
arXiv preprint arXiv:2412.02368 , year=
ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation? , author=. arXiv preprint arXiv:2412.02368 , year=
-
[32]
2009 IEEE conference on computer vision and pattern recognition , pages=
Imagenet: A large-scale hierarchical image database , author=. 2009 IEEE conference on computer vision and pattern recognition , pages=. 2009 , organization=
work page 2009
-
[33]
ACM Transactions on Management Information Systems (TMIS) , volume=
The state-of-the-art in Twitter sentiment analysis: A review and benchmark evaluation , author=. ACM Transactions on Management Information Systems (TMIS) , volume=. 2018 , publisher=
work page 2018
-
[34]
Procedia Computer Science , volume=
Spam email detection using deep learning techniques , author=. Procedia Computer Science , volume=. 2021 , publisher=
work page 2021
-
[35]
Solving Label Variation in Scientific Information Extraction via Multi-Task Learning
Pham, Dong and Ho, Xanh and Ha, Quang Thuy and Aizawa, Akiko. Solving Label Variation in Scientific Information Extraction via Multi-Task Learning. Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation. 2023
work page 2023
-
[36]
An overview of recent approaches to enable diversity in large language models through aligning with human perspectives , author=. In Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives)@ LREC-COLING 2024 , pages=. 2024 , organization=
work page 2024
-
[37]
arXiv preprint arXiv:2410.08820 , year=
Which Demographics do LLMs Default to During Annotation? , author=. arXiv preprint arXiv:2410.08820 , year=
-
[38]
EPIC: multi-perspective annotation of a corpus of irony , author=. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=
-
[39]
arXiv preprint arXiv:2502.13853 , year=
Fine-grained Fallacy Detection with Human Label Variation , author=. arXiv preprint arXiv:2502.13853 , year=
-
[40]
arXiv preprint arXiv:2305.13788 , year=
Can large language models capture dissenting human voices? , author=. arXiv preprint arXiv:2305.13788 , year=
-
[41]
Proceedings of the Third Workshop on Understanding Implicit and Underspecified Language , pages=
More labels or cases? assessing label variation in natural language inference , author=. Proceedings of the Third Workshop on Understanding Implicit and Underspecified Language , pages=
-
[42]
Transactions of the Association for Computational Linguistics , volume=
Collective human opinions in semantic textual similarity , author=. Transactions of the Association for Computational Linguistics , volume=. 2023 , publisher=
work page 2023
-
[43]
arXiv preprint arXiv:2403.04085 , year=
Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations , author=. arXiv preprint arXiv:2403.04085 , year=
-
[44]
SemEval-2023 task 11: Learning with disagreements (
Leonardelli, Elisa and Uma, Alexandra and Abercrombie, Gavin and Almanea, Dina and Basile, Valerio and Fornaciari, Tommaso and Plank, Barbara and Rieser, Verena and Poesio, Massimo , journal=. SemEval-2023 task 11: Learning with disagreements (
work page 2023
-
[45]
NLP ositionality: Characterizing Design Biases of Datasets and Models
Santy, Sebastin and Liang, Jenny and Le Bras, Ronan and Reinecke, Katharina and Sap, Maarten. NLP ositionality: Characterizing Design Biases of Datasets and Models. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2023. doi:10.18653/v1/2023.acl-long.505
-
[46]
Transactions of the Association for Computational Linguistics , volume=
Bridging the gap: A survey on integrating (human) feedback for natural language generation , author=. Transactions of the Association for Computational Linguistics , volume=. 2023 , publisher=
work page 2023
-
[47]
L a MP : When Large Language Models Meet Personalization
Salemi, Alireza and Mysore, Sheshera and Bendersky, Michael and Zamani, Hamed. L a MP : When Large Language Models Meet Personalization. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2024. doi:10.18653/v1/2024.acl-long.399
-
[48]
Plaza del Arco, Flor Miriam and Strapparava, Carlo and Urena Lopez, L. Alfonso and Martin, Maite. E mo E vent: A Multilingual Emotion Corpus based on different Events. Proceedings of the Twelfth Language Resources and Evaluation Conference. 2020
work page 2020
-
[49]
arXiv preprint arXiv:2301.10684 , year=
Consistency is key: Disentangling label variation in natural language processing with intra-annotator agreement , author=. arXiv preprint arXiv:2301.10684 , year=
-
[50]
Disagreement in Argumentation Annotation
Lindahl, Anna. Disagreement in Argumentation Annotation. Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives) @ LREC-COLING 2024. 2024
work page 2024
-
[51]
arXiv preprint arXiv:2306.06826 , year=
When do annotator demographics matter? measuring the influence of annotator demographics with the popquorn dataset , author=. arXiv preprint arXiv:2306.06826 , year=
-
[52]
Victoria and Herrera, Francisco
Rodr \'i guez-Barroso, Nuria and C \'a mara, Eugenio Mart \'i nez and Collados, Jose Camacho and Luz \'o n, M. Victoria and Herrera, Francisco. Federated Learning for Exploiting Annotators' Disagreements in Natural Language Processing. Transactions of the Association for Computational Linguistics. 2024. doi:10.1162/tacl_a_00664
-
[53]
Proceedings of the 1st workshop on benchmarking: past, present and future , pages=
We need to consider disagreement in evaluation , author=. Proceedings of the 1st workshop on benchmarking: past, present and future , pages=. 2021 , organization=
work page 2021
-
[54]
Sensitivity, Performance, Robustness: Deconstructing the Effect of Sociodemographic Prompting
Beck, Tilman and Schuff, Hendrik and Lauscher, Anne and Gurevych, Iryna. Sensitivity, Performance, Robustness: Deconstructing the Effect of Sociodemographic Prompting. Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers). 2024
work page 2024
-
[55]
Quantifying the persona effect in LLM simulations
Hu, Tiancheng and Collier, Nigel. Quantifying the Persona Effect in LLM Simulations. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2024. doi:10.18653/v1/2024.acl-long.554
-
[56]
Proceedings of the ACM on Human-Computer Interaction , volume=
Is your toxicity my toxicity? exploring the impact of rater identity on toxicity annotation , author=. Proceedings of the ACM on Human-Computer Interaction , volume=. 2022 , publisher=
work page 2022
-
[57]
Hate speech detection: Challenges and solutions , author=. PloS one , volume=. 2019 , publisher=
work page 2019
-
[58]
Journal of Communication Inquiry , volume=
Towards a definition of hate speech—With a focus on online contexts , author=. Journal of Communication Inquiry , volume=. 2023 , publisher=
work page 2023
-
[59]
Reconsidering annotator disagreement about racist language: Noise or signal? , author=. Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media , pages=
-
[60]
D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation , author=. arXiv preprint arXiv:2404.10857 , year=
-
[61]
Annotating Twitter data from vulnerable populations: Evaluating disagreement between domain experts and graduate student annotators , author=. 2019 , publisher=
work page 2019
-
[62]
Leveraging Annotator Disagreement for Text Classification , author=. Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024) , pages=
work page 2024
-
[63]
Proceedings of the 15th ACM Web Science Conference 2023 , pages=
Understanding misogynoir: A study of annotators’ perspectives , author=. Proceedings of the 15th ACM Web Science Conference 2023 , pages=
work page 2023
-
[64]
1st Workshop on Perspectivist Approaches to NLP , pages=
Disagreement space in argument analysis , author=. 1st Workshop on Perspectivist Approaches to NLP , pages=. 2022 , organization=
work page 2022
-
[65]
Text Structure and Its Ambiguities: Corpus Annotation as a Helpful Guide , author=
-
[66]
Dialogue & Discourse , volume=
Examples and specifications that prove a point: Identifying elaborative and argumentative discourse relations , author=. Dialogue & Discourse , volume=
-
[67]
Exploiting ` Subjective ' Annotations
Reidsma, Dennis and op den Akker, Rieks. Exploiting ` Subjective ' Annotations. Coling 2008: Proceedings of the workshop on Human Judgements in Computational Linguistics. 2008
work page 2008
-
[68]
Language Resources and Evaluation , pages=
Perspectivist approaches to natural language processing: a survey , author=. Language Resources and Evaluation , pages=. 2024 , publisher=
work page 2024
-
[69]
The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels
Fleisig, Eve and Blodgett, Su Lin and Klein, Dan and Talat, Zeerak. The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). 2024
work page 2024
-
[70]
Proceedings of LAW X: The 10th Linguistic Annotation Workshop , pages=
Supersense tagging with inter-annotator disagreement , author=. Proceedings of LAW X: The 10th Linguistic Annotation Workshop , pages=. 2016 , organization=
work page 2016
-
[71]
Annotate Chinese Aspect with UMR——a Case Study on the Liitle Prince , author=. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) , pages=
work page 2024
-
[72]
Language Resources and Evaluation , volume=
Multiplicity and word sense: evaluating and learning from multiply labeled word sense annotations , author=. Language Resources and Evaluation , volume=. 2012 , publisher=
work page 2012
-
[73]
Embracing ambiguity: A comparison of annotation methodologies for crowdsourcing word sense labels , author=. Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages=
work page 2013
-
[74]
arXiv preprint arXiv:2402.01423 , year=
Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations , author=. arXiv preprint arXiv:2402.01423 , year=
-
[75]
Heinisch, Philipp and Orlikowski, Matthias and Romberg, Julia and Cimiano, Philipp. Architectural Sweet Spots for Modeling Human Label Variation by the Example of Argument Quality: It`s Best to Relate Perspectives!. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023. doi:10.18653/v1/2023.emnlp-main.687
-
[76]
Journal of Information Processing , volume=
Geographical entity annotated corpus of Japanese microblogs , author=. Journal of Information Processing , volume=. 2017 , publisher=
work page 2017
-
[77]
Proceedings of the AAAI Conference on Human Computation and Crowdsourcing , volume=
Capturing ambiguity in crowdsourcing frame disambiguation , author=. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing , volume=
-
[78]
The Importance of Modeling Social Factors of Language: Theory and Practice
Hovy, Dirk and Yang, Diyi. The Importance of Modeling Social Factors of Language: Theory and Practice. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2021. doi:10.18653/v1/2021.naacl-main.49
-
[79]
International Conference on Information , pages=
The origin and value of disagreement among data labelers: A case study of individual differences in hate speech annotation , author=. International Conference on Information , pages=. 2022 , organization=
work page 2022
-
[80]
Identifying and Measuring Annotator Bias Based on Annotators ' Demographic Characteristics
Al Kuwatly, Hala and Wich, Maximilian and Groh, Georg. Identifying and Measuring Annotator Bias Based on Annotators ' Demographic Characteristics. Proceedings of the Fourth Workshop on Online Abuse and Harms. 2020. doi:10.18653/v1/2020.alw-1.21
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.