Annotation Quality in Aspect-Based Sentiment Analysis: A Case Study Comparing Experts, Students, Crowdworkers, and Large Language Model

Christian Wolff; Jakob Fehle; Markus Weinberger; Niklas Donhauser; Nils Constantin Hellwig; Udo Kruschwitz

arxiv: 2605.03624 · v1 · submitted 2026-05-05 · 💻 cs.CL

Annotation Quality in Aspect-Based Sentiment Analysis: A Case Study Comparing Experts, Students, Crowdworkers, and Large Language Model

Niklas Donhauser , Jakob Fehle , Nils Constantin Hellwig , Markus Weinberger , Udo Kruschwitz , Christian Wolff This is my paper

Pith reviewed 2026-05-07 04:14 UTC · model grok-4.3

classification 💻 cs.CL

keywords aspect-based sentiment analysisGerman ABSAannotation qualityinter-annotator agreementlarge language model annotationcrowdworker annotationsdataset constructionunder-resourced NLP

0 comments

The pith

Different sources of annotation for German aspect-based sentiment analysis produce datasets of varying quality that affect how well state-of-the-art models perform on fine-grained sentiment tasks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines how the choice of who creates the labels influences the usefulness of datasets for aspect-based sentiment analysis in German. It re-annotates an existing dataset with experts to create a reference point and then measures how closely student, crowdworker, and large-language-model annotations match that reference, both in direct agreement and in the accuracy of models trained on each version. This comparison matters because high-quality labeled data remains scarce for languages other than English, so clearer information about reliability versus speed helps researchers decide how to allocate limited resources when building new datasets.

Core claim

Re-annotating an existing German ABSA dataset with experts creates a ground truth against which annotations from students, crowdworkers, and LLMs are compared using inter-annotator agreement and downstream performance on aspect category sentiment analysis and target aspect sentiment detection. State-of-the-art models based on BERT, T5, and LLaMA are trained and evaluated under both fine-tuning and in-context learning settings to quantify how annotation source changes task accuracy. The resulting measurements show clear differences in consistency and model effectiveness that point to concrete trade-offs between annotation reliability and the effort required to obtain it.

What carries the argument

Inter-annotator agreement metrics combined with controlled evaluation of model performance on ACSA and TASD subtasks after training on each annotation source.

If this is right

Models trained on expert annotations reach higher accuracy on German ACSA and TASD than models trained on the other sources.
Large language models can supply annotations at lower cost but introduce more variability that reduces final model performance.
Student and crowdworker annotations occupy an intermediate position in both agreement and downstream results.
Dataset builders for other under-resourced languages can use these measured trade-offs to choose annotation methods that match available time and budget.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Hybrid workflows that let LLMs generate initial labels and then route uncertain cases to experts could reduce total cost while preserving most of the quality gain.
The same source-comparison design could be applied to other sequence-labeling tasks such as named-entity recognition or opinion-target extraction in low-resource settings.
Performance differences observed on held-out test data may shrink or grow when the models are tested on entirely new German text from different domains.

Load-bearing premise

That the expert re-annotations form an unbiased and stable reference that fairly represents the correct labels for the texts.

What would settle it

A second independent group of experts re-annotating the same texts and producing labels that agree more with student or LLM annotations than with the first expert set would undermine the chosen ground truth.

Figures

Figures reproduced from arXiv: 2605.03624 by Christian Wolff, Jakob Fehle, Markus Weinberger, Niklas Donhauser, Nils Constantin Hellwig, Udo Kruschwitz.

**Figure 1.** Figure 1: Example annotations of the same text by four groups (crowdworkers, students, LLMs, and view at source ↗

**Figure 2.** Figure 2: Distribution of aspect categories across the five datasets for the TASD and ACSA tasks, reported view at source ↗

**Figure 3.** Figure 3: Label interfaces used for the ACSA and TASD annotation tasks. While not shown in the view at source ↗

read the original abstract

Aspect-Based Sentiment Analysis (ABSA) enables fine-grained opinion analysis by identifying sentiments toward specific aspects or targets within a text. While ABSA has been widely studied for English, research on other languages such as German remains limited, largely due to the lack of high-quality annotated datasets. This paper examines how different annotation sources influence the development of German ABSA. To this end, an existing dataset is re-annotated by experts to establish a ground truth, which serves as a reference for evaluating annotations produced by students, crowdworkers, Large Language Models (LLMs), and experts. Annotation quality is compared using Inter-Annotator Agreement (IAA) and its impact on downstream model performance for different ABSA subtasks. The evaluation focuses on Aspect Category Sentiment Analysis (ACSA) and Target Aspect Sentiment Detection (TASD). We apply State-of-the-Art (SOTA) methods for ABSA, including BERT-, T5-, and LLaMA-based approaches to assess performance differences, spanning fine-tuning and in-context learning with instruction prompts. The findings provide practical insights into trade-offs between annotation reliability and efficiency, offering guidance for dataset construction in under-resourced Natural Language Processing (NLP) scenarios.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper compares four annotation sources for German ABSA with both IAA and downstream model performance, but its claims rest on treating expert re-annotations as reliable ground truth without shown checks.

read the letter

This paper compares annotation quality from experts, students, crowdworkers, and LLMs on German ABSA data. It re-annotates an existing dataset with experts as the reference, then measures inter-annotator agreement and how the resulting labels affect model performance on ACSA and TASD using BERT, T5, and LLaMA approaches with both fine-tuning and in-context learning. The main takeaway is practical guidance on reliability versus efficiency trade-offs for building datasets in low-resource languages like German.

Referee Report

1 major / 1 minor

Summary. The paper claims that re-annotating an existing German ABSA dataset with experts establishes a reliable ground truth against which student, crowdworker, and LLM annotations can be compared via IAA and downstream performance on ACSA and TASD subtasks; SOTA models (BERT, T5, LLaMA) in fine-tuning and in-context learning settings then reveal practical reliability-efficiency trade-offs for dataset construction in under-resourced NLP.

Significance. If the expert ground truth is validated, the work supplies concrete guidance for low-resource ABSA dataset creation by quantifying when cheaper sources (LLMs, crowdworkers) can substitute for experts without substantial downstream loss. The dual IAA-plus-task-performance evaluation is a strength that could inform annotation protocols beyond German ABSA.

major comments (1)

[Abstract and §3] Abstract and §3 (Annotation and Evaluation Setup): The central comparisons rest on expert re-annotations as the reference standard, yet no IAA among experts, disagreement-resolution protocol, or comparison to the original dataset labels is reported. Without these metrics the observed gaps for students, crowdworkers, and LLMs cannot be confidently attributed to quality rather than reference bias or task-interpretation differences; this assumption is load-bearing for all trade-off claims.

minor comments (1)

[Abstract] The abstract mentions 'SOTA methods' and 'instruction prompts' without naming the exact model variants, prompt templates, or hyper-parameters used in the in-context learning experiments; this reduces reproducibility.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their constructive feedback, which highlights an important aspect of our methodology. We address the major comment below and will incorporate revisions to strengthen the validation of our expert ground truth.

read point-by-point responses

Referee: [Abstract and §3] Abstract and §3 (Annotation and Evaluation Setup): The central comparisons rest on expert re-annotations as the reference standard, yet no IAA among experts, disagreement-resolution protocol, or comparison to the original dataset labels is reported. Without these metrics the observed gaps for students, crowdworkers, and LLMs cannot be confidently attributed to quality rather than reference bias or task-interpretation differences; this assumption is load-bearing for all trade-off claims.

Authors: We agree that these details are necessary to rigorously establish the expert re-annotations as a reliable reference standard and to rule out reference bias. In the revised manuscript, we will expand §3 to report inter-annotator agreement among the expert annotators (using metrics such as Fleiss' kappa for multi-label aspects and sentiments), describe the disagreement-resolution protocol (e.g., discussion rounds leading to consensus), and include a direct comparison of the expert re-annotations against the original dataset labels, quantifying differences in aspect coverage and sentiment assignments. These additions will allow readers to assess whether performance gaps for students, crowdworkers, and LLMs stem from annotation quality rather than inconsistencies in the ground truth, thereby reinforcing the validity of our reliability-efficiency trade-off claims for ACSA and TASD. revision: yes

Circularity Check

0 steps flagged

No significant circularity: empirical comparison anchored to expert reference without self-referential derivations

full rationale

The paper presents a purely empirical study that re-annotates an existing German ABSA dataset with experts to create a reference, then measures IAA and downstream ACSA/TASD model performance for student, crowdworker, and LLM annotations against that reference. No equations, fitted parameters, or predictive derivations appear in the described method; the evaluation pipeline does not reduce any output to its inputs by construction. Standard IAA metrics and fine-tuned/in-context model runs constitute independent benchmarks rather than tautological restatements. No load-bearing self-citations, uniqueness theorems, or ansatzes are invoked to justify the core claims. The design is self-contained against the external expert annotations as reference, which is a conventional methodological choice in annotation-quality research and does not create circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that expert annotations can serve as ground truth; no free parameters or invented entities are introduced in the abstract description.

axioms (1)

domain assumption Expert annotations of the existing dataset provide a reliable ground truth for ABSA.
Used as the reference standard for evaluating all other annotation sources.

pith-pipeline@v0.9.0 · 5536 in / 1260 out tokens · 63819 ms · 2026-05-07T04:14:24.832361+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

300 extracted references · 300 canonical work pages · 2 internal anchors

[1]

Toufique and Al Omar, Abdullah and Bhuiyan, Hanif , month = jun, year =

Ara, Jinat and Hasan, Md. Toufique and Al Omar, Abdullah and Bhuiyan, Hanif , month = jun, year =. Understanding. 2020. doi:10.1109/TENSYMP50017.2020.9230712 , abstract =

work page doi:10.1109/tensymp50017.2020.9230712 2020
[2]

Bai, Yinhao and Han, Zhixin and Zhao, Yuhua and Gao, Hang and Zhang, Zhuowei and Wang, Xunzhi and Hu, Mengting , editor =. Is. Findings of the. 2024 , pages =. doi:10.18653/v1/2024.findings-emnlp.460 , abstract =

work page doi:10.18653/v1/2024.findings-emnlp.460 2024
[3]

Overview of the

Basile, Pierpaolo and Croce, Danilo and Basile, Valerio and Polignano, Marco , year =. Overview of the

work page
[4]

Bhoi, Amlaan and Joshi, Sandeep , month = may, year =. Various. doi:10.48550/arXiv.1805.01984 , abstract =

work page doi:10.48550/arxiv.1805.01984
[5]

ACM Comput

A. ACM Comput. Surv. , author =. 2022 , pages =. doi:10.1145/3503044 , abstract =

work page doi:10.1145/3503044 2022
[6]

Brun, Caroline and Nikoulina, Vassilina , editor =. Aspect. Proceedings of the 9th. 2018 , pages =. doi:10.18653/v1/W18-6217 , abstract =

work page doi:10.18653/v1/w18-6217 2018
[7]

Bu, Jiahao and Ren, Lei and Zheng, Shuang and Yang, Yang and Wang, Jingang and Zhang, Fuzheng and Wu, Wei , booktitle=

work page
[8]

Cai, Hongjie and Xia, Rui and Yu, Jianfei , editor =. Aspect-. Proceedings of the 59th. 2021 , pages =. doi:10.18653/v1/2021.acl-long.29 , abstract =

work page doi:10.18653/v1/2021.acl-long.29 2021
[9]

Computer Science Review , author =

Aspect based sentiment analysis using deep learning approaches:. Computer Science Review , author =. 2023 , keywords =. doi:10.1016/j.cosrev.2023.100576 , abstract =

work page doi:10.1016/j.cosrev.2023.100576 2023
[10]

Proceedings of the 13th

Chebolu, Siva Uday Sampreeth and Dernoncourt, Franck and Lipka, Nedim and Solorio, Thamar , year =. Proceedings of the 13th

work page
[11]

Aspect-level

Cheng, Jiajun and Zhao, Shenglin and Zhang, Jiani and King, Irwin and Zhang, Xin and Wang, Hui , month = nov, year =. Aspect-level. Proceedings of the 2017. doi:10.1145/3132847.3133037 , abstract =

work page doi:10.1145/3132847.3133037 2017
[12]

Clematide, Simon and Gindl, Stefan and Klenner, Manfred and Petrakis, Stefanos and Remus, Robert and Ruppenhofer, Josef and Waltinger, Ulli and Wiegand, Michael , booktitle =

work page
[13]

A coefficient of agreement for nominal scales.Educational and Psychological Measurement, 20(1):37–46, 1960

A. Educational and Psychological Measurement , author =. 1960 , pages =. doi:10.1177/001316446002000104 , language =

work page doi:10.1177/001316446002000104 1960
[14]

Colucci Cante, Luigi and D’Angelo, Salvatore and Di Martino, Beniamino and Graziano, Mariangela , editor =. Text. Complex,. 2024 , pages =. doi:10.1007/978-3-031-70011-8_33 , abstract =

work page doi:10.1007/978-3-031-70011-8_33 2024
[15]

Companion

de França Costa, Dayan and da Silva, Nadia Felix Felipe , month = apr, year =. Companion. doi:10.1145/3184558.3191828 , abstract =

work page doi:10.1145/3184558.3191828
[16]

De Mattei, Lorenzo and De Martino, Graziella and Iovine, Andrea and Miaschi, Alessio and Polignano, Marco and Rambelli, Giulia , year =

work page
[17]

, month = feb, year =

Ding, Xiaowen and Liu, Bing and Yu, Philip S. , month = feb, year =. A holistic lexicon-based approach to opinion mining , isbn =. Proceedings of the 2008. doi:10.1145/1341531.1341561 , abstract =

work page doi:10.1145/1341531.1341561 2008
[18]

Adaptive recursive neural network for target-dependent twitter sentiment classification , url =

Dong, Li and Wei, Furu and Tan, Chuanqi and Tang, Duyu and Zhou, Ming and Xu, Ke , year =. Adaptive recursive neural network for target-dependent twitter sentiment classification , url =. Proceedings of the 52nd annual meeting of the association for computational linguistics (volume 2:

work page
[19]

Target-oriented

Fan, Zhifang and Wu, Zhen and Dai, Xin-Yu and Huang, Shujian and Chen, Jiajun , editor =. Target-oriented. Proceedings of the 2019. 2019 , pages =. doi:10.18653/v1/N19-1259 , abstract =

work page doi:10.18653/v1/n19-1259 2019
[20]

Fehle, Jakob and Donhauser, Niklas and Kruschwitz, Udo and Hellwig, Nils Constantin and Wolff, Christian , year =. German. 21st

work page
[21]

Fehle, Jakob and Münster, Leonie and Schmidt, Thomas and Wolff, Christian , year =. Aspect-. Proceedings of the 19th conference on natural language processing (konvens 2023) , pages=

work page 2023
[22]

2012 , publisher =

Discovering Statistics Using R , author =. 2012 , publisher =

work page 2012
[23]

Fisher, R. A. , editor =. Statistical. Breakthroughs in. 1992 , doi =

work page 1992
[24]

, volume =

Measuring nominal scale agreement among many raters. , volume =. Psychological bulletin , author =. 1971 , note =

work page 1971
[25]

Journal of the American Statistical Association , author =

The. Journal of the American Statistical Association , author =. 1937 , pages =. doi:10.1080/01621459.1937.10503522 , language =

work page doi:10.1080/01621459.1937.10503522 1937
[26]

Gabryszak, Aleksandra and Thomas, Philippe , year =. Mob. Proceedings of the

work page
[27]

M v P : Multi-view Prompting Improves Aspect Sentiment Tuple Prediction

Gou, Zhibin and Guo, Qingyan and Yang, Yujiu , editor =. Proceedings of the 61st. 2023 , pages =. doi:10.18653/v1/2023.acl-long.240 , abstract =

work page doi:10.18653/v1/2023.acl-long.240 2023
[28]

Metrics for multi-class classifi- cation: an overview,

Grandini, Margherita and Bagli, Enrico and Visani, Giorgio , month = aug, year =. Metrics for. doi:10.48550/arXiv.2008.05756 , abstract =

work page doi:10.48550/arxiv.2008.05756 2008
[29]

Computational Linguistics in the Netherlands Journal , author =

Aspect-based. Computational Linguistics in the Netherlands Journal , author =. 2021 , pages =

work page 2021
[30]

Hamborg, Felix and Donnay, Karsten and Merlo, Paola , year =

work page
[31]

Hellwig, Nils Constantin and Fehle, Jakob and Bink, Markus and Wolff, Christian , booktitle=

work page
[32]

1979 , note =

Scandinavian journal of statistics , author =. 1979 , note =

work page 1979
[33]

Mining and summarizing customer reviews , isbn =

Hu, Minqing and Liu, Bing , month = aug, year =. Mining and summarizing customer reviews , isbn =. Proceedings of the tenth. doi:10.1145/1014052.1014073 , urldate =

work page doi:10.1145/1014052.1014073
[34]

Artificial Intelligence Review , author =

A systematic review of aspect-based sentiment analysis: domains, methods, and trends , volume =. Artificial Intelligence Review , author =. 2024 , keywords =. doi:10.1007/s10462-024-10906-z , abstract =

work page doi:10.1007/s10462-024-10906-z 2024
[35]

Jiang, Qingnan and Chen, Lei and Xu, Ruifeng and Ao, Xiang and Yang, Min , editor =. A. Proceedings of the 2019. 2019 , pages =. doi:10.18653/v1/D19-1654 , abstract =

work page doi:10.18653/v1/d19-1654 2019
[36]

Jun, Yonghyun and Lee, Hwanhee , editor =. Dynamic. Proceedings of the 63rd. 2025 , pages =. doi:10.18653/v1/2025.acl-short.48 , abstract =

work page doi:10.18653/v1/2025.acl-short.48 2025
[37]

and Eckert, Miriam and Clark, Lyndsie and Nicolov, Nicolas , year =

Kessler, Jason S. and Eckert, Miriam and Clark, Lyndsie and Nicolov, Nicolas , year =. The. Proceedings of the 4th

work page
[38]

Klie, Jan-Christoph and Bugert, Michael and Boullosa, Beto and Eckart de Castilho, Richard and Gurevych, Iryna , editor =. The. Proceedings of the 27th. 2018 , pages =

work page 2018
[39]

2024 , pages =

Computational Linguistics , author =. 2024 , pages =

work page 2024
[40]

Computing

Krippendorff, Klaus , year =. Computing

work page
[41]

Overview of the

Lee, Lung-Hao and Yu, Liang-Chih and Wang, Suge and Liao, Jian , editor =. Overview of the. Proceedings of the 10th. 2024 , pages =

work page 2024
[42]

Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019) , year=

Exploiting BERT for End-to-End Aspect-based Sentiment Analysis , author=. Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019) , year=

work page 2019
[43]

Mathematics , abstract =

A more fine-grained aspect--sentiment--opinion triplet extraction task , author=. Mathematics , abstract =. 2023 , publisher=

work page 2023
[44]

2022 , url =

Bing Liu , address =. 2022 , url =

work page 2022
[45]

Automated rule selection for aspect extraction in opinion mining , url =

Liu, Qian and Gao, Zhiqiang and Liu, Bing and Zhang, Yuanlin , year =. Automated rule selection for aspect extraction in opinion mining , url =. Twenty-

work page
[46]

Efficient Hybrid Generation Framework for Aspect-Based Sentiment Analysis

Lv, Haoran and Liu, Junyi and Wang, Henan and Wang, Yaoming and Luo, Jixiang and Liu, Yaxiao , editor =. Efficient. Proceedings of the 17th. 2023 , pages =. doi:10.18653/v1/2023.eacl-main.71 , urldate =

work page doi:10.18653/v1/2023.eacl-main.71 2023
[47]

Minaee, Shervin and Mikolov, Tomas and Nikzad, Narjes and Chenaghlu, Meysam and Socher, Richard and Amatriain, Xavier and Gao, Jianfeng , month = feb, year =. Large. doi:10.48550/arXiv.2402.06196 , abstract =

work page internal anchor Pith review doi:10.48550/arxiv.2402.06196
[48]

Human-in-the-

Monarch, Robert Munro , year=. Human-in-the-

work page
[49]

AIP Conference Proceedings , author =

Aspect-based sentiment analysis to review products using. AIP Conference Proceedings , author =. 2017 , pages =. doi:10.1063/1.4994463 , abstract =

work page doi:10.1063/1.4994463 2017
[50]

Comparative Analysis of Deep Natural Networks and Large Language Models for Aspect-Based Sentiment Analysis , year=

Mughal, Nimra and Mujtaba, Ghulam and Shaikh, Sarang and Kumar, Aveenash and Daudpota, Sher Muhammad , journal=. Comparative Analysis of Deep Natural Networks and Large Language Models for Aspect-Based Sentiment Analysis , year=

work page
[51]

New Media & Society , author =

The social construction of datasets:. New Media & Society , author =. 2024 , note =. doi:10.1177/14614448241251797 , abstract =

work page doi:10.1177/14614448241251797 2024
[52]

Proceedings of the AAAI Conference on Artificial Intelligence , author =

Knowing. Proceedings of the AAAI Conference on Artificial Intelligence , author =. 2020 , note =. doi:10.1609/aaai.v34i05.6383 , abstract =

work page doi:10.1609/aaai.v34i05.6383 2020
[53]

International Journal of Approximate Reasoning , author =

Exploiting multiple word embeddings and one-hot character vectors for aspect-based sentiment analysis , volume =. International Journal of Approximate Reasoning , author =. 2018 , keywords =. doi:10.1016/j.ijar.2018.08.003 , abstract =

work page doi:10.1016/j.ijar.2018.08.003 2018
[54]

International workshop on semantic evaluation , author =

Semeval-2016 task 5:. International workshop on semantic evaluation , author =. 2016 , pages =

work page 2016
[55]

S em E val-2015 Task 12: Aspect Based Sentiment Analysis

Pontiki, Maria and Galanis, Dimitris and Papageorgiou, Haris and Manandhar, Suresh and Androutsopoulos, Ion , editor =. Proceedings of the 9th. 2015 , pages =. doi:10.18653/v1/S15-2082 , urldate =

work page doi:10.18653/v1/s15-2082 2015
[56]

S em E val-2014 Task 4: Aspect Based Sentiment Analysis

Pontiki, Maria and Galanis, Dimitris and Pavlopoulos, John and Papageorgiou, Harris and Androutsopoulos, Ion and Manandhar, Suresh , editor =. Proceedings of the 8th. 2014 , pages =. doi:10.3115/v1/S14-2004 , urldate =

work page doi:10.3115/v1/s14-2004 2014
[57]

Data , author =

Datasets for. Data , author =. 2018 , note =. doi:10.3390/data3020015 , language =

work page doi:10.3390/data3020015 2018
[58]

Regatte, Yashwanth Reddy and Gangula, Rama Rohit Reddy and Mamidi, Radhika , editor =. Dataset. Proceedings of the. 2020 , pages =

work page 2020
[59]

Sadia, Azeema and Khan, Fariha and Bashir, Fatima , year=. An. 2018 3rd International electrical engineering conference (IEEC 2018) , pages=

work page 2018
[60]

Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers , abstract =

SentiHood: Targeted Aspect Based Sentiment Analysis Dataset for Urban Neighbourhoods , author=. Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers , abstract =

work page 2016
[61]

Public opinion quarterly , author =

Reliability of content analysis:. Public opinion quarterly , author =. 1955 , note =

work page 1955
[62]

Biometrika , author =

An analysis of variance test for normality (complete samples) , volume =. Biometrika , author =. 1965 , note =

work page 1965
[63]

Proceedings of the

Sidarenka, Uladzimir , editor =. Proceedings of the. 2016 , pages =

work page 2016
[64]

Simmering and Paavo Huoviala , title =

Large language models for aspect-based sentiment analysis , url =. arXiv preprint arXiv:2310.18025 , author =. 2023 , keywords =. doi:10.48550/arXiv.2310.18025 , abstract =

work page doi:10.48550/arxiv.2310.18025 2023
[65]

Exploring

Singhi, Vishal and Chauhan, Charulata and Soni, Piyush Kumar , month = apr, year =. Exploring. 2024. doi:10.1109/I2CT61223.2024.10543612 , abstract =

work page doi:10.1109/i2ct61223.2024.10543612 2024
[66]

Proceedings of the 5th workshop on computational approaches to subjectivity, sentiment and social media analysis , author =

Aspect-level sentiment analysis in czech , url =. Proceedings of the 5th workshop on computational approaches to subjectivity, sentiment and social media analysis , author =. 2014 , pages =

work page 2014
[67]

Stenetorp, Pontus and Pyysalo, Sampo and Topić, Goran and Ohta, Tomoko and Ananiadou, Sophia and Tsujii, Jun'ichi , editor =. brat: a. Proceedings of the. 2012 , pages =

work page 2012
[68]

Biometrika , author =

The probable error of a mean , url =. Biometrika , author =. 1908 , note =

work page 1908
[69]

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) , pages=

Sänger, Mario and Kemmerer, Steffen and Adolphs, Peter and Klinger, Roman and Leser, Ulf , year =. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) , pages=

work page
[70]

Proceedings of the 10th

Tong, Zeliang and Wei, Wei , editor =. Proceedings of the 10th. 2024 , pages =

work page 2024
[71]

Attention is

Vaswani, Ashish and Shazeer, Noam and Parmar, Niki and Uszkoreit, Jakob and Jones, Llion and Gomez, Aidan N and Kaiser, Ł ukasz and Polosukhin, Illia , year =. Attention is. Advances in

work page
[72]

, year =

Wan, Hai and Yang, Yufei and Du, Jianfeng and Liu, Yanan and Qi, Kunxun and Pan, Jeff Z. , year =. Target-aspect-sentiment joint detection for aspect-based sentiment analysis , volume =. Proceedings of the

work page
[73]

Latent aspect rating analysis without aspect keyword supervision , isbn =

Wang, Hongning and Lu, Yue and Zhai, ChengXiang , month = aug, year =. Latent aspect rating analysis without aspect keyword supervision , isbn =. Proceedings of the 17th. doi:10.1145/2020408.2020505 , abstract =

work page doi:10.1145/2020408.2020505
[74]

Wang, Zengzhi and Xie, Qiming and Xia, Rui , month = jul, year =. A. Proceedings of the 46th. doi:10.1145/3539618.3591940 , abstract =

work page doi:10.1145/3539618.3591940
[75]

Applied Soft Computing , author =

A survey on aspect base sentiment analysis methods and challenges , volume =. Applied Soft Computing , author =. 2024 , keywords =. doi:10.1016/j.asoc.2024.112249 , abstract =

work page doi:10.1016/j.asoc.2024.112249 2024
[76]

Workshop Proceedings of the 12th Edition of the KONVENS Conference, Hildesheim, Germany, October 8-10, 2014 , pages=

Saarland University’s participation in the German sentiment analysis shared task (GESTALT) , author=. Workshop Proceedings of the 12th Edition of the KONVENS Conference, Hildesheim, Germany, October 8-10, 2014 , pages=. 2014 , organization=

work page 2014
[77]

Individual

Wilcoxon, Frank , editor =. Individual. Breakthroughs in. 1992 , doi =

work page 1992
[78]

Evaluation of an algorithm for aspect-based opinion mining using a lexicon-based approach , isbn =

Wogenstein, Florian and Drescher, Johannes and Reinel, Dirk and Rill, Sven and Scheidt, Jörg , month = aug, year =. Evaluation of an algorithm for aspect-based opinion mining using a lexicon-based approach , isbn =. Proceedings of the. doi:10.1145/2502069.2502074 , abstract =

work page doi:10.1145/2502069.2502074
[79]

Wu, ChengYan and Ma, Bolei and Liu, Yihong and Zhang, Zheyu and Deng, Ningyuan and Li, Yanshu and Chen, Baolan and Zhang, Yi and Xue, Yun and Plank, Barbara , editor =. M-. Proceedings of the 2025. 2025 , pages =. doi:10.18653/v1/2025.emnlp-main.128 , abstract =

work page doi:10.18653/v1/2025.emnlp-main.128 2025
[80]

Proceedings of the 10th

Xu, Hongling and Zhang, Delong and Zhang, Yice and Xu, Ruifeng , editor =. Proceedings of the 10th. 2024 , pages =

work page 2024

Showing first 80 references.

[1] [1]

Toufique and Al Omar, Abdullah and Bhuiyan, Hanif , month = jun, year =

Ara, Jinat and Hasan, Md. Toufique and Al Omar, Abdullah and Bhuiyan, Hanif , month = jun, year =. Understanding. 2020. doi:10.1109/TENSYMP50017.2020.9230712 , abstract =

work page doi:10.1109/tensymp50017.2020.9230712 2020

[2] [2]

Bai, Yinhao and Han, Zhixin and Zhao, Yuhua and Gao, Hang and Zhang, Zhuowei and Wang, Xunzhi and Hu, Mengting , editor =. Is. Findings of the. 2024 , pages =. doi:10.18653/v1/2024.findings-emnlp.460 , abstract =

work page doi:10.18653/v1/2024.findings-emnlp.460 2024

[3] [3]

Overview of the

Basile, Pierpaolo and Croce, Danilo and Basile, Valerio and Polignano, Marco , year =. Overview of the

work page

[4] [4]

Bhoi, Amlaan and Joshi, Sandeep , month = may, year =. Various. doi:10.48550/arXiv.1805.01984 , abstract =

work page doi:10.48550/arxiv.1805.01984

[5] [5]

ACM Comput

A. ACM Comput. Surv. , author =. 2022 , pages =. doi:10.1145/3503044 , abstract =

work page doi:10.1145/3503044 2022

[6] [6]

Brun, Caroline and Nikoulina, Vassilina , editor =. Aspect. Proceedings of the 9th. 2018 , pages =. doi:10.18653/v1/W18-6217 , abstract =

work page doi:10.18653/v1/w18-6217 2018

[7] [7]

Bu, Jiahao and Ren, Lei and Zheng, Shuang and Yang, Yang and Wang, Jingang and Zhang, Fuzheng and Wu, Wei , booktitle=

work page

[8] [8]

Cai, Hongjie and Xia, Rui and Yu, Jianfei , editor =. Aspect-. Proceedings of the 59th. 2021 , pages =. doi:10.18653/v1/2021.acl-long.29 , abstract =

work page doi:10.18653/v1/2021.acl-long.29 2021

[9] [9]

Computer Science Review , author =

Aspect based sentiment analysis using deep learning approaches:. Computer Science Review , author =. 2023 , keywords =. doi:10.1016/j.cosrev.2023.100576 , abstract =

work page doi:10.1016/j.cosrev.2023.100576 2023

[10] [10]

Proceedings of the 13th

Chebolu, Siva Uday Sampreeth and Dernoncourt, Franck and Lipka, Nedim and Solorio, Thamar , year =. Proceedings of the 13th

work page

[11] [11]

Aspect-level

Cheng, Jiajun and Zhao, Shenglin and Zhang, Jiani and King, Irwin and Zhang, Xin and Wang, Hui , month = nov, year =. Aspect-level. Proceedings of the 2017. doi:10.1145/3132847.3133037 , abstract =

work page doi:10.1145/3132847.3133037 2017

[12] [12]

Clematide, Simon and Gindl, Stefan and Klenner, Manfred and Petrakis, Stefanos and Remus, Robert and Ruppenhofer, Josef and Waltinger, Ulli and Wiegand, Michael , booktitle =

work page

[13] [13]

A coefficient of agreement for nominal scales.Educational and Psychological Measurement, 20(1):37–46, 1960

A. Educational and Psychological Measurement , author =. 1960 , pages =. doi:10.1177/001316446002000104 , language =

work page doi:10.1177/001316446002000104 1960

[14] [14]

Colucci Cante, Luigi and D’Angelo, Salvatore and Di Martino, Beniamino and Graziano, Mariangela , editor =. Text. Complex,. 2024 , pages =. doi:10.1007/978-3-031-70011-8_33 , abstract =

work page doi:10.1007/978-3-031-70011-8_33 2024

[15] [15]

Companion

de França Costa, Dayan and da Silva, Nadia Felix Felipe , month = apr, year =. Companion. doi:10.1145/3184558.3191828 , abstract =

work page doi:10.1145/3184558.3191828

[16] [16]

De Mattei, Lorenzo and De Martino, Graziella and Iovine, Andrea and Miaschi, Alessio and Polignano, Marco and Rambelli, Giulia , year =

work page

[17] [17]

, month = feb, year =

Ding, Xiaowen and Liu, Bing and Yu, Philip S. , month = feb, year =. A holistic lexicon-based approach to opinion mining , isbn =. Proceedings of the 2008. doi:10.1145/1341531.1341561 , abstract =

work page doi:10.1145/1341531.1341561 2008

[18] [18]

Adaptive recursive neural network for target-dependent twitter sentiment classification , url =

Dong, Li and Wei, Furu and Tan, Chuanqi and Tang, Duyu and Zhou, Ming and Xu, Ke , year =. Adaptive recursive neural network for target-dependent twitter sentiment classification , url =. Proceedings of the 52nd annual meeting of the association for computational linguistics (volume 2:

work page

[19] [19]

Target-oriented

Fan, Zhifang and Wu, Zhen and Dai, Xin-Yu and Huang, Shujian and Chen, Jiajun , editor =. Target-oriented. Proceedings of the 2019. 2019 , pages =. doi:10.18653/v1/N19-1259 , abstract =

work page doi:10.18653/v1/n19-1259 2019

[20] [20]

Fehle, Jakob and Donhauser, Niklas and Kruschwitz, Udo and Hellwig, Nils Constantin and Wolff, Christian , year =. German. 21st

work page

[21] [21]

Fehle, Jakob and Münster, Leonie and Schmidt, Thomas and Wolff, Christian , year =. Aspect-. Proceedings of the 19th conference on natural language processing (konvens 2023) , pages=

work page 2023

[22] [22]

2012 , publisher =

Discovering Statistics Using R , author =. 2012 , publisher =

work page 2012

[23] [23]

Fisher, R. A. , editor =. Statistical. Breakthroughs in. 1992 , doi =

work page 1992

[24] [24]

, volume =

Measuring nominal scale agreement among many raters. , volume =. Psychological bulletin , author =. 1971 , note =

work page 1971

[25] [25]

Journal of the American Statistical Association , author =

The. Journal of the American Statistical Association , author =. 1937 , pages =. doi:10.1080/01621459.1937.10503522 , language =

work page doi:10.1080/01621459.1937.10503522 1937

[26] [26]

Gabryszak, Aleksandra and Thomas, Philippe , year =. Mob. Proceedings of the

work page

[27] [27]

M v P : Multi-view Prompting Improves Aspect Sentiment Tuple Prediction

Gou, Zhibin and Guo, Qingyan and Yang, Yujiu , editor =. Proceedings of the 61st. 2023 , pages =. doi:10.18653/v1/2023.acl-long.240 , abstract =

work page doi:10.18653/v1/2023.acl-long.240 2023

[28] [28]

Metrics for multi-class classifi- cation: an overview,

Grandini, Margherita and Bagli, Enrico and Visani, Giorgio , month = aug, year =. Metrics for. doi:10.48550/arXiv.2008.05756 , abstract =

work page doi:10.48550/arxiv.2008.05756 2008

[29] [29]

Computational Linguistics in the Netherlands Journal , author =

Aspect-based. Computational Linguistics in the Netherlands Journal , author =. 2021 , pages =

work page 2021

[30] [30]

Hamborg, Felix and Donnay, Karsten and Merlo, Paola , year =

work page

[31] [31]

Hellwig, Nils Constantin and Fehle, Jakob and Bink, Markus and Wolff, Christian , booktitle=

work page

[32] [32]

1979 , note =

Scandinavian journal of statistics , author =. 1979 , note =

work page 1979

[33] [33]

Mining and summarizing customer reviews , isbn =

Hu, Minqing and Liu, Bing , month = aug, year =. Mining and summarizing customer reviews , isbn =. Proceedings of the tenth. doi:10.1145/1014052.1014073 , urldate =

work page doi:10.1145/1014052.1014073

[34] [34]

Artificial Intelligence Review , author =

A systematic review of aspect-based sentiment analysis: domains, methods, and trends , volume =. Artificial Intelligence Review , author =. 2024 , keywords =. doi:10.1007/s10462-024-10906-z , abstract =

work page doi:10.1007/s10462-024-10906-z 2024

[35] [35]

Jiang, Qingnan and Chen, Lei and Xu, Ruifeng and Ao, Xiang and Yang, Min , editor =. A. Proceedings of the 2019. 2019 , pages =. doi:10.18653/v1/D19-1654 , abstract =

work page doi:10.18653/v1/d19-1654 2019

[36] [36]

Jun, Yonghyun and Lee, Hwanhee , editor =. Dynamic. Proceedings of the 63rd. 2025 , pages =. doi:10.18653/v1/2025.acl-short.48 , abstract =

work page doi:10.18653/v1/2025.acl-short.48 2025

[37] [37]

and Eckert, Miriam and Clark, Lyndsie and Nicolov, Nicolas , year =

Kessler, Jason S. and Eckert, Miriam and Clark, Lyndsie and Nicolov, Nicolas , year =. The. Proceedings of the 4th

work page

[38] [38]

Klie, Jan-Christoph and Bugert, Michael and Boullosa, Beto and Eckart de Castilho, Richard and Gurevych, Iryna , editor =. The. Proceedings of the 27th. 2018 , pages =

work page 2018

[39] [39]

2024 , pages =

Computational Linguistics , author =. 2024 , pages =

work page 2024

[40] [40]

Computing

Krippendorff, Klaus , year =. Computing

work page

[41] [41]

Overview of the

Lee, Lung-Hao and Yu, Liang-Chih and Wang, Suge and Liao, Jian , editor =. Overview of the. Proceedings of the 10th. 2024 , pages =

work page 2024

[42] [42]

Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019) , year=

Exploiting BERT for End-to-End Aspect-based Sentiment Analysis , author=. Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019) , year=

work page 2019

[43] [43]

Mathematics , abstract =

A more fine-grained aspect--sentiment--opinion triplet extraction task , author=. Mathematics , abstract =. 2023 , publisher=

work page 2023

[44] [44]

2022 , url =

Bing Liu , address =. 2022 , url =

work page 2022

[45] [45]

Automated rule selection for aspect extraction in opinion mining , url =

Liu, Qian and Gao, Zhiqiang and Liu, Bing and Zhang, Yuanlin , year =. Automated rule selection for aspect extraction in opinion mining , url =. Twenty-

work page

[46] [46]

Efficient Hybrid Generation Framework for Aspect-Based Sentiment Analysis

Lv, Haoran and Liu, Junyi and Wang, Henan and Wang, Yaoming and Luo, Jixiang and Liu, Yaxiao , editor =. Efficient. Proceedings of the 17th. 2023 , pages =. doi:10.18653/v1/2023.eacl-main.71 , urldate =

work page doi:10.18653/v1/2023.eacl-main.71 2023

[47] [47]

Minaee, Shervin and Mikolov, Tomas and Nikzad, Narjes and Chenaghlu, Meysam and Socher, Richard and Amatriain, Xavier and Gao, Jianfeng , month = feb, year =. Large. doi:10.48550/arXiv.2402.06196 , abstract =

work page internal anchor Pith review doi:10.48550/arxiv.2402.06196

[48] [48]

Human-in-the-

Monarch, Robert Munro , year=. Human-in-the-

work page

[49] [49]

AIP Conference Proceedings , author =

Aspect-based sentiment analysis to review products using. AIP Conference Proceedings , author =. 2017 , pages =. doi:10.1063/1.4994463 , abstract =

work page doi:10.1063/1.4994463 2017

[50] [50]

Comparative Analysis of Deep Natural Networks and Large Language Models for Aspect-Based Sentiment Analysis , year=

Mughal, Nimra and Mujtaba, Ghulam and Shaikh, Sarang and Kumar, Aveenash and Daudpota, Sher Muhammad , journal=. Comparative Analysis of Deep Natural Networks and Large Language Models for Aspect-Based Sentiment Analysis , year=

work page

[51] [51]

New Media & Society , author =

The social construction of datasets:. New Media & Society , author =. 2024 , note =. doi:10.1177/14614448241251797 , abstract =

work page doi:10.1177/14614448241251797 2024

[52] [52]

Proceedings of the AAAI Conference on Artificial Intelligence , author =

Knowing. Proceedings of the AAAI Conference on Artificial Intelligence , author =. 2020 , note =. doi:10.1609/aaai.v34i05.6383 , abstract =

work page doi:10.1609/aaai.v34i05.6383 2020

[53] [53]

International Journal of Approximate Reasoning , author =

Exploiting multiple word embeddings and one-hot character vectors for aspect-based sentiment analysis , volume =. International Journal of Approximate Reasoning , author =. 2018 , keywords =. doi:10.1016/j.ijar.2018.08.003 , abstract =

work page doi:10.1016/j.ijar.2018.08.003 2018

[54] [54]

International workshop on semantic evaluation , author =

Semeval-2016 task 5:. International workshop on semantic evaluation , author =. 2016 , pages =

work page 2016

[55] [55]

S em E val-2015 Task 12: Aspect Based Sentiment Analysis

Pontiki, Maria and Galanis, Dimitris and Papageorgiou, Haris and Manandhar, Suresh and Androutsopoulos, Ion , editor =. Proceedings of the 9th. 2015 , pages =. doi:10.18653/v1/S15-2082 , urldate =

work page doi:10.18653/v1/s15-2082 2015

[56] [56]

S em E val-2014 Task 4: Aspect Based Sentiment Analysis

Pontiki, Maria and Galanis, Dimitris and Pavlopoulos, John and Papageorgiou, Harris and Androutsopoulos, Ion and Manandhar, Suresh , editor =. Proceedings of the 8th. 2014 , pages =. doi:10.3115/v1/S14-2004 , urldate =

work page doi:10.3115/v1/s14-2004 2014

[57] [57]

Data , author =

Datasets for. Data , author =. 2018 , note =. doi:10.3390/data3020015 , language =

work page doi:10.3390/data3020015 2018

[58] [58]

Regatte, Yashwanth Reddy and Gangula, Rama Rohit Reddy and Mamidi, Radhika , editor =. Dataset. Proceedings of the. 2020 , pages =

work page 2020

[59] [59]

Sadia, Azeema and Khan, Fariha and Bashir, Fatima , year=. An. 2018 3rd International electrical engineering conference (IEEC 2018) , pages=

work page 2018

[60] [60]

Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers , abstract =

SentiHood: Targeted Aspect Based Sentiment Analysis Dataset for Urban Neighbourhoods , author=. Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers , abstract =

work page 2016

[61] [61]

Public opinion quarterly , author =

Reliability of content analysis:. Public opinion quarterly , author =. 1955 , note =

work page 1955

[62] [62]

Biometrika , author =

An analysis of variance test for normality (complete samples) , volume =. Biometrika , author =. 1965 , note =

work page 1965

[63] [63]

Proceedings of the

Sidarenka, Uladzimir , editor =. Proceedings of the. 2016 , pages =

work page 2016

[64] [64]

Simmering and Paavo Huoviala , title =

Large language models for aspect-based sentiment analysis , url =. arXiv preprint arXiv:2310.18025 , author =. 2023 , keywords =. doi:10.48550/arXiv.2310.18025 , abstract =

work page doi:10.48550/arxiv.2310.18025 2023

[65] [65]

Exploring

Singhi, Vishal and Chauhan, Charulata and Soni, Piyush Kumar , month = apr, year =. Exploring. 2024. doi:10.1109/I2CT61223.2024.10543612 , abstract =

work page doi:10.1109/i2ct61223.2024.10543612 2024

[66] [66]

Proceedings of the 5th workshop on computational approaches to subjectivity, sentiment and social media analysis , author =

Aspect-level sentiment analysis in czech , url =. Proceedings of the 5th workshop on computational approaches to subjectivity, sentiment and social media analysis , author =. 2014 , pages =

work page 2014

[67] [67]

Stenetorp, Pontus and Pyysalo, Sampo and Topić, Goran and Ohta, Tomoko and Ananiadou, Sophia and Tsujii, Jun'ichi , editor =. brat: a. Proceedings of the. 2012 , pages =

work page 2012

[68] [68]

Biometrika , author =

The probable error of a mean , url =. Biometrika , author =. 1908 , note =

work page 1908

[69] [69]

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) , pages=

Sänger, Mario and Kemmerer, Steffen and Adolphs, Peter and Klinger, Roman and Leser, Ulf , year =. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) , pages=

work page

[70] [70]

Proceedings of the 10th

Tong, Zeliang and Wei, Wei , editor =. Proceedings of the 10th. 2024 , pages =

work page 2024

[71] [71]

Attention is

Vaswani, Ashish and Shazeer, Noam and Parmar, Niki and Uszkoreit, Jakob and Jones, Llion and Gomez, Aidan N and Kaiser, Ł ukasz and Polosukhin, Illia , year =. Attention is. Advances in

work page

[72] [72]

, year =

Wan, Hai and Yang, Yufei and Du, Jianfeng and Liu, Yanan and Qi, Kunxun and Pan, Jeff Z. , year =. Target-aspect-sentiment joint detection for aspect-based sentiment analysis , volume =. Proceedings of the

work page

[73] [73]

Latent aspect rating analysis without aspect keyword supervision , isbn =

Wang, Hongning and Lu, Yue and Zhai, ChengXiang , month = aug, year =. Latent aspect rating analysis without aspect keyword supervision , isbn =. Proceedings of the 17th. doi:10.1145/2020408.2020505 , abstract =

work page doi:10.1145/2020408.2020505

[74] [74]

Wang, Zengzhi and Xie, Qiming and Xia, Rui , month = jul, year =. A. Proceedings of the 46th. doi:10.1145/3539618.3591940 , abstract =

work page doi:10.1145/3539618.3591940

[75] [75]

Applied Soft Computing , author =

A survey on aspect base sentiment analysis methods and challenges , volume =. Applied Soft Computing , author =. 2024 , keywords =. doi:10.1016/j.asoc.2024.112249 , abstract =

work page doi:10.1016/j.asoc.2024.112249 2024

[76] [76]

Workshop Proceedings of the 12th Edition of the KONVENS Conference, Hildesheim, Germany, October 8-10, 2014 , pages=

Saarland University’s participation in the German sentiment analysis shared task (GESTALT) , author=. Workshop Proceedings of the 12th Edition of the KONVENS Conference, Hildesheim, Germany, October 8-10, 2014 , pages=. 2014 , organization=

work page 2014

[77] [77]

Individual

Wilcoxon, Frank , editor =. Individual. Breakthroughs in. 1992 , doi =

work page 1992

[78] [78]

Evaluation of an algorithm for aspect-based opinion mining using a lexicon-based approach , isbn =

Wogenstein, Florian and Drescher, Johannes and Reinel, Dirk and Rill, Sven and Scheidt, Jörg , month = aug, year =. Evaluation of an algorithm for aspect-based opinion mining using a lexicon-based approach , isbn =. Proceedings of the. doi:10.1145/2502069.2502074 , abstract =

work page doi:10.1145/2502069.2502074

[79] [79]

Wu, ChengYan and Ma, Bolei and Liu, Yihong and Zhang, Zheyu and Deng, Ningyuan and Li, Yanshu and Chen, Baolan and Zhang, Yi and Xue, Yun and Plank, Barbara , editor =. M-. Proceedings of the 2025. 2025 , pages =. doi:10.18653/v1/2025.emnlp-main.128 , abstract =

work page doi:10.18653/v1/2025.emnlp-main.128 2025

[80] [80]

Proceedings of the 10th

Xu, Hongling and Zhang, Delong and Zhang, Yice and Xu, Ruifeng , editor =. Proceedings of the 10th. 2024 , pages =

work page 2024