Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization

Daniel Deutsch; Elizabeth Clark; Jo\~ao Sedoc; Khyathi Chandu; Lining Zhang; Miruna Clinciu; Saad Mahamood; Sebastian Gehrmann; Simon Mille; Yixin Liu

arxiv: 2212.10397 · v3 · pith:TRFPYAR7new · submitted 2022-12-20 · 💻 cs.CL

Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization

Lining Zhang , Simon Mille , Yufang Hou , Daniel Deutsch , Elizabeth Clark , Yixin Liu , Saad Mahamood , Sebastian Gehrmann

show 3 more authors

Miruna Clinciu Khyathi Chandu Jo\~ao Sedoc

This is my paper

classification 💻 cs.CL

keywords workersannotationsannotatorshigh-agreementrecruitmentresourcessummarizationtasks

0 comments

read the original abstract

To prevent the costly and inefficient use of resources on low-quality annotations, we want a method for creating a pool of dependable annotators who can effectively complete difficult tasks, such as evaluating automatic summarization. Thus, we investigate the recruitment of high-quality Amazon Mechanical Turk workers via a two-step pipeline. We show that we can successfully filter out subpar workers before they carry out the evaluations and obtain high-agreement annotations with similar constraints on resources. Although our workers demonstrate a strong consensus among themselves and CloudResearch workers, their alignment with expert judgments on a subset of the data is not as expected and needs further training in correctness. This paper still serves as a best practice for the recruitment of qualified annotators in other challenging annotation tasks.

This paper has not been read by Pith yet.

Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization

discussion (0)