We find that audio preference annotations are reliable in aggregate and ex- hibit agreement levels comparable to text when sufficient raters are pooled

Conclusion This study examined the reliability, characteristics of preference judgments across text, audio modalities

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Same Words, Different Judgments: How Preferences Vary Across Modalities

cs.SD · 2026-02-26 · unverdicted · novelty 7.0

Human preferences for the same semantic content show near-chance agreement between text and audio, with audio raters using narrower decision thresholds, less length bias, and more user-oriented criteria.

citing papers explorer

Showing 1 of 1 citing paper.

Same Words, Different Judgments: How Preferences Vary Across Modalities cs.SD · 2026-02-26 · unverdicted · none · ref 11
Human preferences for the same semantic content show near-chance agreement between text and audio, with audio raters using narrower decision thresholds, less length bias, and more user-oriented criteria.

We find that audio preference annotations are reliable in aggregate and ex- hibit agreement levels comparable to text when sufficient raters are pooled

fields

years

verdicts

representative citing papers

citing papers explorer