Leveraging Large Language Models for Sarcastic Speech Annotation in Sarcasm Detection

· 2025 · cs.CL · arXiv 2506.00955

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Sarcasm fundamentally alters meaning through tone and context, yet detecting it in speech remains a challenge due to data scarcity. In addition, existing detection systems often rely on multimodal data, limiting their applicability in contexts where only speech is available. To address this, we propose an annotation pipeline that leverages large language models (LLMs) to generate a sarcasm dataset. Using a publicly available sarcasm-focused podcast, we employ GPT-4o and LLaMA 3 for initial sarcasm annotations, followed by human verification to resolve disagreements. We validate this approach by comparing annotation quality and detection performance on a publicly available sarcasm dataset using a collaborative gating architecture. Finally, we introduce PodSarc, a large-scale sarcastic speech dataset created through this pipeline. The detection model achieves a 73.63% F1 score, demonstrating the dataset's potential as a benchmark for sarcasm detection research.

representative citing papers

Leveraging Large Language Models for Sarcastic Speech Annotation in Sarcasm Detection

cs.CL · 2025-06-01 · unverdicted · novelty 4.0

An LLM-assisted annotation pipeline creates the PodSarc sarcastic speech dataset from podcasts and validates it via a collaborative gating detection model reaching 73.63% F1.

citing papers explorer

Showing 1 of 1 citing paper.

Leveraging Large Language Models for Sarcastic Speech Annotation in Sarcasm Detection cs.CL · 2025-06-01 · unverdicted · none · ref 4 · internal anchor
An LLM-assisted annotation pipeline creates the PodSarc sarcastic speech dataset from podcasts and validates it via a collaborative gating detection model reaching 73.63% F1.

Leveraging Large Language Models for Sarcastic Speech Annotation in Sarcasm Detection

fields

years

verdicts

representative citing papers

citing papers explorer