pith. sign in

arxiv: 2606.00012 · v1 · pith:RD6PZWZ7new · submitted 2026-04-13 · 💻 cs.CL · cs.AI

DraDDP: A Multimodal Multi-Party Dialogue Discourse Parsing Dataset

classification 💻 cs.CL cs.AI
keywords dialoguemulti-partymultimodaldraddpdatasetdiscourseparsingpublicly
0
0 comments X
read the original abstract

Multi-party dialogue discourse parsing aims to identify dependency structures and relation types between utterances in conversations. Previous studies are mostly limited to textual modality or two-party dialogue, failing to meet the multimodal and multi-party settings. In this paper, we construct the first publicly available English multimodal dataset DraDDP for multi-party dialogue discourse parsing, based on American TV dramas. DraDDP contains 495 dialogue segments with 6,374 utterances and 9.1 hours of parallel video content, covering rich multi-party interaction scenarios. Moreover, we establish comprehensive benchmarks by evaluating this task on DraDDP and conducting in-depth analysis on the impact of different modalities. Experimental results demonstrate the value of multimodal information in capturing dialogue structures and relation types. We will publicly release the dataset, annotation guidelines, and code to promote future research in multimodal dialogue understanding.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.