Talking to myself: self-dialogues as data for conversational agents
classification
💻 cs.CL
cs.AI
keywords
dataagentsconversationalself-dialoguescorporacorpusacrossalongside
read the original abstract
Conversational agents are gaining popularity with the increasing ubiquity of smart devices. However, training agents in a data driven manner is challenging due to a lack of suitable corpora. This paper presents a novel method for gathering topical, unstructured conversational data in an efficient way: self-dialogues through crowd-sourcing. Alongside this paper, we include a corpus of 3.6 million words across 23 topics. We argue the utility of the corpus by comparing self-dialogues with standard two-party conversations as well as data from other corpora.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Implicit Discourse Relation Identification for Open-domain Dialogues
Extracts novel corpus of implicit discourse relations from dialogue turns and augments SOTA model with dialogue features.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.