pith. sign in

arxiv: 1809.06641 · v2 · pith:Y45BZ6GYnew · submitted 2018-09-18 · 💻 cs.CL · cs.AI

Talking to myself: self-dialogues as data for conversational agents

classification 💻 cs.CL cs.AI
keywords dataagentsconversationalself-dialoguescorporacorpusacrossalongside
0
0 comments X
read the original abstract

Conversational agents are gaining popularity with the increasing ubiquity of smart devices. However, training agents in a data driven manner is challenging due to a lack of suitable corpora. This paper presents a novel method for gathering topical, unstructured conversational data in an efficient way: self-dialogues through crowd-sourcing. Alongside this paper, we include a corpus of 3.6 million words across 23 topics. We argue the utility of the corpus by comparing self-dialogues with standard two-party conversations as well as data from other corpora.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Implicit Discourse Relation Identification for Open-domain Dialogues

    cs.CL 2019-07 unverdicted novelty 7.0

    Extracts novel corpus of implicit discourse relations from dialogue turns and augments SOTA model with dialogue features.