Continuously Learning Neural Dialogue Management

arxiv: 1606.02689 · v1 · pith:L3QHCFZEnew · submitted 2016-06-08 · 💻 cs.CL · cs.LG

Continuously Learning Neural Dialogue Management

Pei-Hao Su , Milica Gasic , Nikola Mrksic , Lina Rojas-Barahona , Stefan Ultes , David Vandyke , Tsung-Hsien Wen , Steve Young This is my paper

classification 💻 cs.CL cs.LG

keywords dialoguelearningmodelcontinuouslymanagementneuralreinforcementalgorithms

0 comments p. Extension

pith:L3QHCFZE Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{L3QHCFZE}

Prints a linked pith:L3QHCFZE badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A unified neural network framework is proposed to enable the system to first learn by supervision from a set of dialogue data and then continuously improve its behaviour via reinforcement learning, all using gradient-based algorithms on one single model. The experiments demonstrate the supervised model's effectiveness in the corpus-based evaluation, with user simulation, and with paid human subjects. The use of reinforcement learning further improves the model's performance in both interactive settings, especially under higher-noise conditions.

This paper has not been read by Pith yet.

Continuously Learning Neural Dialogue Management

discussion (0)