pith. sign in

arxiv: 1606.02689 · v1 · pith:L3QHCFZEnew · submitted 2016-06-08 · 💻 cs.CL · cs.LG

Continuously Learning Neural Dialogue Management

classification 💻 cs.CL cs.LG
keywords dialoguelearningmodelcontinuouslymanagementneuralreinforcementalgorithms
0
0 comments X p. Extension
pith:L3QHCFZE Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{L3QHCFZE}

Prints a linked pith:L3QHCFZE badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A unified neural network framework is proposed to enable the system to first learn by supervision from a set of dialogue data and then continuously improve its behaviour via reinforcement learning, all using gradient-based algorithms on one single model. The experiments demonstrate the supervised model's effectiveness in the corpus-based evaluation, with user simulation, and with paid human subjects. The use of reinforcement learning further improves the model's performance in both interactive settings, especially under higher-noise conditions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.