Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

arxiv: 1706.06210 · v2 · pith:SYL3LYBOnew · submitted 2017-06-19 · 💻 cs.CL · cs.AI

Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

Pawe{\l} Budzianowski , Stefan Ultes , Pei-Hao Su , Nikola Mrk\v{s}i\'c , Tsung-Hsien Wen , I\~nigo Casanueva , Lina Rojas-Barahona , Milica Ga\v{s}i\'c This is my paper

classification 💻 cs.CL cs.AI

keywords dialoguelearningpolicyreinforcementsystemscomplexflatframework

0 comments p. Extension

pith:SYL3LYBO Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{SYL3LYBO}

Prints a linked pith:SYL3LYBO badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Human conversation is inherently complex, often spanning many different topics/domains. This makes policy learning for dialogue systems very challenging. Standard flat reinforcement learning methods do not provide an efficient framework for modelling such dialogues. In this paper, we focus on the under-explored problem of multi-domain dialogue management. First, we propose a new method for hierarchical reinforcement learning using the option framework. Next, we show that the proposed architecture learns faster and arrives at a better policy than the existing flat ones do. Moreover, we show how pretrained policies can be adapted to more complex systems with an additional set of new actions. In doing that, we show that our approach has the potential to facilitate policy optimisation for more sophisticated multi-domain dialogue systems.

This paper has not been read by Pith yet.

Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

discussion (0)