Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning

Maxine Eskenazi; Tiancheng Zhao

arxiv: 1606.02560 · v2 · pith:L6SZKYK2new · submitted 2016-06-08 · 💻 cs.AI · cs.CL· cs.LG

Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning

Tiancheng Zhao , Maxine Eskenazi This is my paper

classification 💻 cs.AI cs.CLcs.LG

keywords learningdialogdeepend-to-endgamemodelproposedreinforcement

0 comments

read the original abstract

This paper presents an end-to-end framework for task-oriented dialog systems using a variant of Deep Recurrent Q-Networks (DRQN). The model is able to interface with a relational database and jointly learn policies for both language understanding and dialog strategy. Moreover, we propose a hybrid algorithm that combines the strength of reinforcement learning and supervised learning to achieve faster learning speed. We evaluated the proposed model on a 20 Question Game conversational game simulator. Results show that the proposed method outperforms the modular-based baseline and learns a distributed representation of the latent dialog state.

This paper has not been read by Pith yet.

Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning

discussion (0)