Deep Conversational Recommender in Travel

Lizi Liao; Minlie Huang; Ryuichi Takanobu; Tat-Seng Chua; Xun Yang; Yunshan Ma

arxiv: 1907.00710 · v1 · pith:K37Q6SMAnew · submitted 2019-06-25 · 💻 cs.CL · cs.AI· cs.IR· cs.LG

Deep Conversational Recommender in Travel

Lizi Liao , Ryuichi Takanobu , Yunshan Ma , Xun Yang , Minlie Huang , Tat-Seng Chua This is my paper

Pith reviewed 2026-05-25 17:10 UTC · model grok-4.3

classification 💻 cs.CL cs.AIcs.IRcs.LG

keywords conversational recommendertravel domainseq2seq modellatent topic modelgraph convolutional networkpointer networktask-oriented dialog

0 comments

The pith

A seq2seq model with latent topics and venue graphs generates travel responses that respect user constraints across multiple turns.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper builds a conversational agent for travel that must juggle hotel bookings, restaurant suggestions, and taxi requests while obeying limits such as price or distance. It augments standard sequence-to-sequence generation with a neural latent topic module that steers the overall conversation direction and a graph convolutional network that links venues to the dialog context. Recommendation results are then inserted via pointer networks so the output stays grounded in the chosen venues. The resulting system is evaluated on a multi-turn travel dialog dataset and reported to exceed a range of baselines.

Core claim

The Deep Conversational Recommender augments seq2seq models with a neural latent topic component to guide response generation and simplify training, combines this with a GCN that models relationships among venues and their fit to dialog context, and uses pointer networks to incorporate the chosen recommendations into the final output, yielding superior performance on a multi-turn task-oriented travel dialog dataset.

What carries the argument

Deep Conversational Recommender (DCR) that fuses a neural latent topic component for global topic control, GCN-based venue modeling for constraint handling, and pointer networks for recommendation insertion inside seq2seq generation.

If this is right

Responses can shift between sub-tasks such as hotel reservation and restaurant recommendation without losing coherence.
User-specified constraints like price or distance are more reliably reflected in the chosen venues.
Training becomes easier because the topic component supplies an auxiliary signal for the generator.
Recommendation results appear naturally inside fluent replies rather than as separate list outputs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same architecture could be tested on other multi-constraint domains such as medical appointment scheduling or product configuration dialogs.
If the GCN venue graph proves central, replacing it with a learned embedding table would serve as a direct test of whether explicit relational structure is required.
Pointer-network insertion may reduce the frequency of hallucinated venue details, which could be measured by entity-level accuracy on held-out dialogs.

Load-bearing premise

The neural latent topic component and GCN-based venue modeling will effectively guide response generation and capture relationships between venues and dialog context.

What would settle it

A controlled ablation on the same travel dialog dataset in which removing the latent topic module or the GCN component produces no drop in automatic or human metrics relative to the full model.

Figures

Figures reproduced from arXiv: 1907.00710 by Lizi Liao, Minlie Huang, Ryuichi Takanobu, Tat-Seng Chua, Xun Yang, Yunshan Ma.

**Figure 2.** Figure 2: The proposed DCR model for travel, which consists of three components. The global topic control component [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: The illustration of convolution operation in the con [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 5.** Figure 5: The entity accuracy scores for each method. Note [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 4.** Figure 4: The BLEU scores for each method. In more detail, we analyse the BLEU score shown in [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 6.** Figure 6: As can be seen, the learned indicators correspond [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗

**Figure 7.** Figure 7: Inferred topic distribution of two example dialog sessions. It shows that some of the topics have been picked [PITH_FULL_IMAGE:figures/full_fig_p010_7.png] view at source ↗

read the original abstract

When traveling to a foreign country, we are often in dire need of an intelligent conversational agent to provide instant and informative responses to our various queries. However, to build such a travel agent is non-trivial. First of all, travel naturally involves several sub-tasks such as hotel reservation, restaurant recommendation and taxi booking etc, which invokes the need for global topic control. Secondly, the agent should consider various constraints like price or distance given by the user to recommend an appropriate venue. In this paper, we present a Deep Conversational Recommender (DCR) and apply to travel. It augments the sequence-to-sequence (seq2seq) models with a neural latent topic component to better guide response generation and make the training easier. To consider the various constraints for venue recommendation, we leverage a graph convolutional network (GCN) based approach to capture the relationships between different venues and the match between venue and dialog context. For response generation, we combine the topic-based component with the idea of pointer networks, which allows us to effectively incorporate recommendation results. We perform extensive evaluation on a multi-turn task-oriented dialog dataset in travel domain and the results show that our method achieves superior performance as compared to a wide range of baselines.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Standard seq2seq plus topics, GCN, and pointers applied to travel dialogs; performance edge is asserted but not broken down in the abstract.

read the letter

The paper builds a conversational recommender for travel by starting with seq2seq, adding a neural latent topic model to steer multi-turn responses across subtasks like hotels and restaurants, using GCN to encode venue relations and user constraints such as price or distance, and feeding recommendation results through pointer networks. That combination is the core of what they present and evaluate on a multi-turn travel dialog dataset, where they report better results than a range of baselines. The motivation for handling global topic control and constraint matching in this domain is clear and the architecture choices line up with the stated problems. The pointer-network step for injecting specific venues into generated text is a practical move. The main weakness is that the abstract supplies no ablation numbers, no metrics, and no description of exactly how the topic distribution conditions the decoder or how dialog context enters the GCN node features. Without those details the performance delta cannot be confidently tied to the added components rather than dataset size or tuning. The stress-test concern holds on the information given. This work is mainly useful to people already building task-oriented dialog systems who want a worked example in the travel domain; it is not a new framework. I would send it for peer review so the experiments and integration can be checked directly.

Referee Report

3 major / 2 minor

Summary. The paper proposes a Deep Conversational Recommender (DCR) for the travel domain that augments seq2seq models with a neural latent topic component to guide response generation, a GCN-based module to capture venue relationships and user constraints (e.g., price, distance), and pointer networks to incorporate recommendations into generated responses. It reports superior performance over baselines on a multi-turn task-oriented travel dialog dataset.

Significance. If the empirical claims hold after proper validation, the architecture could usefully combine topic control with graph-based constraint modeling for task-oriented dialog. The manuscript does not supply machine-checked proofs, reproducible code, or parameter-free derivations.

major comments (3)

[Abstract and §4] Abstract and §4 (Experiments): the central claim of superior performance is asserted without any reported metrics, ablation results, dataset statistics, or error analysis, making it impossible to determine whether the data supports attribution to the latent topic or GCN components.
[§3] §3 (Architecture): the precise fusion mechanism by which the neural latent topic distribution conditions the decoder, and how dialog context is encoded into GCN node features (including constraints), is not specified, so the claimed guidance of response generation cannot be verified or reproduced.
[§4] §4 (Experiments): no ablation studies isolate the contribution of the neural latent topic component versus the GCN-based venue modeling, which is load-bearing for the claim that these additions improve pointer-network output over plain seq2seq baselines.

minor comments (2)

[Abstract] The travel-domain dataset is referenced only generically without citation, size, or split details.
[§3] Notation for the topic distribution and GCN embeddings is introduced without explicit equations or variable definitions in the main text.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the detailed and constructive comments. We address each major point below and commit to a major revision that supplies the missing experimental details, architectural specifications, and ablation studies.

read point-by-point responses

Referee: [Abstract and §4] Abstract and §4 (Experiments): the central claim of superior performance is asserted without any reported metrics, ablation results, dataset statistics, or error analysis, making it impossible to determine whether the data supports attribution to the latent topic or GCN components.

Authors: We agree that the submitted manuscript does not report concrete metrics, dataset statistics, ablation results or error analysis. In the revised version we will add these elements to the abstract, §4 and a new appendix, including quantitative results on the travel dialog dataset, dataset size and split statistics, and error analysis that attributes gains to the topic and GCN modules. revision: yes
Referee: [§3] §3 (Architecture): the precise fusion mechanism by which the neural latent topic distribution conditions the decoder, and how dialog context is encoded into GCN node features (including constraints), is not specified, so the claimed guidance of response generation cannot be verified or reproduced.

Authors: We will expand §3 with explicit equations and a diagram showing (i) how the latent topic distribution is fused into the decoder (via concatenation or gating at each step) and (ii) the precise construction of GCN node features from dialog context and user constraints (price, distance, etc.). These additions will make the model fully reproducible. revision: yes
Referee: [§4] §4 (Experiments): no ablation studies isolate the contribution of the neural latent topic component versus the GCN-based venue modeling, which is load-bearing for the claim that these additions improve pointer-network output over plain seq2seq baselines.

Authors: We acknowledge the lack of ablations. The revised §4 will contain systematic ablations that remove the topic component, the GCN module, and both, reporting BLEU, entity F1 and success rate for each variant against the plain seq2seq+pointer baseline, thereby isolating the contribution of each addition. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical architecture with no derivation chain

full rationale

The manuscript describes a seq2seq augmentation via neural latent topics and GCN venue modeling, followed by pointer-network generation, then reports empirical superiority on a travel dialog dataset. No equations, uniqueness theorems, fitted-parameter predictions, or self-citation load-bearing steps appear in the supplied text. The performance claim rests on experimental comparison rather than any reduction of outputs to inputs by construction, satisfying the self-contained criterion.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no explicit free parameters, axioms, or invented entities; the work relies on standard neural network assumptions not detailed here.

pith-pipeline@v0.9.0 · 5761 in / 1087 out tokens · 25194 ms · 2026-05-25T17:10:11.076038+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

augments the sequence-to-sequence (seq2seq) models with a neural latent topic component ... leverage a graph convolutional network (GCN) based approach ... combine the topic-based component with the idea of pointer networks
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

global topic control component ... GCN-based venue recommendation component ... pointed integration mechanism

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages · 4 internal anchors

[1]

A Neural Conversational Model

O. Vinyals and Q. Le, “A neural conversational model,” arXiv preprint arXiv:1506.05869, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[2]

Neural responding machine for short- text conversation,

L. Shang, Z. Lu, and H. Li, “Neural responding machine for short- text conversation,” in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing , 2015, pp. 1577– 1586

work page 2015
[3]

Building end-to-end dialogue systems using generative hier- archical neural network models

I. V . Serban, A. Sordoni, Y. Bengio, A. C. Courville, and J. Pineau, “Building end-to-end dialogue systems using generative hier- archical neural network models.” in Thirty AAAI Conference on Artiﬁcial Intelligence, 2016, pp. 3776–3784

work page 2016
[4]

Learning end-to-end goal-oriented dia- log,

A. Bordes and J. Weston, “Learning end-to-end goal-oriented dia- log,” in The 3nd International Conference on Learning Representations, 2016, pp. 1–14

work page 2016
[5]

A network-based end-to-end trainable task-oriented dialogue system,

T. Wen, D. Vandyke, N. Mrk ˇs´ıc, M. Gaˇs´ıc, L. Rojas-Barahona, P . Su, S. Ultes, and S. Young, “A network-based end-to-end trainable task-oriented dialogue system,” in 15th Conference of the European Chapter of the Association for Computational Linguistics , 2017, pp. 438–449

work page 2017
[6]

Multiwoz - a large-scale multi- domain wizard-of-oz dataset for task-oriented dialogue mod- elling,

P . Budzianowski, T.-H. Wen, B.-H. Tseng, I. Casanueva, S. Ultes, O. Ramadan, and M. Gasic, “Multiwoz - a large-scale multi- domain wizard-of-oz dataset for task-oriented dialogue mod- elling,” in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018, pp. 5016–5026

work page 2018
[7]

End-to-end memory networks,

S. Sukhbaatar, J. Weston, R. Fergus et al. , “End-to-end memory networks,” in Advances in neural information processing systems , 2015, pp. 2440–2448

work page 2015
[8]

Mem2seq: Effectively incor- porating knowledge bases into end-to-end task-oriented dialog systems,

A. Madotto, C.-S. Wu, and P . Fung, “Mem2seq: Effectively incor- porating knowledge bases into end-to-end task-oriented dialog systems,” in Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018, pp. 1468–1478

work page 2018
[9]

Towards deep conversational recommendations,

R. Li, S. E. Kahou, H. Schulz, V . Michalski, L. Charlin, and C. Pal, “Towards deep conversational recommendations,” in Advances in Neural Information Processing Systems, 2018, pp. 9748–9758

work page 2018
[10]

Conversational recommender system,

Y. Sun and Y. Zhang, “Conversational recommender system,” in The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018, pp. 235–244

work page 2018
[11]

Latent dirichlet allocation,

D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent dirichlet allocation,” Journal of machine Learning research, pp. 993–1022, 2003

work page 2003
[12]

Partially observable markov decision processes for spoken dialog systems,

J. D. Williams and S. Young, “Partially observable markov decision processes for spoken dialog systems,”Computer Speech & Language, pp. 393–422, 2007

work page 2007
[13]

Sta- tistical dialog management applied to wfst-based dialog systems,

C. Hori, K. Ohtake, T. Misu, H. Kashioka, and S. Nakamura, “Sta- tistical dialog management applied to wfst-based dialog systems,” in Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, 2009, pp. 4793–4796

work page 2009
[14]

Pomdp- based statistical spoken dialog systems: A review,

S. Young, M. Ga ˇsi´c, B. Thomson, and J. D. Williams, “Pomdp- based statistical spoken dialog systems: A review,” Proceedings of the IEEE, pp. 1160–1179, 2013

work page 2013
[15]

Dialogue management in the mercury ﬂight reservation system,

S. Seneff and J. Polifroni, “Dialogue management in the mercury ﬂight reservation system,” in Proceedings of the 2000 ANLP/NAACL Workshop on Conversational systems, 2000, pp. 11–16

work page 2000
[16]

Let’s go public! taking a spoken dialog system to the real world,

A. Raux, B. Langner, D. Bohus, A. W. Black, and M. Eskenazi, “Let’s go public! taking a spoken dialog system to the real world,” in Ninth European Conference on Speech Communication and Technol- ogy, 2005

work page 2005
[17]

Generative encoder- decoder models for task-oriented spoken dialog systems with chatting capability,

T. Zhao, A. Lu, K. Lee, and M. Eskenazi, “Generative encoder- decoder models for task-oriented spoken dialog systems with chatting capability,” in Proceedings of the 18th Annual SIGdial Meet- ing on Discourse and Dialogue, 2017, pp. 27–36

work page 2017
[18]

Latent intention dialogue models,

T.-H. Wen, Y. Miao, P . Blunsom, and S. Young, “Latent intention dialogue models,” in International Conference on Machine Learning , 2017, pp. 3732–3741

work page 2017
[19]

An introduction to the syntax and content of cyc

C. Matuszek, J. Cabral, M. J. Witbrock, and J. DeOliveira, “An introduction to the syntax and content of cyc.” in AAAI Spring Symposium: Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering, 2006, pp. 44–49

work page 2006
[20]

Key-value memory networks for directly reading docu- ments,

A. Miller, A. Fisch, J. Dodge, A.-H. Karimi, A. Bordes, and J. We- ston, “Key-value memory networks for directly reading docu- ments,” in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016, pp. 1400–1409

work page 2016
[21]

A knowledge-grounded neural con- versation model,

M. Ghazvininejad, C. Brockett, M.-W. Chang, B. Dolan, J. Gao, W.-t. Yih, and M. Galley, “A knowledge-grounded neural con- versation model,” in Thirty-Second AAAI Conference on Artiﬁcial Intelligence, 2018, pp. 5110–5117

work page 2018
[22]

Steering output style and topic in neural response generation,

D. Wang, N. Jojic, C. Brockett, and E. Nyberg, “Steering output style and topic in neural response generation,” in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017, pp. 2140–2150

work page 2017
[23]

Topic Aware Neural Response Generation

C. Xing, W. Wu, Y. Wu, J. Liu, Y. Huang, M. Zhou, and W.-Y. Ma, “Topic augmented neural response generation with a joint atten- tion mechanism. arxiv preprint,” arXiv preprint arXiv:1606.08340 , 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[24]

A personalized system for conversational recommendations,

C. A. Thompson, M. H. Goker, and P . Langley, “A personalized system for conversational recommendations,” Journal of Artiﬁcial Intelligence Research, pp. 393–428, 2004

work page 2004
[25]

Converse-et- impera: Exploiting deep learning and hierarchical reinforcement learning for conversational recommender systems,

C. Greco, A. Suglia, P . Basile, and G. Semeraro, “Converse-et- impera: Exploiting deep learning and hierarchical reinforcement learning for conversational recommender systems,” in Conference of the Italian Association for Artiﬁcial Intelligence , 2017, pp. 372–386

work page 2017
[26]

Towards conversational recommender systems,

K. Christakopoulou, F. Radlinski, and K. Hofmann, “Towards conversational recommender systems,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016, pp. 815–824

work page 2016
[27]

Towards end-to-end reinforcement learning of dialogue agents for information access,

B. Dhingra, L. Li, X. Li, J. Gao, Y.-N. Chen, F. Ahmed, and L. Deng, “Towards end-to-end reinforcement learning of dialogue agents for information access,” in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017, pp. 484–495

work page 2017
[28]

End-to-end task-completion neural dialogue systems,

X. Li, Y.-N. Chen, L. Li, J. Gao, and A. Celikyilmaz, “End-to-end task-completion neural dialogue systems,” in Proceedings of the Eighth International Joint Conference on Natural Language Processing , 2017, pp. 733–743

work page 2017
[29]

Learning long-term depen- dencies with gradient descent is difﬁcult,

Y. Bengio, P . Simard, and P . Frasconi, “Learning long-term depen- dencies with gradient descent is difﬁcult,” IEEE transactions on neural networks, pp. 157–166, 1994

work page 1994
[30]

TopicRNN: A Recurrent Neural Network with Long-Range Semantic Dependency

A. B. Dieng, C. Wang, J. Gao, and J. Paisley, “Topicrnn: A recurrent neural network with long-range semantic dependency,” arXiv preprint arXiv:1611.01702, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[31]

An introduction to variational methods for graphical models,

M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul, “An introduction to variational methods for graphical models,” Machine learning, pp. 183–233, 1999

work page 1999
[32]

Neural variational inference and learning in belief networks,

A. Mnih and K. Gregor, “Neural variational inference and learning in belief networks,” in International Conference on Machine Learning, 2014, pp. 1791–1799

work page 2014
[33]

Neural variational inference for text processing,

Y. Miao, L. Yu, and P . Blunsom, “Neural variational inference for text processing,” in International Conference on Machine Learning , 2016, pp. 1727–1736

work page 2016
[34]

Auto-encoding variational bayes,

D. P . Kingma and M. Welling, “Auto-encoding variational bayes,” in The 2nd International Conference on Learning Representations, 2013, pp. 1–14

work page 2013
[35]

Graph convolutional neural networks for web-scale recommender systems,

R. Ying, R. He, K. Chen, P . Eksombatchai, W. L. Hamilton, and J. Leskovec, “Graph convolutional neural networks for web-scale recommender systems,” in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , 2018, pp. 974–983

work page 2018
[36]

Representation Learning on Graphs: Methods and Applications

W. L. Hamilton, R. Ying, and J. Leskovec, “Representation learning on graphs: Methods and applications,” arXiv preprint arXiv:1709.05584, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[37]

Semi-supervised classiﬁcation with graph convolutional networks,

T. N. Kipf and M. Welling, “Semi-supervised classiﬁcation with graph convolutional networks,” in International Conference on Learning Representations, 2016, pp. 1–14

work page 2016
[38]

Empirical evalua- tion of gated recurrent neural networks on sequence modeling,

J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, “Empirical evalua- tion of gated recurrent neural networks on sequence modeling,” in NIPS 2014 Workshop on Deep Learning , 2014

work page 2014
[39]

Point- ing the unknown words,

C. Gulcehre, S. Ahn, R. Nallapati, B. Zhou, and Y. Bengio, “Point- ing the unknown words,” in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics , 2016, pp. 140–149

work page 2016
[40]

Get to the point: Summariza- tion with pointer-generator networks,

A. See, P . J. Liu, and C. D. Manning, “Get to the point: Summariza- tion with pointer-generator networks,” in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017, pp. 1073–1083

work page 2017
[41]

Knowledge- aware multimodal dialogue systems,

L. Liao, Y. Ma, X. He, R. Hong, and T.-s. Chua, “Knowledge- aware multimodal dialogue systems,” in 2018 ACM Multimedia Conference on Multimedia Conference, 2018, pp. 801–809

work page 2018
[42]

Autorec: Au- toencoders meet collaborative ﬁltering,

S. Sedhain, A. K. Menon, S. Sanner, and L. Xie, “Autorec: Au- toencoders meet collaborative ﬁltering,” in Proceedings of the 24th International Conference on World Wide Web, 2015, pp. 111–112

work page 2015
[43]

Neu- ral collaborative ﬁltering,

X. He, L. Liao, H. Zhang, L. Nie, X. Hu, and T.-S. Chua, “Neu- ral collaborative ﬁltering,” in Proceedings of the 26th International Conference on World Wide Web, 2017, pp. 173–182

work page 2017
[44]

C.-W. Liu, R. Lowe, I. V . Serban, M. Noseworthy, L. Charlin, and J. Pineau, “How not to evaluate your dialogue system: An JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, JUNE 2019 12 empirical study of unsupervised evaluation metrics for dialogue response generation.”

work page 2019
[45]

A copy-augmented sequence-to- sequence architecture gives good performance on task-oriented dialogue,

M. Eric and C. Manning, “A copy-augmented sequence-to- sequence architecture gives good performance on task-oriented dialogue,” in Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics , 2017, pp. 468–473

work page 2017
[46]

Glove: Global vectors for word representation,

J. Pennington, R. Socher, and C. Manning, “Glove: Global vectors for word representation,” in Proceedings of the 2014 conference on empirical methods in natural language processing, 2014, pp. 1532–1543

work page 2014
[47]

Adam: A method for stochastic optimization,

D. P . Kingma and B. Jimmy, “Adam: A method for stochastic optimization,” in The 3nd International Conference on Learning Rep- resentations, 2015, pp. 1–14

work page 2015
[48]

Correlated topic models,

D. Blei and J. Lafferty, “Correlated topic models,” Advances in neural information processing systems, p. 147, 2006

work page 2006

[1] [1]

A Neural Conversational Model

O. Vinyals and Q. Le, “A neural conversational model,” arXiv preprint arXiv:1506.05869, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[2] [2]

Neural responding machine for short- text conversation,

L. Shang, Z. Lu, and H. Li, “Neural responding machine for short- text conversation,” in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing , 2015, pp. 1577– 1586

work page 2015

[3] [3]

Building end-to-end dialogue systems using generative hier- archical neural network models

I. V . Serban, A. Sordoni, Y. Bengio, A. C. Courville, and J. Pineau, “Building end-to-end dialogue systems using generative hier- archical neural network models.” in Thirty AAAI Conference on Artiﬁcial Intelligence, 2016, pp. 3776–3784

work page 2016

[4] [4]

Learning end-to-end goal-oriented dia- log,

A. Bordes and J. Weston, “Learning end-to-end goal-oriented dia- log,” in The 3nd International Conference on Learning Representations, 2016, pp. 1–14

work page 2016

[5] [5]

A network-based end-to-end trainable task-oriented dialogue system,

T. Wen, D. Vandyke, N. Mrk ˇs´ıc, M. Gaˇs´ıc, L. Rojas-Barahona, P . Su, S. Ultes, and S. Young, “A network-based end-to-end trainable task-oriented dialogue system,” in 15th Conference of the European Chapter of the Association for Computational Linguistics , 2017, pp. 438–449

work page 2017

[6] [6]

Multiwoz - a large-scale multi- domain wizard-of-oz dataset for task-oriented dialogue mod- elling,

P . Budzianowski, T.-H. Wen, B.-H. Tseng, I. Casanueva, S. Ultes, O. Ramadan, and M. Gasic, “Multiwoz - a large-scale multi- domain wizard-of-oz dataset for task-oriented dialogue mod- elling,” in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018, pp. 5016–5026

work page 2018

[7] [7]

End-to-end memory networks,

S. Sukhbaatar, J. Weston, R. Fergus et al. , “End-to-end memory networks,” in Advances in neural information processing systems , 2015, pp. 2440–2448

work page 2015

[8] [8]

Mem2seq: Effectively incor- porating knowledge bases into end-to-end task-oriented dialog systems,

A. Madotto, C.-S. Wu, and P . Fung, “Mem2seq: Effectively incor- porating knowledge bases into end-to-end task-oriented dialog systems,” in Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018, pp. 1468–1478

work page 2018

[9] [9]

Towards deep conversational recommendations,

R. Li, S. E. Kahou, H. Schulz, V . Michalski, L. Charlin, and C. Pal, “Towards deep conversational recommendations,” in Advances in Neural Information Processing Systems, 2018, pp. 9748–9758

work page 2018

[10] [10]

Conversational recommender system,

Y. Sun and Y. Zhang, “Conversational recommender system,” in The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018, pp. 235–244

work page 2018

[11] [11]

Latent dirichlet allocation,

D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent dirichlet allocation,” Journal of machine Learning research, pp. 993–1022, 2003

work page 2003

[12] [12]

Partially observable markov decision processes for spoken dialog systems,

J. D. Williams and S. Young, “Partially observable markov decision processes for spoken dialog systems,”Computer Speech & Language, pp. 393–422, 2007

work page 2007

[13] [13]

Sta- tistical dialog management applied to wfst-based dialog systems,

C. Hori, K. Ohtake, T. Misu, H. Kashioka, and S. Nakamura, “Sta- tistical dialog management applied to wfst-based dialog systems,” in Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, 2009, pp. 4793–4796

work page 2009

[14] [14]

Pomdp- based statistical spoken dialog systems: A review,

S. Young, M. Ga ˇsi´c, B. Thomson, and J. D. Williams, “Pomdp- based statistical spoken dialog systems: A review,” Proceedings of the IEEE, pp. 1160–1179, 2013

work page 2013

[15] [15]

Dialogue management in the mercury ﬂight reservation system,

S. Seneff and J. Polifroni, “Dialogue management in the mercury ﬂight reservation system,” in Proceedings of the 2000 ANLP/NAACL Workshop on Conversational systems, 2000, pp. 11–16

work page 2000

[16] [16]

Let’s go public! taking a spoken dialog system to the real world,

A. Raux, B. Langner, D. Bohus, A. W. Black, and M. Eskenazi, “Let’s go public! taking a spoken dialog system to the real world,” in Ninth European Conference on Speech Communication and Technol- ogy, 2005

work page 2005

[17] [17]

Generative encoder- decoder models for task-oriented spoken dialog systems with chatting capability,

T. Zhao, A. Lu, K. Lee, and M. Eskenazi, “Generative encoder- decoder models for task-oriented spoken dialog systems with chatting capability,” in Proceedings of the 18th Annual SIGdial Meet- ing on Discourse and Dialogue, 2017, pp. 27–36

work page 2017

[18] [18]

Latent intention dialogue models,

T.-H. Wen, Y. Miao, P . Blunsom, and S. Young, “Latent intention dialogue models,” in International Conference on Machine Learning , 2017, pp. 3732–3741

work page 2017

[19] [19]

An introduction to the syntax and content of cyc

C. Matuszek, J. Cabral, M. J. Witbrock, and J. DeOliveira, “An introduction to the syntax and content of cyc.” in AAAI Spring Symposium: Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering, 2006, pp. 44–49

work page 2006

[20] [20]

Key-value memory networks for directly reading docu- ments,

A. Miller, A. Fisch, J. Dodge, A.-H. Karimi, A. Bordes, and J. We- ston, “Key-value memory networks for directly reading docu- ments,” in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016, pp. 1400–1409

work page 2016

[21] [21]

A knowledge-grounded neural con- versation model,

M. Ghazvininejad, C. Brockett, M.-W. Chang, B. Dolan, J. Gao, W.-t. Yih, and M. Galley, “A knowledge-grounded neural con- versation model,” in Thirty-Second AAAI Conference on Artiﬁcial Intelligence, 2018, pp. 5110–5117

work page 2018

[22] [22]

Steering output style and topic in neural response generation,

D. Wang, N. Jojic, C. Brockett, and E. Nyberg, “Steering output style and topic in neural response generation,” in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017, pp. 2140–2150

work page 2017

[23] [23]

Topic Aware Neural Response Generation

C. Xing, W. Wu, Y. Wu, J. Liu, Y. Huang, M. Zhou, and W.-Y. Ma, “Topic augmented neural response generation with a joint atten- tion mechanism. arxiv preprint,” arXiv preprint arXiv:1606.08340 , 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[24] [24]

A personalized system for conversational recommendations,

C. A. Thompson, M. H. Goker, and P . Langley, “A personalized system for conversational recommendations,” Journal of Artiﬁcial Intelligence Research, pp. 393–428, 2004

work page 2004

[25] [25]

Converse-et- impera: Exploiting deep learning and hierarchical reinforcement learning for conversational recommender systems,

C. Greco, A. Suglia, P . Basile, and G. Semeraro, “Converse-et- impera: Exploiting deep learning and hierarchical reinforcement learning for conversational recommender systems,” in Conference of the Italian Association for Artiﬁcial Intelligence , 2017, pp. 372–386

work page 2017

[26] [26]

Towards conversational recommender systems,

K. Christakopoulou, F. Radlinski, and K. Hofmann, “Towards conversational recommender systems,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016, pp. 815–824

work page 2016

[27] [27]

Towards end-to-end reinforcement learning of dialogue agents for information access,

B. Dhingra, L. Li, X. Li, J. Gao, Y.-N. Chen, F. Ahmed, and L. Deng, “Towards end-to-end reinforcement learning of dialogue agents for information access,” in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017, pp. 484–495

work page 2017

[28] [28]

End-to-end task-completion neural dialogue systems,

X. Li, Y.-N. Chen, L. Li, J. Gao, and A. Celikyilmaz, “End-to-end task-completion neural dialogue systems,” in Proceedings of the Eighth International Joint Conference on Natural Language Processing , 2017, pp. 733–743

work page 2017

[29] [29]

Learning long-term depen- dencies with gradient descent is difﬁcult,

Y. Bengio, P . Simard, and P . Frasconi, “Learning long-term depen- dencies with gradient descent is difﬁcult,” IEEE transactions on neural networks, pp. 157–166, 1994

work page 1994

[30] [30]

TopicRNN: A Recurrent Neural Network with Long-Range Semantic Dependency

A. B. Dieng, C. Wang, J. Gao, and J. Paisley, “Topicrnn: A recurrent neural network with long-range semantic dependency,” arXiv preprint arXiv:1611.01702, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[31] [31]

An introduction to variational methods for graphical models,

M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul, “An introduction to variational methods for graphical models,” Machine learning, pp. 183–233, 1999

work page 1999

[32] [32]

Neural variational inference and learning in belief networks,

A. Mnih and K. Gregor, “Neural variational inference and learning in belief networks,” in International Conference on Machine Learning, 2014, pp. 1791–1799

work page 2014

[33] [33]

Neural variational inference for text processing,

Y. Miao, L. Yu, and P . Blunsom, “Neural variational inference for text processing,” in International Conference on Machine Learning , 2016, pp. 1727–1736

work page 2016

[34] [34]

Auto-encoding variational bayes,

D. P . Kingma and M. Welling, “Auto-encoding variational bayes,” in The 2nd International Conference on Learning Representations, 2013, pp. 1–14

work page 2013

[35] [35]

Graph convolutional neural networks for web-scale recommender systems,

R. Ying, R. He, K. Chen, P . Eksombatchai, W. L. Hamilton, and J. Leskovec, “Graph convolutional neural networks for web-scale recommender systems,” in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , 2018, pp. 974–983

work page 2018

[36] [36]

Representation Learning on Graphs: Methods and Applications

W. L. Hamilton, R. Ying, and J. Leskovec, “Representation learning on graphs: Methods and applications,” arXiv preprint arXiv:1709.05584, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[37] [37]

Semi-supervised classiﬁcation with graph convolutional networks,

T. N. Kipf and M. Welling, “Semi-supervised classiﬁcation with graph convolutional networks,” in International Conference on Learning Representations, 2016, pp. 1–14

work page 2016

[38] [38]

Empirical evalua- tion of gated recurrent neural networks on sequence modeling,

J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, “Empirical evalua- tion of gated recurrent neural networks on sequence modeling,” in NIPS 2014 Workshop on Deep Learning , 2014

work page 2014

[39] [39]

Point- ing the unknown words,

C. Gulcehre, S. Ahn, R. Nallapati, B. Zhou, and Y. Bengio, “Point- ing the unknown words,” in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics , 2016, pp. 140–149

work page 2016

[40] [40]

Get to the point: Summariza- tion with pointer-generator networks,

A. See, P . J. Liu, and C. D. Manning, “Get to the point: Summariza- tion with pointer-generator networks,” in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017, pp. 1073–1083

work page 2017

[41] [41]

Knowledge- aware multimodal dialogue systems,

L. Liao, Y. Ma, X. He, R. Hong, and T.-s. Chua, “Knowledge- aware multimodal dialogue systems,” in 2018 ACM Multimedia Conference on Multimedia Conference, 2018, pp. 801–809

work page 2018

[42] [42]

Autorec: Au- toencoders meet collaborative ﬁltering,

S. Sedhain, A. K. Menon, S. Sanner, and L. Xie, “Autorec: Au- toencoders meet collaborative ﬁltering,” in Proceedings of the 24th International Conference on World Wide Web, 2015, pp. 111–112

work page 2015

[43] [43]

Neu- ral collaborative ﬁltering,

X. He, L. Liao, H. Zhang, L. Nie, X. Hu, and T.-S. Chua, “Neu- ral collaborative ﬁltering,” in Proceedings of the 26th International Conference on World Wide Web, 2017, pp. 173–182

work page 2017

[44] [44]

C.-W. Liu, R. Lowe, I. V . Serban, M. Noseworthy, L. Charlin, and J. Pineau, “How not to evaluate your dialogue system: An JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, JUNE 2019 12 empirical study of unsupervised evaluation metrics for dialogue response generation.”

work page 2019

[45] [45]

A copy-augmented sequence-to- sequence architecture gives good performance on task-oriented dialogue,

M. Eric and C. Manning, “A copy-augmented sequence-to- sequence architecture gives good performance on task-oriented dialogue,” in Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics , 2017, pp. 468–473

work page 2017

[46] [46]

Glove: Global vectors for word representation,

J. Pennington, R. Socher, and C. Manning, “Glove: Global vectors for word representation,” in Proceedings of the 2014 conference on empirical methods in natural language processing, 2014, pp. 1532–1543

work page 2014

[47] [47]

Adam: A method for stochastic optimization,

D. P . Kingma and B. Jimmy, “Adam: A method for stochastic optimization,” in The 3nd International Conference on Learning Rep- resentations, 2015, pp. 1–14

work page 2015

[48] [48]

Correlated topic models,

D. Blei and J. Lafferty, “Correlated topic models,” Advances in neural information processing systems, p. 147, 2006

work page 2006