pith. sign in

arxiv: 1807.03367 · v3 · pith:XSMZSTLJnew · submitted 2018-07-09 · 💻 cs.AI · cs.CL· cs.CV· cs.LG

Talk the Walk: Navigating New York City through Grounded Dialogue

classification 💻 cs.AI cs.CLcs.CVcs.LG
keywords tasktouristdatasetdialoguefullgroundedguidelanguage
0
0 comments X
read the original abstract

We introduce "Talk The Walk", the first large-scale dialogue dataset grounded in action and perception. The task involves two agents (a "guide" and a "tourist") that communicate via natural language in order to achieve a common goal: having the tourist navigate to a given target location. The task and dataset, which are described in detail, are challenging and their full solution is an open problem that we pose to the community. We (i) focus on the task of tourist localization and develop the novel Masked Attention for Spatial Convolutions (MASC) mechanism that allows for grounding tourist utterances into the guide's map, (ii) show it yields significant improvements for both emergent and natural language communication, and (iii) using this method, we establish non-trivial baselines on the full task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. CraftAssist: A Framework for Dialogue-enabled Interactive Agents

    cs.AI 2019-07 unverdicted novelty 5.0

    CraftAssist supplies a Minecraft bot, dialogue interface, and data-recording platform intended to support research on agents that execute tasks specified through conversation.