arxiv: 1705.08080 · v2 · pith:7XOQZ24Jnew · submitted 2017-05-23 · 💻 cs.CV · cs.LG· cs.RO

Visual Semantic Planning using Deep Successor Representations

Yuke Zhu , Daniel Gordon , Eric Kolve , Dieter Fox , Li Fei-Fei , Abhinav Gupta , Roozbeh Mottaghi , Ali Farhadi This is my paper

classification 💻 cs.CV cs.LGcs.RO

keywords visualactionsenvironmentlearningdeepdynamicplanningrepresentations

0 comments

read the original abstract

A crucial capability of real-world intelligent agents is their ability to plan a sequence of actions to achieve their goals in the visual world. In this work, we address the problem of visual semantic planning: the task of predicting a sequence of actions from visual observations that transform a dynamic environment from an initial state to a goal state. Doing so entails knowledge about objects and their affordances, as well as actions and their preconditions and effects. We propose learning these through interacting with a visual and dynamic environment. Our proposed solution involves bootstrapping reinforcement learning with imitation learning. To ensure cross task generalization, we develop a deep predictive model based on successor representations. Our experimental results show near optimal results across a wide range of tasks in the challenging THOR environment.

This paper has not been read by Pith yet.

Visual Semantic Planning using Deep Successor Representations

discussion (0)