pith. sign in

arxiv: 1709.10489 · v3 · pith:SG7AWKCZnew · submitted 2017-09-29 · 💻 cs.LG · cs.AI· cs.RO

Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

classification 💻 cs.LG cs.AIcs.RO
keywords methodsapproachcomplexlearnnavigatenavigationreal-worldrobot
0
0 comments X
read the original abstract

Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that learns from raw images and is sample efficient. Our simulated car experiments explore the design decisions of our navigation model, and show our approach outperforms single-step and $N$-step double Q-learning. We also evaluate our approach on a real-world RC car and show it can learn to navigate through a complex indoor environment with a few hours of fully autonomous, self-supervised training. Videos of the experiments and code can be found at github.com/gkahn13/gcg

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. End-to-end Decentralized Multi-robot Navigation in Unknown Complex Environments via Deep Reinforcement Learning

    cs.RO 2019-07 unverdicted novelty 4.0

    A DRL method learns decentralized policies for multi-robot navigation from raw lidar in unknown environments via centralized training and decentralized execution.