One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay

Jake Bruce; Michael Milford; Niko Suenderhauf; Piotr Mirowski; Raia Hadsell

arxiv: 1711.10137 · v2 · pith:R4GREV47new · submitted 2017-11-28 · 💻 cs.AI · cs.LG· cs.RO

One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay

Jake Bruce , Niko Suenderhauf , Piotr Mirowski , Raia Hadsell , Michael Milford This is my paper

classification 💻 cs.AI cs.LGcs.RO

keywords learningenvironmentrobotenvironmentalinteractioninteractivereinforcementworld

0 comments

read the original abstract

Recently, model-free reinforcement learning algorithms have been shown to solve challenging problems by learning from extensive interaction with the environment. A significant issue with transferring this success to the robotics domain is that interaction with the real world is costly, but training on limited experience is prone to overfitting. We present a method for learning to navigate, to a fixed goal and in a known environment, on a mobile robot. The robot leverages an interactive world model built from a single traversal of the environment, a pre-trained visual feature encoder, and stochastic environmental augmentation, to demonstrate successful zero-shot transfer under real-world environmental variations without fine-tuning.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning
cs.RO 2026-06 unverdicted novelty 5.0

Marope applies hierarchical MARL with decentralized lower-level rope policies and a centralized scheduler to achieve cooperative long rope skipping on Unitree G1 humanoids in simulation and reality.