pith. sign in

arxiv: 1711.10137 · v2 · pith:R4GREV47new · submitted 2017-11-28 · 💻 cs.AI · cs.LG· cs.RO

One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay

classification 💻 cs.AI cs.LGcs.RO
keywords learningenvironmentrobotenvironmentalinteractioninteractivereinforcementworld
0
0 comments X
read the original abstract

Recently, model-free reinforcement learning algorithms have been shown to solve challenging problems by learning from extensive interaction with the environment. A significant issue with transferring this success to the robotics domain is that interaction with the real world is costly, but training on limited experience is prone to overfitting. We present a method for learning to navigate, to a fixed goal and in a known environment, on a mobile robot. The robot leverages an interactive world model built from a single traversal of the environment, a pre-trained visual feature encoder, and stochastic environmental augmentation, to demonstrate successful zero-shot transfer under real-world environmental variations without fine-tuning.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning

    cs.RO 2026-06 unverdicted novelty 5.0

    Marope applies hierarchical MARL with decentralized lower-level rope policies and a centralized scheduler to achieve cooperative long rope skipping on Unitree G1 humanoids in simulation and reality.