One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay

· 2017 · cs.AI · arXiv 1711.10137

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Recently, model-free reinforcement learning algorithms have been shown to solve challenging problems by learning from extensive interaction with the environment. A significant issue with transferring this success to the robotics domain is that interaction with the real world is costly, but training on limited experience is prone to overfitting. We present a method for learning to navigate, to a fixed goal and in a known environment, on a mobile robot. The robot leverages an interactive world model built from a single traversal of the environment, a pre-trained visual feature encoder, and stochastic environmental augmentation, to demonstrate successful zero-shot transfer under real-world environmental variations without fine-tuning.

representative citing papers

Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning

cs.RO · 2026-06-06 · unverdicted · novelty 5.0

Marope applies hierarchical MARL with decentralized lower-level rope policies and a centralized scheduler to achieve cooperative long rope skipping on Unitree G1 humanoids in simulation and reality.

citing papers explorer

Showing 1 of 1 citing paper.

Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning cs.RO · 2026-06-06 · unverdicted · none · ref 39 · internal anchor
Marope applies hierarchical MARL with decentralized lower-level rope policies and a centralized scheduler to achieve cooperative long rope skipping on Unitree G1 humanoids in simulation and reality.

One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay

fields

years

verdicts

representative citing papers

citing papers explorer