Marope applies hierarchical MARL with decentralized lower-level rope policies and a centralized scheduler to achieve cooperative long rope skipping on Unitree G1 humanoids in simulation and reality.
One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Recently, model-free reinforcement learning algorithms have been shown to solve challenging problems by learning from extensive interaction with the environment. A significant issue with transferring this success to the robotics domain is that interaction with the real world is costly, but training on limited experience is prone to overfitting. We present a method for learning to navigate, to a fixed goal and in a known environment, on a mobile robot. The robot leverages an interactive world model built from a single traversal of the environment, a pre-trained visual feature encoder, and stochastic environmental augmentation, to demonstrate successful zero-shot transfer under real-world environmental variations without fine-tuning.
fields
cs.RO 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning
Marope applies hierarchical MARL with decentralized lower-level rope policies and a centralized scheduler to achieve cooperative long rope skipping on Unitree G1 humanoids in simulation and reality.