Rpl: Learning robust humanoid perceptive locomotion on challenging terrains

· 2026 · arXiv 2602.03002

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

representative citing papers

Physics-Guided Biomechanical Gait Adaptation for Humanoid Locomotion on Extreme Sloped Terrains

cs.RO · 2026-07-08 · conditional · novelty 6.0

A proprioceptive humanoid policy trained with slope-adaptive ZMP regularization plus biomechanical reward gating traverses outdoor grass slopes to 32.1° without online exteroception.

SceneBot: Contact-Prompted General Humanoid Whole Body Tracking with Scene-Interaction

cs.RO · 2026-06-25 · unverdicted · novelty 6.0

SceneBot conditions a humanoid tracking policy on motion references and contact labels, using reconstructed scene-interaction data to unify free-space locomotion with contact-rich manipulation and terrain tasks.

TAGA: Terrain-aware Active Gaze Learning for Generalizable Agile Humanoid Locomotion

cs.RO · 2026-06-04 · unverdicted · novelty 6.0

TAGA learns terrain-aware active gaze behaviors for humanoid robots via RL alone, enabling generalizable locomotion with 1.2m real-world gap traversal.

MARCH: Model-Assisted Reinforcement Learning for the Perceptive Control of Humanoids over Sparse Footholds

cs.RO · 2026-06-09 · unverdicted · novelty 5.0

MARCH combines simplified-model trajectory generation with CLF-guided teacher RL and vision-policy distillation to enable stable humanoid locomotion over sparse terrain with better sample efficiency than pure model-free methods.

VAIC: Vision-Guided Humanoid Agile Object Interaction Control via Decoupled Commands

cs.RO · 2026-06-08 · unverdicted · novelty 5.0

VAIC distills a teacher policy into a vision-and-proprioception student policy using recurrent adaptation and decoupled commands, enabling diverse real-robot tasks like box carrying and skateboarding that outperform baselines.

LadderMan: Learning Humanoid Perceptive Ladder Climbing

cs.RO · 2026-06-04 · unverdicted · novelty 5.0

A hybrid motion-tracking and imitation-reinforcement pipeline produces a depth-based visuomotor policy that lets humanoids climb varied ladders zero-shot on hardware and perform teleoperated manipulation while climbing.

Global-Local Attention Decomposition for Terrain Encoding in Humanoid Perceptive Locomotion

cs.RO · 2026-05-30 · unverdicted · novelty 5.0

GLAD decomposes terrain encoding via coarse-to-fine attention on elevation maps to separate broad awareness from precise foothold selection in perceptive humanoid locomotion.

Terrain Consistent Reference-Guided RL for Humanoid Navigation Autonomy

cs.RO · 2026-05-15 · unverdicted · novelty 5.0

Terrain-consistent reference modulation during RL training yields SE(2)-controllable humanoid locomotion policies that improve tracking in simulation and enable over 70 m closed-loop autonomous navigation on rough terrain and stairs on the Unitree G1 with onboard computation.

TACT-ful: Multi-Channel Terrain Affordance and Compliance Training for Payload-Robust Perceptive Humanoid Locomotion

cs.RO · 2026-06-06 · unverdicted · novelty 4.0

A multi-channel terrain affordance reward combined with lower-body compliance training via virtual wrenches enables end-to-end PPO-trained humanoid policies to walk at 1 m/s on 0.2 m risers with improved payload robustness.

citing papers explorer

Showing 9 of 9 citing papers.

Physics-Guided Biomechanical Gait Adaptation for Humanoid Locomotion on Extreme Sloped Terrains cs.RO · 2026-07-08 · conditional · none · ref 28
A proprioceptive humanoid policy trained with slope-adaptive ZMP regularization plus biomechanical reward gating traverses outdoor grass slopes to 32.1° without online exteroception.
SceneBot: Contact-Prompted General Humanoid Whole Body Tracking with Scene-Interaction cs.RO · 2026-06-25 · unverdicted · none · ref 19
SceneBot conditions a humanoid tracking policy on motion references and contact labels, using reconstructed scene-interaction data to unify free-space locomotion with contact-rich manipulation and terrain tasks.
TAGA: Terrain-aware Active Gaze Learning for Generalizable Agile Humanoid Locomotion cs.RO · 2026-06-04 · unverdicted · none · ref 55
TAGA learns terrain-aware active gaze behaviors for humanoid robots via RL alone, enabling generalizable locomotion with 1.2m real-world gap traversal.
MARCH: Model-Assisted Reinforcement Learning for the Perceptive Control of Humanoids over Sparse Footholds cs.RO · 2026-06-09 · unverdicted · none · ref 6
MARCH combines simplified-model trajectory generation with CLF-guided teacher RL and vision-policy distillation to enable stable humanoid locomotion over sparse terrain with better sample efficiency than pure model-free methods.
VAIC: Vision-Guided Humanoid Agile Object Interaction Control via Decoupled Commands cs.RO · 2026-06-08 · unverdicted · none · ref 48
VAIC distills a teacher policy into a vision-and-proprioception student policy using recurrent adaptation and decoupled commands, enabling diverse real-robot tasks like box carrying and skateboarding that outperform baselines.
LadderMan: Learning Humanoid Perceptive Ladder Climbing cs.RO · 2026-06-04 · unverdicted · none · ref 2
A hybrid motion-tracking and imitation-reinforcement pipeline produces a depth-based visuomotor policy that lets humanoids climb varied ladders zero-shot on hardware and perform teleoperated manipulation while climbing.
Global-Local Attention Decomposition for Terrain Encoding in Humanoid Perceptive Locomotion cs.RO · 2026-05-30 · unverdicted · none · ref 12
GLAD decomposes terrain encoding via coarse-to-fine attention on elevation maps to separate broad awareness from precise foothold selection in perceptive humanoid locomotion.
Terrain Consistent Reference-Guided RL for Humanoid Navigation Autonomy cs.RO · 2026-05-15 · unverdicted · none · ref 6
Terrain-consistent reference modulation during RL training yields SE(2)-controllable humanoid locomotion policies that improve tracking in simulation and enable over 70 m closed-loop autonomous navigation on rough terrain and stairs on the Unitree G1 with onboard computation.
TACT-ful: Multi-Channel Terrain Affordance and Compliance Training for Payload-Robust Perceptive Humanoid Locomotion cs.RO · 2026-06-06 · unverdicted · none · ref 16
A multi-channel terrain affordance reward combined with lower-body compliance training via virtual wrenches enables end-to-end PPO-trained humanoid policies to walk at 1 m/s on 0.2 m risers with improved payload robustness.

Rpl: Learning robust humanoid perceptive locomotion on challenging terrains

fields

years

verdicts

representative citing papers

citing papers explorer