DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.
Learning perceptive humanoid locomotion over challenging terrain
3 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.RO 3verdicts
UNVERDICTED 3representative citing papers
CART integrates vision and body sensing via temporal sequences to improve legged robot stability and success rates on complex terrain by 5% in simulation and up to 45% in real-world tests.
An end-to-end policy learns robust humanoid locomotion directly from noisy depth images via high-fidelity sensor simulation, vision-aware distillation from privileged maps, and terrain-specific multi-critic reward shaping.
citing papers explorer
-
DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion
DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.
-
CART: Context-Aware Terrain Adaptation using Temporal Sequence Selection for Legged Robots
CART integrates vision and body sensing via temporal sequences to improve legged robot stability and success rates on complex terrain by 5% in simulation and up to 45% in real-world tests.
-
Now You See That: Learning End-to-End Humanoid Locomotion from Raw Pixels
An end-to-end policy learns robust humanoid locomotion directly from noisy depth images via high-fidelity sensor simulation, vision-aware distillation from privileged maps, and terrain-specific multi-critic reward shaping.