Distillation-ppo: A novel two-stage reinforcement learning framework for humanoid robot perceptive locomotion

Qiang Zhang, Gang Han, Jingkai Sun, Wen Zhao, Chenghao Sun, Jiahang Cao, Jiaxu Wang, Yijie Guo, Renjing Xu · 2025 · arXiv 2503.08299

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion

cs.RO · 2025-05-24 · unverdicted · novelty 6.0

DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.

Learning Agile Striker Skills for Humanoid Soccer Robots from Noisy Sensory Input

cs.RO · 2025-12-06 · conditional · novelty 5.0

A four-stage RL system with teacher-student distillation and online constrained adaptation enables humanoid robots to achieve robust ball-kicking accuracy under noisy perception in simulation and on physical hardware.

citing papers explorer

Showing 2 of 2 citing papers.

DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion cs.RO · 2025-05-24 · unverdicted · none · ref 29
DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.
Learning Agile Striker Skills for Humanoid Soccer Robots from Noisy Sensory Input cs.RO · 2025-12-06 · conditional · none · ref 44
A four-stage RL system with teacher-student distillation and online constrained adaptation enables humanoid robots to achieve robust ball-kicking accuracy under noisy perception in simulation and on physical hardware.

Distillation-ppo: A novel two-stage reinforcement learning framework for humanoid robot perceptive locomotion

fields

years

verdicts

representative citing papers

citing papers explorer