Deep reinforcement learning for robotic bipedal locomotion: A brief survey

Bao, L · 2025 · arXiv 2404.17070

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Multi-Gait Learning for Humanoid Robots Using Reinforcement Learning with Selective Adversarial Motion Prior

cs.RO · 2026-04-21 · unverdicted · novelty 6.0

Selective AMP in RL enables a single policy for five humanoid gaits with faster convergence and better performance on stability tasks without losing dynamic agility.

DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion

cs.RO · 2025-05-24 · unverdicted · novelty 6.0

DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.

Toward Seamless Physical Human-Humanoid Interaction: Insights from Control, Intent, and Modeling with a Vision for What Comes Next

cs.RO · 2025-12-08 · unverdicted · novelty 5.0

A literature review of pHHI that proposes a taxonomy of interaction types by modality and engagement level while outlining pathways to integrate control, intent, and modeling for more seamless humanoid-human collaboration.

citing papers explorer

Showing 3 of 3 citing papers.

Multi-Gait Learning for Humanoid Robots Using Reinforcement Learning with Selective Adversarial Motion Prior cs.RO · 2026-04-21 · unverdicted · none · ref 1
Selective AMP in RL enables a single policy for five humanoid gaits with faster convergence and better performance on stability tasks without losing dynamic agility.
DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion cs.RO · 2025-05-24 · unverdicted · none · ref 34
DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.
Toward Seamless Physical Human-Humanoid Interaction: Insights from Control, Intent, and Modeling with a Vision for What Comes Next cs.RO · 2025-12-08 · unverdicted · none · ref 103
A literature review of pHHI that proposes a taxonomy of interaction types by modality and engagement level while outlining pathways to integrate control, intent, and modeling for more seamless humanoid-human collaboration.

Deep reinforcement learning for robotic bipedal locomotion: A brief survey

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer