Learning to walk in minutes using massively parallel deep reinforcement learning,

· 2022

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

browse 6 citing papers

representative citing papers

Stability of Control Lyapunov Function Guided Reinforcement Learning

eess.SY · 2026-05-03 · conditional · novelty 6.0

CLF-guided RL yields exponentially stable optimal controllers, with proofs in continuous and discrete time, numerical checks on double integrator and cart-pole, and implementation on a walking humanoid.

MUJICA: Multi-skill Unified Joint Integration of Control Architecture for Wheeled-Legged Robots

cs.RO · 2026-05-13 · unverdicted · novelty 5.0

A single reinforcement learning policy jointly trains multiple locomotion skills for wheeled-legged robots with DC-motor constraints and learns a proprioceptive skill selector for adaptive behavior.

Reset-Free Reinforcement Learning for Real-World Agile Driving: An Empirical Study

cs.RO · 2026-04-09 · unverdicted · novelty 5.0

Empirical comparison shows a clear sim-to-real gap in reset-free RL for agile driving: TD-MPC2 outperforms the MPPI baseline in the real world while SAC excels in simulation, and residual learning benefits simulation but does not transfer.

Learning Agile Striker Skills for Humanoid Soccer Robots from Noisy Sensory Input

cs.RO · 2025-12-06 · conditional · novelty 5.0

A four-stage RL system with teacher-student distillation and online constrained adaptation enables humanoid robots to achieve robust ball-kicking accuracy under noisy perception in simulation and on physical hardware.

Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input

cs.RO · 2026-04-21 · unverdicted · novelty 4.0

Sparsely gated MoE policies double the success rate of a real Unitree Go2 quadruped on large-obstacle parkour versus matched-active-parameter MLP baselines while cutting inference time compared with a scaled-up MLP.

CART: Context-Aware Terrain Adaptation using Temporal Sequence Selection for Legged Robots

cs.RO · 2026-04-15

citing papers explorer

Showing 6 of 6 citing papers.

Stability of Control Lyapunov Function Guided Reinforcement Learning eess.SY · 2026-05-03 · conditional · none · ref 23
CLF-guided RL yields exponentially stable optimal controllers, with proofs in continuous and discrete time, numerical checks on double integrator and cart-pole, and implementation on a walking humanoid.
MUJICA: Multi-skill Unified Joint Integration of Control Architecture for Wheeled-Legged Robots cs.RO · 2026-05-13 · unverdicted · none · ref 31
A single reinforcement learning policy jointly trains multiple locomotion skills for wheeled-legged robots with DC-motor constraints and learns a proprioceptive skill selector for adaptive behavior.
Reset-Free Reinforcement Learning for Real-World Agile Driving: An Empirical Study cs.RO · 2026-04-09 · unverdicted · none · ref 3
Empirical comparison shows a clear sim-to-real gap in reset-free RL for agile driving: TD-MPC2 outperforms the MPPI baseline in the real world while SAC excels in simulation, and residual learning benefits simulation but does not transfer.
Learning Agile Striker Skills for Humanoid Soccer Robots from Noisy Sensory Input cs.RO · 2025-12-06 · conditional · none · ref 7
A four-stage RL system with teacher-student distillation and online constrained adaptation enables humanoid robots to achieve robust ball-kicking accuracy under noisy perception in simulation and on physical hardware.
Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input cs.RO · 2026-04-21 · unverdicted · none · ref 28
Sparsely gated MoE policies double the success rate of a real Unitree Go2 quadruped on large-obstacle parkour versus matched-active-parameter MLP baselines while cutting inference time compared with a scaled-up MLP.
CART: Context-Aware Terrain Adaptation using Temporal Sequence Selection for Legged Robots cs.RO · 2026-04-15 · unreviewed · ref 47

Learning to walk in minutes using massively parallel deep reinforcement learning,

fields

years

verdicts

representative citing papers

citing papers explorer