Learning agile robotic locomotion skills by imitating animals

· 2020 · arXiv 2004.00784

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

representative citing papers

Referring-Aware Visuomotor Policy Learning for Closed-Loop Manipulation

cs.RO · 2026-04-07 · unverdicted · novelty 7.0

ReV is a referring-aware visuomotor policy using coupled diffusion heads for real-time trajectory replanning in robotic manipulation, trained solely via targeted perturbations to expert demonstrations and achieving higher success rates in simulated and real tasks.

X-Morph: Human Motion Priors for Scalable Robot Learning Across Morphologies

cs.RO · 2026-06-29 · unverdicted · novelty 6.0

X-Morph retargets human motions to kinematically plausible references for multiple legged morphologies, trains privileged RL trackers, and distills them into deployable policies that generalize and enable teleoperation and text-conditioned generation.

StairMaster: Learning to Conquer Risky Hollow Stairs for Agile Quadrupedal Robots

cs.RO · 2026-06-24 · unverdicted · novelty 6.0

StairMaster trains an RL policy that lets a Unitree Go2 quadruped climb hollow stairs up to 55 degrees via zero-shot sim-to-real transfer using cross-attention, SRU memory, and active-perception rewards.

Enforcing Human-like Kinematics in Dexterous Piano Playing via Adversarial Posture Regularization

cs.RO · 2026-06-22 · unverdicted · novelty 6.0

Adversarial Posture Regularization matches RL policy posture distributions to casual human piano-playing data to enforce human-like kinematics in dexterous hands, outperforming baselines on cPSI, BSE, and FAC metrics.

Towards Real-time Control of a CartPole System on a Quantum Computer

quant-ph · 2026-05-03 · unverdicted · novelty 6.0

A single-qubit quantum reinforcement learning agent solves CartPole faster than classical networks and quantifies shot-count versus control-frequency requirements for real-time closed-loop control on NISQ hardware, including direct electronics programming to reduce latency.

Learning Gait-Aware Quadruped Locomotion with Temporal Logic Specifications

cs.RO · 2026-07-01 · unverdicted · novelty 5.0

Framework using parameterized Signal Temporal Logic specifications to shape rewards for PPO-based RL, yielding tighter velocity tracking and more stable training than hand-crafted rewards on Barkour quadruped in MuJoCo simulation.

DynaRetarget: Dynamically-Feasible Retargeting using Sampling-Based Trajectory Optimization

cs.RO · 2026-02-06

citing papers explorer

Showing 7 of 7 citing papers.

Referring-Aware Visuomotor Policy Learning for Closed-Loop Manipulation cs.RO · 2026-04-07 · unverdicted · none · ref 26
ReV is a referring-aware visuomotor policy using coupled diffusion heads for real-time trajectory replanning in robotic manipulation, trained solely via targeted perturbations to expert demonstrations and achieving higher success rates in simulated and real tasks.
X-Morph: Human Motion Priors for Scalable Robot Learning Across Morphologies cs.RO · 2026-06-29 · unverdicted · none · ref 16
X-Morph retargets human motions to kinematically plausible references for multiple legged morphologies, trains privileged RL trackers, and distills them into deployable policies that generalize and enable teleoperation and text-conditioned generation.
StairMaster: Learning to Conquer Risky Hollow Stairs for Agile Quadrupedal Robots cs.RO · 2026-06-24 · unverdicted · none · ref 3
StairMaster trains an RL policy that lets a Unitree Go2 quadruped climb hollow stairs up to 55 degrees via zero-shot sim-to-real transfer using cross-attention, SRU memory, and active-perception rewards.
Enforcing Human-like Kinematics in Dexterous Piano Playing via Adversarial Posture Regularization cs.RO · 2026-06-22 · unverdicted · none · ref 24
Adversarial Posture Regularization matches RL policy posture distributions to casual human piano-playing data to enforce human-like kinematics in dexterous hands, outperforming baselines on cPSI, BSE, and FAC metrics.
Towards Real-time Control of a CartPole System on a Quantum Computer quant-ph · 2026-05-03 · unverdicted · none · ref 14
A single-qubit quantum reinforcement learning agent solves CartPole faster than classical networks and quantifies shot-count versus control-frequency requirements for real-time closed-loop control on NISQ hardware, including direct electronics programming to reduce latency.
Learning Gait-Aware Quadruped Locomotion with Temporal Logic Specifications cs.RO · 2026-07-01 · unverdicted · none · ref 3
Framework using parameterized Signal Temporal Logic specifications to shape rewards for PPO-based RL, yielding tighter velocity tracking and more stable training than hand-crafted rewards on Barkour quadruped in MuJoCo simulation.
DynaRetarget: Dynamically-Feasible Retargeting using Sampling-Based Trajectory Optimization cs.RO · 2026-02-06 · unreviewed · ref 13

Learning agile robotic locomotion skills by imitating animals

fields

years

verdicts

representative citing papers

citing papers explorer