Barkour: Benchmarking Animal-level Agility with Quadruped Robots

Adil Dostmohamed; Alejandro Escontrela; Atil Iscen; Baruch Tabanpour; Bauyrjan Jyenis; Carolina Parada; Daniel Freeman; Daniel Zheng; Deepali Jain; Diego Reyes

arxiv: 2305.14654 · v1 · pith:QO5UGCZFnew · submitted 2023-05-24 · 💻 cs.RO · cs.AI

Barkour: Benchmarking Animal-level Agility with Quadruped Robots

Ken Caluwaerts , Atil Iscen , J. Chase Kew , Wenhao Yu , Tingnan Zhang , Daniel Freeman , Kuang-Huei Lee , Lisa Lee

show 36 more authors

Stefano Saliceti Vincent Zhuang Nathan Batchelor Steven Bohez Federico Casarini Jose Enrique Chen Omar Cortes Erwin Coumans Adil Dostmohamed Gabriel Dulac-Arnold Alejandro Escontrela Erik Frey Roland Hafner Deepali Jain Bauyrjan Jyenis Yuheng Kuang Edward Lee Linda Luu Ofir Nachum Ken Oslund Jason Powell Diego Reyes Francesco Romano Feresteh Sadeghi Ron Sloat Baruch Tabanpour Daniel Zheng Michael Neunert Raia Hadsell Nicolas Heess Francesco Nori Jeff Seto Carolina Parada Vikas Sindhwani Vincent Vanhoucke Jie Tan

This is my paper

classification 💻 cs.RO cs.AI

keywords agilityrobotslocomotionrobotskillsvariousagileanimal-level

0 comments

read the original abstract

Animals have evolved various agile locomotion strategies, such as sprinting, leaping, and jumping. There is a growing interest in developing legged robots that move like their biological counterparts and show various agile skills to navigate complex environments quickly. Despite the interest, the field lacks systematic benchmarks to measure the performance of control policies and hardware in agility. We introduce the Barkour benchmark, an obstacle course to quantify agility for legged robots. Inspired by dog agility competitions, it consists of diverse obstacles and a time based scoring mechanism. This encourages researchers to develop controllers that not only move fast, but do so in a controllable and versatile way. To set strong baselines, we present two methods for tackling the benchmark. In the first approach, we train specialist locomotion skills using on-policy reinforcement learning methods and combine them with a high-level navigation controller. In the second approach, we distill the specialist skills into a Transformer-based generalist locomotion policy, named Locomotion-Transformer, that can handle various terrains and adjust the robot's gait based on the perceived environment and robot states. Using a custom-built quadruped robot, we demonstrate that our method can complete the course at half the speed of a dog. We hope that our work represents a step towards creating controllers that enable robots to reach animal-level agility.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 8 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution
cs.CL 2023-09 unverdicted novelty 8.0

Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.
ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders
cs.RO 2026-05 unverdicted novelty 7.0

ARC-RL provides four new MuJoCo continuous-control environments with hexapod and quadruped morphologies inspired by ARC Raiders, a unified multi-component reward without motion capture, CPG expert demonstrators, and e...
ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders
cs.RO 2026-05 accept novelty 6.0

ARC-RL is a new suite of four MuJoCo continuous-control environments featuring game-inspired hexapod and quadruped morphologies, a single closed-form multi-component reward function, CPG demonstrators, and empirical c...
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching
cs.RO 2026-02 unverdicted novelty 6.0

A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over...
Learning Gait-Aware Quadruped Locomotion with Temporal Logic Specifications
cs.RO 2026-07 unverdicted novelty 5.0

Framework using parameterized Signal Temporal Logic specifications to shape rewards for PPO-based RL, yielding tighter velocity tracking and more stable training than hand-crafted rewards on Barkour quadruped in MuJoC...
ParkourFormer: Integrating Predictive Supervision and Sequence Modeling into Parkour Locomotion
cs.RO 2026-05 unverdicted novelty 5.0

ParkourFormer achieves 93.85% average success on multi-terrain humanoid parkour by fusing Transformer sequence modeling with supervised future-state prediction.
Robot Squid Game: Quadrupedal Locomotion for Traversing Narrow Tunnels
cs.RO 2026-05 unverdicted novelty 5.0

A teacher-student RL policy distillation approach combined with procedural tunnel generation enables quadruped robots to traverse narrow tunnels consistently in both simulation and real-world tests.
Intelligent Automation for Embodied Benchmark Construction: Pipelines, Embodiments, Simulators, and Trends
cs.RO 2026-06 unverdicted novelty 3.0

Automation in embodied benchmark construction shifts costs from acquisition toward validation, auditability, version control, and long-term governance instead of simply lowering total cost.