pith. sign in

arxiv: 2512.00727 · v2 · pith:R5KPEM7Knew · submitted 2025-11-30 · 💻 cs.RO

Beyond Topology: A Morphological Symmetry Graph Representation for Locomotion Policy Learning

classification 💻 cs.RO
keywords graphlocomotionpolicysymmetrylearningmorphologicalrepresentationcommand
0
0 comments X
read the original abstract

Reinforcement learning has enabled impressive locomotion skills on articulated robots, but common policy representations remain only weakly aligned with robot physics. Generic networks ignore kinematic structure, while graph-based policies encode connectivity without specifying how physical quantities transform across symmetric body parts. We introduce a morphological symmetry graph representation for locomotion policy learning and instantiate it in MS-PPO. Starting from the robot's topological graph, our representation augments each observation and action space with the permutation and sign transformations induced by morphological symmetry. This yields a symmetry-equivariant graph actor and a symmetry-invariant graph critic, enforcing the desired policy and value constraints by construction rather than through reward shaping or data augmentation. We evaluate MS-PPO on a variety of locomotion tasks using both Unitree Go2 quadruped and Unitree G1 humanoid, including command tracking, asymmetric joint failures, out-of-distribution command generalization, and zero-shot sim-to-real deployment. Experiments show improved symmetry generalization, robustness, sample efficiency, and model efficiency over topology- and symmetry-aware baselines.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Morphologically Equivariant Flow Matching for Bimanual Mobile Manipulation

    cs.RO 2026-05 conditional novelty 7.0

    A morphologically equivariant flow matching policy for bimanual robots enforces reflective symmetry to improve sample efficiency and enable zero-shot generalization to mirrored task configurations.

  2. Toward Hardware-Agnostic Quadrupedal World Models via Morphology Conditioning

    cs.RO 2026-04 unverdicted novelty 6.0

    Morphology-conditioned quadrupedal world model enables zero-shot generalization to new robot embodiments for locomotion tasks.

  3. WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotic Systems

    cs.LG 2026-03 unverdicted novelty 5.0

    WestWorld introduces a scalable trajectory world model with Sys-MoE routing via system embeddings and structural embeddings for physical knowledge, pretrained on 89 environments to improve zero-shot prediction and rea...