Learning and Transfer of Modulated Locomotor Controllers

Nicolas Heess , Greg Wayne , Yuval Tassa , Timothy Lillicrap , Martin Riedmiller , David Silver

Authors on Pith no claims yet

classification 💻 cs.RO cs.AI

keywords architecturedimensionalnetworkspinaltasksaccesshigh-levellearning

read the original abstract

We study a novel architecture and training procedure for locomotion tasks. A high-frequency, low-level "spinal" network with access to proprioceptive sensors learns sensorimotor primitives by training on simple tasks. This pre-trained module is fixed and connected to a low-frequency, high-level "cortical" network, with access to all sensors, which drives behavior by modulating the inputs to the spinal network. Where a monolithic end-to-end architecture fails completely, learning with a pre-trained spinal module succeeds at multiple high-level tasks, and enables the effective exploration required to learn from sparse rewards. We test our proposed architecture on three simulated bodies: a 16-dimensional swimming snake, a 20-dimensional quadruped, and a 54-dimensional humanoid. Our results are illustrated in the accompanying video at https://youtu.be/sboPYvhpraQ

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

LatentMimic: Terrain-Adaptive Locomotion via Latent Space Imitation
cs.RO 2026-04 unverdicted novelty 6.0

LatentMimic decouples stylistic fidelity from geometric terrain constraints in quadruped locomotion via marginal latent divergence to a mocap prior and a dynamic replay buffer, yielding higher traversal success than m...