Recognition: unknown
Learning and Transfer of Modulated Locomotor Controllers
read the original abstract
We study a novel architecture and training procedure for locomotion tasks. A high-frequency, low-level "spinal" network with access to proprioceptive sensors learns sensorimotor primitives by training on simple tasks. This pre-trained module is fixed and connected to a low-frequency, high-level "cortical" network, with access to all sensors, which drives behavior by modulating the inputs to the spinal network. Where a monolithic end-to-end architecture fails completely, learning with a pre-trained spinal module succeeds at multiple high-level tasks, and enables the effective exploration required to learn from sparse rewards. We test our proposed architecture on three simulated bodies: a 16-dimensional swimming snake, a 20-dimensional quadruped, and a 54-dimensional humanoid. Our results are illustrated in the accompanying video at https://youtu.be/sboPYvhpraQ
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
LatentMimic: Terrain-Adaptive Locomotion via Latent Space Imitation
LatentMimic decouples stylistic fidelity from geometric terrain constraints in quadruped locomotion via marginal latent divergence to a mocap prior and a dynamic replay buffer, yielding higher traversal success than m...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.