TAGA: Terrain-aware Active Gaze Learning for Generalizable Agile Humanoid Locomotion

Fangzhou Xu; Guillaume Sartoretti; Hongtao Wang; Hongyi Li; Mingfeng Fan; Peizhuo Li; Shuhao Liao; Yongbin Jin; Yuhong Cao; Yuxuan Ma

read the original abstract

Agile humanoid locomotion across diverse challenging terrain demands both wide perceptual coverage and precise local geometry understanding. Motivated by the way humans selectively look at relevant terrain during locomotion, we introduce TAGA, a Terrain-aware Active Gaze learning framework for Attention-based humanoid control. By fusing vision, proprioception, and motion commands, our framework guides the model to learn anticipatory cues and actively attend to specific areas of the height scan, selectively using these informative regions for the downstream network. This adaptively increases the information density of observations under tight onboard computational constraints, thus enabling fine-grained perceptive locomotion over larger-scale terrains. We find that such gaze behaviors can naturally emerge through reinforcement learning alone, without requiring additional supervision or explicit guidance, significantly improve training efficiency. As a result, the trained policy demonstrates robust and generalizable locomotion in simulation and on hardware, including reliable terrain-aware foothold selection, elevated-platform traversal, competitive sparse-foothold traversal, and the largest reported real-world gap traversal distance of 1.2m among perceptive humanoid locomotion systems, while maintaining stability under severe perceptual disturbances and environmental interference.

TAGA: Terrain-aware Active Gaze Learning for Generalizable Agile Humanoid Locomotion

discussion (0)