An n-dimensional hybrid system embeds into a continuous vector field in m > 2n dimensions, enabling latent Neural ODEs with consistency losses to recover hybrid flows from time series.
Hybrid internal model: Learning agile legged locomotion with simulated robot response
7 Pith papers cite this work. Polarity classification is still indexing.
years
2026 7verdicts
UNVERDICTED 7representative citing papers
FADA is a three-stage Planner-IDM method that achieves few-shot domain adaptation for humanoid control by distilling an oracle policy then finetuning only the IDM on short target-domain rollouts via supervised learning.
A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.
IMPACT decouples forceful manipulation into task-planning and internal-model predictive control, claiming higher success rates, better generalization to unseen weights, and improved safety and energy efficiency in simulation and real-world tests.
A lightweight RL framework trains terrain-agnostic 3D foothold-tracking policies for humanoids that transfer directly to real-world use as standalone low-level controllers.
A single reinforcement learning policy jointly trains multiple locomotion skills for wheeled-legged robots with DC-motor constraints and learns a proprioceptive skill selector for adaptive behavior.
A two-stage RL framework with a thermal-aware residual policy enables a Unitree A1 quadruped to achieve over 13 minutes of stable locomotion under 3 kg payload versus 5 minutes before overheating with the nominal policy alone.
citing papers explorer
-
Embedding Hybrid Systems into Continuous Latent Vector Fields
An n-dimensional hybrid system embeds into a continuous vector field in m > 2n dimensions, enabling latent Neural ODEs with consistency losses to recover hybrid flows from time series.
-
FADA: Few-Shot Domain Adaptation via Dynamics Alignment for Humanoid Control
FADA is a three-stage Planner-IDM method that achieves few-shot domain adaptation for humanoid control by distilling an oracle policy then finetuning only the IDM on short target-domain rollouts via supervised learning.
-
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching
A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.
-
IMPACT: Learning Internal-Model Predictive Control for Forceful Robotic Manipulation
IMPACT decouples forceful manipulation into task-planning and internal-model predictive control, claiming higher success rates, better generalization to unseen weights, and improved safety and energy efficiency in simulation and real-world tests.
-
Mind Your Steps: A General Learning Framework for Accurate Humanoid Foothold Tracking
A lightweight RL framework trains terrain-agnostic 3D foothold-tracking policies for humanoids that transfer directly to real-world use as standalone low-level controllers.
-
MUJICA: Multi-skill Unified Joint Integration of Control Architecture for Wheeled-Legged Robots
A single reinforcement learning policy jointly trains multiple locomotion skills for wheeled-legged robots with DC-motor constraints and learns a proprioceptive skill selector for adaptive behavior.
-
Learning to Balance Motor Thermal Safety and Quadrupedal Locomotion Performance with Residual Policy
A two-stage RL framework with a thermal-aware residual policy enables a Unitree A1 quadruped to achieve over 13 minutes of stable locomotion under 3 kg payload versus 5 minutes before overheating with the nominal policy alone.