Hybrid internal model: Learning agile legged locomotion with simulated robot response

· 2023 · arXiv 2312.11460

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

representative citing papers

Embedding Hybrid Systems into Continuous Latent Vector Fields

cs.LG · 2026-06-09 · unverdicted · novelty 7.0

An n-dimensional hybrid system embeds into a continuous vector field in m > 2n dimensions, enabling latent Neural ODEs with consistency losses to recover hybrid flows from time series.

FADA: Few-Shot Domain Adaptation via Dynamics Alignment for Humanoid Control

cs.RO · 2026-06-26 · unverdicted · novelty 6.0

FADA is a three-stage Planner-IDM method that achieves few-shot domain adaptation for humanoid control by distilling an oracle policy then finetuning only the IDM on short target-domain rollouts via supervised learning.

Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

cs.RO · 2026-02-17 · unverdicted · novelty 6.0

A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.

IMPACT: Learning Internal-Model Predictive Control for Forceful Robotic Manipulation

cs.RO · 2026-06-09 · unverdicted · novelty 5.0

IMPACT decouples forceful manipulation into task-planning and internal-model predictive control, claiming higher success rates, better generalization to unseen weights, and improved safety and energy efficiency in simulation and real-world tests.

Mind Your Steps: A General Learning Framework for Accurate Humanoid Foothold Tracking

cs.RO · 2026-06-06 · unverdicted · novelty 5.0

A lightweight RL framework trains terrain-agnostic 3D foothold-tracking policies for humanoids that transfer directly to real-world use as standalone low-level controllers.

MUJICA: Multi-skill Unified Joint Integration of Control Architecture for Wheeled-Legged Robots

cs.RO · 2026-05-13 · unverdicted · novelty 5.0

A single reinforcement learning policy jointly trains multiple locomotion skills for wheeled-legged robots with DC-motor constraints and learns a proprioceptive skill selector for adaptive behavior.

Learning to Balance Motor Thermal Safety and Quadrupedal Locomotion Performance with Residual Policy

cs.RO · 2026-05-26 · unverdicted · novelty 4.0 · 2 refs

A two-stage RL framework with a thermal-aware residual policy enables a Unitree A1 quadruped to achieve over 13 minutes of stable locomotion under 3 kg payload versus 5 minutes before overheating with the nominal policy alone.

citing papers explorer

Showing 7 of 7 citing papers.

Embedding Hybrid Systems into Continuous Latent Vector Fields cs.LG · 2026-06-09 · unverdicted · none · ref 26
An n-dimensional hybrid system embeds into a continuous vector field in m > 2n dimensions, enabling latent Neural ODEs with consistency losses to recover hybrid flows from time series.
FADA: Few-Shot Domain Adaptation via Dynamics Alignment for Humanoid Control cs.RO · 2026-06-26 · unverdicted · none · ref 33
FADA is a three-stage Planner-IDM method that achieves few-shot domain adaptation for humanoid control by distilling an oracle policy then finetuning only the IDM on short target-domain rollouts via supervised learning.
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching cs.RO · 2026-02-17 · unverdicted · none · ref 21
A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.
IMPACT: Learning Internal-Model Predictive Control for Forceful Robotic Manipulation cs.RO · 2026-06-09 · unverdicted · none · ref 59
IMPACT decouples forceful manipulation into task-planning and internal-model predictive control, claiming higher success rates, better generalization to unseen weights, and improved safety and energy efficiency in simulation and real-world tests.
Mind Your Steps: A General Learning Framework for Accurate Humanoid Foothold Tracking cs.RO · 2026-06-06 · unverdicted · none · ref 23
A lightweight RL framework trains terrain-agnostic 3D foothold-tracking policies for humanoids that transfer directly to real-world use as standalone low-level controllers.
MUJICA: Multi-skill Unified Joint Integration of Control Architecture for Wheeled-Legged Robots cs.RO · 2026-05-13 · unverdicted · none · ref 18
A single reinforcement learning policy jointly trains multiple locomotion skills for wheeled-legged robots with DC-motor constraints and learns a proprioceptive skill selector for adaptive behavior.
Learning to Balance Motor Thermal Safety and Quadrupedal Locomotion Performance with Residual Policy cs.RO · 2026-05-26 · unverdicted · none · ref 21 · 2 links
A two-stage RL framework with a thermal-aware residual policy enables a Unitree A1 quadruped to achieve over 13 minutes of stable locomotion under 3 kg payload versus 5 minutes before overheating with the nominal policy alone.

Hybrid internal model: Learning agile legged locomotion with simulated robot response

fields

years

verdicts

representative citing papers

citing papers explorer