hub Canonical reference

Humanplus: Humanoid shadowing and imitation from humans

Zipeng Fu, Qingqing Zhao, Qi Wu, Gordon Wetzstein, Chelsea Finn · 2024 · arXiv 2406.10454

Canonical reference. 80% of citing Pith papers cite this work as background.

17 Pith papers citing it

Background 80% of classified citations

read on arXiv browse 17 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4 baseline 1

citation-polarity summary

background 4 baseline 1

representative citing papers

BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion

cs.RO · 2025-08-11 · conditional · novelty 7.0

BeyondMimic combines compact motion tracking with a unified guided latent diffusion model to master diverse agile behaviors from human demos and solve unseen downstream tasks via test-time classifier guidance.

Imagine2Real: Towards Zero-shot Humanoid-Object Interaction via Video Generative Priors

cs.RO · 2026-05-21 · unverdicted · novelty 6.0

Imagine2Real enables zero-shot humanoid-object interaction by unifying motions as 4D point trajectories, tracking only base/hands/object keypoints inside a BFM latent space, and training with progressive simple rewards for mocap deployment.

BifrostUMI: Bridging Robot-Free Demonstrations and Humanoid Whole-Body Manipulation

cs.RO · 2026-05-05 · unverdicted · novelty 6.0

BifrostUMI enables robot-free human demonstration capture via VR and wrist cameras to train visuomotor policies that predict keypoint trajectories for transfer to humanoid whole-body control through retargeting.

Learn Weightlessness: Imitate Non-Self-Stabilizing Motions on Humanoid Robot

cs.RO · 2026-04-23 · unverdicted · novelty 6.0

The Weightlessness Mechanism lets humanoid robots imitate non-self-stabilizing motions by dynamically relaxing specific joints to exploit passive environmental contacts, generalizing from single demonstrations to varied setups.

RoSHI: A Versatile Robot-oriented Suit for Human Data In-the-Wild

cs.RO · 2026-04-08 · unverdicted · novelty 6.0

RoSHI is a hybrid wearable that combines sparse IMUs and egocentric SLAM to capture accurate full-body 3D pose and shape data in natural environments for robot learning.

Make Tracking Easy: Neural Motion Retargeting for Humanoid Whole-body Control

cs.RO · 2026-03-23 · unverdicted · novelty 6.0

NMR uses VAE-based clustered expert physics refinement and a CNN-Transformer to learn dynamics-aware retargeting, eliminating joint jumps and self-collisions on Unitree G1 while accelerating downstream control policies.

Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

cs.RO · 2026-02-17 · unverdicted · novelty 6.0

A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.

HUSKY: Humanoid Skateboarding System via Physics-Aware Whole-Body Control

cs.RO · 2026-02-03 · conditional · novelty 6.0

HUSKY combines humanoid-skateboard dynamics modeling with adversarial motion priors and physics-guided lean-to-steer strategies to achieve real-world stable skateboarding on a humanoid robot.

Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations

cs.RO · 2025-07-01 · unverdicted · novelty 6.0

RIGVid shows that filtered AI-generated videos can serve as effective supervision for complex robotic manipulation tasks without any real demonstrations.

DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion

cs.RO · 2025-05-24 · unverdicted · novelty 6.0

DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.

CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

cs.CV · 2025-03-27 · unverdicted · novelty 6.0

CoT-VLA is a 7B VLA that generates future visual frames autoregressively as planning goals before actions, outperforming prior VLAs by 17% on real-world tasks and 6% in simulation.

DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control

cs.RO · 2025-02-09 · unverdicted · novelty 6.0

DexVLA combines a scaled diffusion action expert with embodiment curriculum learning to achieve better generalization and performance than prior VLA models on diverse robot hardware and long-horizon tasks.

HoloMotion-1 Technical Report

cs.RO · 2026-05-14 · unverdicted · novelty 5.0 · 2 refs

HoloMotion-1 trains a MoE Transformer policy on hybrid video and MoCap motion data to achieve robust zero-shot tracking that transfers directly to real humanoid robots.

RPG: Robust Policy Gating for Smooth Multi-Skill Transitions in Humanoid Fighting

cs.RO · 2026-04-23 · unverdicted · novelty 5.0

RPG trains a single policy with transition and timing randomization for stable multi-skill fighting on humanoids, integrated with locomotion for arbitrary-duration combat.

Switch: Learning Agile Skills Switching for Humanoid Robots

cs.RO · 2026-04-16 · unverdicted · novelty 5.0

Switch enables humanoid robots to perform agile, seamless transitions between locomotion skills via a kinematic skill graph, DRL tracking policy, and real-time graph-search scheduler.

Learning Versatile Humanoid Manipulation with Touch Dreaming

cs.RO · 2026-04-14 · conditional · novelty 5.0

HTD, a multimodal transformer policy trained with behavioral cloning and touch dreaming to predict future tactile latents, achieves a 90.9% relative success rate improvement over baselines on five real-world contact-rich humanoid loco-manipulation tasks.

Toward Seamless Physical Human-Humanoid Interaction: Insights from Control, Intent, and Modeling with a Vision for What Comes Next

cs.RO · 2025-12-08 · unverdicted · novelty 5.0

A literature review of pHHI that proposes a taxonomy of interaction types by modality and engagement level while outlining pathways to integrate control, intent, and modeling for more seamless humanoid-human collaboration.

citing papers explorer

Showing 17 of 17 citing papers.

BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion cs.RO · 2025-08-11 · conditional · none · ref 43
BeyondMimic combines compact motion tracking with a unified guided latent diffusion model to master diverse agile behaviors from human demos and solve unseen downstream tasks via test-time classifier guidance.
Imagine2Real: Towards Zero-shot Humanoid-Object Interaction via Video Generative Priors cs.RO · 2026-05-21 · unverdicted · none · ref 42
Imagine2Real enables zero-shot humanoid-object interaction by unifying motions as 4D point trajectories, tracking only base/hands/object keypoints inside a BFM latent space, and training with progressive simple rewards for mocap deployment.
BifrostUMI: Bridging Robot-Free Demonstrations and Humanoid Whole-Body Manipulation cs.RO · 2026-05-05 · unverdicted · none · ref 3
BifrostUMI enables robot-free human demonstration capture via VR and wrist cameras to train visuomotor policies that predict keypoint trajectories for transfer to humanoid whole-body control through retargeting.
Learn Weightlessness: Imitate Non-Self-Stabilizing Motions on Humanoid Robot cs.RO · 2026-04-23 · unverdicted · none · ref 1
The Weightlessness Mechanism lets humanoid robots imitate non-self-stabilizing motions by dynamically relaxing specific joints to exploit passive environmental contacts, generalizing from single demonstrations to varied setups.
RoSHI: A Versatile Robot-oriented Suit for Human Data In-the-Wild cs.RO · 2026-04-08 · unverdicted · none · ref 7
RoSHI is a hybrid wearable that combines sparse IMUs and egocentric SLAM to capture accurate full-body 3D pose and shape data in natural environments for robot learning.
Make Tracking Easy: Neural Motion Retargeting for Humanoid Whole-body Control cs.RO · 2026-03-23 · unverdicted · none · ref 2
NMR uses VAE-based clustered expert physics refinement and a CNN-Transformer to learn dynamics-aware retargeting, eliminating joint jumps and self-collisions on Unitree G1 while accelerating downstream control policies.
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching cs.RO · 2026-02-17 · unverdicted · none · ref 10
A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.
HUSKY: Humanoid Skateboarding System via Physics-Aware Whole-Body Control cs.RO · 2026-02-03 · conditional · none · ref 6
HUSKY combines humanoid-skateboard dynamics modeling with adversarial motion priors and physics-guided lean-to-steer strategies to achieve real-world stable skateboarding on a humanoid robot.
Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations cs.RO · 2025-07-01 · unverdicted · none · ref 34
RIGVid shows that filtered AI-generated videos can serve as effective supervision for complex robotic manipulation tasks without any real demonstrations.
DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion cs.RO · 2025-05-24 · unverdicted · none · ref 16
DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models cs.CV · 2025-03-27 · unverdicted · none · ref 17
CoT-VLA is a 7B VLA that generates future visual frames autoregressively as planning goals before actions, outperforming prior VLAs by 17% on real-world tasks and 6% in simulation.
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control cs.RO · 2025-02-09 · unverdicted · none · ref 23
DexVLA combines a scaled diffusion action expert with embodiment curriculum learning to achieve better generalization and performance than prior VLA models on diverse robot hardware and long-horizon tasks.
HoloMotion-1 Technical Report cs.RO · 2026-05-14 · unverdicted · none · ref 17 · 2 links
HoloMotion-1 trains a MoE Transformer policy on hybrid video and MoCap motion data to achieve robust zero-shot tracking that transfers directly to real humanoid robots.
RPG: Robust Policy Gating for Smooth Multi-Skill Transitions in Humanoid Fighting cs.RO · 2026-04-23 · unverdicted · none · ref 29
RPG trains a single policy with transition and timing randomization for stable multi-skill fighting on humanoids, integrated with locomotion for arbitrary-duration combat.
Switch: Learning Agile Skills Switching for Humanoid Robots cs.RO · 2026-04-16 · unverdicted · none · ref 12
Switch enables humanoid robots to perform agile, seamless transitions between locomotion skills via a kinematic skill graph, DRL tracking policy, and real-time graph-search scheduler.
Learning Versatile Humanoid Manipulation with Touch Dreaming cs.RO · 2026-04-14 · conditional · none · ref 5
HTD, a multimodal transformer policy trained with behavioral cloning and touch dreaming to predict future tactile latents, achieves a 90.9% relative success rate improvement over baselines on five real-world contact-rich humanoid loco-manipulation tasks.
Toward Seamless Physical Human-Humanoid Interaction: Insights from Control, Intent, and Modeling with a Vision for What Comes Next cs.RO · 2025-12-08 · unverdicted · none · ref 69
A literature review of pHHI that proposes a taxonomy of interaction types by modality and engagement level while outlining pathways to integrate control, intent, and modeling for more seamless humanoid-human collaboration.

Humanplus: Humanoid shadowing and imitation from humans

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer