Being-H0.7 adds future-aware latent reasoning to direct VLA policies via dual-branch alignment on latent queries, matching world-model benefits at VLA efficiency.
Maple: Encoding dexterous robotic manipulation priors learned from egocentric videos
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 2polarities
background 2representative citing papers
EgoDex delivers the largest egocentric dataset with native 3D hand tracking for dexterous manipulation, enabling imitation learning policies for hand trajectory prediction on 194 tasks.
FingerViP equips each finger with a miniature camera and trains a multi-view diffusion policy that achieves 80.8% success on real-world dexterous tasks previously limited by wrist-camera occlusion.
citing papers explorer
-
Being-H0.7: A Latent World-Action Model from Egocentric Videos
Being-H0.7 adds future-aware latent reasoning to direct VLA policies via dual-branch alignment on latent queries, matching world-model benefits at VLA efficiency.
-
EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video
EgoDex delivers the largest egocentric dataset with native 3D hand tracking for dexterous manipulation, enabling imitation learning policies for hand trajectory prediction on 194 tasks.
-
FingerViP: Learning Real-World Dexterous Manipulation with Fingertip Visual Perception
FingerViP equips each finger with a miniature camera and trains a multi-view diffusion policy that achieves 80.8% success on real-world dexterous tasks previously limited by wrist-camera occlusion.