{"record_type":"pith_number_record","schema_url":"https://pith.science/schemas/pith-number/v1.json","pith_number":"pith:2025:TPPT6AL5A2DBU6JEE7XKTHLMYF","short_pith_number":"pith:TPPT6AL5","schema_version":"1.0","canonical_sha256":"9bdf3f017d06861a792427eea99d6cc15b233fa797458d3770b70b71ecef86d9","source":{"kind":"arxiv","id":"2508.08241","version":4},"attestation_state":"computed","paper":{"title":"BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"A compact motion-tracking setup plus classifier-guided latent diffusion lets one humanoid policy master diverse agile skills and solve unseen tasks zero-shot.","cross_cats":[],"primary_cat":"cs.RO","authors_text":"C. Karen Liu, Guy Tevet, Koushil Sreenath, Qiayuan Liao, Takara E. Truong, Xiaoyu Huang, Yuman Gao","submitted_at":"2025-08-11T17:55:26Z","abstract_excerpt":"The human-like form of humanoid robots positions them uniquely to achieve the agility and versatility in motor skills that humans possess. Learning from human demonstrations offers a scalable approach to acquiring these capabilities. However, prior works either produce unnatural motions or rely on motion-specific tuning to achieve satisfactory naturalness. Furthermore, these methods are often motion- or goal-specific, lacking the versatility to compose diverse skills, especially when solving unseen tasks. We present BeyondMimic, a framework that scales to diverse motions and carries the versat"},"verification_status":{"content_addressed":true,"pith_receipt":true,"author_attested":false,"weak_author_claims":0,"strong_author_claims":0,"externally_anchored":false,"storage_verified":false,"citation_signatures":0,"replication_records":0,"graph_snapshot":true,"references_resolved":true,"formal_links_present":true},"canonical_record":{"source":{"id":"2508.08241","kind":"arxiv","version":4},"metadata":{"license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","primary_cat":"cs.RO","submitted_at":"2025-08-11T17:55:26Z","cross_cats_sorted":[],"title_canon_sha256":"63a78279064fd2c1e1986f6a8723c1a1c68c03fe5dfe135871379a33e6f07a53","abstract_canon_sha256":"54665c33a379a25e9e298cbc2ff0f76341ba3d92229ff79f30706d0b57985440"},"schema_version":"1.0"},"receipt":{"kind":"pith_receipt","key_id":"pith-v1-2026-05","algorithm":"ed25519","signed_at":"2026-05-17T23:38:49.825619Z","signature_b64":"ADdG4o9QS+RtNv5mdroLddjTz21N8tCw/ZxhIFmwceznxDa9w0t2uIJoFwJfDyhLDA6SiWHjpBBdyTWHlunYCA==","signed_message":"canonical_sha256_bytes","builder_version":"pith-number-builder-2026-05-17-v1","receipt_version":"0.3","canonical_sha256":"9bdf3f017d06861a792427eea99d6cc15b233fa797458d3770b70b71ecef86d9","last_reissued_at":"2026-05-17T23:38:49.825057Z","signature_status":"signed_v1","first_computed_at":"2026-05-17T23:38:49.825057Z","public_key_fingerprint":"8d4b5ee74e4693bcd1df2446408b0d54"},"graph_snapshot":{"paper":{"title":"BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"A compact motion-tracking setup plus classifier-guided latent diffusion lets one humanoid policy master diverse agile skills and solve unseen tasks zero-shot.","cross_cats":[],"primary_cat":"cs.RO","authors_text":"C. Karen Liu, Guy Tevet, Koushil Sreenath, Qiayuan Liao, Takara E. Truong, Xiaoyu Huang, Yuman Gao","submitted_at":"2025-08-11T17:55:26Z","abstract_excerpt":"The human-like form of humanoid robots positions them uniquely to achieve the agility and versatility in motor skills that humans possess. Learning from human demonstrations offers a scalable approach to acquiring these capabilities. However, prior works either produce unnatural motions or rely on motion-specific tuning to achieve satisfactory naturalness. Furthermore, these methods are often motion- or goal-specific, lacking the versatility to compose diverse skills, especially when solving unseen tasks. We present BeyondMimic, a framework that scales to diverse motions and carries the versat"},"claims":{"count":4,"items":[{"kind":"strongest_claim","text":"A compact motion-tracking formulation enables mastering a wide range of radically agile behaviors, including aerial cartwheels, spin-kicks, flip-kicks, and sprinting, with a single setup and shared hyperparameters, while a unified latent diffusion model with classifier guidance solves downstream tasks never encountered during training and transfers zero-shot to real hardware.","source":"verdict.strongest_claim","status":"machine_extracted","claim_id":"C1","attestation":"unclaimed"},{"kind":"weakest_assumption","text":"That classifier guidance during diffusion sampling can reliably steer toward novel objectives (motion inpainting, teleoperation, obstacle avoidance) while preserving motion naturalness and stability without task-specific retraining or post-hoc tuning that would undermine the single-setup claim.","source":"verdict.weakest_assumption","status":"machine_extracted","claim_id":"C2","attestation":"unclaimed"},{"kind":"one_line_summary","text":"BeyondMimic combines compact motion tracking with a unified guided latent diffusion model to master diverse agile behaviors from human demos and solve unseen downstream tasks via test-time classifier guidance.","source":"verdict.one_line_summary","status":"machine_extracted","claim_id":"C3","attestation":"unclaimed"},{"kind":"headline","text":"A compact motion-tracking setup plus classifier-guided latent diffusion lets one humanoid policy master diverse agile skills and solve unseen tasks zero-shot.","source":"verdict.pith_extraction.headline","status":"machine_extracted","claim_id":"C4","attestation":"unclaimed"}],"snapshot_sha256":"b1932b5fa9c0fc1b50f5696f609d8b5927742152a6a476c18feb715751c5c95b"},"source":{"id":"2508.08241","kind":"arxiv","version":4},"verdict":{"id":"ffbce39c-98cd-4932-9fbd-67455a74ae05","model_set":{"reader":"grok-4.3"},"created_at":"2026-05-15T23:09:48.346184Z","strongest_claim":"A compact motion-tracking formulation enables mastering a wide range of radically agile behaviors, including aerial cartwheels, spin-kicks, flip-kicks, and sprinting, with a single setup and shared hyperparameters, while a unified latent diffusion model with classifier guidance solves downstream tasks never encountered during training and transfers zero-shot to real hardware.","one_line_summary":"BeyondMimic combines compact motion tracking with a unified guided latent diffusion model to master diverse agile behaviors from human demos and solve unseen downstream tasks via test-time classifier guidance.","pipeline_version":"pith-pipeline@v0.9.0","weakest_assumption":"That classifier guidance during diffusion sampling can reliably steer toward novel objectives (motion inpainting, teleoperation, obstacle avoidance) while preserving motion naturalness and stability without task-specific retraining or post-hoc tuning that would undermine the single-setup claim.","pith_extraction_headline":"A compact motion-tracking setup plus classifier-guided latent diffusion lets one humanoid policy master diverse agile skills and solve unseen tasks zero-shot."},"references":{"count":94,"sample":[{"doi":"","year":2016,"title":"Kuindersma,et al., Optimization-based locomotion planning, estimation, and control design for the atlas humanoid robot.Autonomous robots40(3), 429–455 (2016)","work_id":"f4657e60-3e20-474e-af85-92ffb4f9b5f9","ref_index":1,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2023,"title":"P. M. Wensing,et al., Optimization-based control for dynamic legged robots.IEEE Transactions on Robotics40, 43–63 (2023)","work_id":"c03a17e1-3d02-4edc-8edb-cdd0039c033d","ref_index":2,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2003,"title":"Kajita,et al., Biped walking pattern generation by using preview control of zero- moment point, in2003 IEEE international conference on robotics and automation (Cat","work_id":"429a04cc-fac7-49f0-9f99-3a5d8abfb43b","ref_index":3,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2006,"title":"J. Pratt, J. Carff, S. Drakunov, A. Goswami, Capture point: A step toward humanoid push recovery, in2006 6th IEEE-RAS international conference on humanoid robots(Ieee) (2006), pp. 200–207","work_id":"a2e90583-e38a-48f1-abb0-6dbc70a1d361","ref_index":4,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2014,"title":"R. Deits, R. Tedrake, Footstep planning on uneven terrain with mixed-integer convex opti- mization, in2014 IEEE-RAS international conference on humanoid robots(IEEE) (2014), pp. 279–286","work_id":"330d2432-6922-4a72-b969-a66fed94cfad","ref_index":5,"cited_arxiv_id":"","is_internal_anchor":false}],"resolved_work":94,"snapshot_sha256":"dfc0820d0b81916122074258ae9060b0dc75a03b1338ea6b99339567b087db91","internal_anchors":3},"formal_canon":{"evidence_count":3,"snapshot_sha256":"af991a478e87cbb30324b2a16c52dc6f221141b0c4de426c267c9e2f920bdd5b"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"},"aliases":[{"alias_kind":"arxiv","alias_value":"2508.08241","created_at":"2026-05-17T23:38:49.825148+00:00"},{"alias_kind":"arxiv_version","alias_value":"2508.08241v4","created_at":"2026-05-17T23:38:49.825148+00:00"},{"alias_kind":"doi","alias_value":"10.48550/arxiv.2508.08241","created_at":"2026-05-17T23:38:49.825148+00:00"},{"alias_kind":"pith_short_12","alias_value":"TPPT6AL5A2DB","created_at":"2026-05-18T12:33:37.589309+00:00"},{"alias_kind":"pith_short_16","alias_value":"TPPT6AL5A2DBU6JE","created_at":"2026-05-18T12:33:37.589309+00:00"},{"alias_kind":"pith_short_8","alias_value":"TPPT6AL5","created_at":"2026-05-18T12:33:37.589309+00:00"}],"events":[],"event_summary":{},"paper_claims":[],"inbound_citations":{"count":28,"internal_anchor_count":28,"sample":[{"citing_arxiv_id":"2605.22272","citing_title":"Imagine2Real: Towards Zero-shot Humanoid-Object Interaction via Video Generative Priors","ref_index":2,"is_internal_anchor":true},{"citing_arxiv_id":"2511.07820","citing_title":"SONIC: Supersizing Motion Tracking for Natural Humanoid Whole-Body Control","ref_index":30,"is_internal_anchor":true},{"citing_arxiv_id":"2602.03205","citing_title":"HUSKY: Humanoid Skateboarding System via Physics-Aware Whole-Body Control","ref_index":19,"is_internal_anchor":true},{"citing_arxiv_id":"2604.07993","citing_title":"HEX: Humanoid-Aligned Experts for Cross-Embodiment Whole-Body Manipulation","ref_index":25,"is_internal_anchor":true},{"citing_arxiv_id":"2605.20209","citing_title":"NaP-Control: Navigating Diffusion Prior for Versatile and Fast Character Control","ref_index":21,"is_internal_anchor":true},{"citing_arxiv_id":"2605.19981","citing_title":"CEER: Compliant End-Effector and Root Control as a Unified Interface for Hierarchical Humanoid Loco-Manipulation","ref_index":2,"is_internal_anchor":true},{"citing_arxiv_id":"2604.04539","citing_title":"FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control","ref_index":43,"is_internal_anchor":true},{"citing_arxiv_id":"2605.15517","citing_title":"Terrain Consistent Reference-Guided RL for Humanoid Navigation Autonomy","ref_index":9,"is_internal_anchor":true},{"citing_arxiv_id":"2511.11218","citing_title":"Humanoid Whole-Body Badminton via Multi-Stage Reinforcement Learning","ref_index":2,"is_internal_anchor":true},{"citing_arxiv_id":"2602.11758","citing_title":"HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model","ref_index":34,"is_internal_anchor":true},{"citing_arxiv_id":"2602.15827","citing_title":"Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching","ref_index":20,"is_internal_anchor":true},{"citing_arxiv_id":"2603.02856","citing_title":"Rhythm: Learning Interactive Whole-Body Control for Dual Humanoids","ref_index":26,"is_internal_anchor":true},{"citing_arxiv_id":"2603.22201","citing_title":"Make Tracking Easy: Neural Motion Retargeting for Humanoid Whole-body Control","ref_index":36,"is_internal_anchor":true},{"citing_arxiv_id":"2605.03452","citing_title":"BifrostUMI: Bridging Robot-Free Demonstrations and Humanoid Whole-Body Manipulation","ref_index":31,"is_internal_anchor":true},{"citing_arxiv_id":"2604.27711","citing_title":"ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control","ref_index":15,"is_internal_anchor":true},{"citing_arxiv_id":"2605.01518","citing_title":"VOFA: Visual Object Goal Pushing with Force-Adaptive Control for Humanoids","ref_index":19,"is_internal_anchor":true},{"citing_arxiv_id":"2605.01427","citing_title":"SixthSense: Task-Agnostic Proprioception-Only Whole-Body Wrench Estimation for Humanoids","ref_index":24,"is_internal_anchor":true},{"citing_arxiv_id":"2605.01234","citing_title":"TT4D: A Pipeline and Dataset for Table Tennis 4D Reconstruction From Monocular Videos","ref_index":30,"is_internal_anchor":true},{"citing_arxiv_id":"2604.21541","citing_title":"X2-N: A Transformable Wheel-legged Humanoid Robot with Dual-mode Locomotion and Manipulation","ref_index":18,"is_internal_anchor":true},{"citing_arxiv_id":"2604.13015","citing_title":"Learning Versatile Humanoid Manipulation with Touch Dreaming","ref_index":2,"is_internal_anchor":true},{"citing_arxiv_id":"2604.12909","citing_title":"Tree Learning: A Multi-Skill Continual Learning Framework for Humanoid Robots","ref_index":10,"is_internal_anchor":true},{"citing_arxiv_id":"2604.09499","citing_title":"Physics-Informed Reinforcement Learning of Spatial Density Velocity Potentials for Map-Free Racing","ref_index":30,"is_internal_anchor":true},{"citing_arxiv_id":"2604.08508","citing_title":"Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation","ref_index":28,"is_internal_anchor":true},{"citing_arxiv_id":"2604.07993","citing_title":"HEX: Humanoid-Aligned Experts for Cross-Embodiment Whole-Body Manipulation","ref_index":25,"is_internal_anchor":true},{"citing_arxiv_id":"2604.07331","citing_title":"RoSHI: A Versatile Robot-oriented Suit for Human Data In-the-Wild","ref_index":42,"is_internal_anchor":true}]},"formal_canon":{"evidence_count":3,"sample":[],"anchors":[]},"links":{"html":"https://pith.science/pith/TPPT6AL5A2DBU6JEE7XKTHLMYF","json":"https://pith.science/pith/TPPT6AL5A2DBU6JEE7XKTHLMYF.json","graph_json":"https://pith.science/api/pith-number/TPPT6AL5A2DBU6JEE7XKTHLMYF/graph.json","events_json":"https://pith.science/api/pith-number/TPPT6AL5A2DBU6JEE7XKTHLMYF/events.json","paper":"https://pith.science/paper/TPPT6AL5"},"agent_actions":{"view_html":"https://pith.science/pith/TPPT6AL5A2DBU6JEE7XKTHLMYF","download_json":"https://pith.science/pith/TPPT6AL5A2DBU6JEE7XKTHLMYF.json","view_paper":"https://pith.science/paper/TPPT6AL5","resolve_alias":"https://pith.science/api/pith-number/resolve?arxiv=2508.08241&json=true","fetch_graph":"https://pith.science/api/pith-number/TPPT6AL5A2DBU6JEE7XKTHLMYF/graph.json","fetch_events":"https://pith.science/api/pith-number/TPPT6AL5A2DBU6JEE7XKTHLMYF/events.json","actions":{"anchor_timestamp":"https://pith.science/pith/TPPT6AL5A2DBU6JEE7XKTHLMYF/action/timestamp_anchor","attest_storage":"https://pith.science/pith/TPPT6AL5A2DBU6JEE7XKTHLMYF/action/storage_attestation","attest_author":"https://pith.science/pith/TPPT6AL5A2DBU6JEE7XKTHLMYF/action/author_attestation","sign_citation":"https://pith.science/pith/TPPT6AL5A2DBU6JEE7XKTHLMYF/action/citation_signature","submit_replication":"https://pith.science/pith/TPPT6AL5A2DBU6JEE7XKTHLMYF/action/replication_record"}},"created_at":"2026-05-17T23:38:49.825148+00:00","updated_at":"2026-05-17T23:38:49.825148+00:00"}