pith. sign in

Yuke Zhu

Identifiers

  • name variant Yuke Zhu 0.60 · backfill

Papers (40)

  1. GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors cs.RO · 2026 · author #17
  2. Cosmos 3: Omnimodal World Models for Physical AI cs.CV · 2026 · author #289
  3. HumanoidMimicGen: Data Generation for Loco-Manipulation via Whole-Body Planning cs.RO · 2026 · author #10
  4. MotionBricks: Scalable Real-Time Motions with Modular Latent Generative Model and Smart Primitives cs.RO · 2026 · author #15
  5. A Mechanistic Analysis of Sim-and-Real Co-Training in Generative Robot Policies cs.RO · 2026 · author #5
  6. LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation cs.CV · 2026 · author #4
  7. World Action Models are Zero-shot Policies cs.RO · 2026 · author #34
  8. DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos cs.RO · 2026 · author #28
  9. SONIC: Supersizing Motion Tracking for Natural Humanoid Whole-Body Control cs.RO · 2025 · author #28
  10. Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning cs.RO · 2025 · author #99
  11. World Simulation with Video Foundation Models for Physical AI cs.CV · 2025 · author #88
  12. FLARE: Robot Learning with Implicit World Modeling cs.RO · 2025 · author #20
  13. DreamGen: Unlocking Generalization in Robot Learning through Video World Models cs.RO · 2025 · author #27
  14. GR00T N1: An Open Foundation Model for Generalist Humanoid Robots cs.RO · 2025 · author #41
  15. LongVILA: Scaling Long-Context Visual Language Models for Long Videos cs.CV · 2024 · author #16
  16. RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots cs.RO · 2024 · author #8
  17. DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset cs.RO · 2024 · author #98
  18. MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations cs.RO · 2023 · author #7
  19. Eureka: Human-Level Reward Design via Coding Large Language Models cs.RO · 2023 · author #7
  20. Open X-Embodiment: Robotic Learning Datasets and RT-X Models cs.RO · 2023 · author #282
  21. LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning cs.AI · 2023 · author #6
  22. Voyager: An Open-Ended Embodied Agent with Large Language Models cs.AI · 2023 · author #6
  23. What Matters in Learning from Offline Human Demonstrations for Robot Manipulation cs.RO · 2021 · author #9
  24. robosuite: A Modular Simulation Framework and Benchmark for Robot Learning cs.RO · 2020 · author #1
  25. DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion cs.CV · 2019 · author #3
  26. RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through Imitation cs.RO · 2018 · author #2
  27. Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks cs.RO · 2018 · author #2
  28. Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration cs.CV · 2018 · author #4
  29. Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision cs.RO · 2018 · author #2
  30. Reinforcement and Imitation Learning for Diverse Visuomotor Skills cs.RO · 2018 · author #1
  31. AI2-THOR: An Interactive 3D Environment for Visual AI cs.CV · 2017 · author #10
  32. Neural Task Programming: Learning to Generalize Across Hierarchical Tasks cs.AI · 2017 · author #3
  33. ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems cs.RO · 2017 · author #4
  34. Visual Semantic Planning using Deep Successor Representations cs.CV · 2017 · author #1
  35. Scene Graph Generation by Iterative Message Passing cs.CV · 2017 · author #2
  36. Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning cs.CV · 2016 · author #1
  37. Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations cs.CV · 2016 · author #2
  38. Visual7W: Grounded Question Answering in Images cs.CV · 2015 · author #1
  39. Action Recognition by Hierarchical Mid-level Action Elements cs.CV · 2015 · author #2
  40. Building a Large-scale Multimodal Knowledge Base System for Answering Visual Queries cs.CV · 2015 · author #1

Mentions

  • 1511.03416 #1 · backfill · confidence 0.70 Yuke Zhu
  • 1508.07654 #2 · backfill · confidence 0.70 Yuke Zhu
  • 1507.05670 #1 · backfill · confidence 0.70 Yuke Zhu
  • 2606.05160 #17 · arxiv_oai · confidence 0.70 Yuke Zhu
  • 2606.02800 #289 · arxiv_oai · confidence 0.70 Yuke Zhu
  • 2605.27724 #10 · arxiv_oai · confidence 0.70 Yuke Zhu
  • 2511.07820 #28 · arxiv_oai · confidence 0.70 Yuke Zhu
  • 2505.15659 #20 · arxiv_oai · confidence 0.70 Yuke Zhu
  • 2310.17596 #7 · arxiv_oai · confidence 0.70 Yuke Zhu
  • 2408.10188 #16 · arxiv_oai · confidence 0.70 Yuke Zhu
  • 2602.06949 #28 · arxiv_oai · confidence 0.70 Yuke Zhu
  • 2505.12705 #27 · arxiv_oai · confidence 0.70 Yuke Zhu

Frequent Coauthors