pith. sign in

Pieter Abbeel

Identifiers

  • name variant Pieter Abbeel 0.60 · backfill

Papers (156)

  1. SARM2: Multi-Task Stage Aware Reward Modeling for Self Improving Robotic Manipulation cs.RO · 2026 · author #8
  2. LadderMan: Learning Humanoid Perceptive Ladder Climbing cs.RO · 2026 · author #4
  3. Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning cs.LG · 2026 · author #7
  4. When Does Non-Uniform Replay Matter in Reinforcement Learning? cs.LG · 2026 · author #5
  5. World Model for Robot Learning: A Comprehensive Survey cs.RO · 2026 · author #15
  6. Offline Materials Optimization with CliqueFlowmer cs.AI · 2026 · author #4
  7. Reward-Conditioned Reinforcement Learning cs.LG · 2026 · author #3
  8. Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching cs.RO · 2026 · author #6
  9. DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos cs.RO · 2026 · author #26
  10. Large Video Planner Enables Generalizable Robot Control cs.RO · 2025 · author #9
  11. DIPOLE: Fusing Vision and Geometry for Robust Visuomotor Generalization cs.RO · 2025 · author #7
  12. SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation cs.RO · 2025 · author #4
  13. Relative Entropy Pathwise Policy Optimization cs.LG · 2025 · author #5
  14. ViTacFormer: Learning Cross-Modal Representation for Visuo-Tactile Dexterous Manipulation cs.RO · 2025 · author #4
  15. Rodrigues Network for Learning Robot Actions cs.RO · 2025 · author #5
  16. One Step Diffusion via Shortcut Models cs.LG · 2024 · author #4
  17. A StrongREJECT for Empty Jailbreaks cs.LG · 2024 · author #7
  18. World Model on Million-Length Video And Language With Blockwise RingAttention cs.LG · 2024 · author #4
  19. Any-point Trajectory Modeling for Policy Learning cs.RO · 2023 · author #7
  20. Open X-Embodiment: Robotic Learning Datasets and RT-X Models cs.RO · 2023 · author #193
  21. Learning Interactive Real-World Simulators cs.AI · 2023 · author #7
  22. Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own cs.RO · 2023 · author #8
  23. Ring Attention with Blockwise Transformers for Near-Infinite Context cs.CL · 2023 · author #3
  24. The False Promise of Imitating Proprietary LLMs cs.CL · 2023 · author #6
  25. Aligning Text-to-Image Models using Human Feedback cs.LG · 2023 · author #7
  26. Decision Transformer: Reinforcement Learning via Sequence Modeling cs.LG · 2021 · author #7
  27. VideoGPT: Video Generation using VQ-VAE and Transformers cs.CV · 2021 · author #3
  28. Denoising Diffusion Probabilistic Models cs.LG · 2020 · author #3
  29. BagNet: Berkeley Analog Generator with Layout Optimizer Boosted with Deep Neural Networks eess.SP · 2019 · author #3
  30. Benchmarking Model-Based Reinforcement Learning cs.LG · 2019 · author #9
  31. On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference cs.LG · 2019 · author #3
  32. Evaluating Protein Transfer Learning with TAPE cs.LG · 2019 · author #7
  33. Learning latent state representation for speeding up exploration cs.LG · 2019 · author #4
  34. MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies cs.LG · 2019 · author #4
  35. Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules cs.CV · 2019 · author #4
  36. Learning Robotic Manipulation through Visual Planning and Acting cs.RO · 2019 · author #4
  37. Quasi-Direct Drive for Low-Cost Compliant Robotic Manipulation cs.RO · 2019 · author #13
  38. Towards Characterizing Divergence in Deep Q-Learning cs.LG · 2019 · author #3
  39. Domain Randomization for Active Pose Estimation cs.CV · 2019 · author #7
  40. Reinforcement Learning on Variable Impedance Controller for High-Precision Robotic Assembly cs.RO · 2019 · author #7
  41. Preferences Implicit in the State of the World cs.LG · 2019 · author #4
  42. Generalization through Simulation: Integrating Simulated and Real Data into Deep Reinforcement Learning for Vision-Based Autonomous Flight cs.LG · 2019 · author #4
  43. Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design cs.LG · 2019 · author #5
  44. Soft Actor-Critic Algorithms and Applications cs.LG · 2018 · author #10
  45. Guiding Policies with Language via Meta-Learning cs.LG · 2018 · author #7
  46. An Algorithmic Perspective on Imitation Learning cs.RO · 2018 · author #5
  47. Modular Architecture for StarCraft II with Deep Reinforcement Learning cs.AI · 2018 · author #6
  48. One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks cs.LG · 2018 · author #2
  49. Establishing Appropriate Trust via Critical States cs.RO · 2018 · author #3
  50. Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation cs.RO · 2018 · author #3
  51. SFV: Reinforcement Learning of Physical Skills from Videos cs.GR · 2018 · author #4
  52. Model-Based Reinforcement Learning via Meta-Policy Optimization cs.LG · 2018 · author #6
  53. SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning cs.LG · 2018 · author #4
  54. Transfer Learning for Estimating Causal Effects using Neural Networks stat.ML · 2018 · author #6
  55. Variational Option Discovery Algorithms cs.AI · 2018 · author #4
  56. Learning Plannable Representations with Causal InfoGAN cs.LG · 2018 · author #5
  57. Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings cs.LG · 2018 · author #5
  58. The Limits and Potentials of Deep Learning for Robotics cs.RO · 2018 · author #8
  59. Latent Space Policies for Hierarchical Reinforcement Learning cs.LG · 2018 · author #3
  60. DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills cs.GR · 2018 · author #2
  61. Stochastic Adversarial Video Prediction cs.CV · 2018 · author #4
  62. Universal Planning Networks cs.LG · 2018 · author #3
  63. Learning to Adapt in Dynamic, Real-World Environments Through Meta-Reinforcement Learning cs.LG · 2018 · author #5
  64. Learning Robotic Assembly from CAD cs.RO · 2018 · author #5
  65. Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines cs.LG · 2018 · author #8
  66. Composable Deep Reinforcement Learning for Robotic Manipulation cs.LG · 2018 · author #5
  67. Accelerated Methods for Deep Reinforcement Learning cs.LG · 2018 · author #2
  68. Some Considerations on Learning to Explore via Meta-Reinforcement Learning cs.AI · 2018 · author #7
  69. Model-Ensemble Trust-Region Policy Optimization cs.LG · 2018 · author #5
  70. Meta-Reinforcement Learning of Structured Exploration Strategies cs.LG · 2018 · author #4
  71. Evolved Policy Gradients cs.LG · 2018 · author #7
  72. One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning cs.LG · 2018 · author #6
  73. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor cs.LG · 2018 · author #3
  74. PixelSNAIL: An Improved Autoregressive Generative Model cs.LG · 2017 · author #4
  75. A Berkeley View of Systems Challenges for AI cs.AI · 2017 · author #14
  76. Safer Classification by Synthesis cs.LG · 2017 · author #5
  77. Interpretable and Pedagogical Examples cs.AI · 2017 · author #2
  78. Meta Learning Shared Hierarchies cs.LG · 2017 · author #4
  79. Asymmetric Actor Critic for Image-Based Robot Learning cs.RO · 2017 · author #5
  80. Sim-to-Real Transfer of Robotic Control with Dynamics Randomization cs.RO · 2017 · author #4
  81. Domain Randomization and Generative Models for Robotic Grasping cs.RO · 2017 · author #11
  82. Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation cs.LG · 2017 · author #7
  83. Synkhronos: a Multi-GPU Theano Extension for Data Parallelism cs.DC · 2017 · author #2
  84. Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments cs.LG · 2017 · author #6
  85. Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation cs.LG · 2017 · author #4
  86. Overcoming Exploration in Reinforcement Learning with Demonstrations cs.LG · 2017 · author #5
  87. One-Shot Visual Imitation Learning via Meta-Learning cs.LG · 2017 · author #4
  88. Learning with Opponent-Learning Awareness cs.AI · 2017 · author #5
  89. Learning Generalized Reactive Policies using Deep Neural Networks cs.AI · 2017 · author #5
  90. Deep Object-Centric Representations for Generalizable Robot Learning cs.RO · 2017 · author #2
  91. Mutual Alignment Transfer Learning cs.AI · 2017 · author #3
  92. Reverse Curriculum Generation for Reinforcement Learning cs.AI · 2017 · author #5
  93. Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation cs.LG · 2017 · author #3
  94. A Simple Neural Attentive Meta-Learner cs.AI · 2017 · author #4
  95. Hindsight Experience Replay cs.LG · 2017 · author #9
  96. Parameter Space Noise for Exploration cs.LG · 2017 · author #8
  97. UCB Exploration via Q-Ensembles cs.LG · 2017 · author #3
  98. Constrained Policy Optimization cs.LG · 2017 · author #4
  99. Automatic Goal Generation for Reinforcement Learning Agents cs.LG · 2017 · author #4
  100. Probabilistically Safe Policy Transfer cs.RO · 2017 · author #5
  101. Equivalence Between Policy Gradients and Soft Q-Learning cs.LG · 2017 · author #3
  102. Stochastic Neural Networks for Hierarchical Reinforcement Learning cs.AI · 2017 · author #3
  103. Learning Visual Servoing with Deep Features and Fitted Q-Iteration cs.LG · 2017 · author #3
  104. One-Shot Imitation Learning cs.AI · 2017 · author #7
  105. Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World cs.RO · 2017 · author #6
  106. Emergence of Grounded Compositional Language in Multi-Agent Populations cs.AI · 2017 · author #2
  107. Prediction and Control with Temporal Segment Models cs.LG · 2017 · author #2
  108. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks cs.LG · 2017 · author #2
  109. Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning cs.AI · 2017 · author #4
  110. Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation cs.CV · 2017 · author #5
  111. Reinforcement Learning with Deep Energy-Based Policies cs.LG · 2017 · author #3
  112. Enabling Robots to Communicate their Objectives cs.RO · 2017 · author #3
  113. Adversarial Attacks on Neural Network Policies cs.LG · 2017 · author #5
  114. Uncertainty-Aware Reinforcement Learning for Collision Avoidance cs.LG · 2017 · author #4
  115. A K-fold Method for Baseline Estimation in Policy Gradient Algorithms cs.AI · 2017 · author #5
  116. Generalizing Skills with Semi-Supervised Reinforcement Learning cs.LG · 2016 · author #4
  117. The Off-Switch Game cs.AI · 2016 · author #3
  118. #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning cs.AI · 2016 · author #9
  119. A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models cs.LG · 2016 · author #3
  120. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning cs.AI · 2016 · author #6
  121. Variational Lossy Autoencoder cs.LG · 2016 · author #8
  122. Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model cs.RO · 2016 · author #7
  123. Reset-Free Guided Policy Search: Efficient Deep Reinforcement Learning with Stochastic Initial States cs.LG · 2016 · author #4
  124. Deep Reinforcement Learning for Tensegrity Robot Locomotion cs.RO · 2016 · author #7
  125. Learning from the Hindsight Plan -- Episodic MPC Improvement cs.RO · 2016 · author #5
  126. Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer cs.LG · 2016 · author #4
  127. Toward a Science of Autonomy for Physical Systems: Paths cs.CY · 2016 · author #1
  128. Learning to Poke by Poking: Experiential Learning of Intuitive Physics cs.CV · 2016 · author #3
  129. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets cs.LG · 2016 · author #6
  130. VIME: Variational Information Maximizing Exploration cs.LG · 2016 · author #6
  131. Backprop KF: Learning Discriminative Deterministic State Estimators cs.LG · 2016 · author #4
  132. Benchmarking Deep Reinforcement Learning for Continuous Control cs.LG · 2016 · author #5
  133. Learning Dexterous Manipulation for a Soft Robotic Hand from Human Demonstration cs.LG · 2016 · author #4
  134. PLATO: Policy Learning using Adaptive Trajectory Optimization cs.LG · 2016 · author #4
  135. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization cs.LG · 2016 · author #3
  136. Value Iteration Networks cs.AI · 2016 · author #5
  137. Inverse Reinforcement Learning via Deep Gaussian Process cs.LG · 2015 · author #3
  138. Adapting Deep Visuomotor Representations with Weak Pairwise Constraints cs.CV · 2015 · author #5
  139. One-Shot Learning of Manipulation Skills with Online Dynamics Adaptation and Neural Network Priors cs.LG · 2015 · author #3
  140. Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration cs.LG · 2015 · author #5
  141. Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search cs.LG · 2015 · author #4
  142. Deep Spatial Autoencoders for Visuomotor Learning cs.LG · 2015 · author #6
  143. Learning Deep Neural Network Policies with Continuous Memory States cs.LG · 2015 · author #5
  144. Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models cs.AI · 2015 · author #3
  145. Gradient Estimation Using Stochastic Computation Graphs cs.LG · 2015 · author #4
  146. High-Dimensional Continuous Control Using Generalized Advantage Estimation cs.LG · 2015 · author #5
  147. End-to-End Training of Deep Visuomotor Policies cs.LG · 2015 · author #4
  148. Trust Region Policy Optimization cs.LG · 2015 · author #5
  149. Benchmarking in Manipulation Research: The YCB Object and Model Set and Benchmarking Protocols cs.RO · 2015 · author #5
  150. Learning Contact-Rich Manipulation Skills with Guided Policy Search cs.RO · 2015 · author #3
  151. Arriving on time: estimating travel time distributions on large-scale road networks cs.LG · 2013 · author #7
  152. Large Scale Estimation in Cyberphysical Systems using Streaming Data: a Case Study with Smartphone Traces cs.RO · 2012 · author #4
  153. Discriminative Probabilistic Models for Relational Data cs.LG · 2012 · author #2
  154. Learning Factor Graphs in Polynomial Time & Sample Complexity cs.LG · 2012 · author #1
  155. Safe Exploration in Markov Decision Processes cs.LG · 2012 · author #2
  156. The path inference filter: model-based low-latency map matching of probe vehicle data cs.AI · 2011 · author #2

Mentions

  • 2606.10305 #8 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 1511.07111 #5 · backfill · confidence 0.70 Pieter Abbeel
  • 1509.06841 #3 · backfill · confidence 0.70 Pieter Abbeel
  • 1509.06824 #5 · backfill · confidence 0.70 Pieter Abbeel
  • 1509.06791 #4 · backfill · confidence 0.70 Pieter Abbeel
  • 1509.06113 #6 · backfill · confidence 0.70 Pieter Abbeel
  • 2606.05873 #4 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 1710.06537 #4 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 1610.03518 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 1506.02438 #5 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 1507.01273 #5 · backfill · confidence 0.70 Pieter Abbeel
  • 1507.00814 #3 · backfill · confidence 0.70 Pieter Abbeel
  • 1506.05254 #4 · backfill · confidence 0.70 Pieter Abbeel
  • 1506.02438 #5 · backfill · confidence 0.70 Pieter Abbeel
  • 1504.00702 #4 · backfill · confidence 0.70 Pieter Abbeel
  • 1502.05477 #5 · backfill · confidence 0.70 Pieter Abbeel
  • 1502.03143 #5 · backfill · confidence 0.70 Pieter Abbeel
  • 1501.05611 #3 · backfill · confidence 0.70 Pieter Abbeel
  • 2511.22445 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 1302.6617 #7 · backfill · confidence 0.70 Pieter Abbeel
  • 2605.25210 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 1301.0604 #2 · backfill · confidence 0.70 Pieter Abbeel
  • 1212.3393 #4 · backfill · confidence 0.70 Pieter Abbeel
  • 1207.1366 #1 · backfill · confidence 0.70 Pieter Abbeel
  • 1205.4810 #2 · backfill · confidence 0.70 Pieter Abbeel
  • 1109.1966 #2 · backfill · confidence 0.70 Pieter Abbeel
  • 2603.05066 #3 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 2605.10236 #5 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 2106.01345 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 2305.15717 #6 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 2401.00025 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 2402.10260 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 2602.06949 #26 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 2402.08268 #4 · arxiv_oai · confidence 0.70 Pieter Abbeel
  • 2310.06114 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel

Frequent Coauthors