Pieter Abbeel

Identifiers

name variant Pieter Abbeel 0.60 · backfill

Papers (156)

SARM2: Multi-Task Stage Aware Reward Modeling for Self Improving Robotic Manipulation cs.RO · 2026 · author #8
LadderMan: Learning Humanoid Perceptive Ladder Climbing cs.RO · 2026 · author #4
Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning cs.LG · 2026 · author #7
When Does Non-Uniform Replay Matter in Reinforcement Learning? cs.LG · 2026 · author #5
World Model for Robot Learning: A Comprehensive Survey cs.RO · 2026 · author #15
Offline Materials Optimization with CliqueFlowmer cs.AI · 2026 · author #4
Reward-Conditioned Reinforcement Learning cs.LG · 2026 · author #3
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching cs.RO · 2026 · author #6
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos cs.RO · 2026 · author #26
Large Video Planner Enables Generalizable Robot Control cs.RO · 2025 · author #9
DIPOLE: Fusing Vision and Geometry for Robust Visuomotor Generalization cs.RO · 2025 · author #7
SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation cs.RO · 2025 · author #4
Relative Entropy Pathwise Policy Optimization cs.LG · 2025 · author #5
ViTacFormer: Learning Cross-Modal Representation for Visuo-Tactile Dexterous Manipulation cs.RO · 2025 · author #4
Rodrigues Network for Learning Robot Actions cs.RO · 2025 · author #5
One Step Diffusion via Shortcut Models cs.LG · 2024 · author #4
A StrongREJECT for Empty Jailbreaks cs.LG · 2024 · author #7
World Model on Million-Length Video And Language With Blockwise RingAttention cs.LG · 2024 · author #4
Any-point Trajectory Modeling for Policy Learning cs.RO · 2023 · author #7
Open X-Embodiment: Robotic Learning Datasets and RT-X Models cs.RO · 2023 · author #193
Learning Interactive Real-World Simulators cs.AI · 2023 · author #7
Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own cs.RO · 2023 · author #8
Ring Attention with Blockwise Transformers for Near-Infinite Context cs.CL · 2023 · author #3
The False Promise of Imitating Proprietary LLMs cs.CL · 2023 · author #6
Aligning Text-to-Image Models using Human Feedback cs.LG · 2023 · author #7
Decision Transformer: Reinforcement Learning via Sequence Modeling cs.LG · 2021 · author #7
VideoGPT: Video Generation using VQ-VAE and Transformers cs.CV · 2021 · author #3
Denoising Diffusion Probabilistic Models cs.LG · 2020 · author #3
BagNet: Berkeley Analog Generator with Layout Optimizer Boosted with Deep Neural Networks eess.SP · 2019 · author #3
Benchmarking Model-Based Reinforcement Learning cs.LG · 2019 · author #9
On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference cs.LG · 2019 · author #3
Evaluating Protein Transfer Learning with TAPE cs.LG · 2019 · author #7
Learning latent state representation for speeding up exploration cs.LG · 2019 · author #4
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies cs.LG · 2019 · author #4
Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules cs.CV · 2019 · author #4
Learning Robotic Manipulation through Visual Planning and Acting cs.RO · 2019 · author #4
Quasi-Direct Drive for Low-Cost Compliant Robotic Manipulation cs.RO · 2019 · author #13
Towards Characterizing Divergence in Deep Q-Learning cs.LG · 2019 · author #3
Domain Randomization for Active Pose Estimation cs.CV · 2019 · author #7
Reinforcement Learning on Variable Impedance Controller for High-Precision Robotic Assembly cs.RO · 2019 · author #7
Preferences Implicit in the State of the World cs.LG · 2019 · author #4
Generalization through Simulation: Integrating Simulated and Real Data into Deep Reinforcement Learning for Vision-Based Autonomous Flight cs.LG · 2019 · author #4
Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design cs.LG · 2019 · author #5
Soft Actor-Critic Algorithms and Applications cs.LG · 2018 · author #10
Guiding Policies with Language via Meta-Learning cs.LG · 2018 · author #7
An Algorithmic Perspective on Imitation Learning cs.RO · 2018 · author #5
Modular Architecture for StarCraft II with Deep Reinforcement Learning cs.AI · 2018 · author #6
One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks cs.LG · 2018 · author #2
Establishing Appropriate Trust via Critical States cs.RO · 2018 · author #3
Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation cs.RO · 2018 · author #3
SFV: Reinforcement Learning of Physical Skills from Videos cs.GR · 2018 · author #4
Model-Based Reinforcement Learning via Meta-Policy Optimization cs.LG · 2018 · author #6
SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning cs.LG · 2018 · author #4
Transfer Learning for Estimating Causal Effects using Neural Networks stat.ML · 2018 · author #6
Variational Option Discovery Algorithms cs.AI · 2018 · author #4
Learning Plannable Representations with Causal InfoGAN cs.LG · 2018 · author #5
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings cs.LG · 2018 · author #5
The Limits and Potentials of Deep Learning for Robotics cs.RO · 2018 · author #8
Latent Space Policies for Hierarchical Reinforcement Learning cs.LG · 2018 · author #3
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills cs.GR · 2018 · author #2
Stochastic Adversarial Video Prediction cs.CV · 2018 · author #4
Universal Planning Networks cs.LG · 2018 · author #3
Learning to Adapt in Dynamic, Real-World Environments Through Meta-Reinforcement Learning cs.LG · 2018 · author #5
Learning Robotic Assembly from CAD cs.RO · 2018 · author #5
Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines cs.LG · 2018 · author #8
Composable Deep Reinforcement Learning for Robotic Manipulation cs.LG · 2018 · author #5
Accelerated Methods for Deep Reinforcement Learning cs.LG · 2018 · author #2
Some Considerations on Learning to Explore via Meta-Reinforcement Learning cs.AI · 2018 · author #7
Model-Ensemble Trust-Region Policy Optimization cs.LG · 2018 · author #5
Meta-Reinforcement Learning of Structured Exploration Strategies cs.LG · 2018 · author #4
Evolved Policy Gradients cs.LG · 2018 · author #7
One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning cs.LG · 2018 · author #6
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor cs.LG · 2018 · author #3
PixelSNAIL: An Improved Autoregressive Generative Model cs.LG · 2017 · author #4
A Berkeley View of Systems Challenges for AI cs.AI · 2017 · author #14
Safer Classification by Synthesis cs.LG · 2017 · author #5
Interpretable and Pedagogical Examples cs.AI · 2017 · author #2
Meta Learning Shared Hierarchies cs.LG · 2017 · author #4
Asymmetric Actor Critic for Image-Based Robot Learning cs.RO · 2017 · author #5
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization cs.RO · 2017 · author #4
Domain Randomization and Generative Models for Robotic Grasping cs.RO · 2017 · author #11
Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation cs.LG · 2017 · author #7
Synkhronos: a Multi-GPU Theano Extension for Data Parallelism cs.DC · 2017 · author #2
Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments cs.LG · 2017 · author #6
Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation cs.LG · 2017 · author #4
Overcoming Exploration in Reinforcement Learning with Demonstrations cs.LG · 2017 · author #5
One-Shot Visual Imitation Learning via Meta-Learning cs.LG · 2017 · author #4
Learning with Opponent-Learning Awareness cs.AI · 2017 · author #5
Learning Generalized Reactive Policies using Deep Neural Networks cs.AI · 2017 · author #5
Deep Object-Centric Representations for Generalizable Robot Learning cs.RO · 2017 · author #2
Mutual Alignment Transfer Learning cs.AI · 2017 · author #3
Reverse Curriculum Generation for Reinforcement Learning cs.AI · 2017 · author #5
Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation cs.LG · 2017 · author #3
A Simple Neural Attentive Meta-Learner cs.AI · 2017 · author #4
Hindsight Experience Replay cs.LG · 2017 · author #9
Parameter Space Noise for Exploration cs.LG · 2017 · author #8
UCB Exploration via Q-Ensembles cs.LG · 2017 · author #3
Constrained Policy Optimization cs.LG · 2017 · author #4
Automatic Goal Generation for Reinforcement Learning Agents cs.LG · 2017 · author #4
Probabilistically Safe Policy Transfer cs.RO · 2017 · author #5
Equivalence Between Policy Gradients and Soft Q-Learning cs.LG · 2017 · author #3
Stochastic Neural Networks for Hierarchical Reinforcement Learning cs.AI · 2017 · author #3
Learning Visual Servoing with Deep Features and Fitted Q-Iteration cs.LG · 2017 · author #3
One-Shot Imitation Learning cs.AI · 2017 · author #7
Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World cs.RO · 2017 · author #6
Emergence of Grounded Compositional Language in Multi-Agent Populations cs.AI · 2017 · author #2
Prediction and Control with Temporal Segment Models cs.LG · 2017 · author #2
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks cs.LG · 2017 · author #2
Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning cs.AI · 2017 · author #4
Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation cs.CV · 2017 · author #5
Reinforcement Learning with Deep Energy-Based Policies cs.LG · 2017 · author #3
Enabling Robots to Communicate their Objectives cs.RO · 2017 · author #3
Adversarial Attacks on Neural Network Policies cs.LG · 2017 · author #5
Uncertainty-Aware Reinforcement Learning for Collision Avoidance cs.LG · 2017 · author #4
A K-fold Method for Baseline Estimation in Policy Gradient Algorithms cs.AI · 2017 · author #5
Generalizing Skills with Semi-Supervised Reinforcement Learning cs.LG · 2016 · author #4
The Off-Switch Game cs.AI · 2016 · author #3
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning cs.AI · 2016 · author #9
A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models cs.LG · 2016 · author #3
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning cs.AI · 2016 · author #6
Variational Lossy Autoencoder cs.LG · 2016 · author #8
Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model cs.RO · 2016 · author #7
Reset-Free Guided Policy Search: Efficient Deep Reinforcement Learning with Stochastic Initial States cs.LG · 2016 · author #4
Deep Reinforcement Learning for Tensegrity Robot Locomotion cs.RO · 2016 · author #7
Learning from the Hindsight Plan -- Episodic MPC Improvement cs.RO · 2016 · author #5
Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer cs.LG · 2016 · author #4
Toward a Science of Autonomy for Physical Systems: Paths cs.CY · 2016 · author #1
Learning to Poke by Poking: Experiential Learning of Intuitive Physics cs.CV · 2016 · author #3
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets cs.LG · 2016 · author #6
VIME: Variational Information Maximizing Exploration cs.LG · 2016 · author #6
Backprop KF: Learning Discriminative Deterministic State Estimators cs.LG · 2016 · author #4
Benchmarking Deep Reinforcement Learning for Continuous Control cs.LG · 2016 · author #5
Learning Dexterous Manipulation for a Soft Robotic Hand from Human Demonstration cs.LG · 2016 · author #4
PLATO: Policy Learning using Adaptive Trajectory Optimization cs.LG · 2016 · author #4
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization cs.LG · 2016 · author #3
Value Iteration Networks cs.AI · 2016 · author #5
Inverse Reinforcement Learning via Deep Gaussian Process cs.LG · 2015 · author #3
Adapting Deep Visuomotor Representations with Weak Pairwise Constraints cs.CV · 2015 · author #5
One-Shot Learning of Manipulation Skills with Online Dynamics Adaptation and Neural Network Priors cs.LG · 2015 · author #3
Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration cs.LG · 2015 · author #5
Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search cs.LG · 2015 · author #4
Deep Spatial Autoencoders for Visuomotor Learning cs.LG · 2015 · author #6
Learning Deep Neural Network Policies with Continuous Memory States cs.LG · 2015 · author #5
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models cs.AI · 2015 · author #3
Gradient Estimation Using Stochastic Computation Graphs cs.LG · 2015 · author #4
High-Dimensional Continuous Control Using Generalized Advantage Estimation cs.LG · 2015 · author #5
End-to-End Training of Deep Visuomotor Policies cs.LG · 2015 · author #4
Trust Region Policy Optimization cs.LG · 2015 · author #5
Benchmarking in Manipulation Research: The YCB Object and Model Set and Benchmarking Protocols cs.RO · 2015 · author #5
Learning Contact-Rich Manipulation Skills with Guided Policy Search cs.RO · 2015 · author #3
Arriving on time: estimating travel time distributions on large-scale road networks cs.LG · 2013 · author #7
Large Scale Estimation in Cyberphysical Systems using Streaming Data: a Case Study with Smartphone Traces cs.RO · 2012 · author #4
Discriminative Probabilistic Models for Relational Data cs.LG · 2012 · author #2
Learning Factor Graphs in Polynomial Time & Sample Complexity cs.LG · 2012 · author #1
Safe Exploration in Markov Decision Processes cs.LG · 2012 · author #2
The path inference filter: model-based low-latency map matching of probe vehicle data cs.AI · 2011 · author #2

Mentions

2606.10305 #8 · arxiv_oai · confidence 0.70 Pieter Abbeel
1511.07111 #5 · backfill · confidence 0.70 Pieter Abbeel
1509.06841 #3 · backfill · confidence 0.70 Pieter Abbeel
1509.06824 #5 · backfill · confidence 0.70 Pieter Abbeel
1509.06791 #4 · backfill · confidence 0.70 Pieter Abbeel
1509.06113 #6 · backfill · confidence 0.70 Pieter Abbeel
2606.05873 #4 · arxiv_oai · confidence 0.70 Pieter Abbeel
1710.06537 #4 · arxiv_oai · confidence 0.70 Pieter Abbeel
1610.03518 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
1506.02438 #5 · arxiv_oai · confidence 0.70 Pieter Abbeel
1507.01273 #5 · backfill · confidence 0.70 Pieter Abbeel
1507.00814 #3 · backfill · confidence 0.70 Pieter Abbeel
1506.05254 #4 · backfill · confidence 0.70 Pieter Abbeel
1506.02438 #5 · backfill · confidence 0.70 Pieter Abbeel
1504.00702 #4 · backfill · confidence 0.70 Pieter Abbeel
1502.05477 #5 · backfill · confidence 0.70 Pieter Abbeel
1502.03143 #5 · backfill · confidence 0.70 Pieter Abbeel
1501.05611 #3 · backfill · confidence 0.70 Pieter Abbeel
2511.22445 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
1302.6617 #7 · backfill · confidence 0.70 Pieter Abbeel
2605.25210 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
1301.0604 #2 · backfill · confidence 0.70 Pieter Abbeel
1212.3393 #4 · backfill · confidence 0.70 Pieter Abbeel
1207.1366 #1 · backfill · confidence 0.70 Pieter Abbeel
1205.4810 #2 · backfill · confidence 0.70 Pieter Abbeel
1109.1966 #2 · backfill · confidence 0.70 Pieter Abbeel
2603.05066 #3 · arxiv_oai · confidence 0.70 Pieter Abbeel
2605.10236 #5 · arxiv_oai · confidence 0.70 Pieter Abbeel
2106.01345 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
2305.15717 #6 · arxiv_oai · confidence 0.70 Pieter Abbeel
2401.00025 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
2402.10260 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
2602.06949 #26 · arxiv_oai · confidence 0.70 Pieter Abbeel
2402.08268 #4 · arxiv_oai · confidence 0.70 Pieter Abbeel
2310.06114 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel

Frequent Coauthors

Sergey Levine 55 shared papers
Xi Chen 19 shared papers
Chelsea Finn 16 shared papers
Yan Duan 15 shared papers
John Schulman 13 shared papers
Abhishek Gupta 11 shared papers
Aviv Tamar 11 shared papers
Jitendra Malik 11 shared papers
Igor Mordatch 9 shared papers
Wojciech Zaremba 8 shared papers
Gregory Kahn 7 shared papers
Marcin Andrychowicz 7 shared papers
Rein Houthooft 7 shared papers
Trevor Darrell 7 shared papers
Haoran Geng 6 shared papers
Ilya Sutskever 6 shared papers
Tianhao Zhang 6 shared papers
Tuomas Haarnoja 6 shared papers
Bradly C. Stadie 5 shared papers
Coline Devin 5 shared papers