Pieter Abbeel
Identifiers
- name variant Pieter Abbeel 0.60 · backfill
Papers (156)
- SARM2: Multi-Task Stage Aware Reward Modeling for Self Improving Robotic Manipulation cs.RO · 2026 · author #8
- LadderMan: Learning Humanoid Perceptive Ladder Climbing cs.RO · 2026 · author #4
- Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning cs.LG · 2026 · author #7
- When Does Non-Uniform Replay Matter in Reinforcement Learning? cs.LG · 2026 · author #5
- World Model for Robot Learning: A Comprehensive Survey cs.RO · 2026 · author #15
- Offline Materials Optimization with CliqueFlowmer cs.AI · 2026 · author #4
- Reward-Conditioned Reinforcement Learning cs.LG · 2026 · author #3
- Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching cs.RO · 2026 · author #6
- DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos cs.RO · 2026 · author #26
- Large Video Planner Enables Generalizable Robot Control cs.RO · 2025 · author #9
- DIPOLE: Fusing Vision and Geometry for Robust Visuomotor Generalization cs.RO · 2025 · author #7
- SARM: Stage-Aware Reward Modeling for Long Horizon Robot Manipulation cs.RO · 2025 · author #4
- Relative Entropy Pathwise Policy Optimization cs.LG · 2025 · author #5
- ViTacFormer: Learning Cross-Modal Representation for Visuo-Tactile Dexterous Manipulation cs.RO · 2025 · author #4
- Rodrigues Network for Learning Robot Actions cs.RO · 2025 · author #5
- One Step Diffusion via Shortcut Models cs.LG · 2024 · author #4
- A StrongREJECT for Empty Jailbreaks cs.LG · 2024 · author #7
- World Model on Million-Length Video And Language With Blockwise RingAttention cs.LG · 2024 · author #4
- Any-point Trajectory Modeling for Policy Learning cs.RO · 2023 · author #7
- Open X-Embodiment: Robotic Learning Datasets and RT-X Models cs.RO · 2023 · author #193
- Learning Interactive Real-World Simulators cs.AI · 2023 · author #7
- Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own cs.RO · 2023 · author #8
- Ring Attention with Blockwise Transformers for Near-Infinite Context cs.CL · 2023 · author #3
- The False Promise of Imitating Proprietary LLMs cs.CL · 2023 · author #6
- Aligning Text-to-Image Models using Human Feedback cs.LG · 2023 · author #7
- Decision Transformer: Reinforcement Learning via Sequence Modeling cs.LG · 2021 · author #7
- VideoGPT: Video Generation using VQ-VAE and Transformers cs.CV · 2021 · author #3
- Denoising Diffusion Probabilistic Models cs.LG · 2020 · author #3
- BagNet: Berkeley Analog Generator with Layout Optimizer Boosted with Deep Neural Networks eess.SP · 2019 · author #3
- Benchmarking Model-Based Reinforcement Learning cs.LG · 2019 · author #9
- On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference cs.LG · 2019 · author #3
- Evaluating Protein Transfer Learning with TAPE cs.LG · 2019 · author #7
- Learning latent state representation for speeding up exploration cs.LG · 2019 · author #4
- MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies cs.LG · 2019 · author #4
- Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules cs.CV · 2019 · author #4
- Learning Robotic Manipulation through Visual Planning and Acting cs.RO · 2019 · author #4
- Quasi-Direct Drive for Low-Cost Compliant Robotic Manipulation cs.RO · 2019 · author #13
- Towards Characterizing Divergence in Deep Q-Learning cs.LG · 2019 · author #3
- Domain Randomization for Active Pose Estimation cs.CV · 2019 · author #7
- Reinforcement Learning on Variable Impedance Controller for High-Precision Robotic Assembly cs.RO · 2019 · author #7
- Preferences Implicit in the State of the World cs.LG · 2019 · author #4
- Generalization through Simulation: Integrating Simulated and Real Data into Deep Reinforcement Learning for Vision-Based Autonomous Flight cs.LG · 2019 · author #4
- Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design cs.LG · 2019 · author #5
- Soft Actor-Critic Algorithms and Applications cs.LG · 2018 · author #10
- Guiding Policies with Language via Meta-Learning cs.LG · 2018 · author #7
- An Algorithmic Perspective on Imitation Learning cs.RO · 2018 · author #5
- Modular Architecture for StarCraft II with Deep Reinforcement Learning cs.AI · 2018 · author #6
- One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks cs.LG · 2018 · author #2
- Establishing Appropriate Trust via Critical States cs.RO · 2018 · author #3
- Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation cs.RO · 2018 · author #3
- SFV: Reinforcement Learning of Physical Skills from Videos cs.GR · 2018 · author #4
- Model-Based Reinforcement Learning via Meta-Policy Optimization cs.LG · 2018 · author #6
- SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning cs.LG · 2018 · author #4
- Transfer Learning for Estimating Causal Effects using Neural Networks stat.ML · 2018 · author #6
- Variational Option Discovery Algorithms cs.AI · 2018 · author #4
- Learning Plannable Representations with Causal InfoGAN cs.LG · 2018 · author #5
- Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings cs.LG · 2018 · author #5
- The Limits and Potentials of Deep Learning for Robotics cs.RO · 2018 · author #8
- Latent Space Policies for Hierarchical Reinforcement Learning cs.LG · 2018 · author #3
- DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills cs.GR · 2018 · author #2
- Stochastic Adversarial Video Prediction cs.CV · 2018 · author #4
- Universal Planning Networks cs.LG · 2018 · author #3
- Learning to Adapt in Dynamic, Real-World Environments Through Meta-Reinforcement Learning cs.LG · 2018 · author #5
- Learning Robotic Assembly from CAD cs.RO · 2018 · author #5
- Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines cs.LG · 2018 · author #8
- Composable Deep Reinforcement Learning for Robotic Manipulation cs.LG · 2018 · author #5
- Accelerated Methods for Deep Reinforcement Learning cs.LG · 2018 · author #2
- Some Considerations on Learning to Explore via Meta-Reinforcement Learning cs.AI · 2018 · author #7
- Model-Ensemble Trust-Region Policy Optimization cs.LG · 2018 · author #5
- Meta-Reinforcement Learning of Structured Exploration Strategies cs.LG · 2018 · author #4
- Evolved Policy Gradients cs.LG · 2018 · author #7
- One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning cs.LG · 2018 · author #6
- Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor cs.LG · 2018 · author #3
- PixelSNAIL: An Improved Autoregressive Generative Model cs.LG · 2017 · author #4
- A Berkeley View of Systems Challenges for AI cs.AI · 2017 · author #14
- Safer Classification by Synthesis cs.LG · 2017 · author #5
- Interpretable and Pedagogical Examples cs.AI · 2017 · author #2
- Meta Learning Shared Hierarchies cs.LG · 2017 · author #4
- Asymmetric Actor Critic for Image-Based Robot Learning cs.RO · 2017 · author #5
- Sim-to-Real Transfer of Robotic Control with Dynamics Randomization cs.RO · 2017 · author #4
- Domain Randomization and Generative Models for Robotic Grasping cs.RO · 2017 · author #11
- Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation cs.LG · 2017 · author #7
- Synkhronos: a Multi-GPU Theano Extension for Data Parallelism cs.DC · 2017 · author #2
- Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments cs.LG · 2017 · author #6
- Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation cs.LG · 2017 · author #4
- Overcoming Exploration in Reinforcement Learning with Demonstrations cs.LG · 2017 · author #5
- One-Shot Visual Imitation Learning via Meta-Learning cs.LG · 2017 · author #4
- Learning with Opponent-Learning Awareness cs.AI · 2017 · author #5
- Learning Generalized Reactive Policies using Deep Neural Networks cs.AI · 2017 · author #5
- Deep Object-Centric Representations for Generalizable Robot Learning cs.RO · 2017 · author #2
- Mutual Alignment Transfer Learning cs.AI · 2017 · author #3
- Reverse Curriculum Generation for Reinforcement Learning cs.AI · 2017 · author #5
- Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation cs.LG · 2017 · author #3
- A Simple Neural Attentive Meta-Learner cs.AI · 2017 · author #4
- Hindsight Experience Replay cs.LG · 2017 · author #9
- Parameter Space Noise for Exploration cs.LG · 2017 · author #8
- UCB Exploration via Q-Ensembles cs.LG · 2017 · author #3
- Constrained Policy Optimization cs.LG · 2017 · author #4
- Automatic Goal Generation for Reinforcement Learning Agents cs.LG · 2017 · author #4
- Probabilistically Safe Policy Transfer cs.RO · 2017 · author #5
- Equivalence Between Policy Gradients and Soft Q-Learning cs.LG · 2017 · author #3
- Stochastic Neural Networks for Hierarchical Reinforcement Learning cs.AI · 2017 · author #3
- Learning Visual Servoing with Deep Features and Fitted Q-Iteration cs.LG · 2017 · author #3
- One-Shot Imitation Learning cs.AI · 2017 · author #7
- Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World cs.RO · 2017 · author #6
- Emergence of Grounded Compositional Language in Multi-Agent Populations cs.AI · 2017 · author #2
- Prediction and Control with Temporal Segment Models cs.LG · 2017 · author #2
- Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks cs.LG · 2017 · author #2
- Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning cs.AI · 2017 · author #4
- Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation cs.CV · 2017 · author #5
- Reinforcement Learning with Deep Energy-Based Policies cs.LG · 2017 · author #3
- Enabling Robots to Communicate their Objectives cs.RO · 2017 · author #3
- Adversarial Attacks on Neural Network Policies cs.LG · 2017 · author #5
- Uncertainty-Aware Reinforcement Learning for Collision Avoidance cs.LG · 2017 · author #4
- A K-fold Method for Baseline Estimation in Policy Gradient Algorithms cs.AI · 2017 · author #5
- Generalizing Skills with Semi-Supervised Reinforcement Learning cs.LG · 2016 · author #4
- The Off-Switch Game cs.AI · 2016 · author #3
- #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning cs.AI · 2016 · author #9
- A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models cs.LG · 2016 · author #3
- RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning cs.AI · 2016 · author #6
- Variational Lossy Autoencoder cs.LG · 2016 · author #8
- Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model cs.RO · 2016 · author #7
- Reset-Free Guided Policy Search: Efficient Deep Reinforcement Learning with Stochastic Initial States cs.LG · 2016 · author #4
- Deep Reinforcement Learning for Tensegrity Robot Locomotion cs.RO · 2016 · author #7
- Learning from the Hindsight Plan -- Episodic MPC Improvement cs.RO · 2016 · author #5
- Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer cs.LG · 2016 · author #4
- Toward a Science of Autonomy for Physical Systems: Paths cs.CY · 2016 · author #1
- Learning to Poke by Poking: Experiential Learning of Intuitive Physics cs.CV · 2016 · author #3
- InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets cs.LG · 2016 · author #6
- VIME: Variational Information Maximizing Exploration cs.LG · 2016 · author #6
- Backprop KF: Learning Discriminative Deterministic State Estimators cs.LG · 2016 · author #4
- Benchmarking Deep Reinforcement Learning for Continuous Control cs.LG · 2016 · author #5
- Learning Dexterous Manipulation for a Soft Robotic Hand from Human Demonstration cs.LG · 2016 · author #4
- PLATO: Policy Learning using Adaptive Trajectory Optimization cs.LG · 2016 · author #4
- Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization cs.LG · 2016 · author #3
- Value Iteration Networks cs.AI · 2016 · author #5
- Inverse Reinforcement Learning via Deep Gaussian Process cs.LG · 2015 · author #3
- Adapting Deep Visuomotor Representations with Weak Pairwise Constraints cs.CV · 2015 · author #5
- One-Shot Learning of Manipulation Skills with Online Dynamics Adaptation and Neural Network Priors cs.LG · 2015 · author #3
- Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration cs.LG · 2015 · author #5
- Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search cs.LG · 2015 · author #4
- Deep Spatial Autoencoders for Visuomotor Learning cs.LG · 2015 · author #6
- Learning Deep Neural Network Policies with Continuous Memory States cs.LG · 2015 · author #5
- Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models cs.AI · 2015 · author #3
- Gradient Estimation Using Stochastic Computation Graphs cs.LG · 2015 · author #4
- High-Dimensional Continuous Control Using Generalized Advantage Estimation cs.LG · 2015 · author #5
- End-to-End Training of Deep Visuomotor Policies cs.LG · 2015 · author #4
- Trust Region Policy Optimization cs.LG · 2015 · author #5
- Benchmarking in Manipulation Research: The YCB Object and Model Set and Benchmarking Protocols cs.RO · 2015 · author #5
- Learning Contact-Rich Manipulation Skills with Guided Policy Search cs.RO · 2015 · author #3
- Arriving on time: estimating travel time distributions on large-scale road networks cs.LG · 2013 · author #7
- Large Scale Estimation in Cyberphysical Systems using Streaming Data: a Case Study with Smartphone Traces cs.RO · 2012 · author #4
- Discriminative Probabilistic Models for Relational Data cs.LG · 2012 · author #2
- Learning Factor Graphs in Polynomial Time & Sample Complexity cs.LG · 2012 · author #1
- Safe Exploration in Markov Decision Processes cs.LG · 2012 · author #2
- The path inference filter: model-based low-latency map matching of probe vehicle data cs.AI · 2011 · author #2
Mentions
- 2606.10305 #8 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 1511.07111 #5 · backfill · confidence 0.70 Pieter Abbeel
- 1509.06841 #3 · backfill · confidence 0.70 Pieter Abbeel
- 1509.06824 #5 · backfill · confidence 0.70 Pieter Abbeel
- 1509.06791 #4 · backfill · confidence 0.70 Pieter Abbeel
- 1509.06113 #6 · backfill · confidence 0.70 Pieter Abbeel
- 2606.05873 #4 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 1710.06537 #4 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 1610.03518 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 1506.02438 #5 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 1507.01273 #5 · backfill · confidence 0.70 Pieter Abbeel
- 1507.00814 #3 · backfill · confidence 0.70 Pieter Abbeel
- 1506.05254 #4 · backfill · confidence 0.70 Pieter Abbeel
- 1506.02438 #5 · backfill · confidence 0.70 Pieter Abbeel
- 1504.00702 #4 · backfill · confidence 0.70 Pieter Abbeel
- 1502.05477 #5 · backfill · confidence 0.70 Pieter Abbeel
- 1502.03143 #5 · backfill · confidence 0.70 Pieter Abbeel
- 1501.05611 #3 · backfill · confidence 0.70 Pieter Abbeel
- 2511.22445 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 1302.6617 #7 · backfill · confidence 0.70 Pieter Abbeel
- 2605.25210 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 1301.0604 #2 · backfill · confidence 0.70 Pieter Abbeel
- 1212.3393 #4 · backfill · confidence 0.70 Pieter Abbeel
- 1207.1366 #1 · backfill · confidence 0.70 Pieter Abbeel
- 1205.4810 #2 · backfill · confidence 0.70 Pieter Abbeel
- 1109.1966 #2 · backfill · confidence 0.70 Pieter Abbeel
- 2603.05066 #3 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 2605.10236 #5 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 2106.01345 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 2305.15717 #6 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 2401.00025 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 2402.10260 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 2602.06949 #26 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 2402.08268 #4 · arxiv_oai · confidence 0.70 Pieter Abbeel
- 2310.06114 #7 · arxiv_oai · confidence 0.70 Pieter Abbeel
Frequent Coauthors
- Sergey Levine 55 shared papers
- Xi Chen 19 shared papers
- Chelsea Finn 16 shared papers
- Yan Duan 15 shared papers
- John Schulman 13 shared papers
- Abhishek Gupta 11 shared papers
- Aviv Tamar 11 shared papers
- Jitendra Malik 11 shared papers
- Igor Mordatch 9 shared papers
- Wojciech Zaremba 8 shared papers
- Gregory Kahn 7 shared papers
- Marcin Andrychowicz 7 shared papers
- Rein Houthooft 7 shared papers
- Trevor Darrell 7 shared papers
- Haoran Geng 6 shared papers
- Ilya Sutskever 6 shared papers
- Tianhao Zhang 6 shared papers
- Tuomas Haarnoja 6 shared papers
- Bradly C. Stadie 5 shared papers
- Coline Devin 5 shared papers