Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration

Ladislau B\"ol\"oni; Pooya Abolghasemi; Rouhollah Rahmatizadeh; Sergey Levine

arxiv: 1707.02920 · v2 · pith:L57YL2PKnew · submitted 2017-07-10 · 💻 cs.LG · cs.AI· cs.RO

Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration

Rouhollah Rahmatizadeh , Pooya Abolghasemi , Ladislau B\"ol\"oni , Sergey Levine This is my paper

classification 💻 cs.LG cs.AIcs.RO

keywords taskscontrollermanipulationcomplexdemonstrationimageslearningmulti-task

0 comments

read the original abstract

We propose a technique for multi-task learning from demonstration that trains the controller of a low-cost robotic arm to accomplish several complex picking and placing tasks, as well as non-prehensile manipulation. The controller is a recurrent neural network using raw images as input and generating robot arm trajectories, with the parameters shared across the tasks. The controller also combines VAE-GAN-based reconstruction with autoregressive multimodal action prediction. Our results demonstrate that it is possible to learn complex manipulation tasks, such as picking up a towel, wiping an object, and depositing the towel to its previous position, entirely from raw images with direct behavior cloning. We show that weight sharing and reconstruction-based regularization substantially improve generalization and robustness, and training on multiple tasks simultaneously increases the success rate on all tasks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

State-Conditional Adversarial Learning: An Off-Policy Visual Domain Transfer Method for End-to-End Imitation Learning
cs.RO 2025-12 unverdicted novelty 5.0

SCAL derives an upper bound on target-domain imitation loss using source loss plus state-conditional latent KL divergence and aligns distributions via a discriminator-based adversarial estimator.