Using Simulation and Domain Adaptation to Improve Efficiency of Deep Robotic Grasping

Alex Irpan; Julian Ibarz; Konstantinos Bousmalis; Kurt Konolige; Laura Downs; Matthew Kelcey; Mrinal Kalakrishnan; Paul Wohlhart; Peter Pastor; Sergey Levine

arxiv: 1709.07857 · v2 · pith:TCVQ7O3Ynew · submitted 2017-09-22 · 💻 cs.LG · cs.AI· cs.CV· cs.RO

Using Simulation and Domain Adaptation to Improve Efficiency of Deep Robotic Grasping

Konstantinos Bousmalis , Alex Irpan , Paul Wohlhart , Yunfei Bai , Matthew Kelcey , Mrinal Kalakrishnan , Laura Downs , Julian Ibarz

show 4 more authors

Peter Pastor Kurt Konolige Sergey Levine Vincent Vanhoucke

This is my paper

classification 💻 cs.LG cs.AIcs.CVcs.RO

keywords adaptationdomainreal-worlddatagraspingsimulatedgeneratedgraspgan

0 comments

read the original abstract

Instrumenting and collecting annotated visual grasping datasets to train modern machine learning algorithms can be extremely time-consuming and expensive. An appealing alternative is to use off-the-shelf simulators to render synthetic data for which ground-truth annotations are generated automatically. Unfortunately, models trained purely on simulated data often fail to generalize to the real world. We study how randomized simulated environments and domain adaptation methods can be extended to train a grasping system to grasp novel objects from raw monocular RGB images. We extensively evaluate our approaches with a total of more than 25,000 physical test grasps, studying a range of simulation conditions and domain adaptation methods, including a novel extension of pixel-level domain adaptation that we term the GraspGAN. We show that, by using synthetic data and domain adaptation, we are able to reduce the number of real-world samples needed to achieve a given level of performance by up to 50 times, using only randomly generated simulated objects. We also show that by using only unlabeled real-world data and our GraspGAN methodology, we obtain real-world grasping performance without any real-world labels that is similar to that achieved with 939,777 labeled real-world samples.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

State-Conditional Adversarial Learning: An Off-Policy Visual Domain Transfer Method for End-to-End Imitation Learning
cs.RO 2025-12 unverdicted novelty 5.0

SCAL derives an upper bound on target-domain imitation loss using source loss plus state-conditional latent KL divergence and aligns distributions via a discriminator-based adversarial estimator.