Learning To Simulate

Manmohan Chandraker; Nataniel Ruiz; Samuel Schulter

arxiv: 1810.02513 · v2 · pith:SWZDOVFSnew · submitted 2018-10-05 · 💻 cs.LG · cs.CV· stat.ML

Learning To Simulate

Nataniel Ruiz , Samuel Schulter , Manmohan Chandraker This is my paper

classification 💻 cs.LG cs.CVstat.ML

keywords dataparameterssimulationsimulatoraccuracyactualapproachdistribution

0 comments

read the original abstract

Simulation is a useful tool in situations where training data for machine learning models is costly to annotate or even hard to acquire. In this work, we propose a reinforcement learning-based method for automatically adjusting the parameters of any (non-differentiable) simulator, thereby controlling the distribution of synthesized data in order to maximize the accuracy of a model trained on that data. In contrast to prior art that hand-crafts these simulation parameters or adjusts only parts of the available parameters, our approach fully controls the simulator with the actual underlying goal of maximizing accuracy, rather than mimicking the real data distribution or randomly generating a large volume of data. We find that our approach (i) quickly converges to the optimal simulation parameters in controlled experiments and (ii) can indeed discover good sets of parameters for an image rendering simulator in actual computer vision applications.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Solving Rubik's Cube with a Robot Hand
cs.LG 2019-10 accept novelty 7.0

Reinforcement learning models trained only in simulation using automatic domain randomization solve Rubik's cube with a real robot hand.