pith. sign in

arxiv: 1611.01652 · v2 · pith:JPFZPQNEnew · submitted 2016-11-05 · 💻 cs.NE · cs.AI· cs.RO

A Differentiable Physics Engine for Deep Learning in Robotics

classification 💻 cs.NE cs.AIcs.RO
keywords deeplearningengineoptimizationroboticsmethodsparametersphysics
0
0 comments X
read the original abstract

An important field in robotics is the optimization of controllers. Currently, robots are often treated as a black box in this optimization process, which is the reason why derivative-free optimization methods such as evolutionary algorithms or reinforcement learning are omnipresent. When gradient-based methods are used, models are kept small or rely on finite difference approximations for the Jacobian. This method quickly grows expensive with increasing numbers of parameters, such as found in deep learning. We propose the implementation of a modern physics engine, which can differentiate control parameters. This engine is implemented for both CPU and GPU. Firstly, this paper shows how such an engine speeds up the optimization process, even for small problems. Furthermore, it explains why this is an alternative approach to deep Q-learning, for using deep learning in robotics. Finally, we argue that this is a big step for deep learning in robotics, as it opens up new possibilities to optimize robots, both in hardware and software.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. RigPI: Dynamic Parameter Identification of Rigid Body via VLM-Seeded Differentiable Simulation

    cs.RO 2026-06 unverdicted novelty 6.0

    RigPI combines VLM semantic priors with two-stage gradient optimization in differentiable simulation to identify inertial and frictional parameters of rigid bodies from robot-object interactions.

  2. Ensemble Distributionally Robust Bayesian Optimisation with Continuous Context

    cs.LG 2026-05 unverdicted novelty 6.0

    A tractable ensemble distributionally robust Bayesian optimization method achieves improved sublinear regret bounds under context uncertainty.

  3. Ensemble Distributionally Robust Bayesian Optimisation with Continuous Context

    cs.LG 2026-05 unverdicted novelty 6.0

    EDRBO uses ensemble surrogates and Wasserstein ambiguity sets to robustify BO acquisition functions against context distribution mismatch, with sublinear regret O(γ_T √T) and SOTA empirical results on continuous contexts.

  4. RigPI: Dynamic Parameter Identification of Rigid Body via VLM-Seeded Differentiable Simulation

    cs.RO 2026-06 unverdicted novelty 5.0

    RigPI combines VLM initialization with two-stage gradient-based optimization in differentiable simulation to estimate dynamic parameters of rigid bodies from real robot interactions.

  5. Fast Bayesian equipment condition monitoring via simulation based inference: applications to heat exchanger health

    cs.LG 2026-04 unverdicted novelty 5.0

    Amortized neural posterior estimation via simulation-based inference delivers 82x faster inference than MCMC for heat exchanger fouling and leakage diagnosis while maintaining comparable accuracy on synthetic data.

  6. Integrating Mechanistic and Data-Driven Models for Neurological Disorders through Differentiable Programming

    cs.AI 2026-06 unverdicted novelty 3.0

    This perspective paper categorizes hybrid architectures for combining mechanistic and data-driven models using residual learning, Neural ODEs, and solver-in-the-loop to model neurological disorder progression.