pith. machine review for the scientific record. sign in

arxiv: 1611.03158 · v2 · submitted 2016-11-10 · 💻 cs.LG

Recognition: unknown

Using Neural Networks to Compute Approximate and Guaranteed Feasible Hamilton-Jacobi-Bellman PDE Solutions

Authors on Pith no claims yet
classification 💻 cs.LG
keywords frameworkneuralstateapproximateapproximationcomputationfeasiblefunction
0
0 comments X
read the original abstract

To sidestep the curse of dimensionality when computing solutions to Hamilton-Jacobi-Bellman partial differential equations (HJB PDE), we propose an algorithm that leverages a neural network to approximate the value function. We show that our final approximation of the value function generates near optimal controls which are guaranteed to successfully drive the system to a target state. Our framework is not dependent on state space discretization, leading to a significant reduction in computation time and space complexity in comparison with dynamic programming-based approaches. Using this grid-free approach also enables us to plan over longer time horizons with relatively little additional computation overhead. Unlike many previous neural network HJB PDE approximating formulations, our approximation is strictly conservative and hence any trajectories we generate will be strictly feasible. For demonstration, we specialize our new general framework to the Dubins car model and discuss how the framework can be applied to other models with higher-dimensional state spaces.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Safe Large-Scale Robust Nonlinear MPC in Milliseconds via Reachability-Constrained System Level Synthesis on the GPU

    cs.RO 2026-04 unverdicted novelty 6.0

    GPU-SLS computes safe robust nonlinear MPC policies online in ~20 ms for up to 75D systems by reachability-constrained system level synthesis accelerated via custom GPU QP solvers.