When to trust your model: Model-based policy optimization

Michael Janner, Justin Fu, Marvin Zhang, Sergey Levine · 2019

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Stabilized neural Hamilton--Jacobi--Bellman solvers: Error analysis and applications in model-based reinforcement learning

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

The authors prove a population L2 stability estimate and finite-sample certificate for one policy-evaluation step in a neural HJB solver with learned dynamics, plus multi-step propagation through greedy improvement, with experiments on high-dimensional control tasks.

citing papers explorer

Showing 1 of 1 citing paper.

Stabilized neural Hamilton--Jacobi--Bellman solvers: Error analysis and applications in model-based reinforcement learning cs.LG · 2026-05-08 · unverdicted · none · ref 4
The authors prove a population L2 stability estimate and finite-sample certificate for one policy-evaluation step in a neural HJB solver with learned dynamics, plus multi-step propagation through greedy improvement, with experiments on high-dimensional control tasks.

When to trust your model: Model-based policy optimization

fields

years

verdicts

representative citing papers

citing papers explorer