Algo- rithmic framework for model-based deep reinforcement learning with theoretical guarantees

Yuping Luo, Huazhe Xu, Yuanzhi Li, Yuandong Tian, Trevor Darrell, Tengyu Ma · 2019

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Benchmarking Model-Based Reinforcement Learning

cs.LG · 2019-07-03 · accept · novelty 7.0

Introduces a benchmark suite of over 18 MBRL environments, evaluates multiple algorithms under consistent settings, and identifies three core challenges: dynamics bottleneck, planning horizon dilemma, and early-termination dilemma.

Exploring Model-based Planning with Policy Networks

cs.LG · 2019-06-20 · unverdicted · novelty 7.0

POPLIN combines policy networks with model-predictive planning by optimizing either action sequences or policy parameters, yielding 3x better sample efficiency than PETS, TD3 and SAC on MuJoCo locomotion tasks.

citing papers explorer

Showing 2 of 2 citing papers.

Benchmarking Model-Based Reinforcement Learning cs.LG · 2019-07-03 · accept · none · ref 32
Introduces a benchmark suite of over 18 MBRL environments, evaluates multiple algorithms under consistent settings, and identifies three core challenges: dynamics bottleneck, planning horizon dilemma, and early-termination dilemma.
Exploring Model-based Planning with Policy Networks cs.LG · 2019-06-20 · unverdicted · none · ref 27
POPLIN combines policy networks with model-predictive planning by optimizing either action sequences or policy parameters, yielding 3x better sample efficiency than PETS, TD3 and SAC on MuJoCo locomotion tasks.

Algo- rithmic framework for model-based deep reinforcement learning with theoretical guarantees

fields

years

verdicts

representative citing papers

citing papers explorer