The imagination horizon is H = 15 and the same trajectories are used to update both action and value models

but clip them below 3 free nats as in PlaNet · 2018

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Dream to Control: Learning Behaviors by Latent Imagination

cs.LG · 2019-12-03 · accept · novelty 7.0

Dreamer learns to control from images by imagining and optimizing behaviors in a learned latent world model, outperforming prior methods on 20 visual tasks in data efficiency and final performance.

citing papers explorer

Showing 1 of 1 citing paper.

Dream to Control: Learning Behaviors by Latent Imagination cs.LG · 2019-12-03 · accept · none · ref 51
Dreamer learns to control from images by imagining and optimizing behaviors in a learned latent world model, outperforming prior methods on 20 visual tasks in data efficiency and final performance.

The imagination horizon is H = 15 and the same trajectories are used to update both action and value models

fields

years

verdicts

representative citing papers

citing papers explorer