Recurrent Environment Simulators

Daan Wierstra; S\'ebastien Racaniere; Shakir Mohamed; Silvia Chiappa

arxiv: 1704.02254 · v2 · pith:SGSKVIUHnew · submitted 2017-04-07 · 💻 cs.AI · cs.LG· stat.ML

Recurrent Environment Simulators

Silvia Chiappa , S\'ebastien Racaniere , Daan Wierstra , Shakir Mohamed This is my paper

classification 💻 cs.AI cs.LGstat.ML

keywords environmentenvironmentshigh-dimensionalimprovemodelsrecurrentsimulatorsused

0 comments

read the original abstract

Models that can simulate how environments change in response to actions can be used by agents to plan and act efficiently. We improve on previous environment simulators from high-dimensional pixel observations by introducing recurrent neural networks that are able to make temporally and spatially coherent predictions for hundreds of time-steps into the future. We present an in-depth analysis of the factors affecting performance, providing the most extensive attempt to advance the understanding of the properties of these models. We address the issue of computationally inefficiency with a model that does not need to generate a high-dimensional image at each time-step. We show that our approach can be used to improve exploration and is adaptable to many diverse environments, namely 10 Atari games, a 3D car racing environment, and complex 3D mazes.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Mastering Atari with Discrete World Models
cs.LG 2020-10 accept novelty 7.0

DreamerV2 reaches human-level performance on 55 Atari games by learning behaviors inside a separately trained discrete-latent world model.
Geometry-aware 4D Video Generation for Robot Manipulation
cs.CV 2025-07 unverdicted novelty 5.0

A geometry-aware 4D video generation model trained with cross-view pointmap alignment to produce spatio-temporally consistent future videos from novel viewpoints for robot manipulation.
Shaping Belief States with Generative Environment Models for RL
cs.LG 2019-06 unverdicted novelty 5.0

Multi-step predictive generative models form stable belief states capturing environment layout and agent pose, yielding higher data efficiency on RL tasks than model-free agents.