Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning

Adam Crespi; Ahmed Khalifa; Arthur Juliani; Danny Lange; Ervin Teng; Hunter Henry; Jonathan Harper; Julian Togelius; Vincent-Pierre Berges

arxiv: 1902.01378 · v2 · pith:2PWYZS4Inew · submitted 2019-02-04 · 💻 cs.AI · cs.LG

Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning

Arthur Juliani , Ahmed Khalifa , Vincent-Pierre Berges , Jonathan Harper , Ervin Teng , Hunter Henry , Adam Crespi , Julian Togelius

show 1 more author

Danny Lange

This is my paper

classification 💻 cs.AI cs.LG

keywords environmentobstacletoweragentgamescontrolenvironmentshuman

0 comments

read the original abstract

The rapid pace of recent research in AI has been driven in part by the presence of fast and challenging simulation environments. These environments often take the form of games; with tasks ranging from simple board games, to competitive video games. We propose a new benchmark - Obstacle Tower: a high fidelity, 3D, 3rd person, procedurally generated environment. An agent playing Obstacle Tower must learn to solve both low-level control and high-level planning problems in tandem while learning from pixels and a sparse reward signal. Unlike other benchmarks such as the Arcade Learning Environment, evaluation of agent performance in Obstacle Tower is based on an agent's ability to perform well on unseen instances of the environment. In this paper we outline the environment and provide a set of baseline results produced by current state-of-the-art Deep RL methods as well as human players. These algorithms fail to produce agents capable of performing near human level.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

ORRB -- OpenAI Remote Rendering Backend
cs.GR 2019-06 unverdicted novelty 4.0

ORRB is an open-source remote rendering backend that pairs Unity3d with MuJoCo for high-throughput, customizable visual domain randomization in robotics environments.