DeepMind Lab

Adam Cain; Adrian Bolton; Amir Sadik; Andrew Lefrancq; Charles Beattie; Demis Hassabis; Denis Teplyashin; Heinrich K\"uttler; Helen King; Joel Z. Leibo

arxiv: 1612.03801 · v2 · pith:TMM2ZANTnew · submitted 2016-12-12 · 💻 cs.AI

DeepMind Lab

Charles Beattie , Joel Z. Leibo , Denis Teplyashin , Tom Ward , Marcus Wainwright , Heinrich K\"uttler , Andrew Lefrancq , Simon Green

show 13 more authors

V\'ictor Vald\'es Amir Sadik Julian Schrittwieser Keith Anderson Sarah York Max Cant Adam Cain Adrian Bolton Stephen Gaffney Helen King Demis Hassabis Shane Legg Stig Petersen

This is my paper

classification 💻 cs.AI

keywords deepmindartificialgameresearchagentsai-designsautonomouscommunity

0 comments

read the original abstract

DeepMind Lab is a first-person 3D game platform designed for research and development of general artificial intelligence and machine learning systems. DeepMind Lab can be used to study how autonomous artificial agents may learn complex tasks in large, partially observed, and visually diverse worlds. DeepMind Lab has a simple and flexible API enabling creative task-designs and novel AI-designs to be explored and quickly iterated upon. It is powered by a fast and widely recognised game engine, and tailored for effective use by the research community.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 17 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Forager: a lightweight testbed for continual learning with partial observability in RL
cs.LG 2026-05 unverdicted novelty 7.0

Forager is a lightweight partially-observable continual RL environment that exposes loss of plasticity in current agents and highlights the value of state construction for ongoing learning.
Mastering Diverse Domains through World Models
cs.AI 2023-01 unverdicted novelty 7.0

DreamerV3 uses world models and robustness techniques to solve over 150 tasks across domains with a single configuration, including Minecraft diamond collection from scratch.
A Generalist Agent
cs.AI 2022-05 accept novelty 7.0

Gato is a multi-modal, multi-task, multi-embodiment generalist policy using one transformer network to handle text, vision, games, and robotics tasks.
Dream to Control: Learning Behaviors by Latent Imagination
cs.LG 2019-12 accept novelty 7.0

Dreamer learns to control from images by imagining and optimizing behaviors in a learned latent world model, outperforming prior methods on 20 visual tasks in data efficiency and final performance.
Memory-Efficient Transfer Learning with Fading Side Networks via Masked Dual Path Distillation
cs.CV 2026-04 unverdicted novelty 6.0

MDPD mutually distills knowledge between a frozen backbone and a learnable side network during fine-tuning, then discards the side network at inference to accelerate speed by at least 25% while preserving accuracy.
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents
cs.CV 2026-04 unverdicted novelty 6.0

GameWorld is a new benchmark providing standardized interfaces, 34 games, 170 tasks, and verifiable outcome metrics to evaluate multimodal large language model agents in video game environments.
Visual prompting reimagined: The power of the Activation Prompts
cs.CV 2026-04 unverdicted novelty 6.0

Activation prompts on intermediate layers outperform input-level visual prompting and parameter-efficient fine-tuning in accuracy and efficiency across 29 datasets.
A Survey of Continual Reinforcement Learning
cs.LG 2025-06 accept novelty 6.0

The paper surveys CRL literature, proposes a taxonomy of methods into four categories based on knowledge storage and transfer, reviews metrics and benchmarks, and outlines challenges and future research directions.
LLaVA-Video: Video Instruction Tuning With Synthetic Data
cs.CV 2024-10 unverdicted novelty 6.0

LLaVA-Video-178K is a new synthetic video instruction dataset that, when combined with existing data to train LLaVA-Video, produces strong results on video understanding benchmarks.
Compressive Transformers for Long-Range Sequence Modelling
cs.LG 2019-11 unverdicted novelty 6.0

Compressive Transformer sets new records on WikiText-103 (17.1 ppl) and Enwik8 (0.97 bpc) via memory compression and introduces the PG-19 long-range language benchmark.
A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark
cs.CV 2019-10 accept novelty 6.0

VTAB is a 19-task benchmark that measures representation quality by few-shot adaptation performance across diverse vision domains, with a controlled large-scale comparison of popular pretraining methods.
Arena: a toolkit for Multi-Agent Reinforcement Learning
cs.LG 2019-07 accept novelty 6.0

Arena introduces a modular Interface design that extends OpenAI Gym wrappers to support complex multi-agent RL scenarios including self-play and cooperative-competitive interactions.
On Evaluation of Embodied Navigation Agents
cs.AI 2018-07 accept novelty 6.0

Consensus recommendations for standardized evaluation measures, problem statements, and benchmarking scenarios in embodied navigation research.
From Pixels to Digital Agents: An Empirical Study on the Taxonomy and Technological Trends of Reinforcement Learning Environments
cs.AI 2026-03 unverdicted novelty 5.0

An empirical literature analysis reveals a bifurcation in RL environments into Semantic Prior (LLM-dominated) and Domain-Specific Generalization ecosystems with distinct cognitive fingerprints.
PVeRA: Probabilistic Vector-Based Random Matrix Adaptation
cs.CV 2025-12 unverdicted novelty 5.0

PVeRA extends VeRA by making its frozen random low-rank matrices probabilistic, enabling better handling of ambiguities and outperforming prior adapters on the VTAB-1k benchmark.
Why Build an Assistant in Minecraft?
cs.AI 2019-07 unverdicted novelty 4.0

A rationale is presented for developing an assistant in Minecraft to advance natural language understanding and dialogue learning.
On Inductive Biases in Deep Reinforcement Learning
cs.LG 2019-07 unverdicted novelty 4.0

Adaptive replacements for domain-specific components in deep RL agents can yield better learning on new tasks without additional tuning.