ViZDoom: A Doom-based AI research platform for visual reinforcement learning

· 2016 · arXiv 2016.786043

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Best Agent Identification for General Game Playing

cs.LG · 2025-07-01 · unverdicted · novelty 6.0

An optimistic confidence-interval ranking procedure for best-arm identification across multiple independent bandits yields lower average simple regret and error probability than prior methods when selecting high-performing agents for each game in GVGAI and Ludii.

CAMAL: Improving Attention Alignment and Faithfulness with Segmentation Masks

eess.IV · 2026-05-08 · unverdicted · novelty 5.0

CAMAL adds an auxiliary regularizer during training that aligns model attention with segmentation masks to improve both spatial accuracy and causal faithfulness of attention in deep learning and deep reinforcement learning vision models.

Optimal Use of Experience in First Person Shooter Environments

cs.LG · 2019-06-24 · unverdicted · novelty 2.0

Empirical tests in VizDoom show multiple DQN updates per step do not improve performance after learning rate adjustment, with a 4:1 update-to-step ratio optimal before significant degradation.

citing papers explorer

Showing 3 of 3 citing papers.

Best Agent Identification for General Game Playing cs.LG · 2025-07-01 · unverdicted · none · ref 32
An optimistic confidence-interval ranking procedure for best-arm identification across multiple independent bandits yields lower average simple regret and error probability than prior methods when selecting high-performing agents for each game in GVGAI and Ludii.
CAMAL: Improving Attention Alignment and Faithfulness with Segmentation Masks eess.IV · 2026-05-08 · unverdicted · none · ref 3
CAMAL adds an auxiliary regularizer during training that aligns model attention with segmentation masks to improve both spatial accuracy and causal faithfulness of attention in deep learning and deep reinforcement learning vision models.
Optimal Use of Experience in First Person Shooter Environments cs.LG · 2019-06-24 · unverdicted · none · ref 9
Empirical tests in VizDoom show multiple DQN updates per step do not improve performance after learning rate adjustment, with a 4:1 update-to-step ratio optimal before significant degradation.

ViZDoom: A Doom-based AI research platform for visual reinforcement learning

fields

years

verdicts

representative citing papers

citing papers explorer