Re-determinizing Information Set Monte Carlo Tree Search in Hanabi

James Goodman

arxiv: 1902.06075 · v2 · pith:BE2NQAD5new · submitted 2019-02-16 · 💻 cs.AI · cs.LG

Re-determinizing Information Set Monte Carlo Tree Search in Hanabi

James Goodman This is my paper

classification 💻 cs.AI cs.LG

keywords competitionhanabiis-mctsinformationre-determinizingcarlomonteopponent

0 comments

read the original abstract

This technical report documents the winner of the Computational Intelligence in Games(CIG) 2018 Hanabi competition. We introduce Re-determinizing IS-MCTS, a novel extension of Information Set Monte Carlo Tree Search (IS-MCTS) that prevents a leakage of hidden information into opponent models that can occur in IS-MCTS, and is particularly severe in Hanabi. Re-determinizing IS-MCTS scores higher in Hanabi for 2-4 players than previously published work at the time of the competition. Given the 40ms competition time limit per move we use a learned evaluation function to estimate leaf node values and avoid full simulations during MCTS. For the Mixed track competition, in which the identity of the other players is unknown, a simple Bayesian opponent model is used that is updated as each game proceeds.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Diverse Agents for Ad-Hoc Cooperation in Hanabi
cs.AI 2019-07 unverdicted novelty 6.0

Quality Diversity algorithms are proposed to generate diverse agent populations for ad-hoc cooperation evaluation in Hanabi, with discussion of metrics and adaptive agent building.