AlphaZero used the set of legal actions obtained from the simulator to mask the prior produced by the network everywhere in the search tree

Actions available

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

cs.LG · 2019-11-19 · accept · novelty 8.0

MuZero matches or exceeds AlphaZero-level performance in Go, Chess, Shogi and sets a new state of the art on 57 Atari games by learning a model that directly supports planning rather than reconstructing full environment dynamics.

citing papers explorer

Showing 1 of 1 citing paper.

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model cs.LG · 2019-11-19 · accept · none · ref 51
MuZero matches or exceeds AlphaZero-level performance in Go, Chess, Shogi and sets a new state of the art on 57 Atari games by learning a model that directly supports planning rather than reconstructing full environment dynamics.

AlphaZero used the set of legal actions obtained from the simulator to mask the prior produced by the network everywhere in the search tree

fields

years

verdicts

representative citing papers

citing papers explorer