The DeepMind JAX Ecosystem, 2020

Quan, Roman Ring, Francisco Ruiz, Alvaro Sanchez, Laurent Sartran, Rosalia Schneider, Eren Sezener, Stephen Spencer, Srivatsan Srinivasan, Miloš Stanojevi ´c, Wojciech Stokowiec, Luyu Wang, Guangyao Zhou, Fabio Viola · 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Reproducing AlphaZero on Tablut: Self-Play RL for an Asymmetric Board Game

cs.LG · 2026-04-07 · unverdicted · novelty 6.0

AlphaZero modified with separate heads for attacker and defender in Tablut achieves a BayesElo rating of 1235 after 100 self-play iterations with reduced policy entropy.

citing papers explorer

Showing 1 of 1 citing paper.

Reproducing AlphaZero on Tablut: Self-Play RL for an Asymmetric Board Game cs.LG · 2026-04-07 · unverdicted · none · ref 2
AlphaZero modified with separate heads for attacker and defender in Tablut achieves a BayesElo rating of 1235 after 100 self-play iterations with reduced policy entropy.

The DeepMind JAX Ecosystem, 2020

fields

years

verdicts

representative citing papers

citing papers explorer