Deep Counterfactual Regret Minimization

· 2018 · cs.AI · arXiv 1811.00164

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Counterfactual Regret Minimization (CFR) is the leading framework for solving large imperfect-information games. It converges to an equilibrium by iteratively traversing the game tree. In order to deal with extremely large games, abstraction is typically applied before running CFR. The abstracted game is solved with tabular CFR, and its solution is mapped back to the full game. This process can be problematic because aspects of abstraction are often manual and domain specific, abstraction algorithms may miss important strategic nuances of the game, and there is a chicken-and-egg problem because determining a good abstraction requires knowledge of the equilibrium of the game. This paper introduces Deep Counterfactual Regret Minimization, a form of CFR that obviates the need for abstraction by instead using deep neural networks to approximate the behavior of CFR in the full game. We show that Deep CFR is principled and achieves strong performance in large poker games. This is the first non-tabular variant of CFR to be successful in large games.

representative citing papers

How Much Due Diligence Before You Bid? Learning in Intractable Takeover Auctions

cs.AI · 2026-06-28 · unverdicted · novelty 6.0

Self-play RL in a takeover auction model shows optimal due diligence is modest and finite, decreasing with cost and competition, while simple general methods outperform specialized ones in large intractable games.

Unsupervised Causal Abstractions Discovery

cs.LG · 2026-06-17 · unverdicted · novelty 6.0

Low-rank graphs induce latents that form causal abstractions, with identifiability results and a practical objective enabling unsupervised learning of high-level SCMs from low-level measurements.

citing papers explorer

Showing 2 of 2 citing papers after filters.

How Much Due Diligence Before You Bid? Learning in Intractable Takeover Auctions cs.AI · 2026-06-28 · unverdicted · none · ref 1 · internal anchor
Self-play RL in a takeover auction model shows optimal due diligence is modest and finite, decreasing with cost and competition, while simple general methods outperform specialized ones in large intractable games.
Unsupervised Causal Abstractions Discovery cs.LG · 2026-06-17 · unverdicted · none · ref 6 · internal anchor
Low-rank graphs induce latents that form causal abstractions, with identifiability results and a practical objective enabling unsupervised learning of high-level SCMs from low-level measurements.

Deep Counterfactual Regret Minimization

fields

years

verdicts

representative citing papers

citing papers explorer