pith. sign in

arxiv: 1905.10911 · v1 · pith:KBKAKUCJnew · submitted 2019-05-27 · 💻 cs.AI

Policy Based Inference in Trick-Taking Card Games

classification 💻 cs.AI
keywords inferencelargecardgamesinformationtrick-takingalgorithmpolicy
0
0 comments X
read the original abstract

Trick-taking card games feature a large amount of private information that slowly gets revealed through a long sequence of actions. This makes the number of histories exponentially large in the action sequence length, as well as creating extremely large information sets. As a result, these games become too large to solve. To deal with these issues many algorithms employ inference, the estimation of the probability of states within an information set. In this paper, we demonstrate a Policy Based Inference (PI) algorithm that uses player modelling to infer the probability we are in a given state. We perform experiments in the German trick-taking card game Skat, in which we show that this method vastly improves the inference as compared to previous work, and increases the performance of the state-of-the-art Skat AI system Kermit when it is employed into its determinized search algorithm.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Beyond the briscola advantage: a Monte Carlo dominance test for deterministic strategies in two-player Briscola Game

    cs.GT 2026-05 unverdicted novelty 4.0

    Monte Carlo tournaments show that briscola-hoarding and public-information counter strategies dominate the greedy baseline in two-player Briscola regardless of trump-count imbalance.

  2. Imperfect-Information Games on Quantum Computers: A Case Study in Skat

    quant-ph 2024-11 unverdicted novelty 3.0

    Outlines a quantum-register encoding and quantum-counting method to recommend moves in Skat by maximizing expected payoff over possible hidden-card configurations.