Michael Bowling
Identifiers
- name variant Michael Bowling 0.60 · backfill
Papers (54)
- Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning cs.LG · 2024 · author #4
- Monitored Markov Decision Processes cs.LG · 2024 · author #5
- Assessing the Interpretability of Programmatic Policies with Large Language Models cs.AI · 2023 · author #2
- Proper Laplacian Representation Learning cs.LG · 2023 · author #2
- TacticAI: an AI assistant for football tactics cs.LG · 2023 · author #21
- Learning not to Regret cs.GT · 2023 · author #4
- Targeted Search Control in AlphaZero for Effective Policy Improvement cs.AI · 2023 · author #2
- Settling the Reward Hypothesis cs.AI · 2022 · author #1
- Over-communicate no more: Situated RL agents learn concise communication protocols cs.MA · 2022 · author #6
- The Alberta Plan for AI Research cs.AI · 2022 · author #2
- Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration cs.LG · 2022 · author #3
- Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections cs.GT · 2022 · author #5
- Should Models Be Accurate? cs.LG · 2022 · author #5
- Student of Games: A unified learning algorithm for both perfect and imperfect information games cs.AI · 2021 · author #13
- The Partially Observable History Process cs.AI · 2021 · author #3
- Temporal Abstraction in Reinforcement Learning with the Successor Representation cs.LG · 2021 · author #4
- Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games cs.GT · 2021 · author #5
- Solving Common-Payoff Games with Approximate Policy Iteration cs.AI · 2021 · author #8
- Hindsight and Sequential Rationality of Correlated Play cs.GT · 2020 · author #7
- Useful Policy Invariant Shaping from Arbitrary Advice cs.LG · 2020 · author #5
- The Advantage Regret-Matching Actor-Critic cs.AI · 2020 · author #12
- Sound Algorithms in Imperfect Information Games cs.GT · 2020 · author #6
- Marginal Utility for Planning in Continuous or Large Discrete Action Spaces cs.AI · 2020 · author #3
- Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task cs.LG · 2020 · author #4
- Approximate exploitability: Learning a best response in large games cs.LG · 2020 · author #9
- Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization cs.AI · 2019 · author #4
- Low-Variance and Zero-Variance Baselines for Extensive-Form Games cs.GT · 2019 · author #3
- Rethinking Formal Models of Partially Observable Multiagent Decision Making cs.AI · 2019 · author #4
- Ease-of-Teaching and Language Structure from Emergent Communication cs.AI · 2019 · author #2
- The Hanabi Challenge: A New Frontier for AI Research cs.LG · 2019 · author #15
- Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning cs.MA · 2018 · author #8
- Actor-Critic Policy Optimization in Partially Observable Multiagent Environments cs.LG · 2018 · author #7
- Generalization and Regularization in DQN cs.LG · 2018 · author #3
- Solving Large Extensive-Form Games with Strategy Constraints cs.GT · 2018 · author #3
- Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines cs.GT · 2018 · author #6
- Count-Based Exploration with the Successor Representation cs.LG · 2018 · author #3
- The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces cs.AI · 2018 · author #3
- Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents cs.LG · 2017 · author #6
- A Laplacian Framework for Option Discovery in Reinforcement Learning cs.LG · 2017 · author #3
- DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker cs.AI · 2017 · author #10
- Equilibrium Approximation Quality of Current No-Limit Poker Bots cs.GT · 2016 · author #2
- AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games cs.AI · 2016 · author #4
- Learning Purposeful Behaviour in the Absence of Rewards cs.LG · 2016 · author #2
- State of the Art Control of Atari Games Using Shallow Reinforcement Learning cs.LG · 2015 · author #4
- Solving Games with Functional Regret Estimation cs.AI · 2014 · author #4
- Domain-Independent Optimistic Initialization for Reinforcement Learning cs.LG · 2014 · author #3
- Solving Imperfect Information Games Using Decomposition cs.GT · 2013 · author #3
- Partition Tree Weighting cs.IT · 2012 · author #3
- The Arcade Learning Environment: An Evaluation Platform for General Agents cs.AI · 2012 · author #4
- On Local Regret cs.AI · 2012 · author #1
- No-Regret Learning in Extensive-Form Games with Imperfect Recall cs.GT · 2012 · author #5
- A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning cs.LG · 2012 · author #4
- Alignment Based Kernel Learning with a Continuous Set of Base Kernels cs.LG · 2011 · author #3
- Context Tree Switching cs.IT · 2011 · author #4
Mentions
- 2406.19561 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 2310.10833 #2 · arxiv_oai · confidence 0.70 Michael Bowling
- 2303.01074 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 2402.06819 #5 · arxiv_oai · confidence 0.70 Michael Bowling
- 2311.06979 #2 · arxiv_oai · confidence 0.70 Michael Bowling
- 2112.03178 #13 · arxiv_oai · confidence 0.70 Michael Bowling
- 2310.10553 #21 · arxiv_oai · confidence 0.70 Michael Bowling
- 2212.10420 #1 · arxiv_oai · confidence 0.70 Michael Bowling
- 2110.05740 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 2208.11173 #2 · arxiv_oai · confidence 0.70 Michael Bowling
- 2302.12359 #2 · arxiv_oai · confidence 0.70 Michael Bowling
- 2004.09677 #9 · arxiv_oai · confidence 0.70 Michael Bowling
- 2211.01480 #6 · arxiv_oai · confidence 0.70 Michael Bowling
- 2102.06973 #5 · arxiv_oai · confidence 0.70 Michael Bowling
- 2012.05874 #7 · arxiv_oai · confidence 0.70 Michael Bowling
- 2206.02036 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 2205.12031 #5 · arxiv_oai · confidence 0.70 Michael Bowling
- 2205.10736 #5 · arxiv_oai · confidence 0.70 Michael Bowling
- 2111.08102 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1906.11110 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 2006.08740 #6 · arxiv_oai · confidence 0.70 Michael Bowling
- 2101.04237 #8 · arxiv_oai · confidence 0.70 Michael Bowling
- 2011.01297 #5 · arxiv_oai · confidence 0.70 Michael Bowling
- 2008.12234 #12 · arxiv_oai · confidence 0.70 Michael Bowling
- 2006.06054 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1810.09026 #7 · arxiv_oai · confidence 0.70 Michael Bowling
- 1912.02967 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 2004.13657 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 1810.00123 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1902.00506 #15 · arxiv_oai · confidence 0.70 Michael Bowling
- 1807.11622 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1906.02403 #2 · arxiv_oai · confidence 0.70 Michael Bowling
- 1811.01458 #8 · arxiv_oai · confidence 0.70 Michael Bowling
- 1907.09633 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1806.01825 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1809.07893 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1809.03057 #6 · arxiv_oai · confidence 0.70 Michael Bowling
- 1709.06009 #6 · arxiv_oai · confidence 0.70 Michael Bowling
- 1703.00956 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1701.01724 #10 · arxiv_oai · confidence 0.70 Michael Bowling
- 1612.06915 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 1612.07547 #2 · arxiv_oai · confidence 0.70 Michael Bowling
- 1605.07700 #2 · arxiv_oai · confidence 0.70 Michael Bowling
- 1512.01563 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 1205.0288 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 1411.7974 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 1410.4604 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1303.4441 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1207.4708 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 1211.0587 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1206.3318 #1 · arxiv_oai · confidence 0.70 Michael Bowling
- 1205.0622 #5 · arxiv_oai · confidence 0.70 Michael Bowling
- 1112.4607 #3 · arxiv_oai · confidence 0.70 Michael Bowling
- 1111.3182 #4 · arxiv_oai · confidence 0.70 Michael Bowling
- 1411.7974 #4 · backfill · confidence 0.70 Michael Bowling
- 1410.4604 #3 · backfill · confidence 0.70 Michael Bowling
- 1303.4441 #3 · backfill · confidence 0.70 Michael Bowling
- 1211.0587 #3 · backfill · confidence 0.70 Michael Bowling
- 1207.4708 #4 · backfill · confidence 0.70 Michael Bowling
- 1206.3318 #1 · backfill · confidence 0.70 Michael Bowling
- 1205.0622 #5 · backfill · confidence 0.70 Michael Bowling
- 1205.0288 #4 · backfill · confidence 0.70 Michael Bowling
- 1112.4607 #3 · backfill · confidence 0.70 Michael Bowling
- 1111.3182 #4 · backfill · confidence 0.70 Michael Bowling
Frequent Coauthors
- Marc Lanctot 13 shared papers
- Neil Burch 12 shared papers
- Martin Schmid 11 shared papers
- Dustin Morrill 9 shared papers
- Marlos C. Machado 9 shared papers
- Marc G. Bellemare 5 shared papers
- Ryan D'Orazio 5 shared papers
- Elnaz Davoodi 4 shared papers
- Finbarr Timbers 4 shared papers
- James R. Wright 4 shared papers
- Joel Veness 4 shared papers
- Kevin Waugh 4 shared papers
- Nolan Bard 4 shared papers
- Amy Greenwald 3 shared papers
- John D. Martin 3 shared papers
- Karl Tuyls 3 shared papers
- Matej Morav\v{c}\'ik 3 shared papers
- Trevor Davis 3 shared papers
- Amy R. Greenwald 2 shared papers
- Andr\'as Gy\"orgy 2 shared papers