Michael Bowling — Pith Author Registry

Identifiers

name variant Michael Bowling 0.60 · backfill

Papers (56)

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning cs.LG · 2024 · author #3
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning cs.LG · 2024 · author #4
Beyond Optimism: Exploration With Partially Observable Rewards cs.LG · 2024 · author #3
Monitored Markov Decision Processes cs.LG · 2024 · author #5
Assessing the Interpretability of Programmatic Policies with Large Language Models cs.AI · 2023 · author #2
Proper Laplacian Representation Learning cs.LG · 2023 · author #2
TacticAI: an AI assistant for football tactics cs.LG · 2023 · author #21
Learning not to Regret cs.GT · 2023 · author #4
Targeted Search Control in AlphaZero for Effective Policy Improvement cs.AI · 2023 · author #2
Settling the Reward Hypothesis cs.AI · 2022 · author #1
Over-communicate no more: Situated RL agents learn concise communication protocols cs.MA · 2022 · author #6
The Alberta Plan for AI Research cs.AI · 2022 · author #2
Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration cs.LG · 2022 · author #3
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections cs.GT · 2022 · author #5
Should Models Be Accurate? cs.LG · 2022 · author #5
Student of Games: A unified learning algorithm for both perfect and imperfect information games cs.AI · 2021 · author #13
The Partially Observable History Process cs.AI · 2021 · author #3
Temporal Abstraction in Reinforcement Learning with the Successor Representation cs.LG · 2021 · author #4
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games cs.GT · 2021 · author #5
Solving Common-Payoff Games with Approximate Policy Iteration cs.AI · 2021 · author #8
Hindsight and Sequential Rationality of Correlated Play cs.GT · 2020 · author #7
Useful Policy Invariant Shaping from Arbitrary Advice cs.LG · 2020 · author #5
The Advantage Regret-Matching Actor-Critic cs.AI · 2020 · author #12
Sound Algorithms in Imperfect Information Games cs.GT · 2020 · author #6
Marginal Utility for Planning in Continuous or Large Discrete Action Spaces cs.AI · 2020 · author #3
Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task cs.LG · 2020 · author #4
Approximate exploitability: Learning a best response in large games cs.LG · 2020 · author #9
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization cs.AI · 2019 · author #4
Low-Variance and Zero-Variance Baselines for Extensive-Form Games cs.GT · 2019 · author #3
Rethinking Formal Models of Partially Observable Multiagent Decision Making cs.AI · 2019 · author #4
Ease-of-Teaching and Language Structure from Emergent Communication cs.AI · 2019 · author #2
The Hanabi Challenge: A New Frontier for AI Research cs.LG · 2019 · author #15
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning cs.MA · 2018 · author #8
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments cs.LG · 2018 · author #7
Generalization and Regularization in DQN cs.LG · 2018 · author #3
Solving Large Extensive-Form Games with Strategy Constraints cs.GT · 2018 · author #3
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines cs.GT · 2018 · author #6
Count-Based Exploration with the Successor Representation cs.LG · 2018 · author #3
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces cs.AI · 2018 · author #3
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents cs.LG · 2017 · author #6
A Laplacian Framework for Option Discovery in Reinforcement Learning cs.LG · 2017 · author #3
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker cs.AI · 2017 · author #10
Equilibrium Approximation Quality of Current No-Limit Poker Bots cs.GT · 2016 · author #2
AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games cs.AI · 2016 · author #4
Learning Purposeful Behaviour in the Absence of Rewards cs.LG · 2016 · author #2
State of the Art Control of Atari Games Using Shallow Reinforcement Learning cs.LG · 2015 · author #4
Solving Games with Functional Regret Estimation cs.AI · 2014 · author #4
Domain-Independent Optimistic Initialization for Reinforcement Learning cs.LG · 2014 · author #3
Solving Imperfect Information Games Using Decomposition cs.GT · 2013 · author #3
Partition Tree Weighting cs.IT · 2012 · author #3
The Arcade Learning Environment: An Evaluation Platform for General Agents cs.AI · 2012 · author #4
On Local Regret cs.AI · 2012 · author #1
No-Regret Learning in Extensive-Form Games with Imperfect Recall cs.GT · 2012 · author #5
A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning cs.LG · 2012 · author #4
Alignment Based Kernel Learning with a Continuous Set of Base Kernels cs.LG · 2011 · author #3
Context Tree Switching cs.IT · 2011 · author #4

Mentions

2406.13909 #3 · arxiv_oai · confidence 0.70 Michael Bowling
2409.01449 #3 · arxiv_oai · confidence 0.70 Michael Bowling
2406.19561 #4 · arxiv_oai · confidence 0.70 Michael Bowling
2310.10833 #2 · arxiv_oai · confidence 0.70 Michael Bowling
2303.01074 #4 · arxiv_oai · confidence 0.70 Michael Bowling
2402.06819 #5 · arxiv_oai · confidence 0.70 Michael Bowling
2311.06979 #2 · arxiv_oai · confidence 0.70 Michael Bowling
2112.03178 #13 · arxiv_oai · confidence 0.70 Michael Bowling
2310.10553 #21 · arxiv_oai · confidence 0.70 Michael Bowling
2212.10420 #1 · arxiv_oai · confidence 0.70 Michael Bowling
2110.05740 #4 · arxiv_oai · confidence 0.70 Michael Bowling
2208.11173 #2 · arxiv_oai · confidence 0.70 Michael Bowling
2302.12359 #2 · arxiv_oai · confidence 0.70 Michael Bowling
2004.09677 #9 · arxiv_oai · confidence 0.70 Michael Bowling
2211.01480 #6 · arxiv_oai · confidence 0.70 Michael Bowling
2102.06973 #5 · arxiv_oai · confidence 0.70 Michael Bowling
2012.05874 #7 · arxiv_oai · confidence 0.70 Michael Bowling
2206.02036 #3 · arxiv_oai · confidence 0.70 Michael Bowling
2205.12031 #5 · arxiv_oai · confidence 0.70 Michael Bowling
2205.10736 #5 · arxiv_oai · confidence 0.70 Michael Bowling
2111.08102 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1906.11110 #4 · arxiv_oai · confidence 0.70 Michael Bowling
2006.08740 #6 · arxiv_oai · confidence 0.70 Michael Bowling
2101.04237 #8 · arxiv_oai · confidence 0.70 Michael Bowling
2011.01297 #5 · arxiv_oai · confidence 0.70 Michael Bowling
2008.12234 #12 · arxiv_oai · confidence 0.70 Michael Bowling
2006.06054 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1810.09026 #7 · arxiv_oai · confidence 0.70 Michael Bowling
1912.02967 #4 · arxiv_oai · confidence 0.70 Michael Bowling
2004.13657 #4 · arxiv_oai · confidence 0.70 Michael Bowling
1810.00123 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1902.00506 #15 · arxiv_oai · confidence 0.70 Michael Bowling
1807.11622 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1906.02403 #2 · arxiv_oai · confidence 0.70 Michael Bowling
1811.01458 #8 · arxiv_oai · confidence 0.70 Michael Bowling
1907.09633 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1806.01825 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1809.07893 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1809.03057 #6 · arxiv_oai · confidence 0.70 Michael Bowling
1709.06009 #6 · arxiv_oai · confidence 0.70 Michael Bowling
1703.00956 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1701.01724 #10 · arxiv_oai · confidence 0.70 Michael Bowling
1612.06915 #4 · arxiv_oai · confidence 0.70 Michael Bowling
1612.07547 #2 · arxiv_oai · confidence 0.70 Michael Bowling
1605.07700 #2 · arxiv_oai · confidence 0.70 Michael Bowling
1512.01563 #4 · arxiv_oai · confidence 0.70 Michael Bowling
1205.0288 #4 · arxiv_oai · confidence 0.70 Michael Bowling
1411.7974 #4 · arxiv_oai · confidence 0.70 Michael Bowling
1410.4604 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1303.4441 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1207.4708 #4 · arxiv_oai · confidence 0.70 Michael Bowling
1211.0587 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1206.3318 #1 · arxiv_oai · confidence 0.70 Michael Bowling
1205.0622 #5 · arxiv_oai · confidence 0.70 Michael Bowling
1112.4607 #3 · arxiv_oai · confidence 0.70 Michael Bowling
1111.3182 #4 · arxiv_oai · confidence 0.70 Michael Bowling
1411.7974 #4 · backfill · confidence 0.70 Michael Bowling
1410.4604 #3 · backfill · confidence 0.70 Michael Bowling
1303.4441 #3 · backfill · confidence 0.70 Michael Bowling
1211.0587 #3 · backfill · confidence 0.70 Michael Bowling
1207.4708 #4 · backfill · confidence 0.70 Michael Bowling
1206.3318 #1 · backfill · confidence 0.70 Michael Bowling
1205.0622 #5 · backfill · confidence 0.70 Michael Bowling
1205.0288 #4 · backfill · confidence 0.70 Michael Bowling
1112.4607 #3 · backfill · confidence 0.70 Michael Bowling
1111.3182 #4 · backfill · confidence 0.70 Michael Bowling

Frequent Coauthors

Marc Lanctot 13 shared papers
Neil Burch 12 shared papers
Martin Schmid 11 shared papers
Dustin Morrill 9 shared papers
Marlos C. Machado 9 shared papers
Marc G. Bellemare 5 shared papers
Ryan D'Orazio 5 shared papers
Elnaz Davoodi 4 shared papers
Finbarr Timbers 4 shared papers
James R. Wright 4 shared papers
Joel Veness 4 shared papers
Kevin Waugh 4 shared papers
Nolan Bard 4 shared papers
Amy Greenwald 3 shared papers
John D. Martin 3 shared papers
Karl Tuyls 3 shared papers
Matej Morav\v{c}\'ik 3 shared papers
Trevor Davis 3 shared papers
Alireza Kazemipour 2 shared papers
Amy R. Greenwald 2 shared papers