pith. sign in

Michael Bowling

Identifiers

  • name variant Michael Bowling 0.60 · backfill

Papers (54)

  1. Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning cs.LG · 2024 · author #4
  2. Monitored Markov Decision Processes cs.LG · 2024 · author #5
  3. Assessing the Interpretability of Programmatic Policies with Large Language Models cs.AI · 2023 · author #2
  4. Proper Laplacian Representation Learning cs.LG · 2023 · author #2
  5. TacticAI: an AI assistant for football tactics cs.LG · 2023 · author #21
  6. Learning not to Regret cs.GT · 2023 · author #4
  7. Targeted Search Control in AlphaZero for Effective Policy Improvement cs.AI · 2023 · author #2
  8. Settling the Reward Hypothesis cs.AI · 2022 · author #1
  9. Over-communicate no more: Situated RL agents learn concise communication protocols cs.MA · 2022 · author #6
  10. The Alberta Plan for AI Research cs.AI · 2022 · author #2
  11. Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration cs.LG · 2022 · author #3
  12. Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections cs.GT · 2022 · author #5
  13. Should Models Be Accurate? cs.LG · 2022 · author #5
  14. Student of Games: A unified learning algorithm for both perfect and imperfect information games cs.AI · 2021 · author #13
  15. The Partially Observable History Process cs.AI · 2021 · author #3
  16. Temporal Abstraction in Reinforcement Learning with the Successor Representation cs.LG · 2021 · author #4
  17. Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games cs.GT · 2021 · author #5
  18. Solving Common-Payoff Games with Approximate Policy Iteration cs.AI · 2021 · author #8
  19. Hindsight and Sequential Rationality of Correlated Play cs.GT · 2020 · author #7
  20. Useful Policy Invariant Shaping from Arbitrary Advice cs.LG · 2020 · author #5
  21. The Advantage Regret-Matching Actor-Critic cs.AI · 2020 · author #12
  22. Sound Algorithms in Imperfect Information Games cs.GT · 2020 · author #6
  23. Marginal Utility for Planning in Continuous or Large Discrete Action Spaces cs.AI · 2020 · author #3
  24. Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task cs.LG · 2020 · author #4
  25. Approximate exploitability: Learning a best response in large games cs.LG · 2020 · author #9
  26. Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization cs.AI · 2019 · author #4
  27. Low-Variance and Zero-Variance Baselines for Extensive-Form Games cs.GT · 2019 · author #3
  28. Rethinking Formal Models of Partially Observable Multiagent Decision Making cs.AI · 2019 · author #4
  29. Ease-of-Teaching and Language Structure from Emergent Communication cs.AI · 2019 · author #2
  30. The Hanabi Challenge: A New Frontier for AI Research cs.LG · 2019 · author #15
  31. Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning cs.MA · 2018 · author #8
  32. Actor-Critic Policy Optimization in Partially Observable Multiagent Environments cs.LG · 2018 · author #7
  33. Generalization and Regularization in DQN cs.LG · 2018 · author #3
  34. Solving Large Extensive-Form Games with Strategy Constraints cs.GT · 2018 · author #3
  35. Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines cs.GT · 2018 · author #6
  36. Count-Based Exploration with the Successor Representation cs.LG · 2018 · author #3
  37. The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces cs.AI · 2018 · author #3
  38. Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents cs.LG · 2017 · author #6
  39. A Laplacian Framework for Option Discovery in Reinforcement Learning cs.LG · 2017 · author #3
  40. DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker cs.AI · 2017 · author #10
  41. Equilibrium Approximation Quality of Current No-Limit Poker Bots cs.GT · 2016 · author #2
  42. AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games cs.AI · 2016 · author #4
  43. Learning Purposeful Behaviour in the Absence of Rewards cs.LG · 2016 · author #2
  44. State of the Art Control of Atari Games Using Shallow Reinforcement Learning cs.LG · 2015 · author #4
  45. Solving Games with Functional Regret Estimation cs.AI · 2014 · author #4
  46. Domain-Independent Optimistic Initialization for Reinforcement Learning cs.LG · 2014 · author #3
  47. Solving Imperfect Information Games Using Decomposition cs.GT · 2013 · author #3
  48. Partition Tree Weighting cs.IT · 2012 · author #3
  49. The Arcade Learning Environment: An Evaluation Platform for General Agents cs.AI · 2012 · author #4
  50. On Local Regret cs.AI · 2012 · author #1
  51. No-Regret Learning in Extensive-Form Games with Imperfect Recall cs.GT · 2012 · author #5
  52. A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning cs.LG · 2012 · author #4
  53. Alignment Based Kernel Learning with a Continuous Set of Base Kernels cs.LG · 2011 · author #3
  54. Context Tree Switching cs.IT · 2011 · author #4

Mentions

  • 2406.19561 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2310.10833 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2303.01074 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2402.06819 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2311.06979 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2112.03178 #13 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2310.10553 #21 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2212.10420 #1 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2110.05740 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2208.11173 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2302.12359 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2004.09677 #9 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2211.01480 #6 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2102.06973 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2012.05874 #7 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2206.02036 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2205.12031 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2205.10736 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2111.08102 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1906.11110 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2006.08740 #6 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2101.04237 #8 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2011.01297 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2008.12234 #12 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2006.06054 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1810.09026 #7 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1912.02967 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2004.13657 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1810.00123 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1902.00506 #15 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1807.11622 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1906.02403 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1811.01458 #8 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1907.09633 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1806.01825 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1809.07893 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1809.03057 #6 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1709.06009 #6 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1703.00956 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1701.01724 #10 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1612.06915 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1612.07547 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1605.07700 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1512.01563 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1205.0288 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1411.7974 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1410.4604 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1303.4441 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1207.4708 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1211.0587 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1206.3318 #1 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1205.0622 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1112.4607 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1111.3182 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1411.7974 #4 · backfill · confidence 0.70 Michael Bowling
  • 1410.4604 #3 · backfill · confidence 0.70 Michael Bowling
  • 1303.4441 #3 · backfill · confidence 0.70 Michael Bowling
  • 1211.0587 #3 · backfill · confidence 0.70 Michael Bowling
  • 1207.4708 #4 · backfill · confidence 0.70 Michael Bowling
  • 1206.3318 #1 · backfill · confidence 0.70 Michael Bowling
  • 1205.0622 #5 · backfill · confidence 0.70 Michael Bowling
  • 1205.0288 #4 · backfill · confidence 0.70 Michael Bowling
  • 1112.4607 #3 · backfill · confidence 0.70 Michael Bowling
  • 1111.3182 #4 · backfill · confidence 0.70 Michael Bowling

Frequent Coauthors