pith. sign in

Michael Bowling

Identifiers

  • name variant Michael Bowling 0.60 · backfill

Papers (56)

  1. Real-Time Recurrent Learning using Trace Units in Reinforcement Learning cs.LG · 2024 · author #3
  2. Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning cs.LG · 2024 · author #4
  3. Beyond Optimism: Exploration With Partially Observable Rewards cs.LG · 2024 · author #3
  4. Monitored Markov Decision Processes cs.LG · 2024 · author #5
  5. Assessing the Interpretability of Programmatic Policies with Large Language Models cs.AI · 2023 · author #2
  6. Proper Laplacian Representation Learning cs.LG · 2023 · author #2
  7. TacticAI: an AI assistant for football tactics cs.LG · 2023 · author #21
  8. Learning not to Regret cs.GT · 2023 · author #4
  9. Targeted Search Control in AlphaZero for Effective Policy Improvement cs.AI · 2023 · author #2
  10. Settling the Reward Hypothesis cs.AI · 2022 · author #1
  11. Over-communicate no more: Situated RL agents learn concise communication protocols cs.MA · 2022 · author #6
  12. The Alberta Plan for AI Research cs.AI · 2022 · author #2
  13. Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration cs.LG · 2022 · author #3
  14. Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections cs.GT · 2022 · author #5
  15. Should Models Be Accurate? cs.LG · 2022 · author #5
  16. Student of Games: A unified learning algorithm for both perfect and imperfect information games cs.AI · 2021 · author #13
  17. The Partially Observable History Process cs.AI · 2021 · author #3
  18. Temporal Abstraction in Reinforcement Learning with the Successor Representation cs.LG · 2021 · author #4
  19. Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games cs.GT · 2021 · author #5
  20. Solving Common-Payoff Games with Approximate Policy Iteration cs.AI · 2021 · author #8
  21. Hindsight and Sequential Rationality of Correlated Play cs.GT · 2020 · author #7
  22. Useful Policy Invariant Shaping from Arbitrary Advice cs.LG · 2020 · author #5
  23. The Advantage Regret-Matching Actor-Critic cs.AI · 2020 · author #12
  24. Sound Algorithms in Imperfect Information Games cs.GT · 2020 · author #6
  25. Marginal Utility for Planning in Continuous or Large Discrete Action Spaces cs.AI · 2020 · author #3
  26. Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task cs.LG · 2020 · author #4
  27. Approximate exploitability: Learning a best response in large games cs.LG · 2020 · author #9
  28. Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$-Regression Counterfactual Regret Minimization cs.AI · 2019 · author #4
  29. Low-Variance and Zero-Variance Baselines for Extensive-Form Games cs.GT · 2019 · author #3
  30. Rethinking Formal Models of Partially Observable Multiagent Decision Making cs.AI · 2019 · author #4
  31. Ease-of-Teaching and Language Structure from Emergent Communication cs.AI · 2019 · author #2
  32. The Hanabi Challenge: A New Frontier for AI Research cs.LG · 2019 · author #15
  33. Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning cs.MA · 2018 · author #8
  34. Actor-Critic Policy Optimization in Partially Observable Multiagent Environments cs.LG · 2018 · author #7
  35. Generalization and Regularization in DQN cs.LG · 2018 · author #3
  36. Solving Large Extensive-Form Games with Strategy Constraints cs.GT · 2018 · author #3
  37. Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines cs.GT · 2018 · author #6
  38. Count-Based Exploration with the Successor Representation cs.LG · 2018 · author #3
  39. The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces cs.AI · 2018 · author #3
  40. Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents cs.LG · 2017 · author #6
  41. A Laplacian Framework for Option Discovery in Reinforcement Learning cs.LG · 2017 · author #3
  42. DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker cs.AI · 2017 · author #10
  43. Equilibrium Approximation Quality of Current No-Limit Poker Bots cs.GT · 2016 · author #2
  44. AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games cs.AI · 2016 · author #4
  45. Learning Purposeful Behaviour in the Absence of Rewards cs.LG · 2016 · author #2
  46. State of the Art Control of Atari Games Using Shallow Reinforcement Learning cs.LG · 2015 · author #4
  47. Solving Games with Functional Regret Estimation cs.AI · 2014 · author #4
  48. Domain-Independent Optimistic Initialization for Reinforcement Learning cs.LG · 2014 · author #3
  49. Solving Imperfect Information Games Using Decomposition cs.GT · 2013 · author #3
  50. Partition Tree Weighting cs.IT · 2012 · author #3
  51. The Arcade Learning Environment: An Evaluation Platform for General Agents cs.AI · 2012 · author #4
  52. On Local Regret cs.AI · 2012 · author #1
  53. No-Regret Learning in Extensive-Form Games with Imperfect Recall cs.GT · 2012 · author #5
  54. A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning cs.LG · 2012 · author #4
  55. Alignment Based Kernel Learning with a Continuous Set of Base Kernels cs.LG · 2011 · author #3
  56. Context Tree Switching cs.IT · 2011 · author #4

Mentions

  • 2406.13909 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2409.01449 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2406.19561 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2310.10833 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2303.01074 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2402.06819 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2311.06979 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2112.03178 #13 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2310.10553 #21 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2212.10420 #1 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2110.05740 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2208.11173 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2302.12359 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2004.09677 #9 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2211.01480 #6 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2102.06973 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2012.05874 #7 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2206.02036 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2205.12031 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2205.10736 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2111.08102 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1906.11110 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2006.08740 #6 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2101.04237 #8 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2011.01297 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2008.12234 #12 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2006.06054 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1810.09026 #7 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1912.02967 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 2004.13657 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1810.00123 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1902.00506 #15 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1807.11622 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1906.02403 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1811.01458 #8 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1907.09633 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1806.01825 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1809.07893 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1809.03057 #6 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1709.06009 #6 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1703.00956 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1701.01724 #10 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1612.06915 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1612.07547 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1605.07700 #2 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1512.01563 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1205.0288 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1411.7974 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1410.4604 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1303.4441 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1207.4708 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1211.0587 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1206.3318 #1 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1205.0622 #5 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1112.4607 #3 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1111.3182 #4 · arxiv_oai · confidence 0.70 Michael Bowling
  • 1411.7974 #4 · backfill · confidence 0.70 Michael Bowling
  • 1410.4604 #3 · backfill · confidence 0.70 Michael Bowling
  • 1303.4441 #3 · backfill · confidence 0.70 Michael Bowling
  • 1211.0587 #3 · backfill · confidence 0.70 Michael Bowling
  • 1207.4708 #4 · backfill · confidence 0.70 Michael Bowling
  • 1206.3318 #1 · backfill · confidence 0.70 Michael Bowling
  • 1205.0622 #5 · backfill · confidence 0.70 Michael Bowling
  • 1205.0288 #4 · backfill · confidence 0.70 Michael Bowling
  • 1112.4607 #3 · backfill · confidence 0.70 Michael Bowling
  • 1111.3182 #4 · backfill · confidence 0.70 Michael Bowling

Frequent Coauthors