pith. sign in

Daniel J. Mankowitz

Identifiers

  • name variant Daniel J. Mankowitz 0.60 · backfill

Papers (26)

  1. Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #969
  2. Nash Learning from Human Feedback stat.ML · 2023 · author #15
  3. Towards practical reinforcement learning for tokamak magnetic control physics.plasm-ph · 2023 · author #16
  4. Optimizing Memory Mapping Using Deep Reinforcement Learning cs.PF · 2023 · author #19
  5. Controlling Commercial Cooling Systems Using Reinforcement Learning cs.LG · 2022 · author #36
  6. COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation cs.LG · 2022 · author #3
  7. MuZero with Self-competition for Rate Control in VP9 Video Compression eess.IV · 2022 · author #14
  8. Competition-Level Code Generation with AlphaCode cs.PL · 2022 · author #21
  9. Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification cs.LG · 2020 · author #1
  10. Balancing Constraints and Rewards with Meta-Gradient D4PG cs.LG · 2020 · author #2
  11. An empirical investigation of the challenges of real-world reinforcement learning cs.LG · 2020 · author #3
  12. Robust Reinforcement Learning for Continuous Control with Model Misspecification cs.LG · 2019 · author #1
  13. Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces cs.LG · 2019 · author #4
  14. Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning cs.LG · 2018 · author #4
  15. Reward Constrained Policy Optimization cs.LG · 2018 · author #2
  16. Soft-Robust Actor-Critic Policy-Gradient cs.LG · 2018 · author #2
  17. Unicorn: Continual Learning with a Universal, Off-policy Agent cs.LG · 2018 · author #1
  18. Learning Robust Options cs.AI · 2018 · author #1
  19. Situationally Aware Options cs.AI · 2017 · author #1
  20. Shallow Updates for Deep Reinforcement Learning cs.AI · 2017 · author #3
  21. Situational Awareness by Risk-Conscious Skills cs.AI · 2016 · author #1
  22. A Deep Hierarchical Approach to Lifelong Learning in Minecraft cs.AI · 2016 · author #4
  23. Adaptive Skills, Adaptive Partitions (ASAP) cs.LG · 2016 · author #1
  24. Iterative Hierarchical Optimization for Misspecified Problems (IHOMP) cs.LG · 2016 · author #1
  25. CFORB: Circular FREAK-ORB Visual Odometry cs.CV · 2015 · author #1
  26. Bootstrapping Skills cs.AI · 2015 · author #1

Mentions

  • 2312.00886 #15 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 2305.07440 #19 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 2307.11546 #16 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 2211.07357 #36 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 2204.08957 #3 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 2202.06626 #14 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 2003.11881 #3 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 2010.10644 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 2010.06324 #2 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1906.07516 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1905.09700 #4 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1809.02121 #4 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1805.11074 #2 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1803.04848 #2 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1802.08294 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1802.03236 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1711.07832 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1705.07461 #3 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1604.07255 #4 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1610.02847 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1602.03348 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1602.03351 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1506.05257 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1506.03624 #1 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz
  • 1506.05257 #1 · backfill · confidence 0.70 Daniel J. Mankowitz
  • 1506.03624 #1 · backfill · confidence 0.70 Daniel J. Mankowitz
  • 2203.07814 #21 · arxiv_oai · confidence 0.70 Daniel J. Mankowitz

Frequent Coauthors