pith. machine review for the scientific record. sign in

arxiv: 1805.04874 · v3 · submitted 2018-05-13 · 📊 stat.ML · cs.LG

Recognition: unknown

GAN Q-learning

Authors on Pith no claims yet
classification 📊 stat.ML cs.LG
keywords distributionallearningapproachq-learningreinforcementadversarialalgorithmalternative
0
0 comments X
read the original abstract

Distributional reinforcement learning (distributional RL) has seen empirical success in complex Markov Decision Processes (MDPs) in the setting of nonlinear function approximation. However, there are many different ways in which one can leverage the distributional approach to reinforcement learning. In this paper, we propose GAN Q-learning, a novel distributional RL method based on generative adversarial networks (GANs) and analyze its performance in simple tabular environments, as well as OpenAI Gym. We empirically show that our algorithm leverages the flexibility and blackbox approach of deep learning models while providing a viable alternative to traditional methods.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.