pith. sign in

arxiv: 1906.00431 · v1 · pith:S2JVENJWnew · submitted 2019-06-02 · 💻 cs.LG · cs.AI· stat.ML

An Empirical Study on Hyperparameters and their Interdependence for RL Generalization

classification 💻 cs.LG cs.AIstat.ML
keywords empiricalgeneralizationresultshyperparametersmetricsparametersacrossaction
0
0 comments X
read the original abstract

Recent results in Reinforcement Learning (RL) have shown that agents with limited training environments are susceptible to a large amount of overfitting across many domains. A key challenge for RL generalization is to quantitatively explain the effects of changing parameters on testing performance. Such parameters include architecture, regularization, and RL-dependent variables such as discount factor and action stochasticity. We provide empirical results that show complex and interdependent relationships between hyperparameters and generalization. We further show that several empirical metrics such as gradient cosine similarity and trajectory-dependent metrics serve to provide intuition towards these results.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.