pith. sign in

arxiv: 1706.02999 · v1 · pith:D2BXIYQYnew · submitted 2017-06-09 · 📊 stat.ML · cs.AI· cs.LG

Symmetry Learning for Function Approximation in Reinforcement Learning

classification 📊 stat.ML cs.AIcs.LG
keywords learningsymmetriessymmetryapproximationmethodreinforcementrewardadvances
0
0 comments X
read the original abstract

In this paper we explore methods to exploit symmetries for ensuring sample efficiency in reinforcement learning (RL), this problem deserves ever increasing attention with the recent advances in the use of deep networks for complex RL tasks which require large amount of training data. We introduce a novel method to detect symmetries using reward trails observed during episodic experience and prove its completeness. We also provide a framework to incorporate the discovered symmetries for functional approximation. Finally we show that the use of potential based reward shaping is especially effective for our symmetry exploitation mechanism. Experiments on various classical problems show that our method improves the learning performance significantly by utilizing symmetry information.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Deep deterministic policy gradient with symmetric data augmentation for lateral attitude tracking control of a fixed-wing aircraft

    cs.LG 2024-07 unverdicted novelty 4.0

    Symmetric data augmentation plus dual-critic DDPG accelerates policy convergence for fixed-wing aircraft lateral attitude control under an MDP symmetry assumption.