Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

Frank Hutter; Ilya Loshchilov; Patryk Chrabaszcz

arxiv: 1802.08842 · v1 · pith:MRHV3EZ7new · submitted 2018-02-24 · 💻 cs.NE

Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

Patryk Chrabaszcz , Ilya Loshchilov , Frank Hutter This is my paper

classification 💻 cs.NE

keywords algorithmsevolutionstrategiesadvancesalgorithmataribasicbetter

0 comments

read the original abstract

Evolution Strategies (ES) have recently been demonstrated to be a viable alternative to reinforcement learning (RL) algorithms on a set of challenging deep RL problems, including Atari games and MuJoCo humanoid locomotion benchmarks. While the ES algorithms in that work belonged to the specialized class of natural evolution strategies (which resemble approximate gradient RL algorithms, such as REINFORCE), we demonstrate that even a very basic canonical ES algorithm can achieve the same or even better performance. This success of a basic ES algorithm suggests that the state-of-the-art can be advanced further by integrating the many advances made in the field of ES in the last decades. We also demonstrate qualitatively that ES algorithms have very different performance characteristics than traditional RL algorithms: on some games, they learn to exploit the environment and perform much better while on others they can get stuck in suboptimal local minima. Combining their strengths with those of traditional RL algorithms is therefore likely to lead to new advances in the state of the art.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Evolvability ES: Scalable and Direct Optimization of Evolvability
cs.NE 2019-07 unverdicted novelty 6.0

Evolvability ES is an evolutionary strategy variant that directly optimizes for evolvability by maximizing behavioral diversity under mutations, tested on 2D/3D locomotion tasks and shown competitive with MAML.