pith. machine review for the scientific record. sign in

arxiv: 1604.07269 · v1 · submitted 2016-04-25 · 💻 cs.NE · cs.LG

Recognition: unknown

CMA-ES for Hyperparameter Optimization of Deep Neural Networks

Authors on Pith no claims yet
classification 💻 cs.NE cs.LG
keywords cma-esoptimizationneuralbayesiandeephyperparametersnetworksparallel
0
0 comments X
read the original abstract

Hyperparameters of deep neural networks are often optimized by grid search, random search or Bayesian optimization. As an alternative, we propose to use the Covariance Matrix Adaptation Evolution Strategy (CMA-ES), which is known for its state-of-the-art performance in derivative-free optimization. CMA-ES has some useful invariance properties and is friendly to parallel evaluations of solutions. We provide a toy example comparing CMA-ES and state-of-the-art Bayesian optimization algorithms for tuning the hyperparameters of a convolutional neural network for the MNIST dataset on 30 GPUs in parallel.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Importance Sampling Optimization with Laplace Principle

    math.OC 2026-04 unverdicted novelty 7.0

    A Laplace-inspired importance sampling scheme for averaging random search points achieves error of order n to the power -2/(d+2) after n evaluations, improving on the n to the power -1/d rate of standard random search...