Towards Automated Deep Learning: Efficient Joint Neural Architecture and Hyperparameter Search

Aaron Klein; Arber Zela; Frank Hutter; Stefan Falkner

arxiv: 1807.06906 · v1 · pith:GTAF4677new · submitted 2018-07-18 · 💻 cs.LG · cs.AI· cs.CV· stat.ML

Towards Automated Deep Learning: Efficient Joint Neural Architecture and Hyperparameter Search

Arber Zela , Aaron Klein , Stefan Falkner , Frank Hutter This is my paper

classification 💻 cs.LG cs.AIcs.CVstat.ML

keywords architecturehyperparameterneuralsearchdemonstrateduringefficientepochs

0 comments

read the original abstract

While existing work on neural architecture search (NAS) tunes hyperparameters in a separate post-processing step, we demonstrate that architectural choices and other hyperparameter settings interact in a way that can render this separation suboptimal. Likewise, we demonstrate that the common practice of using very few epochs during the main NAS and much larger numbers of epochs during a post-processing step is inefficient due to little correlation in the relative rankings for these two training regimes. To combat both of these problems, we propose to use a recent combination of Bayesian optimization and Hyperband for efficient joint neural architecture and hyperparameter search.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive
cs.CL 2024-02 conditional novelty 6.0

DPOP is a new loss function that prevents DPO from lowering preferred response likelihoods and outperforms standard DPO on diverse datasets, MT-Bench, and enables Smaug-72B to exceed 80% on the Open LLM Leaderboard.
Spiking Neural Network Architecture Search: A Survey
cs.NE 2025-10 unverdicted novelty 2.0

A survey of Spiking Neural Network architecture search techniques viewed through a hardware/software co-design lens.