Peephole: Predicting Network Performance Before Training
read the original abstract
The quest for performant networks has been a significant force that drives the advancements of deep learning in recent years. While rewarding, improving network design has never been an easy journey. The large design space combined with the tremendous cost required for network training poses a major obstacle to this endeavor. In this work, we propose a new approach to this problem, namely, predicting the performance of a network before training, based on its architecture. Specifically, we develop a unified way to encode individual layers into vectors and bring them together to form an integrated description via LSTM. Taking advantage of the recurrent network's strong expressive power, this method can reliably predict the performances of various network architectures. Our empirical studies showed that it not only achieved accurate predictions but also produced consistent rankings across datasets -- a key desideratum in performance prediction.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
From Regression to Inference: Meta-Learning Predictors for Neural Architecture Search
Meta-learning a Convolutional Neural Process to infer neural architecture performance from context-target splits on synthesized tasks improves top-K ranking and achieves state-of-the-art selection on NAS-Bench-101 and...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.