Recognition: unknown
An Analysis of Deep Neural Network Models for Practical Applications
read the original abstract
Since the emergence of Deep Neural Networks (DNNs) as a prominent technique in the field of computer vision, the ImageNet classification challenge has played a major role in advancing the state-of-the-art. While accuracy figures have steadily increased, the resource utilisation of winning models has not been properly taken into account. In this work, we present a comprehensive analysis of important metrics in practical applications: accuracy, memory footprint, parameters, operations count, inference time and power consumption. Key findings are: (1) power consumption is independent of batch size and architecture; (2) accuracy and inference time are in a hyperbolic relationship; (3) energy constraint is an upper bound on the maximum achievable accuracy and model complexity; (4) the number of operations is a reliable estimate of the inference time. We believe our analysis provides a compelling set of information that helps design and engineer efficient DNNs.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
DEEP-GAP: Deep-learning Evaluation of Execution Parallelism in GPU Architectural Performance
L4 delivers up to 4.4x higher throughput than T4 for ResNet models, peaks at batch sizes 16-32, and INT8 yields up to 58x gains over CPU baselines.
-
A Transfer Learning Evaluation of Deep Neural Networks for Image Classification
Empirical comparison of transfer learning performance across eleven pre-trained models on five image datasets using accuracy, time, and size metrics.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.