An Analysis of Deep Neural Network Models for Practical Applications

Adam Paszke; Alfredo Canziani; Eugenio Culurciello

arxiv: 1605.07678 · v4 · pith:2NEHPM6Nnew · submitted 2016-05-24 · 💻 cs.CV

An Analysis of Deep Neural Network Models for Practical Applications

Alfredo Canziani , Adam Paszke , Eugenio Culurciello This is my paper

classification 💻 cs.CV

keywords accuracyanalysisinferencetimeapplicationsconsumptiondeepdnns

0 comments

read the original abstract

Since the emergence of Deep Neural Networks (DNNs) as a prominent technique in the field of computer vision, the ImageNet classification challenge has played a major role in advancing the state-of-the-art. While accuracy figures have steadily increased, the resource utilisation of winning models has not been properly taken into account. In this work, we present a comprehensive analysis of important metrics in practical applications: accuracy, memory footprint, parameters, operations count, inference time and power consumption. Key findings are: (1) power consumption is independent of batch size and architecture; (2) accuracy and inference time are in a hyperbolic relationship; (3) energy constraint is an upper bound on the maximum achievable accuracy and model complexity; (4) the number of operations is a reliable estimate of the inference time. We believe our analysis provides a compelling set of information that helps design and engineer efficient DNNs.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 8 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

New pointwise convolution in Deep Neural Networks through Extremely Fast and Non Parametric Transforms
cs.CV 2019-06 unverdicted novelty 5.0

Replacing pointwise convolutions with DWHT yields a model with 79.1% fewer parameters, 48.4% fewer FLOPs, and 1.49% higher accuracy than MobileNet-V1 on CIFAR-100.
Green Prompting: Characterizing Prompt-driven Energy Costs of LLM Inference
cs.CL 2025-03 unverdicted novelty 4.0

Empirical tests on three LLMs show prompt semantics and task keywords drive inference energy costs more than length, with varying patterns by task.
Comparison of Neural Network Architectures for Spectrum Sensing
eess.SP 2019-07 unverdicted novelty 4.0

Empirical comparison finds CNN, RNN and BiRNN achieve similar detection performance with abundant data and resources while FC performs worse except under strict complexity constraints.
Benchmarking Physical Performance of Neural Inference Circuits
cs.ET 2019-07 unverdicted novelty 4.0

Authors apply a consistent methodology to benchmark physical performance metrics across neural network architectures and device technologies, identifying promising combinations.
DEEP-GAP: Deep-learning Evaluation of Execution Parallelism in GPU Architectural Performance
cs.PF 2026-04 unverdicted novelty 3.0

L4 delivers up to 4.4x higher throughput than T4 for ResNet models, peaks at batch sizes 16-32, and INT8 yields up to 58x gains over CPU baselines.
A Transfer Learning Evaluation of Deep Neural Networks for Image Classification
cs.CV 2026-05 unverdicted novelty 2.0

Empirical comparison of transfer learning performance across eleven pre-trained models on five image datasets using accuracy, time, and size metrics.
Modern CNNs for IoT Based Farms
cs.CY 2019-07 unverdicted novelty 2.0

A survey of state-of-the-art CNN architectures for agricultural IoT applications that proposes a tailored classification taxonomy and reviews existing research to guide architecture selection.
Deep Learning in the Automotive Industry: Recent Advances and Application Examples
cs.LG 2019-06 unverdicted

An overview of deep learning applications and challenges in the automotive industry, covering ADAS, automated driving, virtual sensing, and data-driven development.