Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

Bartosz W\'ojcik; Igor Podolak; Jacek Tabor; Klaudia Ba{\l}azy; Maciej Wo{\l}czyk; Marek \'Smieja; Tomasz Trzci\'nski

arxiv: 2106.05409 · v2 · pith:4KUFXBH2new · submitted 2021-06-09 · 💻 cs.LG

Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

Maciej Wo{\l}czyk , Bartosz W\'ojcik , Klaudia Ba{\l}azy , Igor Podolak , Jacek Tabor , Marek \'Smieja , Tomasz Trzci\'nski This is my paper

classification 💻 cs.LG

keywords timeearlypredictionsexitinferencemethodsneuralreturn

0 comments

read the original abstract

The problem of reducing processing time of large deep learning models is a fundamental challenge in many real-world applications. Early exit methods strive towards this goal by attaching additional Internal Classifiers (ICs) to intermediate layers of a neural network. ICs can quickly return predictions for easy examples and, as a result, reduce the average inference time of the whole model. However, if a particular IC does not decide to return an answer early, its predictions are discarded, with its computations effectively being wasted. To solve this issue, we introduce Zero Time Waste (ZTW), a novel approach in which each IC reuses predictions returned by its predecessors by (1) adding direct connections between ICs and (2) combining previous outputs in an ensemble-like manner. We conduct extensive experiments across various datasets and architectures to demonstrate that ZTW achieves a significantly better accuracy vs. inference time trade-off than other recently proposed early exit methods.

This paper has not been read by Pith yet.

Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

discussion (0)