pith. sign in

arxiv: 1706.00885 · v4 · pith:O3SINTRUnew · submitted 2017-06-03 · 💻 cs.CV · cs.LG

IDK Cascades: Fast Deep Learning by Learning not to Overthink

classification 💻 cs.CV cs.LG
keywords frameworkcascadeslearningdeepmodelpredictionproposedaccuracy
0
0 comments X
read the original abstract

Advances in deep learning have led to substantial increases in prediction accuracy but have been accompanied by increases in the cost of rendering predictions. We conjecture that fora majority of real-world inputs, the recent advances in deep learning have created models that effectively "overthink" on simple inputs. In this paper, we revisit the classic question of building model cascades that primarily leverage class asymmetry to reduce cost. We introduce the "I Don't Know"(IDK) prediction cascades framework, a general framework to systematically compose a set of pre-trained models to accelerate inference without a loss in prediction accuracy. We propose two search based methods for constructing cascades as well as a new cost-aware objective within this framework. The proposed IDK cascade framework can be easily adopted in the existing model serving systems without additional model re-training. We evaluate the proposed techniques on a range of benchmarks to demonstrate the effectiveness of the proposed framework.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Visual Accommodation: Rethinking Image Scale as a Learnable Variable for Object Detection

    cs.CV 2024-12 unverdicted novelty 5.0

    Ciliary-DETR adds a learnable scale predictor and parametric loss objectives to enable test-time image scale adjustment for object detection in a single forward pass.