pith. sign in

arxiv: 1612.00478 · v1 · pith:YAOWZFCUnew · submitted 2016-12-01 · 💻 cs.CV

In Teacher We Trust: Learning Compressed Models for Pedestrian Detection

classification 💻 cs.CV
keywords networklargepedestriandetectionlearningbeendistillationeffective
0
0 comments X
read the original abstract

Deep convolutional neural networks continue to advance the state-of-the-art in many domains as they grow bigger and more complex. It has been observed that many of the parameters of a large network are redundant, allowing for the possibility of learning a smaller network that mimics the outputs of the large network through a process called Knowledge Distillation. We show, however, that standard Knowledge Distillation is not effective for learning small models for the task of pedestrian detection. To improve this process, we introduce a higher-dimensional hint layer to increase information flow. We also estimate the variance in the outputs of the large network and propose a loss function to incorporate this uncertainty. Finally, we attempt to boost the complexity of the small network without increasing its size by using as input hand-designed features that have been demonstrated to be effective for pedestrian detection. We succeed in training a model that contains $400\times$ fewer parameters than the large network while outperforming AlexNet on the Caltech Pedestrian Dataset.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Towards Generalizing Sensorimotor Control Across Weather Conditions

    cs.LG 2019-07 unverdicted novelty 5.0

    A teacher-student framework with domain translation transfers steering control from one weather condition to multiple others using only source-domain labels.