pith. machine review for the scientific record. sign in

arxiv: 1312.6229 · v4 · submitted 2013-12-21 · 💻 cs.CV

Recognition: unknown

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

Authors on Pith no claims yet
classification 💻 cs.CV
keywords detectionlocalizationintegratedapproachconvolutionalframeworklearningnetworks
0
0 comments X
read the original abstract

We present an integrated framework for using Convolutional Networks for classification, localization and detection. We show how a multiscale and sliding window approach can be efficiently implemented within a ConvNet. We also introduce a novel deep learning approach to localization by learning to predict object boundaries. Bounding boxes are then accumulated rather than suppressed in order to increase detection confidence. We show that different tasks can be learned simultaneously using a single shared network. This integrated framework is the winner of the localization task of the ImageNet Large Scale Visual Recognition Challenge 2013 (ILSVRC2013) and obtained very competitive results for the detection and classifications tasks. In post-competition work, we establish a new state of the art for the detection task. Finally, we release a feature extractor from our best model called OverFeat.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. DETOUR: A Practical Backdoor Attack against Object Detection

    cs.CR 2026-04 unverdicted novelty 6.0

    DETOUR enables practical backdoor attacks on object detectors by training with rescaled semantic triggers from real-world objects placed at multiple locations to exploit the trigger radiating effect for reliable activ...

  2. Rethinking Atrous Convolution for Semantic Image Segmentation

    cs.CV 2017-06 unverdicted novelty 6.0

    DeepLabv3 improves semantic segmentation by capturing multi-scale context with cascaded or parallel atrous convolutions and adding global context to ASPP, achieving better results on PASCAL VOC 2012 without DenseCRF p...