Locally Scale-Invariant Convolutional Neural Networks

Abhishek Sharma; Angjoo Kanazawa; David Jacobs

arxiv: 1412.5104 · v1 · pith:FKBWQLQSnew · submitted 2014-12-16 · 💻 cs.CV · cs.LG· cs.NE

Locally Scale-Invariant Convolutional Neural Networks

Angjoo Kanazawa , Abhishek Sharma , David Jacobs This is my paper

classification 💻 cs.CV cs.LGcs.NE

keywords learnconvnetsfeaturesdatadiscriminativeallowscertainconvolutional

0 comments

read the original abstract

Convolutional Neural Networks (ConvNets) have shown excellent results on many visual classification tasks. With the exception of ImageNet, these datasets are carefully crafted such that objects are well-aligned at similar scales. Naturally, the feature learning problem gets more challenging as the amount of variation in the data increases, as the models have to learn to be invariant to certain changes in appearance. Recent results on the ImageNet dataset show that given enough data, ConvNets can learn such invariances producing very discriminative features [1]. But could we do more: use less parameters, less data, learn more discriminative features, if certain invariances were built into the learning process? In this paper we present a simple model that allows ConvNets to learn features in a locally scale-invariant manner without increasing the number of model parameters. We show on a modified MNIST dataset that when faced with scale variation, building in scale-invariance allows ConvNets to learn more discriminative features with reduced chances of over-fitting.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Interaction-and-Aggregation Network for Person Re-identification
cs.CV 2019-07 unverdicted novelty 6.0

Introduces IA network with SIA and CIA modules to adaptively model spatial and channel feature interdependencies for improved person re-identification on benchmarks.
Affine Disentangled GAN for Interpretable and Robust AV Perception
cs.CV 2019-07 unverdicted novelty 5.0

ADIS-GAN disentangles affine transformations in a GAN to achieve over 98% classification accuracy on MNIST within 30 degrees rotation and over 90% under FGSM and PGD attacks while generating rotation and scaling factors.