pith. sign in

arxiv: 1411.6369 · v1 · pith:P5GU45WSnew · submitted 2014-11-24 · 💻 cs.CV · cs.LG· cs.NE

Scale-Invariant Convolutional Neural Networks

classification 💻 cs.CV cs.LGcs.NE
keywords scaleconvolutionalneuralsicnnclassificationmodelmulti-columnnetwork
0
0 comments X
read the original abstract

Even though convolutional neural networks (CNN) has achieved near-human performance in various computer vision tasks, its ability to tolerate scale variations is limited. The popular practise is making the model bigger first, and then train it with data augmentation using extensive scale-jittering. In this paper, we propose a scaleinvariant convolutional neural network (SiCNN), a modeldesigned to incorporate multi-scale feature exaction and classification into the network structure. SiCNN uses a multi-column architecture, with each column focusing on a particular scale. Unlike previous multi-column strategies, these columns share the same set of filter parameters by a scale transformation among them. This design deals with scale variation without blowing up the model size. Experimental results show that SiCNN detects features at various scales, and the classification result exhibits strong robustness against object scale variations.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Generalized Spherical Neural Operators: Green's Function Formulation

    cs.LG 2025-12 unverdicted novelty 6.0

    GSNO uses position-dependent spherical Green's functions to create flexible neural operators that adapt to non-equivariant systems on spheres while keeping spectral efficiency and grid invariance.