pith. sign in

arxiv: 1610.07008 · v1 · pith:N3FHNG5Anew · submitted 2016-10-22 · 💻 cs.CV

Optimization on Submanifolds of Convolution Kernels in CNNs

classification 💻 cs.CV
keywords methodscnnskernelkernelsnormalizationclassificationconvolutionframework
0
0 comments X
read the original abstract

Kernel normalization methods have been employed to improve robustness of optimization methods to reparametrization of convolution kernels, covariate shift, and to accelerate training of Convolutional Neural Networks (CNNs). However, our understanding of theoretical properties of these methods has lagged behind their success in applications. We develop a geometric framework to elucidate underlying mechanisms of a diverse range of kernel normalization methods. Our framework enables us to expound and identify geometry of space of normalized kernels. We analyze and delineate how state-of-the-art kernel normalization methods affect the geometry of search spaces of the stochastic gradient descent (SGD) algorithms in CNNs. Following our theoretical results, we propose a SGD algorithm with assurance of almost sure convergence of the methods to a solution at single minimum of classification loss of CNNs. Experimental results show that the proposed method achieves state-of-the-art performance for major image classification benchmarks with CNNs.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. $\boldsymbol{\lambda}$-Orthogonality Regularization for Compatible Representation Learning

    cs.LG 2025-09 conditional novelty 6.0

    λ-Orthogonality regularization enables distribution-specific adaptation of representations via affine transformations while retaining original learned structures.