pith. sign in

arxiv: 1707.00058 · v1 · pith:LFFRFH5Rnew · submitted 2017-06-30 · 💻 cs.CV

Multiple VLAD encoding of CNNs for image classification

classification 💻 cs.CV
keywords encodingmethodvladcnnsclassificationimagefeaturesframework
0
0 comments X p. Extension
pith:LFFRFH5R Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{LFFRFH5R}

Prints a linked pith:LFFRFH5R badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Despite the effectiveness of convolutional neural networks (CNNs) especially in image classification tasks, the effect of convolution features on learned representations is still limited. It mostly focuses on the salient object of the images, but ignores the variation information on clutter and local. In this paper, we propose a special framework, which is the multiple VLAD encoding method with the CNNs features for image classification. Furthermore, in order to improve the performance of the VLAD coding method, we explore the multiplicity of VLAD encoding with the extension of three kinds of encoding algorithms, which are the VLAD-SA method, the VLAD-LSA and the VLAD-LLC method. Finally, we equip the spatial pyramid patch (SPM) on VLAD encoding to add the spatial information of CNNs feature. In particular, the power of SPM leads our framework to yield better performance compared to the existing method.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.