Information-theoretical label embeddings for large-scale image classification

Fran\c{c}ois Chollet

arxiv: 1607.05691 · v1 · pith:RBIKTXNInew · submitted 2016-07-19 · 💻 cs.CV · cs.LG· stat.ML

Information-theoretical label embeddings for large-scale image classification

Fran\c{c}ois Chollet This is my paper

classification 💻 cs.CV cs.LGstat.ML

keywords classificationmethodregressionfasterimagelabelslogisticproblem

0 comments

read the original abstract

We present a method for training multi-label, massively multi-class image classification models, that is faster and more accurate than supervision via a sigmoid cross-entropy loss (logistic regression). Our method consists in embedding high-dimensional sparse labels onto a lower-dimensional dense sphere of unit-normed vectors, and treating the classification problem as a cosine proximity regression problem on this sphere. We test our method on a dataset of 300 million high-resolution images with 17,000 labels, where it yields considerably faster convergence, as well as a 7% higher mean average precision compared to logistic regression.

This paper has not been read by Pith yet.

Information-theoretical label embeddings for large-scale image classification

discussion (0)