pith. sign in

arxiv: 1812.07627 · v1 · pith:VLGJEVMTnew · submitted 2018-12-18 · 💻 cs.LG · cs.AI· stat.ML

Clustering-Oriented Representation Learning with Attractive-Repulsive Loss

classification 💻 cs.LG cs.AIstat.ML
keywords corellatentlossaccuracyattractive-repulsivebuildingclassificationclustering-oriented
0
0 comments X
read the original abstract

The standard loss function used to train neural network classifiers, categorical cross-entropy (CCE), seeks to maximize accuracy on the training data; building useful representations is not a necessary byproduct of this objective. In this work, we propose clustering-oriented representation learning (COREL) as an alternative to CCE in the context of a generalized attractive-repulsive loss framework. COREL has the consequence of building latent representations that collectively exhibit the quality of natural clustering within the latent space of the final hidden layer, according to a predefined similarity function. Despite being simple to implement, COREL variants outperform or perform equivalently to CCE in a variety of scenarios, including image and news article classification using both feed-forward and convolutional neural networks. Analysis of the latent spaces created with different similarity functions facilitates insights on the different use cases COREL variants can satisfy, where the Cosine-COREL variant makes a consistently clusterable latent space, while Gaussian-COREL consistently obtains better classification accuracy than CCE.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.