Clustering-Oriented Representation Learning with Attractive-Repulsive Loss

Andre Cianflone; Doina Precup; Guillaume Rabusseau; Jackie Chi Kit Cheung; Kian Kenyon-Dean; Lucas Page-Caccia

arxiv: 1812.07627 · v1 · pith:VLGJEVMTnew · submitted 2018-12-18 · 💻 cs.LG · cs.AI· stat.ML

Clustering-Oriented Representation Learning with Attractive-Repulsive Loss

Kian Kenyon-Dean , Andre Cianflone , Lucas Page-Caccia , Guillaume Rabusseau , Jackie Chi Kit Cheung , Doina Precup This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords corellatentlossaccuracyattractive-repulsivebuildingclassificationclustering-oriented

0 comments

read the original abstract

The standard loss function used to train neural network classifiers, categorical cross-entropy (CCE), seeks to maximize accuracy on the training data; building useful representations is not a necessary byproduct of this objective. In this work, we propose clustering-oriented representation learning (COREL) as an alternative to CCE in the context of a generalized attractive-repulsive loss framework. COREL has the consequence of building latent representations that collectively exhibit the quality of natural clustering within the latent space of the final hidden layer, according to a predefined similarity function. Despite being simple to implement, COREL variants outperform or perform equivalently to CCE in a variety of scenarios, including image and news article classification using both feed-forward and convolutional neural networks. Analysis of the latent spaces created with different similarity functions facilitates insights on the different use cases COREL variants can satisfy, where the Cosine-COREL variant makes a consistently clusterable latent space, while Gaussian-COREL consistently obtains better classification accuracy than CCE.

This paper has not been read by Pith yet.

Clustering-Oriented Representation Learning with Attractive-Repulsive Loss

discussion (0)