Embedding norms in contrastive models encode semantic properties via optimization dynamics under scale-invariant losses.
Heated-Up Softmax Embedding
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Metric learning aims at learning a distance which is consistent with the semantic meaning of the samples. The problem is generally solved by learning an embedding for each sample such that the embeddings of samples of the same category are compact while the embeddings of samples of different categories are spread-out in the feature space. We study the features extracted from the second last layer of a deep neural network based classifier trained with the cross entropy loss on top of the softmax layer. We show that training classifiers with different temperature values of softmax function leads to features with different levels of compactness. Leveraging these insights, we propose a "heating-up" strategy to train a classifier with increasing temperatures, leading the corresponding embeddings to achieve state-of-the-art performance on a variety of metric learning benchmarks.
fields
stat.ML 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Optimization Dynamics Imprint Semantic Specificity in Contrastive Embedding Norms
Embedding norms in contrastive models encode semantic properties via optimization dynamics under scale-invariant losses.