pith. machine review for the scientific record. sign in

arxiv: 1609.01977 · v2 · submitted 2016-09-07 · 💻 cs.LG

Recognition: unknown

Doubly Stochastic Neighbor Embedding on Spheres

Authors on Pith no claims yet
classification 💻 cs.LG
keywords dataembeddingmethoddoublyproblemsimilaritystochasticvisualization
0
0 comments X
read the original abstract

Stochastic Neighbor Embedding (SNE) methods minimize the divergence between the similarity matrix of a high-dimensional data set and its counterpart from a low-dimensional embedding, leading to widely applied tools for data visualization. Despite their popularity, the current SNE methods experience a crowding problem when the data include highly imbalanced similarities. This implies that the data points with higher total similarity tend to get crowded around the display center. To solve this problem, we introduce a fast normalization method and normalize the similarity matrix to be doubly stochastic such that all the data points have equal total similarities. Furthermore, we show empirically and theoretically that the doubly stochasticity constraint often leads to embeddings which are approximately spherical. This suggests replacing a flat space with spheres as the embedding space. The spherical embedding eliminates the discrepancy between the center and the periphery in visualization, which efficiently resolves the crowding problem. We compared the proposed method (DOSNES) with the state-of-the-art SNE method on three real-world datasets and the results clearly indicate that our method is more favorable in terms of visualization quality.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Reviving In-domain Fine-tuning Methods for Source-Free Cross-domain Few-shot Learning

    cs.CV 2026-05 unverdicted novelty 7.0

    LoRA adapters fix collapsed visual CLS token attention in CLIP for superior cross-domain few-shot learning, and the new Semantic Probe framework revives prompt methods to reach state-of-the-art on four benchmarks.

  2. Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition

    cs.CV 2026-04 unverdicted novelty 5.0

    A reinforcement learning approach adapts general generative models to produce synthetic data that boosts identity recognition accuracy and generalization under privacy constraints.