Some Theoretical Limitations of t-SNE
Pith reviewed 2026-05-10 15:25 UTC · model grok-4.3
The pith
t-SNE loses important features of the data in multiple analyzed scenarios.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We provide a mathematical framework for understanding this loss for t-SNE by establishing a number of results in different scenarios showing how important features of data are lost by using t-SNE.
What carries the argument
A mathematical framework built from results in distinct data scenarios that each isolate a form of feature loss during t-SNE embedding.
Load-bearing premise
The specific scenarios analyzed in the framework are representative of the practical cases where t-SNE is applied and where feature loss would be most problematic.
What would settle it
A dataset constructed exactly according to one of the paper's scenarios in which the t-SNE embedding preserves every feature the framework predicts will be lost.
Figures
read the original abstract
t-SNE has gained popularity as a dimension reduction technique, especially for visualizing data. It is well-known that all dimension reduction techniques may lose important features of the data. We provide a mathematical framework for understanding this loss for t-SNE by establishing a number of results in different scenarios showing how important features of data are lost by using t-SNE.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to provide a mathematical framework for understanding feature loss in t-SNE by establishing a number of results across different scenarios that demonstrate how t-SNE fails to preserve important data features during dimension reduction.
Significance. If the derivations are rigorous and the scenarios capture properties of typical high-dimensional data (such as local neighborhoods and manifold structure), the framework could offer useful theoretical guidance on t-SNE limitations for visualization tasks. However, the absence of explicit theorems, proofs, or scenario details in the provided text prevents assessment of whether these strengths are realized.
major comments (2)
- [Abstract] Abstract: the claim that 'results are established' in different scenarios cannot be evaluated because no theorems, definitions, proofs, or scenario descriptions appear in the manuscript text. This is load-bearing for the central claim of a 'mathematical framework.'
- [Scenarios] Scenarios (throughout): the results purport to show loss of important features, but the skeptic concern is valid—the manuscript must demonstrate that the chosen scenarios reflect the local neighborhood preservation, cluster separation, and high-dimensional noise characteristics typical of real t-SNE applications. Without this justification, the demonstrated losses do not necessarily indicate relevant practical limitations.
minor comments (1)
- [Abstract] The abstract is extremely terse; adding one sentence summarizing the key scenarios and the nature of the lost features would improve readability.
Simulated Author's Rebuttal
We thank the referee for their comments, which identify key areas where the manuscript's presentation can be strengthened. We address each major comment below and will revise the paper accordingly to make the mathematical framework and scenario justifications explicit and self-contained.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim that 'results are established' in different scenarios cannot be evaluated because no theorems, definitions, proofs, or scenario descriptions appear in the manuscript text. This is load-bearing for the central claim of a 'mathematical framework.'
Authors: We agree that the submitted manuscript does not contain explicit theorems, formal definitions, or proofs in the main text, which prevents direct evaluation of the claims. This is a genuine limitation of the current version. In the revision, we will add a dedicated 'Main Results' section that states each theorem formally, defines the scenarios (including all parameters and assumptions), and provides proof sketches, with full proofs placed in an appendix. This will allow the mathematical framework to be assessed on its merits. revision: yes
-
Referee: [Scenarios] Scenarios (throughout): the results purport to show loss of important features, but the skeptic concern is valid—the manuscript must demonstrate that the chosen scenarios reflect the local neighborhood preservation, cluster separation, and high-dimensional noise characteristics typical of real t-SNE applications. Without this justification, the demonstrated losses do not necessarily indicate relevant practical limitations.
Authors: We accept that the manuscript currently lacks explicit justification linking the scenarios to typical t-SNE use cases. In the revised version, we will insert a new subsection (likely in the introduction) that motivates each scenario by reference to standard t-SNE applications. We will explain how the constructions preserve local neighborhoods via nearest-neighbor distances, model cluster separation through controlled inter-cluster distances in high dimensions, and incorporate noise via additive high-dimensional perturbations, supported by citations to empirical t-SNE literature. revision: yes
Circularity Check
No circularity: theoretical results derived independently in analyzed scenarios
full rationale
The paper establishes a mathematical framework consisting of results proven in multiple scenarios to demonstrate feature loss under t-SNE. No self-definitional loops, fitted parameters renamed as predictions, or load-bearing self-citations appear in the abstract or described structure. The derivations are presented as independent mathematical statements about specific data configurations rather than tautological restatements of inputs. The skeptic concern regarding scenario representativeness pertains to external validity and applicability rather than internal circularity of the proofs themselves.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
- [1]
-
[2]
A. Auffinger and D. Fletcher, Equilibrium distributions for t-distributed stochastic neighbour embedding , arXiv:2304.03727 [math.PR] (2023)
-
[3]
T. T. Cai and R. Ma, Theoretical foundations of t-sne for visualizing high-dimensional clustered data , Journal of Machine Learning Research 23 (2022), no. 301, 1–54
work page 2022
-
[4]
T. M. Cover and J. A. Thomas, Elements of information theory , Wiley Series in Telecommunications, John Wiley & Sons, Inc., New York, 1991, A Wiley-Interscience Publication. MR1122806 doi:10.1002/0471200611 LIMITATIONS OF T-SNE 19
-
[5]
G. E. Hinton and S. Roweis, Stochastic neighbor embedding, Advances in neural information processing systems 15 (2002)
work page 2002
-
[6]
G. C. Linderman and S. Steinerberger, Clustering with t-SNE, provably , SIAM J. Math. Data Sci. 1 (2019), no. 2, 313–332. MR3955236 doi:10.1137/18M1216134
-
[7]
L. v. d. Maaten and G. Hinton, Visualizing data using t-SNE , Journal of machine learning research 9 (2008), no. Nov, 2579–2605
work page 2008
-
[8]
V. D. Milman and G. Schechtman, Asymptotic theory of finite-dimensional normed spaces , Lecture Notes in Mathematics, vol. 1200, Springer-Verlag, Berlin, 1986, With an appendix by M. Gromov. MR856576 Stanford University, Stanford, CA 94305, USA Email address: rupertli@stanford.edu Massachusetts Institute of Technology, Cambridge, MA 02139, USA Email addre...
work page 1986
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.