MAPLE: Self-Supervised Learning-Enhanced Nonlinear Dimensionality Reduction for Visual Analysis

Andreas Kerren; Angelos Chatzimparmpas; Takanori Fujiwara; Wandrille Duchemin; Zeyang Huang

arxiv: 2601.20173 · v2 · pith:GADJQGWQnew · submitted 2026-01-28 · 💻 cs.LG · cs.HC

MAPLE: Self-Supervised Learning-Enhanced Nonlinear Dimensionality Reduction for Visual Analysis

Zeyang Huang , Takanori Fujiwara , Angelos Chatzimparmpas , Wandrille Duchemin , Andreas Kerren This is my paper

Pith reviewed 2026-05-16 11:02 UTC · model grok-4.3

classification 💻 cs.LG cs.HC

keywords nonlinear dimensionality reductionUMAPself-supervised learningmanifold learningdata visualizationcluster separationMAPLE

0 comments

The pith

MAPLE enhances UMAP with self-supervised learning to yield clearer cluster separations and finer subclusters than standard methods.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

MAPLE is a nonlinear dimensionality reduction method that builds directly on UMAP by adding a self-supervised learning step. It introduces maximum manifold capacity representations to encode manifold geometry more efficiently, compressing variance among similar points while expanding differences among dissimilar ones. This targets data with curved manifolds and high internal cluster variation, such as biological or image datasets. A sympathetic reader would care because the result is improved visual separation of clusters and better resolution of subgroups without a large increase in computing time.

Core claim

MAPLE employs a self-supervised learning approach to more efficiently encode low-dimensional manifold geometry. Central to this approach are maximum manifold capacity representations, which help untangle complex manifolds by compressing variances among locally similar data points while amplifying variance among dissimilar data points. This design is particularly effective for high-dimensional data with substantial intra-cluster variance and curved manifold structures. Qualitative and quantitative evaluations show that MAPLE produces clearer visual cluster separations and finer subcluster resolution than UMAP while maintaining tractable computational cost.

What carries the argument

Maximum manifold capacity representations (MMCRs), which untangle complex manifolds by compressing intra-cluster variance and amplifying inter-cluster variance.

If this is right

MAPLE produces clearer visual cluster separations than UMAP on high-dimensional data.
It resolves finer subclusters within groups that standard UMAP may merge.
The added self-supervised step keeps overall computation tractable.
The approach works especially well on curved manifolds with large intra-group variation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The MMCR enhancement could be tested as a plug-in module for other base reduction algorithms besides UMAP.
In single-cell or image domains the method might surface subtle subtypes that require less manual parameter adjustment.
If MMCRs scale well, the technique could reduce reliance on heavy post-processing of visualization outputs.

Load-bearing premise

That maximum manifold capacity representations can reliably untangle complex manifolds by compressing intra-cluster variance and amplifying inter-cluster variance.

What would settle it

A side-by-side comparison on a high-variance curved-manifold dataset where MAPLE shows no improvement in cluster separation or subcluster detail over UMAP would falsify the central claim.

Figures

Figures reproduced from arXiv: 2601.20173 by Andreas Kerren, Angelos Chatzimparmpas, Takanori Fujiwara, Wandrille Duchemin, Zeyang Huang.

**Figure 2.** Figure 2: The MAPLE pipeline, consisting of two phases: (1) graph construction and (2) graph layout. Phase 1 learns a graph representation of the input [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 2.** Figure 2: 1) Step 1. Fuzzy Graph Construction: Given the learned graph, G, and the embedded data, Z, MAPLE computes fuzzy weights for each edge using a softmax function: wi j = exp(− [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: Layouts of image (MNIST [22], Fashion-MNIST [85], STL-10 [17]) and single-cell datasets ( [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: Closer visual inspection of MAPLE layouts on the Fashion-MNIST dataset (same layout as in [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Completion time of MAPLE and UMAP in seconds. Left: runtime with varying feature dimensions (D) at fixed N = 10,000. Right: runtime with varying data sizes (N) at fixed D = 784. Solid and dotted lines indicate neighborhood sizes of k = 15 and k = 30, respectively. Both axes are shown on logarithmic scales, with tick labels 102 and 103 indicate orders of magnitude. execute tensor operations on the integrate… view at source ↗

**Figure 7.** Figure 7: Comparison of MAPLE and UMAP layouts of C. elegans neuron data. (a) Raw neuron subset [60] (20,222 gene dimensions; corresponds to [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗

read the original abstract

We present a new nonlinear dimensionality reduction method, MAPLE, that enhances UMAP by improving manifold modeling. MAPLE employs a self-supervised learning approach to more efficiently encode low-dimensional manifold geometry. Central to this approach are maximum manifold capacity representations (MMCRs), which help untangle complex manifolds by compressing variances among locally similar data points while amplifying variance among dissimilar data points. This design is particularly effective for high-dimensional data with substantial intra-cluster variance and curved manifold structures, such as biological or image data. Our qualitative and quantitative evaluations demonstrate that MAPLE can produce clearer visual cluster separations and finer subcluster resolution than UMAP while maintaining tractable computational cost.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MAPLE adds self-supervision to UMAP for better manifold handling in visual tasks, but the justification for the variance effects is underdeveloped.

read the letter

MAPLE adds a self-supervised layer to UMAP that uses maximum manifold capacity representations to compress variance inside similar groups and spread it between dissimilar ones. The goal is to improve nonlinear reduction on curved manifolds with high internal variation, the kind common in biological or image data. That focus matches a practical need for clearer visualizations without extra compute cost. The evaluations report sharper cluster boundaries and finer subclusters than plain UMAP, backed by both visual examples and quantitative measures, which is the part that could actually help people who rely on these plots for exploration. The main gap is the missing connection between the self-supervised objective and the claimed variance behavior. The abstract states that MMCRs untangle manifolds this way, yet no loss function, derivation, or proof sketch shows why the training produces compression of intra-cluster variance rather than some other reparameterization. Without that link, the observed gains could trace to dataset tuning or implementation choices instead of the new mechanism. The stress-test concern on this point holds up from the description given. The paper is aimed at researchers who already use UMAP for high-dimensional visual analysis in applied domains. A reader working on similar data would get direct value from the comparisons and could test the method themselves. I would send it for peer review so referees can examine the implementation details and see whether the self-supervised step delivers the stated effect or needs stronger grounding.

Referee Report

2 major / 1 minor

Summary. The manuscript presents MAPLE, a nonlinear dimensionality reduction method that augments UMAP via self-supervised learning. Central to the approach are maximum manifold capacity representations (MMCRs), which are claimed to untangle complex manifolds by compressing intra-cluster variance while amplifying inter-cluster variance. The paper asserts that this yields clearer visual cluster separations and finer subcluster resolution than UMAP on high-dimensional data with curved manifolds (e.g., biological or image data), while preserving tractable computational cost.

Significance. If the claimed variance-modulation mechanism and empirical gains are rigorously substantiated, MAPLE would constitute a practical advance in manifold-learning-based visualization tools. The self-supervised enhancement targets a known limitation of UMAP on data with substantial intra-cluster variance, and the emphasis on computational tractability aligns with real-world usage in exploratory analysis.

major comments (2)

[Abstract] Abstract: the central claim that MMCRs 'compress variances among locally similar data points while amplifying variance among dissimilar data points' is presented without an explicit loss function, objective, or derivation showing why the self-supervised objective produces this specific intra-/inter-cluster variance effect rather than a generic re-embedding. This mechanism is load-bearing for the asserted improvement over UMAP.
[Abstract] Abstract: the statement that 'qualitative and quantitative evaluations demonstrate' superior cluster separation lacks any reference to the specific metrics, datasets, or statistical tests used, making it impossible to assess whether the reported gains are robust or merely qualitative.

minor comments (1)

The abstract would be clearer if it briefly indicated the self-supervised architecture or loss used to learn the MMCRs.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback. We address each major comment below, agreeing that the abstract requires clarification on both the mechanism and the evaluation details. Revisions will be made to strengthen these aspects without altering the core contributions.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that MMCRs 'compress variances among locally similar data points while amplifying variance among dissimilar data points' is presented without an explicit loss function, objective, or derivation showing why the self-supervised objective produces this specific intra-/inter-cluster variance effect rather than a generic re-embedding. This mechanism is load-bearing for the asserted improvement over UMAP.

Authors: We acknowledge the abstract's brevity omits the explicit objective. Section 3.2 of the manuscript derives the MMCR self-supervised loss, which maximizes manifold capacity by explicitly minimizing intra-manifold variance (via local similarity compression) and increasing inter-manifold separation (via dissimilarity amplification); this is achieved through a capacity-regularized contrastive formulation rather than generic re-embedding. We will revise the abstract to briefly reference this objective and direct readers to the derivation in the methods. revision: yes
Referee: [Abstract] Abstract: the statement that 'qualitative and quantitative evaluations demonstrate' superior cluster separation lacks any reference to the specific metrics, datasets, or statistical tests used, making it impossible to assess whether the reported gains are robust or merely qualitative.

Authors: The full manuscript reports quantitative results using silhouette score, adjusted Rand index, and trustworthiness on datasets including MNIST, Fashion-MNIST, and single-cell RNA-seq data, with statistical tests via repeated runs and paired t-tests. We will revise the abstract to name these key metrics and datasets, providing context for the robustness of the claimed improvements. revision: yes

Circularity Check

0 steps flagged

No circularity in derivation chain

full rationale

The provided abstract and description contain no equations, loss functions, or explicit derivations. MAPLE is described as enhancing UMAP via self-supervised MMCRs that compress intra-cluster variance, but this is presented as a design principle without any reduction to fitted parameters, self-definitions, or load-bearing self-citations. The central claim of clearer cluster separations is supported by qualitative/quantitative evaluations rather than by construction from inputs. No load-bearing steps reduce to the paper's own fitted values or prior self-citations, making the approach self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 1 invented entities

Only the abstract is available, so the ledger is necessarily incomplete; the central claim rests on the unproven effectiveness of MMCRs for manifold untangling.

invented entities (1)

maximum manifold capacity representations (MMCRs) no independent evidence
purpose: to untangle complex manifolds by compressing variances among locally similar points and amplifying variance among dissimilar points
Introduced as the central mechanism in the self-supervised learning approach; no independent evidence or falsifiable prediction is provided in the abstract.

pith-pipeline@v0.9.0 · 5425 in / 1106 out tokens · 36259 ms · 2026-05-16T11:02:55.420638+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

MMCR loss function... L=λ mean ||Z_locali||_* − ||C||_* ... compression force... diversification force
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

maximum manifold capacity representations (MMCRs), which help untangle complex manifolds by compressing variances among locally similar data points while amplifying variance among dissimilar data points

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

93 extracted references · 93 canonical work pages · 4 internal anchors

[1]

Datasets

10x Genomics. Datasets. https://www.10xgenomics.com/datasets, 2025. Accessed: 2025-09-07

work page 2025
[2]

Atzberger, T

D. Atzberger, T. Cech, W. Scheibel, J. D ¨ollner, M. Behrisch, and T. Schreck. A large-scale sensitivity analysis on latent embeddings and dimensionality reductions for text spatializations.IEEE Transactions on Visualization and Computer Graphics, 31(1):305–315, 2025. doi: 10 .1109/TVCG.2024.3456308

work page arXiv 2025
[3]

D. Bear, C. Fan, D. Mrowca, Y . Li, S. Alter, A. Nayebi, J. Schwartz, L. F. Fei-Fei, J. Wu, J. Tenenbaum, and D. L. Yamins. Learning physical graph representations from visual scenes. InAdvances in Neural Information Processing Systems, vol. 33, pp. 6027–6039, 2020

work page 2020
[4]

Becht, L

E. Becht, L. McInnes, J. Healy, C.-A. Dutertre, I. W. Kwok, L. G. Ng, F. Ginhoux, and E. W. Newell. Dimensionality reduction for visualizing single-cell data using UMAP.Nature Biotechnology, 37(1):38–44, 2019. doi: doi.org/10.1038/nbt.4314

work page doi:10.1038/nbt.4314 2019
[5]

Belkin and P

M. Belkin and P. Niyogi. Laplacian eigenmaps for dimensionality reduction and data representation.Neural Computation, 15(6):1373– 1396, 2003. doi: 10.1162/089976603321780317

work page doi:10.1162/089976603321780317 2003
[6]

Deep nearest neighbor anomaly detection,

L. Bergman, N. Cohen, and Y . Hoshen. Deep nearest neighbor anomaly detection.arXiv:2002.10445, 2020. doi: 10.48550/arXiv.2002.10445

work page doi:10.48550/arxiv.2002.10445 2002
[7]

nearest neighbor

K. Beyer, J. Goldstein, R. Ramakrishnan, and U. Shaft. When is “nearest neighbor”’ meaningful? InProc. ICDT, pp. 217–235. Springer, 1999. doi: doi.org/10.1007/3-540-49257-7 15

work page doi:10.1007/3-540-49257-7 1999
[8]

J. N. B ¨ohm, P. Berens, and D. Kobak. Unsupervised visualization of image datasets using contrastive learning.arXiv:2210.09879, 2022. doi: 10.48550/arXiv.2210.09879

work page doi:10.48550/arxiv.2210.09879 2022
[9]

A. P. Bradley. The use of the area under the ROC curve in the evaluation of machine learning algorithms.Pattern Recognition, 30(7):1145–1159,

work page
[10]

doi: 10.1016/S0031-3203(96)00142-2

work page doi:10.1016/s0031-3203(96)00142-2
[11]

J. N. B ¨ohm, P. Berens, and D. Kobak. Attraction-repulsion spectrum in neighbor embeddings.Journal of Machine Learning Research, 23(95):1– 32, 2022

work page 2022
[12]

Communications in Statistics 3, 1–27

T. Cali ´nski and J. Harabasz. A dendrite method for cluster analysis. Communications in Statistics-Theory and Methods, 3(1):1–27, 1974. doi: 10.1080/03610927408827101

work page doi:10.1080/03610927408827101 1974
[13]

Caron, P

M. Caron, P. Bojanowski, A. Joulin, and M. Douze. Deep clustering for unsupervised learning of visual features. InProc. ECCV, September 2018

work page 2018
[14]

Cellarium cell annotation service (CAS)

Cellarium. Cellarium cell annotation service (CAS). https://cellarium. ai/tool/cellarium-cell-annotation-service-cas/, 2025. Accessed: 2025-09- 07

work page 2025
[15]

Chari and L

T. Chari and L. Pachter. The specious art of single-cell genomics.PLOS Computational Biology, 19(8):e1011288, 2023

work page 2023
[16]

T. Chen, S. Kornblith, M. Norouzi, and G. Hinton. A simple framework for contrastive learning of visual representations. InProc. ICML, vol. 119, pp. 1597–1607. PMLR, 2020

work page 2020
[17]

Chung, D

S. Chung, D. D. Lee, and H. Sompolinsky. Classification and geometry of general perceptual manifolds.Physical Review X, 8:031003, 26 pages, Jul 2018. doi: 10.1103/PhysRevX.8.031003

work page doi:10.1103/physrevx.8.031003 2018
[18]

Coates, A

A. Coates, A. Ng, and H. Lee. An analysis of single-layer networks in unsupervised feature learning. InProc. AISTATS, vol. 15, pp. 215–223. PMLR, 2011. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. XX, NO. XX, MONTH 202X 15

work page 2011
[19]

URL https://www.nature.com/articles/s41467-024-53147-y

C. Conwell, J. S. Prince, K. N. Kay, G. A. Alvarez, and T. Konkle. A large-scale examination of inductive biases shaping high-level vi- sual representation in brains and machines.Nature Communications, 15(1):9383, 2024. doi: doi.org/10.1038/s41467-024-53147-y

work page doi:10.1038/s41467-024-53147-y 2024
[20]

J. P. Cunningham and Z. Ghahramani. Linear dimensionality reduction: Survey, insights, and generalizations.Journal of Machine Learning Research, 16(89):2859–2900, 2015

work page 2015
[21]

Damrich, N

S. Damrich, N. B ¨ohm, F. A. Hamprecht, and D. Kobak. From $t$-SNE to UMAP with contrastive learning. InProc. ICLR, 2023

work page 2023
[22]

Damrich and F

S. Damrich and F. A. Hamprecht. On UMAP’s true loss function. In Advances in Neural Information Processing Systems, vol. 34, pp. 5798–

work page
[23]

Curran Associates, Inc., 2021

work page 2021
[24]

L. Deng. The MNIST database of handwritten digit images for machine learning research [best of the web].IEEE Signal Processing Magazine, 29(6):141–142, 2012. doi: 10.1109/MSP.2012.2211477

work page doi:10.1109/msp.2012.2211477 2012
[25]

Espadoto, R

M. Espadoto, R. M. Martins, A. Kerren, N. S. T. Hirata, and A. C. Telea. Toward a quantitative survey of dimension reduction techniques.IEEE Transactions on Visualization and Computer Graphics, 27(3):2153– 2173, 2021. doi: 10.1109/TVCG.2019.2944182

work page doi:10.1109/tvcg.2019.2944182 2021
[26]

Ester, H.-P

M. Ester, H.-P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. InProc. KDD, 6 pages, p. 226–231, 1996

work page 1996
[27]

Etemadpour, B

R. Etemadpour, B. Olk, and L. Linsen. Eye-tracking investigation during visual analysis of projected multidimensional data with 2D scatterplots. InProc. IVAPP, pp. 233–246, 2014

work page 2014
[28]

L. Feng, C. Wang, P. Liu, K. Ge, and J. Zhang. NCLDR: Nearest- neighbor contrastive learning with dual correlation loss for dimension- ality reduction.Neurocomputing, 594:127848, 2024. doi: 10.1016/j. neucom.2024.127848

work page doi:10.1016/j 2024
[29]

Francois, V

D. Francois, V . Wertz, and M. Verleysen. The concentration of fractional distances.IEEE Transactions on Knowledge and Data Engineering, 19(7):873–886, 2007. doi: 10.1109/TKDE.2007.1037

work page doi:10.1109/tkde.2007.1037 2007
[30]

W. Fung, L. Wexler, and M. G. Heiman. Cell-type-specific promoters forC. elegansglia.Journal of Neurogenetics, 34(3-4):335–346, 2020. doi: 10.1080/01677063.2020.1781851

work page doi:10.1080/01677063.2020.1781851 2020
[31]

G. D. Garson.Factor analysis and dimension reduction in R: A social Scientist’s toolkit. Routledge, 2022

work page 2022
[32]

Gisbrecht, A

A. Gisbrecht, A. Schulz, and B. Hammer. Parametric nonlinear dimen- sionality reduction using kernel t-SNE.Neurocomputing, 147:71–82,

work page
[33]

doi: 10.1016/j.neucom.2013.11.045

Advances in Self-Organizing Maps Subtitle of the Special Issue: Selected Papers from WSOM 2012. doi: 10.1016/j.neucom.2013.11.045

work page doi:10.1016/j.neucom.2013.11.045 2012
[34]

Grill, F

J.-B. Grill, F. Strub, F. Altch ´e, C. Tallec, P. Richemond, E. Buchatskaya, C. Doersch, B. Avila Pires, Z. Guo, M. Gheshlaghi Azar, et al. Bootstrap your own latent-a new approach to self-supervised learning.Advances in Neural Information Processing Systems, 33:21271–21284, 2020

work page 2020
[35]

Haghverdi, A

L. Haghverdi, A. T. Lun, M. D. Morgan, and J. C. Marioni. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors.Nature Biotechnology, 36(5):421–427, 2018. doi: 10.1038/nbt.4091

work page doi:10.1038/nbt.4091 2018
[36]

Hamerly and C

G. Hamerly and C. Elkan. Learning the k in k-means. InAdvances in Neural Information Processing Systems, vol. 16. MIT Press, 2003

work page 2003
[37]

K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. InProc. CVPR, June 2016

work page 2016
[38]

Healy and L

J. Healy and L. McInnes. Uniform manifold approximation and projection.Nature Reviews Methods Primers, 4(1):82, 2024. doi: 10. 1038/s43586-024-00363-x

work page 2024
[39]

G. E. Hinton and S. Roweis. Stochastic neighbor embedding.Advances in Neural Information Processing systems, 15, 2002

work page 2002
[40]

, volume =

H. Hotelling. Analysis of a complex of statistical variables into principal components.Journal of Educational Psychology, 24(6):417, 1933. doi: 10.1037/h0071325

work page doi:10.1037/h0071325 1933
[41]

T. Hu, Z. LIU, F. Zhou, W. Wang, and W. Huang. Your contrastive learning is secretly doing stochastic neighbor embedding. InProc. ICLR, 2023

work page 2023
[42]

IEEE Transactions on Pattern Analysis and Machine Intelli- gence15(9), 850–863 (1993).https://doi.org/10.1109/34.232073

D. Huttenlocher, G. Klanderman, and W. Rucklidge. Comparing images using the Hausdorff distance.IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(9):850–863, 1993. doi: 10.1109/34.232073

work page doi:10.1109/34.232073 1993
[43]

B. Isik, V . Lecomte, R. Schaeffer, Y . LeCun, M. Khona, R. Shwartz-Ziv, S. Koyejo, and A. Gromov. An information-theoretic understanding of maximum manifold capacity representations. InProrc. UniReps, 2023

work page 2023
[44]

M. T. Islam and J. W. Fleischer. The shape of attraction in UMAP: Exploring the embedding forces in dimensionality reduction. arXiv:2503.09101, 2025. doi: doi.org/10.48550/arXiv.2503.09101

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2503.09101 2025
[45]

H. Jeon, A. Cho, J. Jang, S. Lee, J. Hyun, H.-K. Ko, J. Jo, and J. Seo. ZADU: A python library for evaluating the reliability of dimensionality reduction embeddings. InProc. VIS, pp. 196–200, 2023. doi: 10.1109/ VIS54172.2023.00048

work page arXiv 2023
[46]

Jeon, Y .-H

H. Jeon, Y .-H. Kuo, M. Aupetit, K.-L. Ma, and J. Seo. Classes are not clusters: Improving label-based evaluation of dimensionality reduction. IEEE Transactions on Visualization and Computer Graphics, 30(1):781– 791, 2024. doi: 10.1109/TVCG.2023.3327187

work page doi:10.1109/tvcg.2023.3327187 2024
[47]

H. Jeon, H. Lee, Y .-H. Kuo, T. Yang, D. Archambault, S. Ko, T. Fu- jiwara, K.-L. Ma, and J. Seo. Unveiling high-dimensional backstage: A survey for reliable visual analytics with dimensionality reduction. In Proc. CHI, article no. 394, 24 pages. ACM, 2025. doi: 10.1145/3706598 .3713551

work page doi:10.1145/3706598 2025
[48]

M. Jung, T. Fujiwara, and J. Jo. GhostUMAP: Measuring pointwise instability in dimensionality reduction. InProc. VIS, pp. 161–165, 2024. doi: 10.1109/VIS55277.2024.00040

work page doi:10.1109/vis55277.2024.00040 2024
[49]

M. Jung, T. Fujiwara, and J. Jo. GhostUMAP2: Measuring and analyzing (r, d)-stability of UMAP.IEEE Transactions on Visualization and Computer Graphics, 2025. doi: 10.48550/arXiv.2507.17174

work page doi:10.48550/arxiv.2507.17174 2025
[50]

Kalantidis, C

Y . Kalantidis, C. E. R. K. Lassance, J. Almaz ´an, and D. Larlus. TLDR: Twin learning for dimensionality reduction.Transactions on Machine Learning Research, 2022

work page 2022
[51]

Kobak and G

D. Kobak and G. C. Linderman. Initialization is critical for preserving global data structure in both t-SNE and UMAP.Nature Biotechnology, 39(2):156–157, Feb. 2021. doi: 10.1038/s41587-020-00809-z

work page doi:10.1038/s41587-020-00809-z 2021
[52]

Kocabas, S

M. Kocabas, S. Karagoz, and E. Akbas. Self-supervised learning of 3D human pose using multi-view geometry. InProc. CVPR, June 2019

work page 2019
[53]

Krizhevsky, I

A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. InAdvances in Neural Information Processing Systems, vol. 25. Curran Associates, Inc., 2012

work page 2012
[54]

X. Liu, F. Zhang, Z. Hou, L. Mian, Z. Wang, J. Zhang, and J. Tang. Self-supervised learning: Generative or contrastive.IEEE Transactions on Knowledge and Data Engineering, 35(1):857–876, 2023. doi: 10. 1109/TKDE.2021.3090866

work page arXiv 2023
[55]

L. v. d. Maaten and G. Hinton. Visualizing data using t-SNE.Journal of Machine Learning Research, 9(Nov):2579–2605, 2008

work page 2008
[56]

Machado, M

A. Machado, M. Behrisch, and A. Telea. Necessary but not sufficient: Limitations of projection quality metrics.Computer Graphics Forum, 44(3):e70101, 2025. doi: 10.1111/cgf.70101

work page doi:10.1111/cgf.70101 2025
[57]

Ma ´ckiewicz and W

A. Ma ´ckiewicz and W. Ratajczak. Principal components analysis (PCA). Computers & Geosciences, 19(3):303–342, 1993

work page 1993
[58]

UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction

L. McInnes, J. Healy, and J. Melville. UMAP: Uniform manifold ap- proximation and projection for dimension reduction.arXiv:1802.03426,

work page internal anchor Pith review Pith/arXiv arXiv
[59]

doi: 10.48550/arXiv.1802.03426

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1802.03426
[60]

Meil ˘a and J

M. Meil ˘a and J. Shi. A random walks view of spectral segmentation. In T. S. Richardson and T. S. Jaakkola, eds.,Proceedings of the Eighth International Workshop on Artificial Intelligence and Statistics, vol. R3 ofProceedings of Machine Learning Research, pp. 203–208. PMLR, 04–07 Jan 2001. Reissued by PMLR on 31 March 2021

work page 2001
[61]

S. Mika, B. Sch ¨olkopf, A. Smola, K.-R. M ¨uller, M. Scholz, and G. R ¨atsch. Kernel PCA and de-noising in feature spaces.Advances in neural information processing systems, 11, 1998

work page 1998
[62]

K. R. Moon, D. van Dijk, Z. Wang, S. Gigante, D. B. Burkhardt, W. S. Chen, K. Yim, A. v. d. Elzen, M. J. Hirn, R. R. Coifman, N. B. Ivanova, G. Wolf, and S. Krishnaswamy. Visualizing structure and transitions in high-dimensional biological data.Nature Biotechnology, 37(12):1482– 1492, Dec. 2019. doi: 10.1038/s41587-019-0336-3

work page doi:10.1038/s41587-019-0336-3 2019
[63]

Narayan, B

A. Narayan, B. Berger, and H. Cho. Assessing single-cell transcriptomic variability through density-preserving data visualization.Nature Biotech- nology, 39(6):765–774, June 2021. doi: 10.1038/s41587-020-00801-7

work page doi:10.1038/s41587-020-00801-7 2021
[64]

J. S. Packer, Q. Zhu, C. Huynh, P. Sivaramakrishnan, E. Preston, H. Dueck, D. Stefanik, K. Tan, C. Trapnell, J. Kim, et al. A lineage- resolved molecular atlas ofC. elegansembryogenesis at single-cell resolution.Science, 365(6459):eaax1971, 2019. doi: 10.1126/science. aax1971

work page doi:10.1126/science 2019
[65]

Paszke, S

A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Kopf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala. PyTorch: An imperative style, high- performance deep learning library. InAdvances in Neural Information Processing Sys...

work page 2019
[66]

F. V . Paulovich, L. G. Nonato, R. Minghim, and H. Levkowitz. Least square projection: A fast high-precision multidimensional projection technique and its application to document mapping.IEEE Transactions on Visualization and Computer Graphics, 14(3):564–575, 2008. doi: 10 .1109/TVCG.2007.70443

work page arXiv 2008
[67]

Pedregosa, G

F. Pedregosa, G. Varoquaux, A. Gramfort, et al. Scikit-learn: Machine learning in Python.Journal of Machine Learning Research, 12:2825– 2830, 2011

work page 2011
[68]

X. Peng, C. Lu, Z. Yi, and H. Tang. Connections between nuclear- norm and Frobenius-norm-based representations.IEEE Transactions on IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. XX, NO. XX, MONTH 202X 16 Neural Networks and Learning Systems, 29(1):218–224, 2016. doi: 10. 1109/TNNLS.2016.2608834

work page arXiv 2016
[69]

Rodr ´ıguez G´alvez, A

B. Rodr ´ıguez G´alvez, A. Blaas, P. Rodriguez, A. Golinski, X. Suau, J. Ramapuram, D. Busbridge, and L. Zappella. The role of entropy and reconstruction in multi-view self-supervised learning. InProc. ICML, vol. 202, pp. 29143–29160. PMLR, 2023

work page 2023
[70]

2011, Neural Comput., 23, 1661, 10.1162/NECO\_a\_00142

T. Sainburg, L. McInnes, and T. Q. Gentner. Parametric UMAP embeddings for representation and semisupervised learning.Neural Computation, 33(11):2881–2907, 10 2021. doi: 10.1162/neco a 01434

work page doi:10.1162/neco 2021
[71]

Schaeffer, V

R. Schaeffer, V . Lecomte, D. B. Pai, A. Carranza, B. Isik, A. Unell, M. Khona, T. Yerxa, Y . LeCun, S. Chung, et al. Towards an improved understanding and utilization of maximum manifold capacity represen- tations.arXiv:2406.09366, 2024. doi: 10.48550/arXiv.2406.09366

work page doi:10.48550/arxiv.2406.09366 2024
[72]

Sivaramakrishnan, C

P. Sivaramakrishnan, C. Watkins, and J. I. Murray. Transcript accu- mulation rates in the earlyCaenorhabditis elegansembryo.Science Advances, 9(34):eadi1270, 2023. doi: 10.1126/sciadv.adi1270

work page doi:10.1126/sciadv.adi1270 2023
[73]

S. Sun, J. Zhu, Y . Ma, and X. Zhou. Accuracy, robustness and scalability of dimensionality reduction methods for single-cell rna-seq analysis. Genome Biology, 20(1):269, 2019. doi: doi.org/10.1186/s13059-019 -1898-6

work page doi:10.1186/s13059-019 2019
[74]

A. A. Taha and A. Hanbury. Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool.BMC Medical Imaging, 15(1):29, 2015. doi: doi.org/10.1186/s12880-015-0068-x

work page doi:10.1186/s12880-015-0068-x 2015
[75]

J. Tang, J. Liu, M. Zhang, and Q. Mei. Visualizing large-scale and high- dimensional data. InProc. WWW, 11 pages, p. 287–297, 2016. doi: 10 .1145/2872427.2883041

work page arXiv 2016
[76]

J. B. Tenenbaum, V . d. Silva, and J. C. Langford. A global ge- ometric framework for nonlinear dimensionality reduction.Science, 290(5500):2319–2323, 2000. doi: 10.1126/science.290.5500.2319

work page doi:10.1126/science.290.5500.2319 2000
[77]

Trapnell

C. Trapnell. Constructing single-cell trajectories. https:// cole-trapnell-lab.github.io/monocle3/docs/trajectories/, 2022. Accessed: 2025-09-07

work page 2022
[78]

Trapnell, D

C. Trapnell, D. Cacchiarelli, J. Grimsby, P. Pokharel, S. Li, M. Morse, N. J. Lennon, K. J. Livak, T. S. Mikkelsen, and J. L. Rinn. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells.Nature Biotechnology, 32(4):381–386, 2014. doi: doi.org/10.1038/nbt.2859

work page doi:10.1038/nbt.2859 2014
[79]

Y .-H. H. Tsai, Y . Wu, R. Salakhutdinov, and L.-P. Morency. Self- supervised learning from a multi-view perspective. InProc. ICLR, 2021

work page 2021
[80]

van der Maaten, E

L. van der Maaten, E. Postma, and J. van den Herik. Dimensionality re- duction: A comparative review.Journal of Machine Learning Research, 10:66–71, 2009

work page 2009

Showing first 80 references.

[1] [1]

Datasets

10x Genomics. Datasets. https://www.10xgenomics.com/datasets, 2025. Accessed: 2025-09-07

work page 2025

[2] [2]

Atzberger, T

D. Atzberger, T. Cech, W. Scheibel, J. D ¨ollner, M. Behrisch, and T. Schreck. A large-scale sensitivity analysis on latent embeddings and dimensionality reductions for text spatializations.IEEE Transactions on Visualization and Computer Graphics, 31(1):305–315, 2025. doi: 10 .1109/TVCG.2024.3456308

work page arXiv 2025

[3] [3]

D. Bear, C. Fan, D. Mrowca, Y . Li, S. Alter, A. Nayebi, J. Schwartz, L. F. Fei-Fei, J. Wu, J. Tenenbaum, and D. L. Yamins. Learning physical graph representations from visual scenes. InAdvances in Neural Information Processing Systems, vol. 33, pp. 6027–6039, 2020

work page 2020

[4] [4]

Becht, L

E. Becht, L. McInnes, J. Healy, C.-A. Dutertre, I. W. Kwok, L. G. Ng, F. Ginhoux, and E. W. Newell. Dimensionality reduction for visualizing single-cell data using UMAP.Nature Biotechnology, 37(1):38–44, 2019. doi: doi.org/10.1038/nbt.4314

work page doi:10.1038/nbt.4314 2019

[5] [5]

Belkin and P

M. Belkin and P. Niyogi. Laplacian eigenmaps for dimensionality reduction and data representation.Neural Computation, 15(6):1373– 1396, 2003. doi: 10.1162/089976603321780317

work page doi:10.1162/089976603321780317 2003

[6] [6]

Deep nearest neighbor anomaly detection,

L. Bergman, N. Cohen, and Y . Hoshen. Deep nearest neighbor anomaly detection.arXiv:2002.10445, 2020. doi: 10.48550/arXiv.2002.10445

work page doi:10.48550/arxiv.2002.10445 2002

[7] [7]

nearest neighbor

K. Beyer, J. Goldstein, R. Ramakrishnan, and U. Shaft. When is “nearest neighbor”’ meaningful? InProc. ICDT, pp. 217–235. Springer, 1999. doi: doi.org/10.1007/3-540-49257-7 15

work page doi:10.1007/3-540-49257-7 1999

[8] [8]

J. N. B ¨ohm, P. Berens, and D. Kobak. Unsupervised visualization of image datasets using contrastive learning.arXiv:2210.09879, 2022. doi: 10.48550/arXiv.2210.09879

work page doi:10.48550/arxiv.2210.09879 2022

[9] [9]

A. P. Bradley. The use of the area under the ROC curve in the evaluation of machine learning algorithms.Pattern Recognition, 30(7):1145–1159,

work page

[10] [10]

doi: 10.1016/S0031-3203(96)00142-2

work page doi:10.1016/s0031-3203(96)00142-2

[11] [11]

J. N. B ¨ohm, P. Berens, and D. Kobak. Attraction-repulsion spectrum in neighbor embeddings.Journal of Machine Learning Research, 23(95):1– 32, 2022

work page 2022

[12] [12]

Communications in Statistics 3, 1–27

T. Cali ´nski and J. Harabasz. A dendrite method for cluster analysis. Communications in Statistics-Theory and Methods, 3(1):1–27, 1974. doi: 10.1080/03610927408827101

work page doi:10.1080/03610927408827101 1974

[13] [13]

Caron, P

M. Caron, P. Bojanowski, A. Joulin, and M. Douze. Deep clustering for unsupervised learning of visual features. InProc. ECCV, September 2018

work page 2018

[14] [14]

Cellarium cell annotation service (CAS)

Cellarium. Cellarium cell annotation service (CAS). https://cellarium. ai/tool/cellarium-cell-annotation-service-cas/, 2025. Accessed: 2025-09- 07

work page 2025

[15] [15]

Chari and L

T. Chari and L. Pachter. The specious art of single-cell genomics.PLOS Computational Biology, 19(8):e1011288, 2023

work page 2023

[16] [16]

T. Chen, S. Kornblith, M. Norouzi, and G. Hinton. A simple framework for contrastive learning of visual representations. InProc. ICML, vol. 119, pp. 1597–1607. PMLR, 2020

work page 2020

[17] [17]

Chung, D

S. Chung, D. D. Lee, and H. Sompolinsky. Classification and geometry of general perceptual manifolds.Physical Review X, 8:031003, 26 pages, Jul 2018. doi: 10.1103/PhysRevX.8.031003

work page doi:10.1103/physrevx.8.031003 2018

[18] [18]

Coates, A

A. Coates, A. Ng, and H. Lee. An analysis of single-layer networks in unsupervised feature learning. InProc. AISTATS, vol. 15, pp. 215–223. PMLR, 2011. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. XX, NO. XX, MONTH 202X 15

work page 2011

[19] [19]

URL https://www.nature.com/articles/s41467-024-53147-y

C. Conwell, J. S. Prince, K. N. Kay, G. A. Alvarez, and T. Konkle. A large-scale examination of inductive biases shaping high-level vi- sual representation in brains and machines.Nature Communications, 15(1):9383, 2024. doi: doi.org/10.1038/s41467-024-53147-y

work page doi:10.1038/s41467-024-53147-y 2024

[20] [20]

J. P. Cunningham and Z. Ghahramani. Linear dimensionality reduction: Survey, insights, and generalizations.Journal of Machine Learning Research, 16(89):2859–2900, 2015

work page 2015

[21] [21]

Damrich, N

S. Damrich, N. B ¨ohm, F. A. Hamprecht, and D. Kobak. From $t$-SNE to UMAP with contrastive learning. InProc. ICLR, 2023

work page 2023

[22] [22]

Damrich and F

S. Damrich and F. A. Hamprecht. On UMAP’s true loss function. In Advances in Neural Information Processing Systems, vol. 34, pp. 5798–

work page

[23] [23]

Curran Associates, Inc., 2021

work page 2021

[24] [24]

L. Deng. The MNIST database of handwritten digit images for machine learning research [best of the web].IEEE Signal Processing Magazine, 29(6):141–142, 2012. doi: 10.1109/MSP.2012.2211477

work page doi:10.1109/msp.2012.2211477 2012

[25] [25]

Espadoto, R

M. Espadoto, R. M. Martins, A. Kerren, N. S. T. Hirata, and A. C. Telea. Toward a quantitative survey of dimension reduction techniques.IEEE Transactions on Visualization and Computer Graphics, 27(3):2153– 2173, 2021. doi: 10.1109/TVCG.2019.2944182

work page doi:10.1109/tvcg.2019.2944182 2021

[26] [26]

Ester, H.-P

M. Ester, H.-P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. InProc. KDD, 6 pages, p. 226–231, 1996

work page 1996

[27] [27]

Etemadpour, B

R. Etemadpour, B. Olk, and L. Linsen. Eye-tracking investigation during visual analysis of projected multidimensional data with 2D scatterplots. InProc. IVAPP, pp. 233–246, 2014

work page 2014

[28] [28]

L. Feng, C. Wang, P. Liu, K. Ge, and J. Zhang. NCLDR: Nearest- neighbor contrastive learning with dual correlation loss for dimension- ality reduction.Neurocomputing, 594:127848, 2024. doi: 10.1016/j. neucom.2024.127848

work page doi:10.1016/j 2024

[29] [29]

Francois, V

D. Francois, V . Wertz, and M. Verleysen. The concentration of fractional distances.IEEE Transactions on Knowledge and Data Engineering, 19(7):873–886, 2007. doi: 10.1109/TKDE.2007.1037

work page doi:10.1109/tkde.2007.1037 2007

[30] [30]

W. Fung, L. Wexler, and M. G. Heiman. Cell-type-specific promoters forC. elegansglia.Journal of Neurogenetics, 34(3-4):335–346, 2020. doi: 10.1080/01677063.2020.1781851

work page doi:10.1080/01677063.2020.1781851 2020

[31] [31]

G. D. Garson.Factor analysis and dimension reduction in R: A social Scientist’s toolkit. Routledge, 2022

work page 2022

[32] [32]

Gisbrecht, A

A. Gisbrecht, A. Schulz, and B. Hammer. Parametric nonlinear dimen- sionality reduction using kernel t-SNE.Neurocomputing, 147:71–82,

work page

[33] [33]

doi: 10.1016/j.neucom.2013.11.045

Advances in Self-Organizing Maps Subtitle of the Special Issue: Selected Papers from WSOM 2012. doi: 10.1016/j.neucom.2013.11.045

work page doi:10.1016/j.neucom.2013.11.045 2012

[34] [34]

Grill, F

J.-B. Grill, F. Strub, F. Altch ´e, C. Tallec, P. Richemond, E. Buchatskaya, C. Doersch, B. Avila Pires, Z. Guo, M. Gheshlaghi Azar, et al. Bootstrap your own latent-a new approach to self-supervised learning.Advances in Neural Information Processing Systems, 33:21271–21284, 2020

work page 2020

[35] [35]

Haghverdi, A

L. Haghverdi, A. T. Lun, M. D. Morgan, and J. C. Marioni. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors.Nature Biotechnology, 36(5):421–427, 2018. doi: 10.1038/nbt.4091

work page doi:10.1038/nbt.4091 2018

[36] [36]

Hamerly and C

G. Hamerly and C. Elkan. Learning the k in k-means. InAdvances in Neural Information Processing Systems, vol. 16. MIT Press, 2003

work page 2003

[37] [37]

K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. InProc. CVPR, June 2016

work page 2016

[38] [38]

Healy and L

J. Healy and L. McInnes. Uniform manifold approximation and projection.Nature Reviews Methods Primers, 4(1):82, 2024. doi: 10. 1038/s43586-024-00363-x

work page 2024

[39] [39]

G. E. Hinton and S. Roweis. Stochastic neighbor embedding.Advances in Neural Information Processing systems, 15, 2002

work page 2002

[40] [40]

, volume =

H. Hotelling. Analysis of a complex of statistical variables into principal components.Journal of Educational Psychology, 24(6):417, 1933. doi: 10.1037/h0071325

work page doi:10.1037/h0071325 1933

[41] [41]

T. Hu, Z. LIU, F. Zhou, W. Wang, and W. Huang. Your contrastive learning is secretly doing stochastic neighbor embedding. InProc. ICLR, 2023

work page 2023

[42] [42]

IEEE Transactions on Pattern Analysis and Machine Intelli- gence15(9), 850–863 (1993).https://doi.org/10.1109/34.232073

D. Huttenlocher, G. Klanderman, and W. Rucklidge. Comparing images using the Hausdorff distance.IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(9):850–863, 1993. doi: 10.1109/34.232073

work page doi:10.1109/34.232073 1993

[43] [43]

B. Isik, V . Lecomte, R. Schaeffer, Y . LeCun, M. Khona, R. Shwartz-Ziv, S. Koyejo, and A. Gromov. An information-theoretic understanding of maximum manifold capacity representations. InProrc. UniReps, 2023

work page 2023

[44] [44]

M. T. Islam and J. W. Fleischer. The shape of attraction in UMAP: Exploring the embedding forces in dimensionality reduction. arXiv:2503.09101, 2025. doi: doi.org/10.48550/arXiv.2503.09101

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2503.09101 2025

[45] [45]

H. Jeon, A. Cho, J. Jang, S. Lee, J. Hyun, H.-K. Ko, J. Jo, and J. Seo. ZADU: A python library for evaluating the reliability of dimensionality reduction embeddings. InProc. VIS, pp. 196–200, 2023. doi: 10.1109/ VIS54172.2023.00048

work page arXiv 2023

[46] [46]

Jeon, Y .-H

H. Jeon, Y .-H. Kuo, M. Aupetit, K.-L. Ma, and J. Seo. Classes are not clusters: Improving label-based evaluation of dimensionality reduction. IEEE Transactions on Visualization and Computer Graphics, 30(1):781– 791, 2024. doi: 10.1109/TVCG.2023.3327187

work page doi:10.1109/tvcg.2023.3327187 2024

[47] [47]

H. Jeon, H. Lee, Y .-H. Kuo, T. Yang, D. Archambault, S. Ko, T. Fu- jiwara, K.-L. Ma, and J. Seo. Unveiling high-dimensional backstage: A survey for reliable visual analytics with dimensionality reduction. In Proc. CHI, article no. 394, 24 pages. ACM, 2025. doi: 10.1145/3706598 .3713551

work page doi:10.1145/3706598 2025

[48] [48]

M. Jung, T. Fujiwara, and J. Jo. GhostUMAP: Measuring pointwise instability in dimensionality reduction. InProc. VIS, pp. 161–165, 2024. doi: 10.1109/VIS55277.2024.00040

work page doi:10.1109/vis55277.2024.00040 2024

[49] [49]

M. Jung, T. Fujiwara, and J. Jo. GhostUMAP2: Measuring and analyzing (r, d)-stability of UMAP.IEEE Transactions on Visualization and Computer Graphics, 2025. doi: 10.48550/arXiv.2507.17174

work page doi:10.48550/arxiv.2507.17174 2025

[50] [50]

Kalantidis, C

Y . Kalantidis, C. E. R. K. Lassance, J. Almaz ´an, and D. Larlus. TLDR: Twin learning for dimensionality reduction.Transactions on Machine Learning Research, 2022

work page 2022

[51] [51]

Kobak and G

D. Kobak and G. C. Linderman. Initialization is critical for preserving global data structure in both t-SNE and UMAP.Nature Biotechnology, 39(2):156–157, Feb. 2021. doi: 10.1038/s41587-020-00809-z

work page doi:10.1038/s41587-020-00809-z 2021

[52] [52]

Kocabas, S

M. Kocabas, S. Karagoz, and E. Akbas. Self-supervised learning of 3D human pose using multi-view geometry. InProc. CVPR, June 2019

work page 2019

[53] [53]

Krizhevsky, I

A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. InAdvances in Neural Information Processing Systems, vol. 25. Curran Associates, Inc., 2012

work page 2012

[54] [54]

X. Liu, F. Zhang, Z. Hou, L. Mian, Z. Wang, J. Zhang, and J. Tang. Self-supervised learning: Generative or contrastive.IEEE Transactions on Knowledge and Data Engineering, 35(1):857–876, 2023. doi: 10. 1109/TKDE.2021.3090866

work page arXiv 2023

[55] [55]

L. v. d. Maaten and G. Hinton. Visualizing data using t-SNE.Journal of Machine Learning Research, 9(Nov):2579–2605, 2008

work page 2008

[56] [56]

Machado, M

A. Machado, M. Behrisch, and A. Telea. Necessary but not sufficient: Limitations of projection quality metrics.Computer Graphics Forum, 44(3):e70101, 2025. doi: 10.1111/cgf.70101

work page doi:10.1111/cgf.70101 2025

[57] [57]

Ma ´ckiewicz and W

A. Ma ´ckiewicz and W. Ratajczak. Principal components analysis (PCA). Computers & Geosciences, 19(3):303–342, 1993

work page 1993

[58] [58]

UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction

L. McInnes, J. Healy, and J. Melville. UMAP: Uniform manifold ap- proximation and projection for dimension reduction.arXiv:1802.03426,

work page internal anchor Pith review Pith/arXiv arXiv

[59] [59]

doi: 10.48550/arXiv.1802.03426

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1802.03426

[60] [60]

Meil ˘a and J

M. Meil ˘a and J. Shi. A random walks view of spectral segmentation. In T. S. Richardson and T. S. Jaakkola, eds.,Proceedings of the Eighth International Workshop on Artificial Intelligence and Statistics, vol. R3 ofProceedings of Machine Learning Research, pp. 203–208. PMLR, 04–07 Jan 2001. Reissued by PMLR on 31 March 2021

work page 2001

[61] [61]

S. Mika, B. Sch ¨olkopf, A. Smola, K.-R. M ¨uller, M. Scholz, and G. R ¨atsch. Kernel PCA and de-noising in feature spaces.Advances in neural information processing systems, 11, 1998

work page 1998

[62] [62]

K. R. Moon, D. van Dijk, Z. Wang, S. Gigante, D. B. Burkhardt, W. S. Chen, K. Yim, A. v. d. Elzen, M. J. Hirn, R. R. Coifman, N. B. Ivanova, G. Wolf, and S. Krishnaswamy. Visualizing structure and transitions in high-dimensional biological data.Nature Biotechnology, 37(12):1482– 1492, Dec. 2019. doi: 10.1038/s41587-019-0336-3

work page doi:10.1038/s41587-019-0336-3 2019

[63] [63]

Narayan, B

A. Narayan, B. Berger, and H. Cho. Assessing single-cell transcriptomic variability through density-preserving data visualization.Nature Biotech- nology, 39(6):765–774, June 2021. doi: 10.1038/s41587-020-00801-7

work page doi:10.1038/s41587-020-00801-7 2021

[64] [64]

J. S. Packer, Q. Zhu, C. Huynh, P. Sivaramakrishnan, E. Preston, H. Dueck, D. Stefanik, K. Tan, C. Trapnell, J. Kim, et al. A lineage- resolved molecular atlas ofC. elegansembryogenesis at single-cell resolution.Science, 365(6459):eaax1971, 2019. doi: 10.1126/science. aax1971

work page doi:10.1126/science 2019

[65] [65]

Paszke, S

A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Kopf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala. PyTorch: An imperative style, high- performance deep learning library. InAdvances in Neural Information Processing Sys...

work page 2019

[66] [66]

F. V . Paulovich, L. G. Nonato, R. Minghim, and H. Levkowitz. Least square projection: A fast high-precision multidimensional projection technique and its application to document mapping.IEEE Transactions on Visualization and Computer Graphics, 14(3):564–575, 2008. doi: 10 .1109/TVCG.2007.70443

work page arXiv 2008

[67] [67]

Pedregosa, G

F. Pedregosa, G. Varoquaux, A. Gramfort, et al. Scikit-learn: Machine learning in Python.Journal of Machine Learning Research, 12:2825– 2830, 2011

work page 2011

[68] [68]

X. Peng, C. Lu, Z. Yi, and H. Tang. Connections between nuclear- norm and Frobenius-norm-based representations.IEEE Transactions on IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. XX, NO. XX, MONTH 202X 16 Neural Networks and Learning Systems, 29(1):218–224, 2016. doi: 10. 1109/TNNLS.2016.2608834

work page arXiv 2016

[69] [69]

Rodr ´ıguez G´alvez, A

B. Rodr ´ıguez G´alvez, A. Blaas, P. Rodriguez, A. Golinski, X. Suau, J. Ramapuram, D. Busbridge, and L. Zappella. The role of entropy and reconstruction in multi-view self-supervised learning. InProc. ICML, vol. 202, pp. 29143–29160. PMLR, 2023

work page 2023

[70] [70]

2011, Neural Comput., 23, 1661, 10.1162/NECO\_a\_00142

T. Sainburg, L. McInnes, and T. Q. Gentner. Parametric UMAP embeddings for representation and semisupervised learning.Neural Computation, 33(11):2881–2907, 10 2021. doi: 10.1162/neco a 01434

work page doi:10.1162/neco 2021

[71] [71]

Schaeffer, V

R. Schaeffer, V . Lecomte, D. B. Pai, A. Carranza, B. Isik, A. Unell, M. Khona, T. Yerxa, Y . LeCun, S. Chung, et al. Towards an improved understanding and utilization of maximum manifold capacity represen- tations.arXiv:2406.09366, 2024. doi: 10.48550/arXiv.2406.09366

work page doi:10.48550/arxiv.2406.09366 2024

[72] [72]

Sivaramakrishnan, C

P. Sivaramakrishnan, C. Watkins, and J. I. Murray. Transcript accu- mulation rates in the earlyCaenorhabditis elegansembryo.Science Advances, 9(34):eadi1270, 2023. doi: 10.1126/sciadv.adi1270

work page doi:10.1126/sciadv.adi1270 2023

[73] [73]

S. Sun, J. Zhu, Y . Ma, and X. Zhou. Accuracy, robustness and scalability of dimensionality reduction methods for single-cell rna-seq analysis. Genome Biology, 20(1):269, 2019. doi: doi.org/10.1186/s13059-019 -1898-6

work page doi:10.1186/s13059-019 2019

[74] [74]

A. A. Taha and A. Hanbury. Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool.BMC Medical Imaging, 15(1):29, 2015. doi: doi.org/10.1186/s12880-015-0068-x

work page doi:10.1186/s12880-015-0068-x 2015

[75] [75]

J. Tang, J. Liu, M. Zhang, and Q. Mei. Visualizing large-scale and high- dimensional data. InProc. WWW, 11 pages, p. 287–297, 2016. doi: 10 .1145/2872427.2883041

work page arXiv 2016

[76] [76]

J. B. Tenenbaum, V . d. Silva, and J. C. Langford. A global ge- ometric framework for nonlinear dimensionality reduction.Science, 290(5500):2319–2323, 2000. doi: 10.1126/science.290.5500.2319

work page doi:10.1126/science.290.5500.2319 2000

[77] [77]

Trapnell

C. Trapnell. Constructing single-cell trajectories. https:// cole-trapnell-lab.github.io/monocle3/docs/trajectories/, 2022. Accessed: 2025-09-07

work page 2022

[78] [78]

Trapnell, D

C. Trapnell, D. Cacchiarelli, J. Grimsby, P. Pokharel, S. Li, M. Morse, N. J. Lennon, K. J. Livak, T. S. Mikkelsen, and J. L. Rinn. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells.Nature Biotechnology, 32(4):381–386, 2014. doi: doi.org/10.1038/nbt.2859

work page doi:10.1038/nbt.2859 2014

[79] [79]

Y .-H. H. Tsai, Y . Wu, R. Salakhutdinov, and L.-P. Morency. Self- supervised learning from a multi-view perspective. InProc. ICLR, 2021

work page 2021

[80] [80]

van der Maaten, E

L. van der Maaten, E. Postma, and J. van den Herik. Dimensionality re- duction: A comparative review.Journal of Machine Learning Research, 10:66–71, 2009

work page 2009