pith. sign in

arxiv: 2311.03087 · v3 · pith:QFU72ECFnew · submitted 2023-11-06 · 💻 cs.LG · math.AT

Persistent Homology for High-dimensional Data Based on Spectral Methods

classification 💻 cs.LG math.AT
keywords homologypersistentdatadistanceshigh-dimensionalspectraltopologyallow
0
0 comments X
read the original abstract

Persistent homology is a popular computational tool for analyzing the topology of point clouds, such as the presence of loops or voids. However, many real-world datasets with low intrinsic dimensionality reside in an ambient space of much higher dimensionality. We show that in this case traditional persistent homology becomes very sensitive to noise and fails to detect the correct topology. The same holds true for existing refinements of persistent homology. As a remedy, we find that spectral distances on the k-nearest-neighbor graph of the data, such as diffusion distance and effective resistance, allow to detect the correct topology even in the presence of high-dimensional noise. Moreover, we derive a novel closed-form formula for effective resistance, and describe its relation to diffusion distances. Finally, we apply these methods to high-dimensional single-cell RNA-sequencing data and show that spectral distances allow robust detection of cell cycle loops.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Improved convergence rate of kNN graph Laplacians: differentiable self-tuned affinity

    stat.ML 2024-10 unverdicted novelty 7.0

    kNN graph Laplacians with self-tuned affinity achieve operator pointwise convergence to the manifold operator at rate O(N^{-2/(d+6)}) when epsilon and k scale optimally.

  2. Beyond Explained Variance: A Cautionary Tale of PCA

    cond-mat.stat-mech 2026-05 unverdicted novelty 4.0

    PCA suggested clustering in fossil teeth data on a nonlinear manifold, but t-SNE and persistent homology show a ring structure with no clustering, supported by a unit-circle generative model whose arcsine distance dis...

  3. Beyond Explained Variance: A Cautionary Tale of PCA

    cond-mat.stat-mech 2026-05 unverdicted novelty 4.0

    PCA scatterplots misleadingly indicate clusters in Kuehneotherium teeth data, whereas t-SNE and persistent homology detect a ring-like one-dimensional manifold, backed by a generative model of uniform sampling from a ...