A Closed-Form Persistence-Landmark Pipeline for Certified Point-Cloud and Graph Classification

Atish Mitra; Pramita Bagchi; Sushovan Majhi; \v{Z}iga Virk

arxiv: 2605.02836 · v2 · pith:B23YWRAGnew · submitted 2026-05-04 · 💻 cs.LG · math.AT

A Closed-Form Persistence-Landmark Pipeline for Certified Point-Cloud and Graph Classification

Sushovan Majhi , Atish Mitra , \v{Z}iga Virk , Pramita Bagchi This is my paper

Pith reviewed 2026-05-09 15:37 UTC · model grok-4.3

classification 💻 cs.LG math.AT

keywords persistent homologypoint cloud classificationgraph classificationclosed-form pipelinetopological data analysismargin boundcertified classification

0 comments

The pith

PLACE builds classifiers for point clouds and graphs from persistent-homology signatures using only training labels and closed-form rules.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents a pipeline that embeds persistent-homology features by summing fixed coordinate functions over a sparse set of landmarks and then assigns closed-form weights that maximize a structural distortion constant. From these embeddings it derives an excess-risk bound that scales with class separation and embedding radius, a descriptor-selection rule based on the Mahalanobis margin, and a per-prediction certificate that can be computed at training time. All three guarantees are obtained without learned parameters or a held-out calibration set. The approach therefore offers a fully analytic route to certified topological classification whenever the non-interference condition on the landmark sum holds.

Core claim

PLACE is a closed-form pipeline that classifies point clouds and graphs by summing Mitra-Virk single-point coordinate functions over a landmark grid, choosing weights that maximize the structural distortion constant λ(ν), and thereby obtaining an O(kR/(Δ√m_min)) margin-based excess-risk bound, a closed-form Mahalanobis-margin descriptor selector, and a training-time-decided certificate in both non-asymptotic and Gaussian-plug-in forms.

What carries the argument

The embedding formed by summing Mitra-Virk coordinate functions over a sparse landmark grid, with weights chosen to maximize the Lipschitz lower bound λ(ν) under a non-interference condition.

If this is right

The excess-risk rate improves with larger class-mean separation Δ and smaller embedding radius R.
Mahalanobis margin under Ledoit-Wolf shrinkage selects descriptors more consistently than isotropic surrogates on heterogeneous descriptor pools.
The per-prediction certificate can be decided once at training time and applied to new points with no additional computation.
The same landmark embedding yields both the risk bound and the certificate, linking geometric separation directly to certified accuracy.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method could be extended to other topological descriptors whose coordinate functions obey a comparable non-interference property.
If the distortion constant λ(ν) can be bounded analytically for new landmark choices, the same guarantees would transfer without retraining.
The gap between the derived certificate and observed accuracy on small data sets suggests that tighter multivariate-norm bounds could make the certificate operational sooner.

Load-bearing premise

The summed coordinate functions must satisfy a non-interference condition so that the distortion constant λ(ν) can be maximized in closed form from the training labels alone.

What would settle it

A concrete data set in which the empirical excess risk exceeds the derived O(kR/(Δ√m_min)) bound by more than a small constant factor, or in which the non-interference condition is visibly violated on the chosen landmark grid.

Figures

Figures reproduced from arXiv: 2605.02836 by Atish Mitra, Pramita Bagchi, Sushovan Majhi, \v{Z}iga Virk.

**Figure 1.** Figure 1: From point cloud to persistence diagram. (a) Noisy sample from a circle. (b) Vietoris–Rips filtration at radii 𝑟1 < 𝑟2 < 𝑟3: the 1-cycle is born at 𝑟2 and dies at 𝑟3. (c) Barcode; bar length equals feature lifetime. (d) Persistence diagram; each feature becomes a point (𝑏, 𝑑), with distance 𝜏 = 𝑑 − 𝑏 to the diagonal measuring topological significance. We overlaid the 0-dim (blue) and 1-dim (red) diagrams t… view at source ↗

**Figure 2.** Figure 2: The PLACE pipeline. A point cloud or graph (left) is converted to a persistence diagram through a filtration—a growing sequence of simplicial complexes—then embedded to R ℓ by summing hat-function coordinates over a landmark grid: each diagram point (red) contributes to the coordinates indexed by the landmarks (orange) whose 𝑑B-cover squares it falls within, via Φ𝑝 (𝐴) = Í 𝑎∈𝐴 𝜑𝑅,𝑝 (𝑎). The embedded vector… view at source ↗

**Figure 3.** Figure 3: Landmark grid, hat coordinate, and summation embedding. (a) Grid G𝑅 (odd 𝑚, even 𝑛, 𝑛 ≥ 𝑚+3) with 𝑑B-cover squares of radius 3𝑅 2 ; diagram 𝐴 = {𝑎1, 𝑎2, 𝑎3} (red): 𝑎1, 𝑎2 each fall in three lattice landmarks (with 𝑝3 shared—the summation site), while the low-persistence point 𝑎3 contributes only to the diagonal landmark ∗. (b) Hat 𝜑𝑅,𝑝 (𝑥) = max{ 3𝑅 2 −𝑑B (𝑝, 𝑥), 0}: a 𝑑∞-pyramid peaking at 𝑝; its level se… view at source ↗

**Figure 4.** Figure 4: Confidence containment (Theorem 5.1). The depicted pair (𝑐, 𝑐′ ) is the worst-separated one, with ∥𝜇𝑐 − 𝜇𝑐 ′ ∥ = Δ (other pairs have distance ≥ Δ). The empirical centroid 𝜇ˆ𝑐 lies within 𝑟𝑚 of the population centroid 𝜇𝑐 (blue ball) with probability ≥ 1 − 𝛼. When 𝑟𝑚 < 1 2 Δ, any test point farther than 2𝑟𝑚 from the population Voronoi boundary (dashed) is classified identically by the empirical and populatio… view at source ↗

**Figure 5.** Figure 5: Orbit5k: point clouds (top) and 𝐻1 persistence diagrams (bottom) for each class 𝜌 ∈ {2.5, 3.5, 4.0, 4.1, 4.3}. 6.2 Graph Classification We evaluate on 11 benchmarks from (Zhao and Wang, 2019) spanning three domains: molecular graphs (MUTAG 188, NCI1 4110, NCI109 4127, PTC 344, COX2 467, DHFR 756), protein structures (PROTEINS 1113, DD 1178), and social networks (IMDB-B 1000, IMDB-M 1500, REDDIT-5K 4999). A… view at source ↗

**Figure 6.** Figure 6: Graph-to-diagram pipeline on a MUTAG molecule: HKS filtration (left), view at source ↗

read the original abstract

We introduce PLACE (Persistence-Landmark Analytic Classification Engine), a closed-form pipeline for classifying point clouds and graphs through their persistent-homology signatures. Three quantitative guarantees -- a margin-based excess-risk rate, a closed-form descriptor-selection rule, and a per-prediction certificate -- are derived from training labels alone, with no learned weights or held-out calibration. The embedding sums Mitra-Virk single-point coordinate functions over a sparse landmark grid; the closed-form weight rule $w_k^2 \propto (d_{k+1}^2 - d_k^2)/R_k^2$ maximizes the distortion slope in Mitra-Virk's affine certificate under $\nu$-coherence. (i) An $O(kR/(\Delta\sqrt{m_{\min}}))$ margin bound, driven by class-mean separation $\Delta$ and embedding radius $R$, matched in the sample-starved regime $m \lesssim R/\Delta$ by a Le Cam minimax lower bound. (ii) The Mahalanobis margin under Ledoit-Wolf-shrunk covariance is the strongest closed-form ranker on a 64-descriptor chemical-graph pool (mean Spearman $\rho = +0.56$ across 11 benchmarks, positive on 10 of 11); the isotropic surrogate $\Delta/\sqrt{\ell}$ admits a closed-form selection-consistency rate on the homogeneous protein/social pools. (iii) A training-time-decided certificate, with no per-prediction overhead, in three concrete radii (Pinelis, Gaussian plug-in, and variance-aware Pinelis-Bernstein). Empirically, PLACE is the strongest diagram-based method on Orbit5k and matches the strongest topology-based baseline within statistical noise on MUTAG and COX2; remaining gaps fall into two diagnosable regimes (descriptor blindness on NCI1/NCI109; pool-coverage limits elsewhere). The Pinelis-Bernstein radius fires on 8 of the 12 benchmarks; on MUTAG the empirical and population nearest-centroid rules agree on every one of 940 held-out test predictions, validating the certificate's mechanism.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

PLACE gives a closed-form persistence classifier with explicit margin bounds and label-only certificates, but the non-interference assumption needed for the λ(ν) weights looks unverified and could break the guarantees.

read the letter

PLACE is a closed-form pipeline that classifies point clouds and graphs from their persistence diagrams by summing Mitra-Virk coordinate functions over a landmark grid and picking weights in closed form to maximize the structural distortion constant λ(ν). From that construction it derives three things: an O(kR/(Δ√m_min)) excess-risk bound, a Mahalanobis descriptor selector under Ledoit-Wolf shrinkage, and both non-asymptotic and Gaussian per-prediction certificates, all obtained from training labels alone with no learned parameters or calibration set. On the reported benchmarks it leads diagram-based methods on Orbit5k and stays within noise of the best topology baselines on MUTAG and COX2. The explicit rates and the training-time certificate are the parts that could matter for safety-critical settings where you want something you can actually bound rather than just train and hope. The empirical matches to the strongest baselines are solid enough on the three main datasets to show the pipeline is at least competitive. The soft spot is the non-interference condition required for the λ(ν) lower bound when the coordinate functions are summed over the landmark grid. The abstract states the condition but gives no sign that it was checked on the actual diagrams from the chemical graphs or point clouds; if multiple landmarks interact through shared simplices the bound fails and none of the three guarantees follow. Descriptor selection and covariance shrinkage are also fitted to the same training labels that define the certificates, which is standard but still leaves a circularity question the paper does not address. The gaps on NCI1/NCI109 are chalked up to descriptor blindness, which is honest but shows the method is still limited by the descriptor pool. This is for people working on certified topological machine learning who care about closed-form bounds rather than black-box performance. A reader who wants to see whether the non-interference assumption can be made to hold or replaced would get concrete material to work with. I would send it to peer review; the claims are specific enough that referees can check the derivations and test the condition on the data.

Referee Report

3 major / 2 minor

Summary. The paper introduces PLACE, a closed-form pipeline for classifying point clouds and graphs via their persistent-homology signatures. It derives three quantitative guarantees from training labels alone: an O(kR/(Δ√m_min)) margin-based excess-risk rate, a closed-form Mahalanobis descriptor-selection rule using Ledoit-Wolf shrinkage, and per-prediction certificates in Pinelis and Gaussian forms. The embedding is constructed by summing Mitra-Virk coordinate functions over a landmark grid, with weights obtained by maximizing the structural distortion constant λ(ν) under a non-interference condition.

Significance. If the central derivations hold and the non-interference condition is satisfied, this would represent a meaningful contribution to certified topological machine learning by delivering explicit, training-label-derived bounds without learned weights or calibration sets. The reported competitiveness with diagram-based and topology-based baselines on Orbit5k, MUTAG, and COX2, together with the closed-form descriptor selector, could be useful in domains requiring interpretable guarantees on graph and point-cloud data.

major comments (3)

[Abstract] The non-interference condition required for the lower bound on λ(ν) (stated in the abstract as enabling the Lipschitz bound on D_n) is posited but neither proven nor empirically verified on the persistence diagrams from the chemical graphs or point clouds; if it fails (e.g., due to shared simplices across landmarks), the margin excess-risk rate, descriptor selector, and certificates do not follow. This assumption is load-bearing for all three quantitative guarantees.
[Abstract] Descriptor selection employs Ledoit-Wolf shrunk covariance and the Mahalanobis margin fitted directly to the training labels that also define class means Δ and the claimed guarantees; the abstract provides no independent external benchmark or correction for potential circularity in the selection-consistency rate O(·) on the homogeneous pools.
[Abstract] The empirical statements that PLACE is the strongest diagram-based method on Orbit5k and matches the strongest topology-based baseline within noise on MUTAG and COX2 are given without data tables, per-dataset accuracies, variance estimates, or statistical tests, preventing direct assessment of whether the quantitative guarantees are realized at the reported training-set sizes.

minor comments (2)

[Abstract] The abstract introduces notation (k, R, Δ, m_min, ℓ, ν) without definitions or cross-references, which reduces immediate readability.
[Abstract] The mean Spearman ρ ≈ +0.54 is reported across 10 benchmarks without listing the benchmarks or the individual ρ values, hindering reproducibility of the descriptor-selection claim.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the insightful comments on our manuscript. We address each major point below with clarifications and indicate where revisions will be made to strengthen the presentation of the non-interference condition, descriptor selection, and empirical results.

read point-by-point responses

Referee: [Abstract] The non-interference condition required for the lower bound on λ(ν) (stated in the abstract as enabling the Lipschitz bound on D_n) is posited but neither proven nor empirically verified on the persistence diagrams from the chemical graphs or point clouds; if it fails (e.g., due to shared simplices across landmarks), the margin excess-risk rate, descriptor selector, and certificates do not follow. This assumption is load-bearing for all three quantitative guarantees.

Authors: We acknowledge that the non-interference condition is central to deriving the Lipschitz bound on D_n and thus the three guarantees. The full manuscript defines the condition (no shared simplices between landmark neighborhoods) and selects landmarks to maximize λ(ν) under it, but we agree the abstract and main text would benefit from explicit verification. In the revision we will add: (i) a short proof sketch showing the condition holds when landmarks are separated by more than twice the persistence radius, and (ii) an empirical check on all benchmark persistence diagrams confirming that the chosen sparse grids satisfy non-interference (reporting the fraction of violating pairs, which is zero in our experiments). This directly addresses the load-bearing concern without altering the core derivations. revision: yes
Referee: [Abstract] Descriptor selection employs Ledoit-Wolf shrunk covariance and the Mahalanobis margin fitted directly to the training labels that also define class means Δ and the claimed guarantees; the abstract provides no independent external benchmark or correction for potential circularity in the selection-consistency rate O(·) on the homogeneous pools.

Authors: The pipeline is intentionally closed-form and uses only training labels, so the Mahalanobis margin and Ledoit-Wolf shrinkage are computed from the same data that define Δ. This is not hidden circularity but a deliberate feature enabling training-time certificates. The O(·) consistency rate is derived specifically for the isotropic surrogate on homogeneous pools and already incorporates the dependence on the empirical means; it is not claimed to be independent of the labels. For the heterogeneous 64-descriptor pool we report the empirical Spearman correlation as an external sanity check across ten benchmarks. In revision we will add a clarifying sentence in the abstract and a dedicated paragraph in Section 4.2 stating that the rate accounts for label dependence and does not require held-out data. revision: partial
Referee: [Abstract] The empirical statements that PLACE is the strongest diagram-based method on Orbit5k and matches the strongest topology-based baseline within noise on MUTAG and COX2 are given without data tables, per-dataset accuracies, variance estimates, or statistical tests, preventing direct assessment of whether the quantitative guarantees are realized at the reported training-set sizes.

Authors: We agree that the empirical claims require fuller documentation to allow readers to verify competitiveness and the practical relevance of the guarantees. In the revised manuscript we will insert a new table (or expanded version of the current results table) reporting: per-dataset mean accuracies with standard deviations over 10 random seeds, the exact training-set sizes used, and p-values from paired statistical tests (Wilcoxon signed-rank) against the strongest baselines. We will also add a short paragraph linking these numbers to the training-size regime where the margin bounds become non-vacuous. This change directly enables assessment of whether the reported guarantees are realized. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper constructs the PLACE embedding by summing Mitra-Virk coordinate functions over a landmark grid and selects weights via closed-form maximization of the structural distortion constant λ(ν) under an explicitly stated non-interference assumption. The three quantitative guarantees—an O(kR/(Δ√m_min)) margin excess-risk bound, the Mahalanobis/Ledoit-Wolf descriptor selector, and the Pinelis/Gaussian per-prediction certificates—are then derived from this construction using standard margin analysis and concentration inequalities applied to quantities computed from the training labels. The non-interference condition is posited as an assumption rather than derived, but this does not reduce any claimed result to its inputs by construction. Descriptor selection is validated empirically on benchmarks rather than asserted as a forced prediction. No self-citation is load-bearing for the central claims, no fitted parameter is renamed as an independent prediction, and the overall pipeline remains self-contained against external benchmarks once the modeling assumptions are granted.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The central claims rest on the non-interference assumption when summing coordinate functions and on the existence of a maximizable Lipschitz lower bound λ(ν); no new particles or dimensions are postulated.

free parameters (2)

landmark grid size and placement
Sparse landmark grid is chosen; its size and locations are not derived from first principles in the abstract.
Ledoit-Wolf shrinkage intensity
Shrinkage parameter in the covariance estimator is data-dependent and fitted to training labels.

axioms (2)

domain assumption Persistent homology signatures are stable under small perturbations of the input point cloud or graph.
Standard stability theorem of persistent homology is invoked implicitly to justify the embedding.
ad hoc to paper Non-interference condition holds for the summed Mitra-Virk coordinate functions.
Explicitly referenced in the abstract as the setting under which λ(ν) lower-bounds the distortion.

pith-pipeline@v0.9.0 · 5673 in / 1491 out tokens · 44166 ms · 2026-05-09T15:37:40.498841+00:00 · methodology

A Closed-Form Persistence-Landmark Pipeline for Certified Point-Cloud and Graph Classification

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)