Hyperboloid GPLVM for Discovering Continuous Hierarchies via Nonparametric Estimation

Keisuke Maeda; Koshi Watanabe; Miki Haseyama; Takahiro Ogawa

arxiv: 2410.16698 · v2 · submitted 2024-10-22 · 💻 cs.LG

Hyperboloid GPLVM for Discovering Continuous Hierarchies via Nonparametric Estimation

Koshi Watanabe , Keisuke Maeda , Takahiro Ogawa , Miki Haseyama This is my paper

Pith reviewed 2026-05-23 19:14 UTC · model grok-4.3

classification 💻 cs.LG

keywords hyperbolic geometrygaussian process latent variable modeldimensionality reductionhierarchical datanonparametric estimationhyperboloid embeddinggenerative modeling

0 comments

The pith

Hyperboloid Gaussian process latent variable models embed high-dimensional hierarchical data continuously using nonparametric estimation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes hyperboloid GP latent variable models to embed high-dimensional hierarchical data in low-dimensional hyperbolic spaces while maintaining implicit continuity in the hierarchies. Traditional neighbor embedding methods often break these continuous relations. By employing generative modeling with Gaussian processes, the approach enables effective hierarchical embedding and addresses hyperparameter tuning issues through nonparametric methods. Three variants are developed using different estimation strategies, and learning algorithms incorporate Riemannian optimization and approximation schemes.

Core claim

hGP-LVMs embed high-dimensional hierarchical data with implicit continuity via nonparametric estimation using generative GP modeling in hyperboloid space.

What carries the argument

Hyperboloid Gaussian process latent variable models (hGP-LVMs) that use generative GP modeling for embedding in hyperbolic geometry.

Load-bearing premise

Adopting generative modeling with the GP brings effective hierarchical embedding and resolves ill-posed hyperparameter tuning.

What would settle it

A comparison where hGP-LVMs fail to maintain continuous hierarchical relations better than existing neighbor embedding methods on standard hierarchical datasets.

Figures

Figures reproduced from arXiv: 2410.16698 by Keisuke Maeda, Koshi Watanabe, Miki Haseyama, Takahiro Ogawa.

**Figure 1.** Figure 1: An illustration of hyperboloid Gaussian process latent variable models (hGP-LVMs). We learn the latent variables on the Lorentz model and visualize them on the Poincaré ball. Gaussian process (GP) latent variable models (GPLVMs) (Lawrence, 2005; Titsias and Lawrence, 2010) are one of the representatives in the nonparametric data-embedding methods. GP-LVMs assume the GP decoder of the observed variables (R… view at source ↗

**Figure 2.** Figure 2: GP prior comparison between the hyperboloid exponential kernel (upper, κ = 5) and Euclidean exponential kernel (bottom). The color gives the value of the sampled GP. We input the latent variables on the Poincaré ball model when M = L Q (left) and those on the unit circle when M = E (right). if dM(x, x ′ ) is a conditionally negative definite (CND) metric (Feragen et al., 2015). Since the hyperbolic metric… view at source ↗

**Figure 3.** Figure 3: An illustration of the synthetic binary tree [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Qualitative results on the SBT dataset (d = 6). (a) embedding comparison between generative models, (b) embedding of hGP-LVM with different length scales, and (c) color code of embedding. validated the effectiveness of hGP-LVM toward generative models with the experiment typically used for evaluating hyperbolic generative models. Embedding Comparison with Different Lengthscale In Section 3.1, we state th… view at source ↗

**Figure 5.** Figure 5: Experimental results on the scRNA-seq dataset. (a) The canonical hematopoietic cell lineage tree. (b) Two-dimensional embedding of UMAP, PoincaréMap, Sparse hGP-LVM, and Bayesian hGP-LVM. The colors correspond to those of the lineage tree. (c) The error bar plot of comparative methods. We ran the same experiment 30 times and computed the mean error with standard deviation. (k = 5) as the local metric to us… view at source ↗

read the original abstract

Dimensionality reduction (DR) offers a useful representation of complex high-dimensional data. Recent DR methods focus on hyperbolic geometry to derive a faithful low-dimensional representation of hierarchical data. However, existing methods are based on neighbor embedding, frequently ruining the continual relation of the hierarchies. This paper presents hyperboloid Gaussian process (GP) latent variable models (hGP-LVMs) to embed high-dimensional hierarchical data with implicit continuity via nonparametric estimation. We adopt generative modeling using the GP, which brings effective hierarchical embedding and executes ill-posed hyperparameter tuning. This paper presents three variants that employ original point, sparse point, and Bayesian estimations. We establish their learning algorithms by incorporating the Riemannian optimization and active approximation scheme of GP-LVM. For Bayesian inference, we further introduce the reparameterization trick to realize Bayesian latent variable learning. In the last part of this paper, we apply hGP-LVMs to several datasets and show their ability to represent high-dimensional hierarchies in low-dimensional spaces.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

hGP-LVM adapts GP-LVM to the hyperboloid with three estimation variants and Riemannian optimization, but the continuity and tuning benefits rest on standard adaptations without standout new evidence.

read the letter

The main takeaway is that this paper moves Gaussian process latent variable models onto the hyperboloid to embed hierarchical data while preserving continuity through nonparametric estimation. They introduce three variants—point estimation, sparse point, and Bayesian with a reparameterization trick—plus Riemannian optimization drawn from existing GP-LVM machinery. That combination is the concrete new element here. The generative GP framing is presented as a way to sidestep the continuity loss common in neighbor-embedding hyperbolic methods and to make hyperparameter choices less ill-posed. The paper walks through the learning algorithms for each variant and ends with applications to several datasets showing the embeddings. Those steps are clear and build directly on prior GP-LVM work without internal contradictions. The experiments are the expected next step after the method description. The softer part is the strength of the supporting claims. The abstract ties the continuity and tuning advantages to the GP choice, yet the provided description does not include quantitative comparisons or ablations that isolate how much the generative model improves continuity over existing hyperbolic DR baselines. Without those numbers it is hard to judge whether the claimed resolution of the tuning problem holds up in practice or is mostly inherited from the GP-LVM template. The central construction itself looks coherent. This paper is aimed at people already working on hyperbolic dimensionality reduction or hierarchical representations in machine learning. A reader focused on manifold learning or GP models would find the specific adaptations and the three variants useful to examine. It is worth sending for peer review because the algorithmic details are laid out and the motivation is consistent, even if the empirical case needs tightening.

Referee Report

2 major / 3 minor

Summary. The paper proposes hyperboloid Gaussian process latent variable models (hGP-LVMs) to perform dimensionality reduction on high-dimensional hierarchical data. It uses generative GP modeling in hyperbolic (hyperboloid) space to embed data while preserving implicit continuity in hierarchies via nonparametric estimation, addressing limitations of neighbor-embedding approaches. Three variants are developed—point estimation, sparse point estimation, and Bayesian estimation—with learning procedures that incorporate Riemannian optimization, active approximation, and the reparameterization trick for the Bayesian case. Experiments on several datasets are used to illustrate the embeddings.

Significance. If the continuity-preserving property holds under the GP construction and the variants scale without introducing new instabilities, the work could provide a useful nonparametric alternative to existing hyperbolic DR methods for tree-structured data. The explicit use of generative modeling and Riemannian tools is a clear strength, as is the provision of multiple estimation regimes.

major comments (2)

[§3.2] §3.2, the sparse variant: the active approximation scheme is stated to inherit from standard GP-LVM, but no derivation or bound is given showing that the inducing-point selection preserves the hyperboloid geometry or the continuity property claimed in the abstract; this is load-bearing for the scalability claim.
[§4] §4, experimental section: the reported visualizations and qualitative hierarchy recovery are not accompanied by a quantitative continuity metric (e.g., a hierarchy violation score or geodesic-distance correlation with ground-truth tree depth); without it the central claim that neighbor-embedding methods “ruin the continual relation” cannot be directly compared.

minor comments (3)

The notation for the hyperboloid manifold and its tangent-space operations is introduced without a short self-contained definition or reference to a standard source; this affects readability for readers outside hyperbolic geometry.
[Algorithm 1] Algorithm 1 (point estimation) and Algorithm 2 (Bayesian) list the same Riemannian optimizer step; a brief note on any differences in step-size scheduling or retraction would clarify the distinction.
The abstract states that GP modeling “executes ill-posed hyperparameter tuning,” yet the text does not quantify how many hyperparameters are actually optimized versus fixed; a short table of free vs. fixed parameters per variant would help.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the positive summary and recommendation for minor revision. We respond to each major comment below.

read point-by-point responses

Referee: [§3.2] §3.2, the sparse variant: the active approximation scheme is stated to inherit from standard GP-LVM, but no derivation or bound is given showing that the inducing-point selection preserves the hyperboloid geometry or the continuity property claimed in the abstract; this is load-bearing for the scalability claim.

Authors: We acknowledge that §3.2 states the active approximation inherits from standard GP-LVM without providing an explicit derivation or bound confirming preservation of hyperboloid geometry and the continuity property under inducing-point selection. The construction places inducing points on the manifold and applies the same Riemannian optimization as the point-estimate variant, but the manuscript does not derive a formal guarantee. In the revision we will add a short paragraph in §3.2 supplying a first-order argument that the inducing-point scheme preserves the nonparametric GP prior on the hyperboloid and therefore the continuity property. revision: yes
Referee: [§4] §4, experimental section: the reported visualizations and qualitative hierarchy recovery are not accompanied by a quantitative continuity metric (e.g., a hierarchy violation score or geodesic-distance correlation with ground-truth tree depth); without it the central claim that neighbor-embedding methods “ruin the continual relation” cannot be directly compared.

Authors: We agree that the absence of a quantitative continuity metric limits direct comparison with neighbor-embedding baselines. The current experiments rely on qualitative visualization of hierarchy recovery. In the revised manuscript we will augment §4 with at least one quantitative measure (e.g., Spearman correlation between embedding geodesic distances and ground-truth tree depths, or a hierarchy-violation count) on the synthetic and real tree-structured datasets. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper presents hGP-LVMs as an adaptation of existing GP-LVM machinery to hyperboloid geometry, introducing three estimation variants (point, sparse, Bayesian) along with Riemannian optimization, active approximation, and the reparameterization trick. No load-bearing derivation step is shown reducing to a self-definition, a fitted parameter renamed as prediction, or a self-citation chain. The motivation for using generative GP modeling is stated as a methodological choice that addresses hierarchical continuity and hyperparameter issues, without equations or claims that collapse by construction to the inputs. The construction remains self-contained against external GP-LVM benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

Abstract-only; ledger populated from stated assumptions in the summary. Relies on GP generative modeling for hierarchies and hyperbolic suitability without independent verification visible.

free parameters (1)

GP hyperparameters
Tuned via the proposed method; abstract notes ill-posed tuning as motivation for GP approach.

axioms (2)

domain assumption Hyperbolic geometry faithfully represents hierarchical data structures
Core motivation stated in abstract for using hyperboloid model.
domain assumption GP generative modeling resolves ill-posed hyperparameter tuning for hierarchical embedding
Directly stated as benefit of adopting GP in the abstract.

pith-pipeline@v0.9.0 · 5713 in / 1255 out tokens · 24726 ms · 2026-05-23T19:14:50.798382+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We adopt generative modeling using the GP, which brings effective hierarchical embedding... hyperboloid exponential kernel k_M(x,x′)=σ exp(−d_M(x,x′)/κ)
IndisputableMonolith/Foundation/AlexanderDuality.lean alexander_duality_circle_linking unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Lorentz model L^Q ... d_LQ(x,x′)=cosh⁻¹(−⟨x,x′⟩_LQ)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

53 extracted references · 53 canonical work pages · 2 internal anchors

[1]

Amid and M

Amid, E. and Warmuth, M. K. (2019). TriMap : Large-scale dimensionality reduction using triplets. arXiv preprint arXiv:1910.00204

work page arXiv 2019
[2]

G., Schoep, J., Acar, E., Van Noord, N., and Mettes, P

Atigh, M. G., Schoep, J., Acar, E., Van Noord, N., and Mettes, P. (2022). Hyperbolic image segmentation. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition , pages 4453--4462

work page 2022
[3]

Azangulov, I., Smolensky, A., Terenin, A., and Borovitskiy, V. (2022). Stationary kernels and Gaussian processes on Lie groups and their homogeneous spaces i: the compact case. arXiv preprint arXiv:2208.14960

work page arXiv 2022
[4]

Azangulov, I., Smolensky, A., Terenin, A., and Borovitskiy, V. (2023). Stationary kernels and Gaussian processes on Lie groups and their homogeneous spaces ii: non-compact symmetric spaces. arXiv preprint arXiv:2301.13088

work page arXiv 2023
[5]

Bauer, M., Van der Wilk, M., and Rasmussen, C. E. (2016). Understanding probabilistic sparse Gaussian process approximations. Advances in Neural Information Processing Systems , 29:1--9

work page 2016
[6]

W., Ng, L

Becht, E., McInnes, L., Healy, J., Dutertre, C.-A., Kwok, I. W., Ng, L. G., Ginhoux, F., and Newell, E. W. (2019). Dimensionality reduction for visualizing single-cell data using UMAP . Nature Biotechnology , 37(1):38--44

work page 2019
[7]

Borovitskiy, V., Terenin, A., Mostowsky, P., et al. (2020). Mat \'e rn Gaussian processes on Riemannian manifolds. Advances in Neural Information Processing Systems , 33:12426--12437

work page 2020
[8]

Chami, I., Wolf, A., Juan, D.-C., Sala, F., Ravi, S., and R \'e , C. (2020). Low-dimensional hyperbolic knowledge graph embeddings. arXiv preprint arXiv:2005.00545

work page arXiv 2020
[9]

Cho, S., Lee, J., Park, J., and Kim, D. (2022). A rotated hyperbolic wrapped normal distribution for hierarchical representation learning. Advances in Neural Information Processing Systems , 35:17831--17843

work page 2022
[10]

P., and Mattos, C

de Souza, D., Mesquita, D., Gomes, J. P., and Mattos, C. L. (2021). Learning GPLVM with arbitrary kernels using the unscented transformation. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 451--459

work page 2021
[11]

Desai, K., Nickel, M., Rajpurohit, T., Johnson, J., and Vedantam, S. R. (2023). Hyperbolic image-text representations. In Proceedings of the International Conference on Machine Learning , pages 7694--7731

work page 2023
[12]

Fan, X., Yang, C.-H., and Vemuri, B. (2023). Horospherical decision boundaries for large margin classification in hyperbolic space. Advances in Neural Information Processing Systems , 36:1--11

work page 2023
[13]

Fang, P., Harandi, M., and Petersson, L. (2021). Kernel methods in hyperbolic spaces. In Proceedings of the IEEE/CVF International Conference on Computer Vision , pages 10665--10674

work page 2021
[14]

Feragen, A., Lauze, F., and Hauberg, S. (2015). Geodesic exponential kernels: When curvature and linearity conflict. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 3032--3042

work page 2015
[15]

Ganea, O., B \'e cigneul, G., and Hofmann, T. (2018). Hyperbolic neural networks. Advances in Neural Information Processing Systems , 31:1--11

work page 2018
[16]

Higgins, I., Matthey, L., Pal, A., Burgess, C., Glorot, X., Botvinick, M., Mohamed, S., and Lerchner, A. (2017). beta- VAE : Learning basic visual concepts with a constrained variational framework. In Proceedings of the International Conference on Learning Representations , pages 1--22

work page 2017
[17]

Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology , 24(6):417--441

work page 1933
[18]

Istas, J. (2012). Manifold indexed fractional fields*. ESAIM: Probability and Statistics , 16:222--276

work page 2012
[19]

Jaquier, N., Rozo, L., Gonz \'a lez-Duque, M., Borovitskiy, V., and Asfour, T. (2022). Bringing robotics taxonomies to continuous domains via GPLVM on hyperbolic manifolds. arXiv preprint arXiv:2210.01672

work page arXiv 2022
[20]

Jensen, K., Kao, T.-C., Tripodi, M., and Hennequin, G. (2020). Manifold GPLVMs for discovering non- Euclidean latent structure in neural data. Advances in Neural Information Processing Systems , 33:22580--22592

work page 2020
[21]

Kingma, D. P. and Welling, M. (2013). Auto-encoding variational Bayes . arXiv preprint arXiv:1312.6114

work page internal anchor Pith review Pith/arXiv arXiv 2013
[22]

Klimovskaia, A., Lopez-Paz, D., Bottou, L., and Nickel, M. (2020). Poincar \'e maps for analyzing complex hierarchies in single-cell data. Nature Communications , 11(1):2966

work page 2020
[23]

and Berens, P

Kobak, D. and Berens, P. (2019). The art of using t-SNE for single-cell transcriptomics. Nature Communications , 10(1):5416

work page 2019
[24]

Lalchand, V., Ravuri, A., and Lawrence, N. D. (2022). Generalised GPLVM with stochastic variational inference. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 7841--7864

work page 2022
[25]

Lawrence, N. (2005). Probabilistic non-linear principal component analysis with Gaussian process latent variable models. Journal of Machine Learning Research , 6(11):1--34

work page 2005
[26]

Lawrence, N. D. (2007). Learning for larger datasets with the Gaussian process latent variable model. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 243--250

work page 2007
[27]

Luecken, M. D. and Theis, F. J. (2019). Current best practices in single-cell RNA -seq analysis: a tutorial. Molecular Systems Biology , 15(6):e8746

work page 2019
[28]

and Feragen, A

Mallasto, A. and Feragen, A. (2018). Wrapped Gaussian process regression on Riemannian manifolds. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 5580--5588

work page 2018
[29]

J., Tomioka, R., and Teh, Y

Mathieu, E., Le Lan, C., Maddison, C. J., Tomioka, R., and Teh, Y. W. (2019). Continuous hierarchical representations with Poincar \'e variational auto-encoders. Advances in Neural Information Processing Systems , 32:1--12

work page 2019
[30]

McInnes, L., Healy, J., and Melville, J. (2018). UMAP : Uniform manifold approximation and projection for dimension reduction. arXiv preprint arXiv:1802.03426

work page internal anchor Pith review Pith/arXiv arXiv 2018
[31]

R., Van Dijk, D., Wang, Z., Gigante, S., Burkhardt, D

Moon, K. R., Van Dijk, D., Wang, Z., Gigante, S., Burkhardt, D. B., Chen, W. S., Yim, K., Elzen, A. v. d., Hirn, M. J., Coifman, R. R., et al. (2019). Visualizing structure and transitions in high-dimensional biological data. Nature Biotechnology , 37(12):1482--1492

work page 2019
[32]

Moreno-Mu \ n oz, P., Feldager, C., and Hauberg, S. (2022). Revisiting active sets for Gaussian process decoders. Advances in Neural Information Processing Systems , 35:6603--6614

work page 2022
[33]

Nagano, Y., Yamaguchi, S., Fujita, Y., and Koyama, M. (2019). A wrapped normal distribution on hyperbolic space for gradient-based learning. In Proceedings of the International Conference on Machine Learning , pages 4693--4702

work page 2019
[34]

and Kiela, D

Nickel, M. and Kiela, D. (2017). Poincar \'e embeddings for learning hierarchical representations. Advances in Neural Information Processing Systems , 30:1--10

work page 2017
[35]

and Kiela, D

Nickel, M. and Kiela, D. (2018). Learning continuous hierarchies in the Lorentz model of hyperbolic geometry. In Proceedings of the International Conference on Machine Learning , pages 3779--3788

work page 2018
[36]

Niu, M., Dai, Z., Cheung, P., and Wang, Y. (2023). Intrinsic Gaussian process on unknown manifolds with probabilistic metrics. Journal of Machine Learning Research , 24(104):1--42

work page 2023
[37]

A., Kenigsberg, E., Keren-Shaul, H., Winter, D., Lara-Astiaso, D., Gury, M., Weiner, A., et al

Paul, F., Arkin, Y., Giladi, A., Jaitin, D. A., Kenigsberg, E., Keren-Shaul, H., Winter, D., Lara-Astiaso, D., Gury, M., Weiner, A., et al. (2015). Transcriptional heterogeneity and lineage commitment in myeloid progenitors. Cell , 163(7):1663--1677

work page 2015
[38]

Peng, W., Varanka, T., Mostafa, A., Shi, H., and Zhao, G. (2021). Hyperbolic deep neural networks: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence , 44(12):10023--10044

work page 2021
[39]

Rasmussen, C. E. and Williams, C. K. (2006). Gaussian Processes for Machine Learning . MIT Press

work page 2006
[40]

Sala, F., De Sa, C., Gu, A., and R \'e , C. (2018). Representation tradeoffs for hyperbolic embeddings. In Proceedings of the International Conference on Machine Learning , pages 4460--4469

work page 2018
[41]

and Deisenroth, M

Salimbeni, H. and Deisenroth, M. (2017). Doubly stochastic variational inference for deep Gaussian processes. Advances in Neural Information Processing Systems , 30:1--12

work page 2017
[42]

Titsias, M. (2009). Variational learning of inducing variables in sparse Gaussian processes. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 567--574

work page 2009
[43]

and Lawrence, N

Titsias, M. and Lawrence, N. D. (2010). Bayesian Gaussian process latent variable model. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 844--851

work page 2010
[44]

and Hinton, G

Van der Maaten, L. and Hinton, G. (2008). Visualizing data using t-SNE . Journal of Machine Learning Research , 9(11):1--25

work page 2008
[45]

Wang, Y., Huang, H., Rudin, C., and Shaposhnik, Y. (2021). Understanding how dimension reduction tools work: an empirical approach to deciphering t-SNE, UMAP, TriMAP, and PaCMAP for data visualization. Journal of Machine Learning Research , 22(201):1--73

work page 2021
[46]

Yang, M., Fang, P., and Xue, H. (2023). Expanding the hyperbolic kernels: a curvature-aware isometric embedding view. In Proceedings of the International Joint Conference on Artificial Intelligence , pages 4469--4477

work page 2023
[47]

X., Terry, J

Zheng, G. X., Terry, J. M., Belgrader, P., Ryvkin, P., Bent, Z. W., Wilson, R., Ziraldo, S. B., Wheeler, T. D., McDermott, G. P., Zhu, J., et al. (2017). Massively parallel digital transcriptional profiling of single cells. Nature Communications , 8(1):14049

work page 2017
[48]

M., Kerren, A., Hirata, N

Espadoto, M., Martins, R. M., Kerren, A., Hirata, N. S., and Telea, A. C. (2019). Toward a quantitative survey of dimension reduction techniques. IEEE Transactions on Visualization and Computer Graphics , 27(3):2153--2173

work page 2019
[49]

GPy : A Gaussian process framework in Python

GPy (2012). GPy : A Gaussian process framework in Python . Available: http://github.com/SheffieldML/GPy

work page 2012
[50]

A., Paulovich, F

Joia, P., Coimbra, D., Cuminato, J. A., Paulovich, F. V., and Nonato, L. G. (2011). Local affine multidimensional projection. IEEE Transactions on Visualization and Computer Graphics , 17(12):2563--2571

work page 2011
[51]

Sloane, N. J. (2007). The on-line encyclopedia of integer sequences. In Proceedings of the International Conference on Towards Mechanized Mathematical Assistants , pages 130--130

work page 2007
[52]

and Kaski, S

Venna, J. and Kaski, S. (2001). Neighborhood preservation in nonlinear projection methods: An experimental study. In Proceedings of the International Conference on Artificial Neural Networks , pages 485--491

work page 2001
[53]

and Tao, Q

Zu, X. and Tao, Q. (2022). SpaceMAP : Visualizing high-dimensional data by space expansion. In Proceedings of the International Conference on Machine Learning , pages 27707--27723

work page 2022

[1] [1]

Amid and M

Amid, E. and Warmuth, M. K. (2019). TriMap : Large-scale dimensionality reduction using triplets. arXiv preprint arXiv:1910.00204

work page arXiv 2019

[2] [2]

G., Schoep, J., Acar, E., Van Noord, N., and Mettes, P

Atigh, M. G., Schoep, J., Acar, E., Van Noord, N., and Mettes, P. (2022). Hyperbolic image segmentation. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition , pages 4453--4462

work page 2022

[3] [3]

Azangulov, I., Smolensky, A., Terenin, A., and Borovitskiy, V. (2022). Stationary kernels and Gaussian processes on Lie groups and their homogeneous spaces i: the compact case. arXiv preprint arXiv:2208.14960

work page arXiv 2022

[4] [4]

Azangulov, I., Smolensky, A., Terenin, A., and Borovitskiy, V. (2023). Stationary kernels and Gaussian processes on Lie groups and their homogeneous spaces ii: non-compact symmetric spaces. arXiv preprint arXiv:2301.13088

work page arXiv 2023

[5] [5]

Bauer, M., Van der Wilk, M., and Rasmussen, C. E. (2016). Understanding probabilistic sparse Gaussian process approximations. Advances in Neural Information Processing Systems , 29:1--9

work page 2016

[6] [6]

W., Ng, L

Becht, E., McInnes, L., Healy, J., Dutertre, C.-A., Kwok, I. W., Ng, L. G., Ginhoux, F., and Newell, E. W. (2019). Dimensionality reduction for visualizing single-cell data using UMAP . Nature Biotechnology , 37(1):38--44

work page 2019

[7] [7]

Borovitskiy, V., Terenin, A., Mostowsky, P., et al. (2020). Mat \'e rn Gaussian processes on Riemannian manifolds. Advances in Neural Information Processing Systems , 33:12426--12437

work page 2020

[8] [8]

Chami, I., Wolf, A., Juan, D.-C., Sala, F., Ravi, S., and R \'e , C. (2020). Low-dimensional hyperbolic knowledge graph embeddings. arXiv preprint arXiv:2005.00545

work page arXiv 2020

[9] [9]

Cho, S., Lee, J., Park, J., and Kim, D. (2022). A rotated hyperbolic wrapped normal distribution for hierarchical representation learning. Advances in Neural Information Processing Systems , 35:17831--17843

work page 2022

[10] [10]

P., and Mattos, C

de Souza, D., Mesquita, D., Gomes, J. P., and Mattos, C. L. (2021). Learning GPLVM with arbitrary kernels using the unscented transformation. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 451--459

work page 2021

[11] [11]

Desai, K., Nickel, M., Rajpurohit, T., Johnson, J., and Vedantam, S. R. (2023). Hyperbolic image-text representations. In Proceedings of the International Conference on Machine Learning , pages 7694--7731

work page 2023

[12] [12]

Fan, X., Yang, C.-H., and Vemuri, B. (2023). Horospherical decision boundaries for large margin classification in hyperbolic space. Advances in Neural Information Processing Systems , 36:1--11

work page 2023

[13] [13]

Fang, P., Harandi, M., and Petersson, L. (2021). Kernel methods in hyperbolic spaces. In Proceedings of the IEEE/CVF International Conference on Computer Vision , pages 10665--10674

work page 2021

[14] [14]

Feragen, A., Lauze, F., and Hauberg, S. (2015). Geodesic exponential kernels: When curvature and linearity conflict. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 3032--3042

work page 2015

[15] [15]

Ganea, O., B \'e cigneul, G., and Hofmann, T. (2018). Hyperbolic neural networks. Advances in Neural Information Processing Systems , 31:1--11

work page 2018

[16] [16]

Higgins, I., Matthey, L., Pal, A., Burgess, C., Glorot, X., Botvinick, M., Mohamed, S., and Lerchner, A. (2017). beta- VAE : Learning basic visual concepts with a constrained variational framework. In Proceedings of the International Conference on Learning Representations , pages 1--22

work page 2017

[17] [17]

Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology , 24(6):417--441

work page 1933

[18] [18]

Istas, J. (2012). Manifold indexed fractional fields*. ESAIM: Probability and Statistics , 16:222--276

work page 2012

[19] [19]

Jaquier, N., Rozo, L., Gonz \'a lez-Duque, M., Borovitskiy, V., and Asfour, T. (2022). Bringing robotics taxonomies to continuous domains via GPLVM on hyperbolic manifolds. arXiv preprint arXiv:2210.01672

work page arXiv 2022

[20] [20]

Jensen, K., Kao, T.-C., Tripodi, M., and Hennequin, G. (2020). Manifold GPLVMs for discovering non- Euclidean latent structure in neural data. Advances in Neural Information Processing Systems , 33:22580--22592

work page 2020

[21] [21]

Kingma, D. P. and Welling, M. (2013). Auto-encoding variational Bayes . arXiv preprint arXiv:1312.6114

work page internal anchor Pith review Pith/arXiv arXiv 2013

[22] [22]

Klimovskaia, A., Lopez-Paz, D., Bottou, L., and Nickel, M. (2020). Poincar \'e maps for analyzing complex hierarchies in single-cell data. Nature Communications , 11(1):2966

work page 2020

[23] [23]

and Berens, P

Kobak, D. and Berens, P. (2019). The art of using t-SNE for single-cell transcriptomics. Nature Communications , 10(1):5416

work page 2019

[24] [24]

Lalchand, V., Ravuri, A., and Lawrence, N. D. (2022). Generalised GPLVM with stochastic variational inference. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 7841--7864

work page 2022

[25] [25]

Lawrence, N. (2005). Probabilistic non-linear principal component analysis with Gaussian process latent variable models. Journal of Machine Learning Research , 6(11):1--34

work page 2005

[26] [26]

Lawrence, N. D. (2007). Learning for larger datasets with the Gaussian process latent variable model. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 243--250

work page 2007

[27] [27]

Luecken, M. D. and Theis, F. J. (2019). Current best practices in single-cell RNA -seq analysis: a tutorial. Molecular Systems Biology , 15(6):e8746

work page 2019

[28] [28]

and Feragen, A

Mallasto, A. and Feragen, A. (2018). Wrapped Gaussian process regression on Riemannian manifolds. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 5580--5588

work page 2018

[29] [29]

J., Tomioka, R., and Teh, Y

Mathieu, E., Le Lan, C., Maddison, C. J., Tomioka, R., and Teh, Y. W. (2019). Continuous hierarchical representations with Poincar \'e variational auto-encoders. Advances in Neural Information Processing Systems , 32:1--12

work page 2019

[30] [30]

McInnes, L., Healy, J., and Melville, J. (2018). UMAP : Uniform manifold approximation and projection for dimension reduction. arXiv preprint arXiv:1802.03426

work page internal anchor Pith review Pith/arXiv arXiv 2018

[31] [31]

R., Van Dijk, D., Wang, Z., Gigante, S., Burkhardt, D

Moon, K. R., Van Dijk, D., Wang, Z., Gigante, S., Burkhardt, D. B., Chen, W. S., Yim, K., Elzen, A. v. d., Hirn, M. J., Coifman, R. R., et al. (2019). Visualizing structure and transitions in high-dimensional biological data. Nature Biotechnology , 37(12):1482--1492

work page 2019

[32] [32]

Moreno-Mu \ n oz, P., Feldager, C., and Hauberg, S. (2022). Revisiting active sets for Gaussian process decoders. Advances in Neural Information Processing Systems , 35:6603--6614

work page 2022

[33] [33]

Nagano, Y., Yamaguchi, S., Fujita, Y., and Koyama, M. (2019). A wrapped normal distribution on hyperbolic space for gradient-based learning. In Proceedings of the International Conference on Machine Learning , pages 4693--4702

work page 2019

[34] [34]

and Kiela, D

Nickel, M. and Kiela, D. (2017). Poincar \'e embeddings for learning hierarchical representations. Advances in Neural Information Processing Systems , 30:1--10

work page 2017

[35] [35]

and Kiela, D

Nickel, M. and Kiela, D. (2018). Learning continuous hierarchies in the Lorentz model of hyperbolic geometry. In Proceedings of the International Conference on Machine Learning , pages 3779--3788

work page 2018

[36] [36]

Niu, M., Dai, Z., Cheung, P., and Wang, Y. (2023). Intrinsic Gaussian process on unknown manifolds with probabilistic metrics. Journal of Machine Learning Research , 24(104):1--42

work page 2023

[37] [37]

A., Kenigsberg, E., Keren-Shaul, H., Winter, D., Lara-Astiaso, D., Gury, M., Weiner, A., et al

Paul, F., Arkin, Y., Giladi, A., Jaitin, D. A., Kenigsberg, E., Keren-Shaul, H., Winter, D., Lara-Astiaso, D., Gury, M., Weiner, A., et al. (2015). Transcriptional heterogeneity and lineage commitment in myeloid progenitors. Cell , 163(7):1663--1677

work page 2015

[38] [38]

Peng, W., Varanka, T., Mostafa, A., Shi, H., and Zhao, G. (2021). Hyperbolic deep neural networks: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence , 44(12):10023--10044

work page 2021

[39] [39]

Rasmussen, C. E. and Williams, C. K. (2006). Gaussian Processes for Machine Learning . MIT Press

work page 2006

[40] [40]

Sala, F., De Sa, C., Gu, A., and R \'e , C. (2018). Representation tradeoffs for hyperbolic embeddings. In Proceedings of the International Conference on Machine Learning , pages 4460--4469

work page 2018

[41] [41]

and Deisenroth, M

Salimbeni, H. and Deisenroth, M. (2017). Doubly stochastic variational inference for deep Gaussian processes. Advances in Neural Information Processing Systems , 30:1--12

work page 2017

[42] [42]

Titsias, M. (2009). Variational learning of inducing variables in sparse Gaussian processes. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 567--574

work page 2009

[43] [43]

and Lawrence, N

Titsias, M. and Lawrence, N. D. (2010). Bayesian Gaussian process latent variable model. In Proceedings of the International Conference on Artificial Intelligence and Statistics , pages 844--851

work page 2010

[44] [44]

and Hinton, G

Van der Maaten, L. and Hinton, G. (2008). Visualizing data using t-SNE . Journal of Machine Learning Research , 9(11):1--25

work page 2008

[45] [45]

Wang, Y., Huang, H., Rudin, C., and Shaposhnik, Y. (2021). Understanding how dimension reduction tools work: an empirical approach to deciphering t-SNE, UMAP, TriMAP, and PaCMAP for data visualization. Journal of Machine Learning Research , 22(201):1--73

work page 2021

[46] [46]

Yang, M., Fang, P., and Xue, H. (2023). Expanding the hyperbolic kernels: a curvature-aware isometric embedding view. In Proceedings of the International Joint Conference on Artificial Intelligence , pages 4469--4477

work page 2023

[47] [47]

X., Terry, J

Zheng, G. X., Terry, J. M., Belgrader, P., Ryvkin, P., Bent, Z. W., Wilson, R., Ziraldo, S. B., Wheeler, T. D., McDermott, G. P., Zhu, J., et al. (2017). Massively parallel digital transcriptional profiling of single cells. Nature Communications , 8(1):14049

work page 2017

[48] [48]

M., Kerren, A., Hirata, N

Espadoto, M., Martins, R. M., Kerren, A., Hirata, N. S., and Telea, A. C. (2019). Toward a quantitative survey of dimension reduction techniques. IEEE Transactions on Visualization and Computer Graphics , 27(3):2153--2173

work page 2019

[49] [49]

GPy : A Gaussian process framework in Python

GPy (2012). GPy : A Gaussian process framework in Python . Available: http://github.com/SheffieldML/GPy

work page 2012

[50] [50]

A., Paulovich, F

Joia, P., Coimbra, D., Cuminato, J. A., Paulovich, F. V., and Nonato, L. G. (2011). Local affine multidimensional projection. IEEE Transactions on Visualization and Computer Graphics , 17(12):2563--2571

work page 2011

[51] [51]

Sloane, N. J. (2007). The on-line encyclopedia of integer sequences. In Proceedings of the International Conference on Towards Mechanized Mathematical Assistants , pages 130--130

work page 2007

[52] [52]

and Kaski, S

Venna, J. and Kaski, S. (2001). Neighborhood preservation in nonlinear projection methods: An experimental study. In Proceedings of the International Conference on Artificial Neural Networks , pages 485--491

work page 2001

[53] [53]

and Tao, Q

Zu, X. and Tao, Q. (2022). SpaceMAP : Visualizing high-dimensional data by space expansion. In Proceedings of the International Conference on Machine Learning , pages 27707--27723

work page 2022