Riemannian Networks over Full-Rank Correlation Matrices
Pith reviewed 2026-05-20 11:39 UTC · model grok-4.3
The pith
Neural networks can be defined directly on the manifold of full-rank correlation matrices using five geometries.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We introduce Riemannian networks over the manifold of full-rank correlation matrices by leveraging five recently developed correlation geometries. Basic layers including multinomial logistic regression, fully connected, and convolutional layers are extended to these geometries, and accurate backpropagation methods are presented for two of them. Experiments comparing the resulting networks against existing SPD and Grassmannian networks demonstrate their effectiveness.
What carries the argument
The five correlation geometries on the full-rank correlation manifold, which allow neural-network layers to be extended while remaining on the manifold and respecting its normalization.
If this is right
- Convolutional layers can process correlation-matrix inputs directly without first embedding them into a larger SPD space.
- Accurate gradients can be computed through the network for at least two of the correlation geometries.
- Performance gains appear when the same task is solved on correlation matrices rather than on SPD or Grassmannian representations.
- Correlation matrices function as a scale-normalized alternative to SPD matrices inside manifold-valued deep learning.
Where Pith is reading between the lines
- Domains that already record pairwise correlations, such as financial time series or functional connectivity, could adopt these networks without first inflating the data to full SPD matrices.
- The same layer-extension pattern might be applied to additional operations such as attention or pooling once the basic building blocks are shown to work.
- If the correlation manifold admits a simpler or cheaper geometry than the SPD manifold for certain tasks, training cost per epoch could drop while accuracy rises.
Load-bearing premise
The five correlation geometries permit stable and useful extensions of neural-network operations including convolution and backpropagation that preserve manifold properties and produce measurable gains over prior manifold networks.
What would settle it
A dataset of correlation matrices on which a network built with these extended layers either fails to train stably or produces lower accuracy than an otherwise identical SPD-network baseline would falsify the claim.
Figures
read the original abstract
Representations on the Symmetric Positive Definite (SPD) manifold have garnered significant attention across different applications. In contrast, the manifold of full-rank correlation matrices, a normalized alternative to SPD matrices, remains largely underexplored. This paper introduces Riemannian networks over the correlation manifold, leveraging five recently developed correlation geometries. We systematically extend basic layers, including Multinomial Logistic Regression (MLR), Fully Connected (FC), and convolutional layers, to these geometries. Besides, we present methods for accurate backpropagation for two correlation geometries. Experiments comparing our approach against existing SPD and Grassmannian networks demonstrate its effectiveness.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces Riemannian networks over the manifold of full-rank correlation matrices by leveraging five recently developed correlation geometries. It systematically extends basic layers including Multinomial Logistic Regression (MLR), Fully Connected (FC), and convolutional layers to these geometries, presents methods for accurate backpropagation for two of the geometries, and reports experiments comparing the approach to existing SPD and Grassmannian networks to demonstrate effectiveness.
Significance. If the central constructions hold and the experiments are reproducible, the work would be a useful contribution by providing a normalized alternative to SPD-based manifold networks. The systematic layer extensions across multiple correlation geometries and the explicit backpropagation derivations for two geometries represent concrete engineering progress that could enable new applications in domains where correlation structure (rather than scale) is the primary signal.
major comments (1)
- Abstract: the central claim requires stable, manifold-preserving extensions of layers and backpropagation across all five correlation geometries, yet the manuscript provides explicit accurate backpropagation derivations for only two geometries. For the remaining three, no equivalent procedure is given for computing Riemannian gradients or performing updates while enforcing the correlation-matrix constraints (unit diagonal, positive semidefinite). This gap directly threatens the weakest assumption that the five geometries permit stable and useful extensions yielding measurable gains over prior manifold networks.
minor comments (1)
- The abstract provides no quantitative results, dataset descriptions, or implementation details, which limits the ability to assess the strength of the experimental claims from the summary alone.
Simulated Author's Rebuttal
We thank the referee for their careful reading of our manuscript and for providing constructive feedback. We address the major comment below.
read point-by-point responses
-
Referee: Abstract: the central claim requires stable, manifold-preserving extensions of layers and backpropagation across all five correlation geometries, yet the manuscript provides explicit accurate backpropagation derivations for only two geometries. For the remaining three, no equivalent procedure is given for computing Riemannian gradients or performing updates while enforcing the correlation-matrix constraints (unit diagonal, positive semidefinite). This gap directly threatens the weakest assumption that the five geometries permit stable and useful extensions yielding measurable gains over prior manifold networks.
Authors: We appreciate the referee pointing out this important aspect. The manuscript does extend the basic layers (MLR, FC, and convolutional) to all five correlation geometries in a manifold-preserving manner, as described in Sections 3 and 4 of the paper. The backpropagation methods with explicit derivations are provided for two geometries to ensure accurate and efficient computation of Riemannian gradients. For the other three geometries, the Riemannian gradients can be obtained by projecting the Euclidean gradients onto the tangent space using the metric tensor defined for each geometry, followed by retraction operations that inherently enforce the unit diagonal and positive semidefiniteness constraints, as these are built into the manifold definitions. We will revise the manuscript to include a more detailed general framework for backpropagation applicable to all five geometries, along with pseudocode or additional explanations for the remaining three to improve clarity and address this concern directly. revision: yes
Circularity Check
No significant circularity; derivations build on external prior geometries
full rationale
The paper takes five correlation geometries as given inputs from recent external developments and extends standard layers (MLR, FC, convolution) plus backpropagation procedures to operate on the correlation manifold. No equations or claims reduce the network constructions or experimental results back to the input geometries by definition or self-fit. Backpropagation derivations are presented as new contributions for two geometries rather than tautological renamings. The chain remains self-contained against the cited external manifold structures.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We systematically extend basic layers, including Multinomial Logistic Regression (MLR), Fully Connected (FC), and convolutional layers, to these geometries. Besides, we present methods for accurate backpropagation for two correlation geometries.
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
PHCM is derived from the product of hyperbolic spaces
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Wang, Rui and Jin, Shaocheng and Chen, Ziheng and Luo, Xiaoqing and Wu, Xiao-Jun , booktitle =. Learning to Normalize on the
-
[2]
Riemannian High-Order Pooling for Brain Foundation Models , author =. ICLR , year =
-
[3]
Channel-wise topology refinement graph convolution for skeleton-based action recognition , author =. ICCV , year =
- [4]
-
[5]
Pouliquen, Can and Massias, Mathurin and Vayer, Titouan , booktitle =
-
[6]
Liu, Jun and Shahroudy, Amir and Perez, Mauricio and Wang, Gang and Duan, Ling-Yu and Kot, Alex C , journal =
-
[7]
Li, Shanglin and Kawanabe, Motoaki and Kobler, Reinmar J , booktitle =
-
[8]
Understanding Matrix Function Normalizations in Covariance Pooling through the Lens of
Ziheng Chen and Yue Song and Xiaojun Wu and Gaowen Liu and Nicu Sebe , booktitle =. Understanding Matrix Function Normalizations in Covariance Pooling through the Lens of
- [9]
-
[10]
Neural networks on Symmetric Spaces of Noncompact Type , author =. ICLR , year =
-
[11]
Mind2mind: transfer learning for
Fr. Mind2mind: transfer learning for. Geometric Science of Information: 5th International Conference, GSI 2021, Paris, France, July 21--23, 2021, Proceedings 5 , pages=. 2021 , organization=
work page 2021
-
[12]
Geodesics and curvature of the quotient-affine metrics on full-rank correlation matrices , author=. Geometric Science of Information: 5th International Conference, GSI 2021, Paris, France, July 21--23, 2021, Proceedings 5 , pages=. 2021 , organization=
work page 2021
-
[13]
Chen, Ziheng and Song, Yue and Xiaojun and Sebe, Nicu , booktitle =
- [14]
-
[15]
Modeling graphs beyond hyperbolic: Graph neural networks in symmetric positive definite matrices , author=. ECML PKDD , year=
-
[16]
Rolling rotations for recognizing human actions from
Vemulapalli, Raviteja and Chellapa, Rama , booktitle =. Rolling rotations for recognizing human actions from
-
[17]
Fully Hyperbolic Convolutional Neural Networks for Computer Vision , author =. ICLR , year =
- [18]
-
[19]
Deep Geodesic Canonical Correlation Analysis for Covariance-Based Neuroimaging Data , author =. ICLR , year =
- [20]
- [21]
-
[22]
Hyperbolic graph convolutional neural networks , author =. NeurIPS , year =
-
[23]
Hyperbolic Geometric Latent Diffusion Model for Graph Generation , author =. ICML , year =
- [24]
-
[25]
Building Neural Networks on Matrix Manifolds: A
Nguyen, Xuan Son and Yang, Shuo , booktitle =. Building Neural Networks on Matrix Manifolds: A
-
[26]
Pan, Yue-Ting and Chou, Jing-Lun and Wei, Chun-Shu , booktitle =
-
[27]
A globally convergent numerical algorithm for computing the centre of mass on compact
Manton, Jonathan H , booktitle=. A globally convergent numerical algorithm for computing the centre of mass on compact. 2004 , organization=
work page 2004
-
[28]
2019 20th International Radar Symposium (IRS) , pages=
Complex-valued neural networks for fully-temporal micro-Doppler classification , author=. 2019 20th International Radar Symposium (IRS) , pages=. 2019 , organization=
work page 2019
-
[29]
Second-order networks in PyTorch , author=. Geometric Science of Information: 4th International Conference, GSI 2019, Toulouse, France, August 27--29, 2019, Proceedings 4 , pages=. 2019 , organization=
work page 2019
-
[30]
Exploring complex time-series representations for Riemannian machine learning of radar data , author=. ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pages=. 2019 , organization=
work page 2019
-
[31]
2020 IEEE Radar Conference (RadarConf20) , pages=
Deep learning and information geometry for drone micro-Doppler radar classification , author=. 2020 IEEE Radar Conference (RadarConf20) , pages=. 2020 , organization=
work page 2020
-
[32]
2019 International Radar Conference (RADAR) , pages=
A Hermitian Positive Definite neural network for micro-Doppler complex covariance processing , author=. 2019 International Radar Conference (RADAR) , pages=. 2019 , organization=
work page 2019
-
[33]
Gaussian distributions on the space of symmetric positive definite matrices from
Barbaresco, Fr. Gaussian distributions on the space of symmetric positive definite matrices from. Geometric Science of Information: 5th International Conference , year=
-
[34]
Is affine-invariance well defined on
Thanwerdas, Yann and Pennec, Xavier , booktitle=. Is affine-invariance well defined on
-
[35]
Riemannian Local Mechanism for
Chen, Ziheng and Xu, Tianyang and Wu, Xiao-Jun and Wang, Rui and Huang, Zhiwu and Kittler, Josef , booktitle =. Riemannian Local Mechanism for
-
[36]
Exploration of balanced metrics on symmetric positive definite matrices , author=. Geometric Science of Information: 4th International Conference, GSI 2019, Toulouse, France, August 27--29, 2019, Proceedings 4 , pages=. 2019 , organization=
work page 2019
-
[37]
Human action recognition by representing
Vemulapalli, Raviteja and Arrate, Felipe and Chellappa, Rama , booktitle =. Human action recognition by representing
-
[38]
Huang, Zhiwu and Wan, Chengde and Probst, Thomas and Van Gool, Luc , booktitle =. Deep learning on
-
[39]
Nguyen, Xuan Son , booktitle =. A
-
[40]
2019 IEEE International Conference on Image Processing (ICIP) , pages=
Frame Attention Networks for Facial Expression Recognition in Videos , author=. 2019 IEEE International Conference on Image Processing (ICIP) , pages=. 2019 , organization=
work page 2019
-
[41]
Proceedings of the 20th ACM International Conference on Multimodal Interaction , pages=
Emotiw 2018: Audio-video, Student Engagement and Group-level Affect Prediction , author=. Proceedings of the 20th ACM International Conference on Multimodal Interaction , pages=
work page 2018
-
[42]
First-person Hand Action Benchmark with
Garcia-Hernando, Guillermo and Yuan, Shanxin and Baek, Seungryul and Kim, Tae-Kyun , booktitle =. First-person Hand Action Benchmark with
-
[43]
Deepkspd: Learning kernel-matrix-based
Engin, Melih and Wang, Lei and Zhou, Luping and Liu, Xinwang , booktitle =. Deepkspd: Learning kernel-matrix-based
-
[44]
A review of kernels on covariance matrices for
Yger, Florian , booktitle=. A review of kernels on covariance matrices for. 2013 , organization=
work page 2013
-
[45]
Rectified linear units improve restricted boltzmann machines , author =. ICML , year =
-
[46]
Batch normalization: Accelerating deep network training by reducing internal covariate shift , author =. ICML , year =
- [47]
-
[48]
Covariance Discriminative Learning: A Natural and Efficient Approach to Image Set Classification , author =. CVPR , year =
-
[49]
Huang, Zhiwu and Wang, Ruiping and Shan, Shiguang and Li, Xianqiu and Chen, Xilin , booktitle =. Log-
-
[50]
Huang, Zhiwu and Van Gool, Luc , booktitle =. A
- [51]
-
[52]
Bloom, Victoria and Makris, Dimitrios and Argyriou, Vasileios , booktitle=
-
[53]
GeomNet: A Neural Network Based on
Nguyen, Xuan Son , booktitle =. GeomNet: A Neural Network Based on
-
[54]
Dilated Convolutional Neural Networks for Sequential Manifold-valued Data , author =. ICCV , year =
-
[55]
Why Approximate Matrix Square Root Outperforms Accurate
Song, Yue and Sebe, Nicu and Wang, Wei , booktitle =. Why Approximate Matrix Square Root Outperforms Accurate
-
[56]
Nguyen, Xuan Son , booktitle =. The
-
[57]
Wang, Rui and Wu, Xiao-Jun and Chen, Ziheng and Xu, Tianyang and Kittler, Josef , booktitle=
-
[58]
Hyperplane margin classifiers on the multinomial manifold , author =. ICML , year =
-
[59]
Nguyen, Xuan Son and Brun, Luc and L. A neural network based on. CVPR , year =
- [60]
-
[61]
Delving deep into rectifiers: Surpassing human-level performance on imagenet classification , author =. ICCV , year =
-
[62]
Huang, Zhiwu and Wu, Jiqing and Van Gool, Luc , booktitle =. Building deep networks on
-
[63]
Momentum batch normalization for deep learning with small batch size , author =. ECCV , year =
-
[64]
Domain-specific batch normalization for unsupervised domain adaptation , author =. CVPR , year =
- [65]
-
[66]
Wang, Rui and Hu, Chen and Chen, Ziheng and Wu, Xiao-Jun and Song, Xiaoning , booktitle =. A
-
[67]
Riemannian Multinomial Logistics Regression for
Chen, Ziheng and Song, Yue and Liu, Gaowen and Kompella, Ramana Rao and Wu, Xiaojun and Sebe, Nicu , booktitle =. Riemannian Multinomial Logistics Regression for
- [68]
- [69]
-
[70]
Fr. Les. Annales de l'institut Henri Poincar
-
[71]
Shahroudy, Amir and Liu, Jun and Ng, Tian-Tsong and Wang, Gang , booktitle =
-
[72]
A Statistical Recurrent Model on the Manifold of Symmetric Positive Definite Matrices , author =. NeurIPS , year =
-
[73]
Kobler, Reinmar and Hirayama, Jun-ichiro and Zhao, Qibin and Kawanabe, Motoaki , booktitle =
- [74]
-
[75]
Trivializations for gradient-based optimization on manifolds , author =. NeurIPS , year =
- [76]
-
[77]
Kang, Huan and Li, Hui and Xu, Tianyang and Wu, Xiao-Jun and Wang, Rui and Cheng, Chunyang and Kittler, Josef , journal =
-
[78]
Constructing brain functional networks from
Jalili, Mahdi and Knyazeva, Maria G , journal=. Constructing brain functional networks from. 2011 , publisher=
work page 2011
-
[79]
Frontiers in neuroscience , volume=
An efficient and reliable statistical method for estimating functional connectivity in large scale brain networks using partial correlation , author=. Frontiers in neuroscience , volume=. 2016 , publisher=
work page 2016
-
[80]
Fast and accurate multivariate
Baldassi, Carlo and Zamparo, Marco and Feinauer, Christoph and Procaccini, Andrea and Zecchina, Riccardo and Weigt, Martin and Pagnani, Andrea , journal=. Fast and accurate multivariate. 2014 , publisher=
work page 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.