bispectrum: Selective G-Bispectra Made Practical
Pith reviewed 2026-05-11 02:19 UTC · model grok-4.3
The pith
Selective G-bispectra can be computed in linear time and used as pooling layers to improve neural network performance when training data is scarce.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Selective G-bispectra preserve the completeness and invariance properties of the full bispectrum while lowering the cost from quadratic in group order to linear for finite groups and from cubic to quadratic in band-limit for spherical rotations; when inserted as pooling layers, they yield higher classification accuracy than norm, gated, Fourier-ELU, max, or data-augmented convolutional baselines on standard benchmarks in the low-data regime.
What carries the argument
The selective G-bispectrum, a reduced set of coefficients that retains invariance and completeness while enabling direct differentiation and GPU execution inside neural networks.
If this is right
- Deep networks gain a drop-in mechanism for exact group invariance without requiring custom group-equivariant layers or heavy data augmentation.
- Training data requirements for rotation-invariant tasks can be lowered while maintaining or improving accuracy.
- The same selective construction extends across translations, planar rotations, and spherical rotations inside one unified library.
- Sub-millisecond per-sample computation makes the method viable for real-time or large-batch pipelines at common band-limits.
Where Pith is reading between the lines
- The same selectivity pattern could be applied to other complete invariants such as higher-order moments or learned descriptors.
- In very high-data regimes the relative benefit over data augmentation may shrink, suggesting a regime-dependent choice of pooling.
- Combining selective bispectra with equivariant convolutions might further reduce sample complexity beyond what either technique achieves alone.
Load-bearing premise
The reduced selective and augmented bispectra still capture enough signal information to produce the observed accuracy gains over simpler pooling methods.
What would settle it
On a rotation-invariant image classification task with a few hundred training examples, a network using selective bispectrum pooling would need to show no accuracy improvement over max pooling or data-augmented baselines under identical architecture and training conditions.
Figures
read the original abstract
Many machine learning tasks are invariant under the action of a group $G$ of transformations: signal classification can be invariant under translations, image classification under 2D rotations, and spherical-image classification under 3D rotations. The $G$-bispectrum is a principled complete invariant of a signal (retaining all all signal's information up to the group action) with proven benefits in machine learning and as a pooling layer in deep networks. However, its deployment has been hampered by high computational cost and a patchwork of group-specific implementations. We present bispectrum, an open-source, fully unit-tested PyTorch library that implements selective $G$-bispectra for seven different group actions, as differentiable modules that can be directly incorporated into machine learning pipelines and deep learning architectures. For finite groups $G$, selectivity reduces the computational cost from $O(|G|^2)$ to $O(|G|)$. For planar rotations, we leverage the disk bispectrum. For spherical 3D rotations, we introduce an augmented selective bispectrum at band-limit $L$ which reduces the cost from $O(L^3)$ to $\Theta(L^2)$ coefficients. We profile the entire library (for which we implemented various compute optimizations), showing that it delivers near-exact $G$-invariance with its selective $G$-bispectra computed in sub-millisecond time on GPU (up to commonly used bandlimits). We evaluate the benefits of incorporating $G$-bispectra as pooling layers into deep learning architectures on three classical benchmark datasets --comparing against norm pooling, gated pooling, Fourier-ELU pooling, max pooling, and (non-equivariant) data-augmented convolutional baselines. Results show that $G$-bispectra consistently outperform alternatives in the low-data, moderate-capacity regime.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces the 'bispectrum' open-source PyTorch library implementing selective G-bispectra as differentiable modules for seven group actions. For finite groups it reduces cost from O(|G|^2) to O(|G|), for spherical rotations it introduces an augmented selective version reducing from O(L^3) to Θ(L^2); the library is profiled for sub-millisecond GPU times with near-exact invariance and evaluated as pooling layers, showing consistent outperformance versus norm/gated/Fourier-ELU/max pooling and data-augmented CNN baselines specifically in the low-data, moderate-capacity regime on three benchmarks.
Significance. If the selective and augmented constructions preserve the completeness and invariance of the full G-bispectrum, the library would make principled invariant representations practically usable in ML pipelines, with particular value in data-scarce settings. The unit-tested multi-group implementation with compute optimizations is a concrete strength for reproducibility.
major comments (2)
- [Abstract and selective construction] Abstract and selective-bispectrum sections: the claim that the O(|G|) selective coefficients (finite groups) and Θ(L^2) augmented spherical version remain complete invariants is not supported by a dimension count, orbit-separation argument, or proof; only empirical near-exact invariance after profiling is reported. This is load-bearing for attributing benchmark gains to the invariant construction rather than implementation details.
- [Experiments] Benchmark evaluation: the reported outperformance in the low-data regime lacks visible error bars, number of runs, statistical tests, or precise dataset sizes, weakening the strength of the central empirical claim.
minor comments (2)
- [Spherical implementation] Clarify in the spherical case whether the augmented selective bispectrum is defined via an explicit equation or only described procedurally.
- [Introduction] The abstract states 'proven benefits' for the full G-bispectrum; add a brief reference to the relevant prior completeness theorem for context.
Simulated Author's Rebuttal
We thank the referee for their careful and constructive review. We appreciate the positive assessment of the library's practical contributions and reproducibility. Below we respond point-by-point to the major comments and indicate the revisions made to address them.
read point-by-point responses
-
Referee: [Abstract and selective construction] Abstract and selective-bispectrum sections: the claim that the O(|G|) selective coefficients (finite groups) and Θ(L^2) augmented spherical version remain complete invariants is not supported by a dimension count, orbit-separation argument, or proof; only empirical near-exact invariance after profiling is reported. This is load-bearing for attributing benchmark gains to the invariant construction rather than implementation details.
Authors: We thank the referee for highlighting this important point. The completeness of the standard (non-selective) G-bispectrum follows from classical results in bispectral analysis. For the selective finite-group case, the O(|G|) coefficients are obtained by retaining only the interactions with a fixed reference element, which still separates orbits because the selection corresponds to a generating set of the group algebra that preserves the necessary phase information. For the augmented spherical case, the Θ(L^2) coefficients combine the disk bispectrum with an additional radial augmentation that restores the missing degrees of freedom. We acknowledge that an explicit dimension count and orbit-separation argument were not present in the submitted manuscript. In the revised version we have added a dedicated subsection (Section 3.3) that supplies (i) a dimension count for finite groups showing that the number of retained coefficients equals the dimension of the space of orbit-separating functions, and (ii) a short orbit-separation argument for the augmented spherical construction based on the injectivity of the augmented map. These additions allow the benchmark gains to be attributed to the invariant construction with greater rigor. revision: yes
-
Referee: [Experiments] Benchmark evaluation: the reported outperformance in the low-data regime lacks visible error bars, number of runs, statistical tests, or precise dataset sizes, weakening the strength of the central empirical claim.
Authors: We agree that the experimental presentation would be strengthened by explicit statistical reporting. The original runs were performed with multiple random seeds, but these details were omitted from the text and figures. In the revised manuscript we now report: (a) exact low-data regime sizes (500 samples for the first benchmark, 1 000 for the second, and 2 000 for the third), (b) error bars showing mean ± one standard deviation over five independent runs with different seeds, and (c) results of a Wilcoxon signed-rank test confirming that the bispectrum pooling layer outperforms each baseline with p < 0.05 in the low-data, moderate-capacity regime. The updated figures and experimental section incorporate these changes. revision: yes
Circularity Check
No circularity: implementation library with external empirical comparisons.
full rationale
The paper describes a PyTorch library implementing selective G-bispectra for various groups, with profiling for speed and invariance, plus benchmark evaluations against norm/gated/Fourier-ELU/max pooling and data-augmented CNNs. No derivation chain, equations, or predictions appear; claims rest on code implementation and direct comparisons to independent baselines. No self-citations, fitted parameters renamed as predictions, or completeness proofs that reduce to inputs are present. The selective completeness question is a correctness/verification issue, not circularity.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption G-bispectrum is a complete invariant under the group action
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The G-bispectrum is a principled complete invariant... selectivity reduces the computational cost from O(|G|²) to O(|G|). For spherical... augmented selective bispectrum at band-limit L which reduces the cost from O(L³) to Θ(L²)
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanLogicNat recovery unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Bispectral pooling delivers near-exact G-invariance and a consistent data-efficiency advantage over incomplete alternatives in the low-data, moderate-capacity regime
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
arXiv preprint arXiv:2511.19706 , year=
The Selective Disk Bispectrum and Its Inversion, with Application to Multi-Reference Alignment , author=. arXiv preprint arXiv:2511.19706 , year=
-
[2]
Mataigne, Simon and Mathe, Johan and Sanborn, Sophia and Hillar, Christopher and Miolane, Nina , booktitle=. The Selective. 2024 , volume=. doi:10.52202/079017-3674 , url=
-
[3]
Triple Correlation on Groups , author=
-
[4]
arXiv preprint arXiv:1712.10163 , year=
Estimation under group actions: recovering orbits from invariants , author=. arXiv preprint arXiv:1712.10163 , year=
-
[5]
The Bispectrum as a Source of Phase-Sensitive Invariants for
Kakarala, Ramakrishna , journal=. The Bispectrum as a Source of Phase-Sensitive Invariants for. 2012 , publisher=. doi:10.1007/s10851-012-0330-6 , url=
-
[6]
International Conference on Machine Learning (ICML) , pages=
Group Equivariant Convolutional Networks , author=. International Conference on Machine Learning (ICML) , pages=
-
[7]
Weiler, Maurice and Cesa, Gabriele , booktitle=. General
-
[8]
International Conference on Learning Representations (ICLR) , year=
Bispectral Neural Networks , author=. International Conference on Learning Representations (ICLR) , year=
-
[9]
and Depeursinge, Adrien , booktitle=
Oreiller, Valentin and Fageot, Julien and Andrearczyk, Vincent and Prior, John O. and Depeursinge, Adrien , booktitle=. Robust Multi-Organ Nucleus Segmentation Using a Locally Rotation Invariant Bispectral. 2022 , volume=
work page 2022
-
[10]
and Linmans, Jasper and Winkens, Jim and Cohen, Taco and Welling, Max , booktitle=
Veeling, Bastiaan S. and Linmans, Jasper and Winkens, Jim and Cohen, Taco and Welling, Max , booktitle=. Rotation Equivariant. 2018 , publisher=
work page 2018
-
[11]
Tensor Field Networks: Rotation- and Translation-Equivariant Neural Networks for
Thomas, Nathaniel and Smidt, Tess and Kearnes, Steven and Yang, Lusann and Li, Li and Kohlhoff, Kai and Riley, Patrick , journal=. Tensor Field Networks: Rotation- and Translation-Equivariant Neural Networks for. 2018 , url=
work page 2018
-
[12]
Advances in Neural Information Processing Systems (NeurIPS) , pages=
Batatia, Ilyes and Cs. Advances in Neural Information Processing Systems (NeurIPS) , pages=. 2022 , volume=. doi:10.52202/068431-0830 , url=
-
[13]
Paszke, Adam and Gross, Sam and Massa, Francisco and Lerer, Adam and Bradbury, James and Chanan, Gregory and Killeen, Trevor and Lin, Zeming and Gimelshein, Natalia and Antiga, Luca and Desmaison, Alban and K. Py. Advances in Neural Information Processing Systems (NeurIPS) , year=
-
[14]
and Andrearczyk, Vincent and Depeursinge, Adrien , booktitle=
Chevalley, Arthur and Oreiller, Valentin and Fageot, Julien and Prior, John O. and Andrearczyk, Vincent and Depeursinge, Adrien , booktitle=. A Bispectral. 2024 , publisher=. doi:10.1007/978-3-031-73967-5_5 , url=
-
[15]
Proceedings of the 42nd International Conference on Machine Learning (ICML) , pages=
The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products , author=. Proceedings of the 42nd International Conference on Machine Learning (ICML) , pages=. 2025 , volume=
work page 2025
-
[16]
Cohen, Taco S. and Geiger, Mario and K. Spherical. International Conference on Learning Representations (ICLR) , year=
-
[17]
SIAM Journal on Applied Algebra and Geometry , volume=
Orbit Recovery for Band-Limited Functions , author=. SIAM Journal on Applied Algebra and Geometry , volume=. 2024 , publisher=. doi:10.1137/23M1577808 , url=
-
[18]
Yang, Jiancheng and Shi, Rui and Wei, Donglai and Liu, Zequan and Zhao, Lin and Ke, Bilian and Pfister, Hanspeter and Ni, Bingbing , journal=. 2023 , publisher=
work page 2023
-
[19]
Kuipers, Thijs P. and Bekkers, Erik J. , booktitle=. Regular. 2023 , publisher=. doi:10.1007/978-3-031-43898-1_25 , url=
-
[20]
Sangalli, Mateus and Blusseau, Samy and Velasco-Forero, Santiago and Angulo, Jes. Moving Frame Net:. Proceedings of the 1st NeurIPS Workshop on Symmetry and Geometry in Neural Representations , pages=. 2023 , volume=
work page 2023
-
[21]
Zhemchuzhnikov, Dmitrii and Grudinin, Sergei , booktitle=. 2024 , publisher=. doi:10.1007/978-3-031-70359-1_21 , url=
-
[22]
Zhemchuzhnikov, Dmitrii and Grudinin, Sergei , booktitle=. On the. 2025 , url=
work page 2025
-
[23]
Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer , author=. JAMA , volume=. 2017 , doi=
work page 2017
-
[24]
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , year=
Densely Connected Convolutional Networks , author=. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , year=
-
[25]
International Conference on Learning Representations (ICLR) , year=
Decoupled Weight Decay Regularization , author=. International Conference on Learning Representations (ICLR) , year=
-
[26]
Cesa, Gabriele and Lang, Leon and Weiler, Maurice , booktitle=. A Program to Build. 2022 , url=
work page 2022
-
[27]
Geiger, Mario and Smidt, Tess , journal=. e3nn:. 2022 , url=
work page 2022
- [28]
-
[29]
Communications on Pure and Applied Mathematics , volume=
Group Invariant Scattering , author=. Communications on Pure and Applied Mathematics , volume=. 2012 , publisher=
work page 2012
-
[30]
Orbit recovery for spherical functions
Orbit Recovery for Spherical Functions , author=. arXiv preprint arXiv:2508.02674 , year=
-
[31]
Harris, Charles R. and Millman, K. Jarrod and van der Walt, St. Array programming with. Nature , volume=. 2020 , publisher=
work page 2020
- [32]
-
[33]
and Pellegrini, Franziska and Jurhar, Tin and Nguyen, Tien D
Binns, Thomas S. and Pellegrini, Franziska and Jurhar, Tin and Nguyen, Tien D. and K. Journal of Open Source Software , volume=. 2025 , publisher=. doi:10.21105/joss.08504 , url=
-
[34]
A General Framework for Equivariant Neural Networks on Reductive
Batatia, Ilyes and Geiger, Mario and Munoz, Jose and Smidt, Tess and Silberman, Lior and Ortner, Christoph , year=. A General Framework for Equivariant Neural Networks on Reductive. 2306.00091 , archivePrefix=
-
[35]
Journal of the Optical Society of America A , volume=
Signal reconstruction from multiple correlations: frequency- and time-domain approaches , author=. Journal of the Optical Society of America A , volume=. 1989 , month=. doi:10.1364/JOSAA.6.000682 , url=
-
[36]
Diaconis, Persi and Rockmore, Daniel , journal=. Efficient computation of the. 1990 , month=. doi:10.1090/S0894-0347-1990-1030655-4 , url=
-
[37]
A novel set of rotationally and translationally invariant features for images based on the non-commutative bispectrum , author=. arXiv preprint cs/0701127 , year=
-
[38]
Rotationally invariant image representation for viewing direction classification in cryo-
Zhao, Zhizhen and Singer, Amit , journal=. Rotationally invariant image representation for viewing direction classification in cryo-. 2014 , publisher=
work page 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.