Algebraic Priors for Approximately Equivariant Networks

Jamie Vicary; Pietro Li\`o; Riccardo Ali

arxiv: 2506.08244 · v2 · pith:ER4JJGU6new · submitted 2025-06-09 · 💻 cs.LG · cs.AI· stat.ML

Algebraic Priors for Approximately Equivariant Networks

Riccardo Ali , Pietro Li\`o , Jamie Vicary This is my paper

Pith reviewed 2026-05-22 00:03 UTC · model grok-4.3

classification 💻 cs.LG cs.AIstat.ML

keywords equivariant neural networksgroup representation theoryregular representationauxiliary losssymmetry inductive biasfinite groupslatent space structureparameter-free methods

0 comments

The pith

For an equivariant encoder over a finite group, the latent space must almost surely contain one copy of its regular representation for each linearly independent data orbit.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that any encoder respecting a finite group's symmetries will embed the group's regular representation in its latent space, with one copy per independent orbit of the input data. This algebraic requirement lets the authors add an auxiliary loss that encourages the desired structure with no extra learnable parameters. A reader would care because the method replaces complex, parameter-heavy equivariant architectures with a lightweight inductive bias that still matches or exceeds their performance on benchmarks.

Core claim

For an equivariant encoder over a finite group, the latent space must almost surely contain one copy of its regular representation for each linearly independent data orbit. The authors leverage this fact by imposing the regular representation as an inductive bias through an auxiliary loss that adds no learnable parameters. Extensive evaluations show the resulting models match or outperform specialized equivariant networks, including some designed for infinite groups, while an ablation confirms the regular representation outperforms trivial and defining representation baselines.

What carries the argument

The regular representation of the finite group, imposed as an auxiliary loss on the latent space to enforce the required algebraic structure without adding parameters.

If this is right

The auxiliary loss provides a parameter-free way to inject finite-group symmetry into existing networks.
The same loss can be used during training to encourage approximate equivariance.
The approach applies empirically even when the underlying group is infinite.
Regular representation consistently outperforms both trivial and defining representations in ablation tests.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method could reduce the engineering effort needed to build symmetry-aware models for new data domains.
When orbits are known to be dependent, a modified loss or representation choice might still be derived from the same representation-theoretic starting point.
The auxiliary loss might be combined with other regularizers to handle groups that are only partially known or approximate.

Load-bearing premise

The encoder is exactly equivariant or trained toward equivariance, and the data orbits are treated as linearly independent.

What would settle it

Train an exactly equivariant encoder on a dataset whose orbits are linearly dependent and check whether the latent space still contains one copy of the regular representation per orbit.

Figures

Figures reproduced from arXiv: 2506.08244 by Jamie Vicary, Pietro Li\`o, Riccardo Ali.

**Figure 1.** Figure 1: Generic architecture with input space X , latent space Z and output space Y, carrying group actions ρX , ρY on the input and output spaces, and potentially an action ρZ on the latent space. When we consider the problem of designing a neural network to solve a given task, we commonly observe the existence of a symmetry group G that acts naturally on the training data.1 For purposes of discussion, we illustr… view at source ↗

**Figure 2.** Figure 2: Approximately-equivariant dynamics of smoke plumes [9]. A rich body of work in machine learning aims to design neural network architectures with improved performance for invariant, equivariant, or approximately equivariant tasks (see Section 2 for a brief survey.) Early work on Convolutional Neural Networks showed the power of translation invariance [15], and more recent methods have included Steerable Con… view at source ↗

**Figure 3.** Figure 3: Visualisation of our learned encoder E, decoder D and latent action ρbZ on input vector x. We motivate this loss function as follows. Component (i) ensures that E, D are appropriately trained for the underlying task. Component (ii) encourages E, D to be equivariant with respect to the learned action ρbZ on the latent space and the fixed action ρY on the output space. This component re-uses the underlying t… view at source ↗

**Figure 4.** Figure 4: Complex eigenvalues of the real-valued matrix [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: The group we use here is D1 = {1, a | a 2 = 1} and, for a data point x, we define the group action ρX (a)(x) to be the data point with the font swapped, but the rotation and scaling unchanged. In particular, with reference to images [PITH_FULL_IMAGE:figures/full_fig_p013_5.png] view at source ↗

**Figure 6.** Figure 6: Examples of training data for the DDMNIST experiment with [PITH_FULL_IMAGE:figures/full_fig_p016_6.png] view at source ↗

**Figure 7.** Figure 7: Examples of a velocity field and its augmentations with and without reorientation. Rotating [PITH_FULL_IMAGE:figures/full_fig_p020_7.png] view at source ↗

read the original abstract

Equivariant neural networks incorporate symmetries through group actions, embedding them as an inductive bias to improve performance. Existing methods learn an equivariant action on the latent space, or design architectures that are equivariant by construction. These approaches often deliver strong empirical results but can involve architecture-specific constraints, large parameter counts, and high computational cost. We challenge the paradigm of complex equivariant architectures with a parameter-free approach grounded in group representation theory. We prove that for an equivariant encoder over a finite group, the latent space must almost surely contain one copy of its regular representation for each linearly independent data orbit, which we explore with a number of empirical studies. Leveraging this foundational algebraic insight, we impose the group's regular representation as an inductive bias via an auxiliary loss, adding no learnable parameters. Our extensive evaluation shows that this method matches or outperforms specialized models in several cases, even those for infinite groups. We further validate our choice of the regular representation through an ablation study, showing it consistently outperforms defining and trivial group representation baselines.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript presents a parameter-free approach to approximately equivariant networks based on group representation theory. The central claim is that an equivariant encoder over a finite group must almost surely include one copy of the regular representation in its latent space for each linearly independent data orbit. The authors derive an auxiliary loss from this insight to enforce the regular representation as an inductive bias and validate the method through empirical studies on various tasks, showing competitive or superior performance compared to specialized equivariant architectures, along with an ablation confirming the choice of regular representation.

Significance. If the theoretical result is correct, this work offers a significant simplification for building symmetry-aware models by avoiding the need for custom architectures or extra parameters. The empirical evidence that this simple prior can match or exceed more elaborate methods, including for infinite groups, highlights its potential impact. The inclusion of a proof grounded in representation theory and a dedicated ablation study are strengths that enhance the paper's contribution to the field of equivariant machine learning.

major comments (2)

The proof that the latent space contains the regular representation assumes exact equivariance and linearly independent orbits. Given that the method is applied to approximately equivariant networks, a discussion or bound on how deviations from exact equivariance affect the presence of the regular representation would be necessary to fully support the central claim.
While the ablation study shows the regular representation outperforms trivial and defining representations, the manuscript should report variance across multiple runs or statistical tests to confirm that the observed improvements are significant and not due to random variation.

minor comments (2)

The phrase 'a number of empirical studies' is vague; specifying the number and types of experiments would improve the summary.
Ensure that all figures and tables are clearly labeled and referenced in the text for better readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback and positive recommendation for minor revision. We address each major comment below.

read point-by-point responses

Referee: The proof that the latent space contains the regular representation assumes exact equivariance and linearly independent orbits. Given that the method is applied to approximately equivariant networks, a discussion or bound on how deviations from exact equivariance affect the presence of the regular representation would be necessary to fully support the central claim.

Authors: The theoretical result establishes that exact equivariance implies the presence of the regular representation in the latent space. For the approximately equivariant setting, our approach uses an auxiliary loss to impose this as a soft prior. We will add a new subsection discussing the robustness to approximate equivariance, noting that the representation theory result provides a strong inductive bias even when equivariance is not exact, as supported by our empirical results on tasks where perfect equivariance is not achieved. While a rigorous bound on the deviation is challenging without additional assumptions, we will provide a qualitative analysis based on the stability of representations under small perturbations. revision: partial
Referee: While the ablation study shows the regular representation outperforms trivial and defining representations, the manuscript should report variance across multiple runs or statistical tests to confirm that the observed improvements are significant and not due to random variation.

Authors: We agree that this would strengthen the paper. We will update the ablation study and main results to include standard deviations computed over multiple random seeds and add statistical significance tests (such as Wilcoxon signed-rank tests) where appropriate. revision: yes

Circularity Check

0 steps flagged

Core algebraic claim follows from standard representation theory; no load-bearing circularity

full rationale

The paper's central result—that an equivariant encoder's latent space must almost surely contain one copy of the regular representation per linearly independent data orbit—is presented as a direct consequence of standard facts from finite group representation theory applied to the definition of equivariance. The auxiliary loss is then introduced as a parameter-free way to impose this structure. No step reduces a prediction or uniqueness claim to a fitted input, self-citation chain, or ansatz smuggled from prior work by the same authors. The derivation remains self-contained against external benchmarks in representation theory, yielding only a minor self-citation score of 2 with no impact on the independence of the main claim.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The work rests on standard results from finite group representation theory. No free parameters are introduced; the auxiliary loss is derived directly from the representation-theoretic fact. No new entities are postulated.

axioms (1)

standard math Standard facts from the representation theory of finite groups, including the structure of the regular representation and its decomposition properties.
Invoked to prove that the latent space of an equivariant encoder must contain copies of the regular representation.

pith-pipeline@v0.9.0 · 5707 in / 1235 out tokens · 34164 ms · 2026-05-22T00:03:57.713169+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We prove that for an equivariant encoder over a finite group, the latent space must almost surely contain one copy of its regular representation for each linearly independent data orbit

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

26 extracted references · 26 canonical work pages · 3 internal anchors

[1]

Symmetry-based disentangled representation learning requires interaction with environments, 2019.arXiv:1904.00243

Hugo Caselles-Dupré, Michael Garcia-Ortiz, and David Filliat. Symmetry-based disentangled representation learning requires interaction with environments, 2019.arXiv:1904.00243

work page arXiv 2019
[2]

Steerable CNNs

Taco S. Cohen and Max Welling. Steerable CNNs, 2016.arXiv:1612.08498

work page internal anchor Pith review Pith/arXiv arXiv 2016
[3]

The MNIST database of handwritten digit images for machine learning research

Li Deng. The MNIST database of handwritten digit images for machine learning research. IEEE Signal Processing Magazine, 29(6):141–142, 2012

work page 2012
[4]

Equivariant neural rendering, 2020.arXiv:2006.07630

Emilien Dupont, Miguel Angel Bautista, Alex Colburn, Aditya Sankar, Carlos Guestrin, Josh Susskind, and Qi Shan. Equivariant neural rendering, 2020.arXiv:2006.07630

work page arXiv 2020
[5]

Residual pathway priors for soft equivariance constraints

Marc Finzi, Gregory Benton, and Andrew G Wilson. Residual pathway priors for soft equivariance constraints. In M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan, editors,Advances in Neural Information Processing Systems, volume 34, pages 30037–30049. Curran Associates, Inc., 2021. URL: https://proceedings.neurips.cc/ paper_files/p...

work page 2021
[6]

Universal eigenvarieties, triangu line Galois representations, and p-adic Lang- lands functoriality

William Fulton and Joe Harris. Representation Theory. Springer New York, 2004. doi: 10.1007/978-1-4612-0979-9. 10

work page doi:10.1007/978-1-4612-0979-9 2004
[7]

Canonical decomposition of steerable functions.Journal of Mathematical Imaging and Vision, 9:83–95, 1998

Yacov Hel-Or and Patrick C Teo. Canonical decomposition of steerable functions.Journal of Mathematical Imaging and Vision, 9:83–95, 1998

work page 1998
[8]

Towards a Definition of Disentangled Representations

Irina Higgins, David Amos, David Pfau, Sébastien Racanière, Loïc Matthey, Danilo Rezende, and Alexander Lerchner. Towards a definition of disentangled representations, 2018.arXiv: 1812.02230

work page internal anchor Pith review Pith/arXiv arXiv 2018
[9]

PhiFlow: A differentiable PDE solving framework for deep learning via physical simulations

Philipp Holl, Vladlen Koltun, Kiwon Um, and Nils Thuerey. PhiFlow: A differentiable PDE solving framework for deep learning via physical simulations. InNeurIPS workshop, volume 2, 2020

work page 2020
[10]

Turbulence, coherent structures, dynamical systems and symmetry

Philip Holmes. Turbulence, coherent structures, dynamical systems and symmetry. Cambridge University Press, 2012

work page 2012
[11]

Cambridge University Press, 2001.doi:10.1017/cbo9780511814532

Gordon James and Martin Liebeck.Representations and Characters of Groups. Cambridge University Press, 2001.doi:10.1017/cbo9780511814532

work page doi:10.1017/cbo9780511814532 2001
[12]

Learning group actions on latent representations

Yinzhu Jin, Aman Shrivastava, and Tom Fletcher. Learning group actions on latent representations. InThe Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024. URL: https://openreview.net/forum?id=HGNTcy4eEp

work page 2024
[13]

Adam: A Method for Stochastic Optimization

Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization, 2017. URL: https://arxiv.org/abs/1412.6980, arXiv:1412.6980

work page internal anchor Pith review Pith/arXiv arXiv 2017
[14]

Learning multiple layers of features from tiny images

Alex Krizhevsky. Learning multiple layers of features from tiny images. 2009. URL: https://api.semanticscholar.org/CorpusID:18268744

work page 2009
[15]

MIT Press, Cambridge, MA, USA, 1998

Yann LeCun and Yoshua Bengio.Convolutional networks for images, speech, and time series, page 255–258. MIT Press, Cambridge, MA, USA, 1998

work page 1998
[16]

Typography-MNIST (TMNIST): an MNIST-style image dataset to categorize glyphs and font-styles, 2022.arXiv:2202.08112

Nimish Magre and Nicholas Brown. Typography-MNIST (TMNIST): an MNIST-style image dataset to categorize glyphs and font-styles, 2022.arXiv:2202.08112

work page arXiv 2022
[17]

Learning disentangled representations and group structure of dynamical environments.Advances in Neural Information Processing Systems, 33:19727–19737, 2020.arXiv:2002.06991

Robin Quessard, Thomas Barrett, and William Clements. Learning disentangled representations and group structure of dynamical environments.Advances in Neural Information Processing Systems, 33:19727–19737, 2020.arXiv:2002.06991

work page arXiv 2020
[18]

Khoshgoftaar

Christopher Shorten and Taghi M. Khoshgoftaar. A survey on image data augmentation for deep learning. Journal of Big Data, 6:60, 2019. doi:10.1186/s40537-019-0197-0

work page doi:10.1186/s40537-019-0197-0 2019
[19]

A probabilistic approach to learning the degree of equivariance in steerable CNNs

Lars Veefkind and Gabriele Cesa. A probabilistic approach to learning the degree of equivariance in steerable CNNs. In41st International Conference on Machine Learning (ICML 2024), 2024. URL: https://openreview.net/forum?id=49vHLSxjzy, arXiv:2406.03946

work page arXiv 2024
[20]

Equivariant Q learning in spatial action spaces

Dian Wang, Robin Walters, Xupeng Zhu, and Robert Platt. Equivariant Q learning in spatial action spaces. In 5th Annual Conference on Robot Learning, 2021. URL: https://openreview.net/forum?id=IScz42A3iCI

work page 2021
[21]

Approximately equivariant networks for imperfectly symmetric dynamics

Rui Wang, Robin Walters, and Rose Yu. Approximately equivariant networks for imperfectly symmetric dynamics. InInternational Conference on Machine Learning, pages 23078–23091. PMLR, 2022. arXiv:2201.11969

work page arXiv 2022
[22]

Self-supervised learning disentangled group representation as feature

Tan Wang, Zhongqi Yue, Jianqiang Huang, Qianru Sun, and Hanwang Zhang. Self-supervised learning disentangled group representation as feature. Advances in Neural Information Processing Systems, 34:18225–18240, 2021.arXiv:2110.15255

work page arXiv 2021
[23]

Disentangled representation learning, 2024

Xin Wang, Hong Chen, Si’ao Tang, Zihao Wu, and Wenwu Zhu. Disentangled representation learning, 2024. arXiv:2211.11695

work page arXiv 2024
[24]

General E(2)-Equivariant Steerable CNNs

Maurice Weiler and Gabriele Cesa. General E(2)-Equivariant Steerable CNNs. InConference on Neural Information Processing Systems (NeurIPS), 2019. arXiv:1911.08251. 11

work page arXiv 2019
[25]

Medmnist v2 - a large-scale lightweight benchmark for 2d and 3d biomedical image classification.Scientific Data, 10(1), January 2023.doi:10.1038/s41597-022-01721-8

Jiancheng Yang, Rui Shi, Donglai Wei, Zequan Liu, Lin Zhao, Bilian Ke, Hanspeter Pfister, and Bingbing Ni. Medmnist v2 - a large-scale lightweight benchmark for 2d and 3d biomedical image classification.Scientific Data, 10(1), January 2023.doi:10.1038/s41597-022-01721-8

work page doi:10.1038/s41597-022-01721-8 2023
[26]

Towards building a group-based unsupervised representation disentanglement framework, 2022.arXiv:2102.10303

Tao Yang, Xuanchi Ren, Yuwang Wang, Wenjun Zeng, and Nanning Zheng. Towards building a group-based unsupervised representation disentanglement framework, 2022.arXiv:2102.10303. 12 A Code The code to run all the experiments in this paper is available at the following location: • https://github.com/rick-ali/parameter-free-approximate-equivariance In the REA...

work page arXiv 2022

[1] [1]

Symmetry-based disentangled representation learning requires interaction with environments, 2019.arXiv:1904.00243

Hugo Caselles-Dupré, Michael Garcia-Ortiz, and David Filliat. Symmetry-based disentangled representation learning requires interaction with environments, 2019.arXiv:1904.00243

work page arXiv 2019

[2] [2]

Steerable CNNs

Taco S. Cohen and Max Welling. Steerable CNNs, 2016.arXiv:1612.08498

work page internal anchor Pith review Pith/arXiv arXiv 2016

[3] [3]

The MNIST database of handwritten digit images for machine learning research

Li Deng. The MNIST database of handwritten digit images for machine learning research. IEEE Signal Processing Magazine, 29(6):141–142, 2012

work page 2012

[4] [4]

Equivariant neural rendering, 2020.arXiv:2006.07630

Emilien Dupont, Miguel Angel Bautista, Alex Colburn, Aditya Sankar, Carlos Guestrin, Josh Susskind, and Qi Shan. Equivariant neural rendering, 2020.arXiv:2006.07630

work page arXiv 2020

[5] [5]

Residual pathway priors for soft equivariance constraints

Marc Finzi, Gregory Benton, and Andrew G Wilson. Residual pathway priors for soft equivariance constraints. In M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan, editors,Advances in Neural Information Processing Systems, volume 34, pages 30037–30049. Curran Associates, Inc., 2021. URL: https://proceedings.neurips.cc/ paper_files/p...

work page 2021

[6] [6]

Universal eigenvarieties, triangu line Galois representations, and p-adic Lang- lands functoriality

William Fulton and Joe Harris. Representation Theory. Springer New York, 2004. doi: 10.1007/978-1-4612-0979-9. 10

work page doi:10.1007/978-1-4612-0979-9 2004

[7] [7]

Canonical decomposition of steerable functions.Journal of Mathematical Imaging and Vision, 9:83–95, 1998

Yacov Hel-Or and Patrick C Teo. Canonical decomposition of steerable functions.Journal of Mathematical Imaging and Vision, 9:83–95, 1998

work page 1998

[8] [8]

Towards a Definition of Disentangled Representations

Irina Higgins, David Amos, David Pfau, Sébastien Racanière, Loïc Matthey, Danilo Rezende, and Alexander Lerchner. Towards a definition of disentangled representations, 2018.arXiv: 1812.02230

work page internal anchor Pith review Pith/arXiv arXiv 2018

[9] [9]

PhiFlow: A differentiable PDE solving framework for deep learning via physical simulations

Philipp Holl, Vladlen Koltun, Kiwon Um, and Nils Thuerey. PhiFlow: A differentiable PDE solving framework for deep learning via physical simulations. InNeurIPS workshop, volume 2, 2020

work page 2020

[10] [10]

Turbulence, coherent structures, dynamical systems and symmetry

Philip Holmes. Turbulence, coherent structures, dynamical systems and symmetry. Cambridge University Press, 2012

work page 2012

[11] [11]

Cambridge University Press, 2001.doi:10.1017/cbo9780511814532

Gordon James and Martin Liebeck.Representations and Characters of Groups. Cambridge University Press, 2001.doi:10.1017/cbo9780511814532

work page doi:10.1017/cbo9780511814532 2001

[12] [12]

Learning group actions on latent representations

Yinzhu Jin, Aman Shrivastava, and Tom Fletcher. Learning group actions on latent representations. InThe Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024. URL: https://openreview.net/forum?id=HGNTcy4eEp

work page 2024

[13] [13]

Adam: A Method for Stochastic Optimization

Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization, 2017. URL: https://arxiv.org/abs/1412.6980, arXiv:1412.6980

work page internal anchor Pith review Pith/arXiv arXiv 2017

[14] [14]

Learning multiple layers of features from tiny images

Alex Krizhevsky. Learning multiple layers of features from tiny images. 2009. URL: https://api.semanticscholar.org/CorpusID:18268744

work page 2009

[15] [15]

MIT Press, Cambridge, MA, USA, 1998

Yann LeCun and Yoshua Bengio.Convolutional networks for images, speech, and time series, page 255–258. MIT Press, Cambridge, MA, USA, 1998

work page 1998

[16] [16]

Typography-MNIST (TMNIST): an MNIST-style image dataset to categorize glyphs and font-styles, 2022.arXiv:2202.08112

Nimish Magre and Nicholas Brown. Typography-MNIST (TMNIST): an MNIST-style image dataset to categorize glyphs and font-styles, 2022.arXiv:2202.08112

work page arXiv 2022

[17] [17]

Learning disentangled representations and group structure of dynamical environments.Advances in Neural Information Processing Systems, 33:19727–19737, 2020.arXiv:2002.06991

Robin Quessard, Thomas Barrett, and William Clements. Learning disentangled representations and group structure of dynamical environments.Advances in Neural Information Processing Systems, 33:19727–19737, 2020.arXiv:2002.06991

work page arXiv 2020

[18] [18]

Khoshgoftaar

Christopher Shorten and Taghi M. Khoshgoftaar. A survey on image data augmentation for deep learning. Journal of Big Data, 6:60, 2019. doi:10.1186/s40537-019-0197-0

work page doi:10.1186/s40537-019-0197-0 2019

[19] [19]

A probabilistic approach to learning the degree of equivariance in steerable CNNs

Lars Veefkind and Gabriele Cesa. A probabilistic approach to learning the degree of equivariance in steerable CNNs. In41st International Conference on Machine Learning (ICML 2024), 2024. URL: https://openreview.net/forum?id=49vHLSxjzy, arXiv:2406.03946

work page arXiv 2024

[20] [20]

Equivariant Q learning in spatial action spaces

Dian Wang, Robin Walters, Xupeng Zhu, and Robert Platt. Equivariant Q learning in spatial action spaces. In 5th Annual Conference on Robot Learning, 2021. URL: https://openreview.net/forum?id=IScz42A3iCI

work page 2021

[21] [21]

Approximately equivariant networks for imperfectly symmetric dynamics

Rui Wang, Robin Walters, and Rose Yu. Approximately equivariant networks for imperfectly symmetric dynamics. InInternational Conference on Machine Learning, pages 23078–23091. PMLR, 2022. arXiv:2201.11969

work page arXiv 2022

[22] [22]

Self-supervised learning disentangled group representation as feature

Tan Wang, Zhongqi Yue, Jianqiang Huang, Qianru Sun, and Hanwang Zhang. Self-supervised learning disentangled group representation as feature. Advances in Neural Information Processing Systems, 34:18225–18240, 2021.arXiv:2110.15255

work page arXiv 2021

[23] [23]

Disentangled representation learning, 2024

Xin Wang, Hong Chen, Si’ao Tang, Zihao Wu, and Wenwu Zhu. Disentangled representation learning, 2024. arXiv:2211.11695

work page arXiv 2024

[24] [24]

General E(2)-Equivariant Steerable CNNs

Maurice Weiler and Gabriele Cesa. General E(2)-Equivariant Steerable CNNs. InConference on Neural Information Processing Systems (NeurIPS), 2019. arXiv:1911.08251. 11

work page arXiv 2019

[25] [25]

Medmnist v2 - a large-scale lightweight benchmark for 2d and 3d biomedical image classification.Scientific Data, 10(1), January 2023.doi:10.1038/s41597-022-01721-8

Jiancheng Yang, Rui Shi, Donglai Wei, Zequan Liu, Lin Zhao, Bilian Ke, Hanspeter Pfister, and Bingbing Ni. Medmnist v2 - a large-scale lightweight benchmark for 2d and 3d biomedical image classification.Scientific Data, 10(1), January 2023.doi:10.1038/s41597-022-01721-8

work page doi:10.1038/s41597-022-01721-8 2023

[26] [26]

Towards building a group-based unsupervised representation disentanglement framework, 2022.arXiv:2102.10303

Tao Yang, Xuanchi Ren, Yuwang Wang, Wenjun Zeng, and Nanning Zheng. Towards building a group-based unsupervised representation disentanglement framework, 2022.arXiv:2102.10303. 12 A Code The code to run all the experiments in this paper is available at the following location: • https://github.com/rick-ali/parameter-free-approximate-equivariance In the REA...

work page arXiv 2022