pith. sign in

arxiv: 2601.21645 · v2 · pith:TP2YK3TJnew · submitted 2026-01-29 · 💻 cs.LG · math.CT· math.RT

Identifiable Equivariant Networks are Layerwise Equivariant

classification 💻 cs.LG math.CTmath.RT
keywords equivariantnetworksend-to-endactionsequivariancefunctiongroupidentifiable
0
0 comments X
read the original abstract

We investigate the relation between end-to-end equivariance and layerwise equivariance in deep neural networks. We prove the following: For a network whose end-to-end function is equivariant with respect to group actions on the input and output spaces, there is a parameter choice yielding the same end-to-end function such that its layers are equivariant with respect to some group actions on the latent spaces. Our result assumes that the parameters of the model are identifiable in an appropriate sense. This identifiability property has been established in the literature for a large class of networks, to which our results apply immediately, while it is conjectural for others. The theory we develop is grounded in an abstract formalism, and is therefore architecture-agnostic. Overall, our results provide a mathematical explanation for the emergence of equivariant structures in the weights of neural networks during training -- a phenomenon that is consistently observed in practice.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Most ReLU Networks Admit Identifiable Parameters

    cs.LG 2026-05 accept novelty 8.0

    For ReLU networks with input and hidden widths at least 2, most parameters are identifiable up to symmetry, so the functional dimension equals the parameter count minus the number of hidden neurons.

  2. Most ReLU Networks Admit Identifiable Parameters

    cs.LG 2026-05 unverdicted novelty 8.0

    For ReLU networks with width at least two in input and hidden layers, an open set of parameters is identifiable, implying functional dimension equals parameter count minus hidden neurons.