Equivariant Transformer Networks

Gregory Valiant; Kai Sheng Tai; Peter Bailis

arxiv: 1901.11399 · v2 · pith:4RG4HYNBnew · submitted 2019-01-25 · 💻 cs.CV · cs.LG· stat.ML

Equivariant Transformer Networks

Kai Sheng Tai , Peter Bailis , Gregory Valiant This is my paper

classification 💻 cs.CV cs.LGstat.ML

keywords equivariantimprovetransformationgroupsmodelrobustnesstowardsachieving

0 comments

read the original abstract

How can prior knowledge on the transformation invariances of a domain be incorporated into the architecture of a neural network? We propose Equivariant Transformers (ETs), a family of differentiable image-to-image mappings that improve the robustness of models towards pre-defined continuous transformation groups. Through the use of specially-derived canonical coordinate systems, ETs incorporate functions that are equivariant by construction with respect to these transformations. We show empirically that ETs can be flexibly composed to improve model robustness towards more complicated transformation groups in several parameters. On a real-world image classification task, ETs improve the sample efficiency of ResNet classifiers, achieving relative improvements in error rate of up to 15% in the limited data regime while increasing model parameter count by less than 1%.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Unified Framework for Vision Transformers Equivariant to Discrete Subgroups of $\mathrm{O}(2)$
cs.CV 2026-06 unverdicted novelty 7.0

A unified family of vision transformers equivariant to arbitrary discrete subgroups of O(2), with embedding and expressivity theorems, a D6 construction using hexagonal patches, and experiments on aerial images in low...
Risk-Controlled Post-Processing of Decision Policies
stat.ML 2026-05 unverdicted novelty 7.0

Risk-controlled post-processing yields a threshold-structured policy that follows the baseline except where an oracle fallback sharply reduces conditional violation risk, achieving O(log n/n) expected excess risk in i...