RECON: Robust symmetry discovery via Explicit Canonical Orientation Normalization

Alonso Urbano; David W. Romero; Max Zimmer; Sebastian Pokutta

arxiv: 2505.13289 · v5 · submitted 2025-05-19 · 💻 cs.LG · cs.CV

RECON: Robust symmetry discovery via Explicit Canonical Orientation Normalization

Alonso Urbano , David W. Romero , Max Zimmer , Sebastian Pokutta This is my paper

Pith reviewed 2026-05-22 14:13 UTC · model grok-4.3

classification 💻 cs.LG cs.CV

keywords symmetry discoverycanonical orientation normalizationclass-pose agnosticpose distributionsout-of-distribution detectiontest-time canonicalizationgroup invariancemolecular ensembles

0 comments

The pith

RECON corrects arbitrary canonical orientations with a right translation to discover instance-specific symmetries without predefined groups.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces RECON, a class-pose agnostic method that normalizes orientations by applying a simple right translation to any training-dependent canonical representation. This correction produces natural alignments with the data itself rather than an arbitrary training choice. The resulting alignments support unsupervised recovery of pose distributions that vary per instance. They also flag out-of-distribution poses and serve as an add-on layer that adds invariance to existing models at test time. A reader would care because many real datasets contain symmetries that do not match any fixed group chosen in advance, and this approach handles them without labels or retraining.

Core claim

RECON corrects arbitrary canonicals via a simple right translation, yielding natural, data-aligned canonicalizations. This enables unsupervised discovery of instance-specific pose distributions, detection of out-of-distribution poses, and a plug-and-play test-time canonicalization layer that can be attached to any pre-trained model to infuse group invariance without retraining. The method is validated on images and molecular ensembles, where it demonstrates accurate symmetry discovery and matches or outperforms other canonicalizations on downstream classification tasks.

What carries the argument

The RECON normalization, a class-pose agnostic canonical orientation correction that applies an explicit right translation to align arbitrary canonical representations with observed data symmetries.

If this is right

Instance-specific pose distributions can be discovered unsupervised from data after training.
Poses that fall outside the training distribution can be flagged by measuring alignment quality.
Any pre-trained model can receive a test-time layer that adds group invariance without retraining.
Classification accuracy on images and molecular ensembles matches or exceeds prior canonicalization methods.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same right-translation correction could be tested on point-cloud or graph data where instance symmetries also deviate from fixed groups.
RECON might be combined with existing equivariant architectures by supplying better initial canonicals for group actions.
Performance gains in low-data settings could be measured by comparing models with and without the test-time layer.
Failure cases on data with symmetries that are not representable by translations would highlight the boundary of the method.

Load-bearing premise

Instance-specific symmetries in real data can be recovered and aligned by applying a right translation to an arbitrary training-dependent canonical representation without knowing the exact transformation group or having labeled pose information.

What would settle it

A synthetic dataset with known instance-specific symmetries that require a correction other than a right translation in representation space, where RECON produces misaligned canonicals and cannot recover the correct pose distributions.

read the original abstract

Real world data often exhibits unknown, instance-specific symmetries that rarely exactly match a transformation group $G$ fixed a priori. Class-pose decompositions aim to create disentangled representations by factoring inputs into invariant features and a pose $g\in G$ defined relative to a training-dependent, arbitrary canonical representation. We introduce RECON, a class-pose agnostic canonical orientation normalization that corrects arbitrary canonicals via a simple right translation, yielding natural, data-aligned canonicalizations. This enables (i) unsupervised discovery of instance-specific pose distributions, (ii) detection of out-of-distribution poses and (iii) a plug-and-play test-time canonicalization layer. This layer can be attached on top of any pre-trained model to infuse group invariance, improving its performance without retraining. We validate on images and molecular ensembles, demonstrating accurate symmetry discovery, and matching or outperforming other canonicalizations in downstream classification.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

RECON gives a workable right-translation fix for arbitrary canonicals in class-pose setups, but the claim that it always recovers instance-specific poses without group knowledge or extra conditions looks under-supported.

read the letter

The main point is that RECON normalizes any training-dependent canonical orientation by a right translation to produce a data-aligned version. This is meant to let you discover instance-specific pose distributions unsupervised, spot out-of-distribution poses, and add a test-time invariance layer to existing models without retraining. The abstract and method description frame it as a direct, explicit correction rather than something fitted from parameters, which keeps the circularity low on paper.

Referee Report

1 major / 0 minor

Summary. The manuscript introduces RECON, a class-pose agnostic canonical orientation normalization technique. It claims that applying a simple right translation to any training-dependent arbitrary canonical representation produces natural, data-aligned canonicalizations. This enables (i) unsupervised discovery of instance-specific pose distributions, (ii) detection of out-of-distribution poses, and (iii) a plug-and-play test-time canonicalization layer attachable to pre-trained models to infuse group invariance and improve performance without retraining. Validation is reported on image and molecular ensemble data, with claims of accurate symmetry discovery and matching or outperforming other canonicalizations in downstream classification.

Significance. If the right-translation correction mechanism is shown to be identifiable and effective from data alone, RECON would provide a practical advance for handling unknown instance-specific symmetries in real data, allowing plug-and-play invariance infusion into existing models and unsupervised symmetry discovery without labeled poses or a priori knowledge of G. The reported empirical results on images and molecules indicate potential utility for classification tasks.

major comments (1)

Abstract: The central claim that a single right translation to an arbitrary canonical yields natural data-aligned canonicalizations enabling unsupervised instance-specific pose discovery without knowledge of G or labels is load-bearing. No derivation or identifiability argument is supplied showing that the correcting translation element can be recovered solely from the data distribution when the pre-trained representation may entangle class features with pose in a non-equivariant manner or when real-data symmetries are only approximate and not closed under the implicit group operation.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the careful review and constructive feedback on our manuscript. We address the major comment regarding the identifiability of the right-translation correction below, providing an honest assessment and indicating planned revisions.

read point-by-point responses

Referee: Abstract: The central claim that a single right translation to an arbitrary canonical yields natural data-aligned canonicalizations enabling unsupervised instance-specific pose discovery without knowledge of G or labels is load-bearing. No derivation or identifiability argument is supplied showing that the correcting translation element can be recovered solely from the data distribution when the pre-trained representation may entangle class features with pose in a non-equivariant manner or when real-data symmetries are only approximate and not closed under the implicit group operation.

Authors: We acknowledge that the current manuscript does not include a complete formal derivation or identifiability proof for the right-translation correction, which is a valid observation. The work primarily develops the practical algorithm and demonstrates its utility through experiments on images and molecular data. In the revised version, we will add a dedicated theoretical subsection that sketches the recovery of the translation under the assumption of approximate equivariance in the learned features: the correcting element is the unique right translation that aligns the empirical pose distribution (estimated unsupervised from the data) to a canonical mode. We will also explicitly discuss the limitations in cases of strong class-pose entanglement or only approximate symmetries, noting that the method remains practically effective as shown by the reported results, even if full group closure does not hold. This revision will strengthen the load-bearing claim without altering the core contributions. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected in RECON derivation

full rationale

The paper introduces RECON as a constructive normalization technique that applies an explicit right translation to correct arbitrary training-dependent canonical representations. This is presented directly in the abstract and method as a class-pose agnostic correction yielding data-aligned canonicals, without any equations or steps that reduce the claimed outputs (unsupervised pose discovery, OOD detection, plug-and-play invariance) back to fitted parameters or self-citations by construction. No load-bearing self-citation chains, ansatzes smuggled via prior work, or renaming of known results appear in the provided description. The approach is validated empirically on images and molecular ensembles as an independent proposal, making the derivation self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review based on abstract only; no specific free parameters, axioms, or invented entities are identifiable from the provided text. The approach appears to build on standard group theory concepts such as right translations without introducing new postulated entities.

pith-pipeline@v0.9.0 · 5683 in / 1112 out tokens · 44256 ms · 2026-05-22T14:13:35.036346+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/ArithmeticFromLogic.lean embed_add, embed_injective, LogicNat orbit structure echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

Proposition 3.1 ... the empirical distribution ˆµ[x] corresponding to the normalized samples ψ([x])ˆΓ⁻¹[x] approximates the target distribution µ[x] ... right-multiplying by the inverse ˆΓ⁻¹[x] corrects this offset and centers the Fréchet mean at the identity
IndisputableMonolith/Cost/FunctionalEquation.lean Jcost_unit0, washburn_uniqueness_aczel echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

canonical orientation normalization ... yielding natural, data-aligned canonicalizations

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.