Equivariant Splitting: Self-supervised learning from incomplete data
Pith reviewed 2026-05-18 10:50 UTC · model grok-4.3
The pith
A new definition of equivariance for reconstruction networks makes splitting losses unbiased estimators of the supervised loss from single incomplete observations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors introduce a new definition of equivariance tailored to reconstruction networks and prove that, when a network satisfies this property, self-supervised splitting losses yield unbiased estimates of the supervised loss under a single incomplete observation model.
What carries the argument
The new equivariance definition for reconstruction networks, which ensures the splitting loss remains an unbiased estimator of the full supervised loss.
If this is right
- Enables training of reconstruction networks without any ground-truth references in highly underdetermined inverse problems.
- Delivers state-of-the-art results on image inpainting, accelerated magnetic resonance imaging, sparse-view computed tomography, and compressive sensing.
- Applies to settings with a single incomplete observation model where prior self-supervised methods often introduce bias.
Where Pith is reading between the lines
- The approach could be tested on inverse problems outside medical imaging, such as astronomical image reconstruction or remote sensing.
- Slightly relaxed versions of the equivariance condition might still preserve most of the unbiasedness benefit while easing network design.
- Integration with additional physics-based constraints could further improve results on domain-specific tasks.
Load-bearing premise
The new definition of equivariance is sufficient to guarantee that the splitting loss stays unbiased even when only one incomplete observation is available per sample.
What would settle it
A direct numerical check that the self-supervised loss equals the supervised loss for equivariant networks but deviates measurably when the network violates the proposed equivariance condition.
read the original abstract
Self-supervised learning for inverse problems allows to train a reconstruction network from noise and/or incomplete data alone. These methods have the potential of enabling learning-based solutions when obtaining ground-truth references for training is expensive or even impossible. In this paper, we propose a new self-supervised learning strategy devised for the challenging setting where measurements are observed via a single incomplete observation model. We introduce a new definition of equivariance in the context of reconstruction networks, and show that the combination of self-supervised splitting losses and equivariant reconstruction networks results in unbiased estimates of the supervised loss. Through a series of experiments on image inpainting, accelerated magnetic resonance imaging, sparse-view computed tomography, and compressive sensing, we demonstrate that the proposed loss achieves state-of-the-art performance in settings with highly rank-deficient forward models. The code is available at https://github.com/vsechaud/Equivariant-Splitting
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces a new definition of equivariance tailored to reconstruction networks and claims that combining it with self-supervised splitting losses yields unbiased estimates of the supervised loss under a single incomplete observation model. Experiments on image inpainting, accelerated MRI, sparse-view CT, and compressive sensing demonstrate state-of-the-art performance for highly rank-deficient forward models, with code released.
Significance. If the unbiasedness result holds, the work would meaningfully extend self-supervised learning to inverse problems with limited or no ground-truth data, particularly where measurements are severely incomplete. Reproducibility via the linked GitHub repository is a positive factor.
major comments (2)
- [§3 and §4] §3 (Method) and §4 (Theory): The proof that the stated equivariance definition implies E[L_split(f_θ, y)] = E[L_sup(f_θ, x)] exactly is not visible in sufficient detail. The derivation must explicitly show how cross terms arising from the rank-deficient measurement operator A cancel under the new equivariance; without this step the central unbiasedness claim remains unverified for arbitrary A.
- [Definition 1] Definition 1 (equivariance for reconstruction networks): Clarify whether the definition encodes invariance under the null-space components of A. If it is only of the form f_θ(P y) = P f_θ(y), it is not immediate that this commutes with the splitting operator in a manner guaranteeing unbiasedness for every rank-deficient forward model.
minor comments (2)
- [Table 2] Table 2 (quantitative results): Include standard deviations across multiple random seeds or runs to substantiate the SOTA claims.
- [Figure 3] Figure 3 (qualitative examples): Add error maps or difference images to make visual comparisons with baselines more precise.
Simulated Author's Rebuttal
We thank the referee for their detailed and constructive feedback on our manuscript. We address each of the major comments below and have made revisions to improve the clarity of the theoretical contributions.
read point-by-point responses
-
Referee: [§3 and §4] §3 (Method) and §4 (Theory): The proof that the stated equivariance definition implies E[L_split(f_θ, y)] = E[L_sup(f_θ, x)] exactly is not visible in sufficient detail. The derivation must explicitly show how cross terms arising from the rank-deficient measurement operator A cancel under the new equivariance; without this step the central unbiasedness claim remains unverified for arbitrary A.
Authors: We appreciate the referee highlighting the need for greater explicitness in the derivation. The original Section 4 presented the main steps of the unbiasedness argument, but we agree that the cancellation of cross terms induced by the rank-deficient operator A merits a more granular treatment. In the revised manuscript we have expanded Section 4.2 with a complete, line-by-line derivation: we begin from the definition of the splitting loss, substitute the measurement model y = A x, invoke the new equivariance property to replace f_θ(y) by its projection onto the range of A, and show that all inner-product terms involving the orthogonal complement of range(A) vanish in expectation. The resulting identity E[L_split(f_θ, y)] = E[L_sup(f_θ, x)] therefore holds for any linear A, including those with large null spaces. revision: yes
-
Referee: [Definition 1] Definition 1 (equivariance for reconstruction networks): Clarify whether the definition encodes invariance under the null-space components of A. If it is only of the form f_θ(P y) = P f_θ(y), it is not immediate that this commutes with the splitting operator in a manner guaranteeing unbiasedness for every rank-deficient forward model.
Authors: We thank the referee for requesting this clarification. Definition 1 is not restricted to the projector form f_θ(P y) = P f_θ(y). It requires that the network output remain unchanged when an arbitrary component from the null space of A is added to its input, i.e., f_θ(y + N z) = f_θ(y) for any z, where N is a basis for null(A). This stronger invariance is precisely what ensures commutation with the splitting operator: because each split measurement is formed by masking in the range of A, the null-space component is invisible to the loss and is therefore automatically preserved by the equivariant network. We have inserted a short remark immediately after Definition 1 that spells out this commutation and its consequence for arbitrary rank-deficient A. revision: yes
Circularity Check
No significant circularity; derivation relies on explicit new definition and mathematical proof
full rationale
The paper introduces a new definition of equivariance tailored to reconstruction networks and derives that self-supervised splitting losses yield unbiased estimates of the supervised loss when this equivariance holds. This follows directly from the stated definition and the expectation over the incomplete observation model without reducing to a fitted parameter, self-referential assumption, or prior self-citation that bears the load. The abstract and claimed results present the unbiasedness as a consequence of the definition rather than by construction or renaming; experiments on inpainting, MRI, CT, and compressive sensing provide external validation. No load-bearing step collapses to its own inputs.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We introduce a new definition of equivariance in the context of reconstruction networks, and show that the combination of self-supervised splitting losses and equivariant reconstruction networks results in unbiased estimates of the supervised loss.
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Theorem 1. ... if the matrix Q_A1 ≜ E_g|A1 {(A T_g)^T A T_g} has full rank ... then f^*(y1,A1) = E_{x|y1,A1}{x}.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.