pith. sign in

arxiv: 2604.15651 · v1 · submitted 2026-04-17 · 💻 cs.CV

SPLIT: Self-supervised Partitioning for Learned Inversion in Nonlinear Tomography

Pith reviewed 2026-05-10 08:20 UTC · model grok-4.3

classification 💻 cs.CV
keywords self-supervisednonlinearsplitimagesreconstructiontomographyground-truthinverse
0
0 comments X

The pith

SPLIT partitions projection data to enforce cross-consistency and measurement fidelity, proving that its self-supervised objective matches supervised training in expectation under mild conditions, with strong results on sparse-view multispectral CT.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Tomography creates images from projections like CT scans, but nonlinear relationships, noise, and missing views make reconstruction hard. Supervised machine learning usually needs many pairs of raw projections and perfect reference images, which are rarely available. SPLIT splits the measured projections into multiple partitions and trains a neural network so that the images reconstructed from each partition are consistent with one another while also matching the original measurements. This cross-partition agreement serves as a built-in training signal. The authors show mathematically that, under mild conditions, this self-supervised loss is equivalent on average to the loss you would get if you had the ground-truth images. They add an automatic stopping rule based on a no-reference image quality measure. Applied to multispectral computed tomography with sparse views, the method produces reconstructions that are more robust to noise and higher quality than classical iterative methods and recent self-supervised baselines.

Core claim

Our main theoretical result shows that, under mild conditions, the proposed self-supervised objective is equivalent to its supervised counterpart in expectation.

Load-bearing premise

The mild conditions under which the self-supervised objective equals the supervised one in expectation, plus the assumption that data partitions provide complementary information without introducing systematic bias or violating measurement fidelity.

Figures

Figures reproduced from arXiv: 2604.15651 by Gyeongha Hwang, Lukas Neumann, Markus Haltmeier, Nadja Gruber.

Figure 1
Figure 1. Figure 1: • Training: For the U-net we use the implementation from https://github.com/milesial/Pytorch-UNet. Training uses Adam with learning rate 10−4 , batch size 1, for up to 15,000 epochs. For early stopping we monitor the sum (27) of the estimated PSNRs across materials and halt training when this sum attains its maximum. We used the same U-net model for all learned methods [PITH_FULL_IMAGE:figures/full_fig_p0… view at source ↗
Figure 2
Figure 2. Figure 2: Multispectral CT reconstructions from Nθ = 145 projection angles. Rows and columns as in [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Multispectral CT reconstructions from Nθ = 29 projection angles. Rows and columns as in [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗
read the original abstract

Machine learning has achieved impressive performance in tomographic reconstruction, but supervised training requires paired measurements and ground-truth images that are often unavailable. This has motivated self-supervised approaches, which have primarily addressed denoising and, more recently, linear inverse problems. We address nonlinear inverse problems and introduce SPLIT (Self-supervised Partitioning for Learned Inversion in Nonlinear Tomography), a self-supervised machine-learning framework for reconstructing images from nonlinear, incomplete, and noisy projection data without any samples of ground-truth images. SPLIT enforces cross-partition consistency and measurement-domain fidelity while exploiting complementary information across multiple partitions. Our main theoretical result shows that, under mild conditions, the proposed self-supervised objective is equivalent to its supervised counterpart in expectation. We regularize training with an automatic stopping rule that halts optimization when a no-reference image-quality surrogate saturates. As a concrete application, we derive SPLIT variants for multispectral computed tomography. Experiments on sparse-view acquisitions demonstrate high reconstruction quality and robustness to noise, surpassing classical iterative reconstruction and recent self-supervised baselines.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces SPLIT, a self-supervised framework for nonlinear tomographic reconstruction that partitions measurements to enforce cross-partition consistency and measurement-domain fidelity without requiring ground-truth images. The central theoretical claim is that, under mild conditions, the resulting self-supervised objective is equivalent in expectation to its supervised counterpart. The approach is instantiated for multispectral CT, includes an automatic stopping rule based on a no-reference image-quality surrogate, and is evaluated on sparse-view acquisitions where it outperforms classical iterative reconstruction and recent self-supervised baselines.

Significance. If the equivalence result holds under the stated conditions, the work would meaningfully advance self-supervised learning for nonlinear inverse problems, where paired data are typically unavailable. The combination of a partition-based consistency loss with fidelity terms and the automatic stopping criterion offers a practical training recipe. The empirical robustness to noise on sparse-view multispectral data is a concrete strength that supports applicability in medical imaging settings.

major comments (2)
  1. [§3 (Theoretical Analysis), Theorem 1] §3 (Theoretical Analysis), Theorem 1 and surrounding derivation: the claim that the self-supervised objective equals the supervised risk in expectation must explicitly cancel bias terms arising from the nonlinear forward operator (e.g., exponential Radon or polychromatic attenuation) interacting with partition-induced measurement inconsistencies. The current sketch does not show how the mild conditions guarantee that cross-partition expectations recover the full supervised loss without residual dependence; a counter-example or expanded proof step addressing nonlinearity would be required.
  2. [§5 (Experiments), Table 2] §5 (Experiments), Table 2 and associated text: the reported superiority over self-supervised baselines lacks an ablation isolating the effect of the nonlinear forward-model term versus the partitioning strategy itself. Without this control, it is unclear whether performance gains are attributable to the claimed equivalence or to favorable partition choices that may not generalize.
minor comments (2)
  1. [Abstract] The abstract states the equivalence holds “under mild conditions” but does not list them; adding a one-sentence enumeration would improve accessibility.
  2. [§2 (Preliminaries)] Notation for the partition operator and the nonlinear measurement model is introduced without a consolidated table; a short notation summary would reduce reader effort.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments. We address each major point below, clarifying our theoretical claims and strengthening the experimental analysis as requested.

read point-by-point responses
  1. Referee: [§3 (Theoretical Analysis), Theorem 1] §3 (Theoretical Analysis), Theorem 1 and surrounding derivation: the claim that the self-supervised objective equals the supervised risk in expectation must explicitly cancel bias terms arising from the nonlinear forward operator (e.g., exponential Radon or polychromatic attenuation) interacting with partition-induced measurement inconsistencies. The current sketch does not show how the mild conditions guarantee that cross-partition expectations recover the full supervised loss without residual dependence; a counter-example or expanded proof step addressing nonlinearity would be required.

    Authors: We appreciate the referee highlighting the need for greater explicitness in handling nonlinearity. Theorem 1 establishes equivalence under the stated mild conditions (Lipschitz continuity of the nonlinear operator and balanced partition expectations). To address the concern, we will expand the proof in the revised manuscript with an additional derivation step that explicitly expands the expectation over partitions and shows cancellation of any bias terms induced by the nonlinear forward model (e.g., via the law of total expectation applied to the polychromatic attenuation). We will also include a short discussion confirming the absence of residual dependence and, if space permits, a brief illustrative counter-example under violated conditions to underscore the necessity of the assumptions. These additions will appear in an extended §3 and a new appendix. revision: yes

  2. Referee: [§5 (Experiments), Table 2] §5 (Experiments), Table 2 and associated text: the reported superiority over self-supervised baselines lacks an ablation isolating the effect of the nonlinear forward-model term versus the partitioning strategy itself. Without this control, it is unclear whether performance gains are attributable to the claimed equivalence or to favorable partition choices that may not generalize.

    Authors: We agree that an explicit ablation would better isolate the contributions. In the revised manuscript we will augment the experimental section with a new ablation study on the multispectral CT data. This will compare (i) full SPLIT (nonlinear forward model + partitioning), (ii) a linear-approximation variant of SPLIT, and (iii) a partitioning-only baseline without the measurement-fidelity term. Results will be added to an extended Table 2 together with quantitative metrics and a short discussion of generalization across random and structured partition choices. This will clarify that the observed gains arise from the combination of the equivalence result and the nonlinear modeling rather than partition selection alone. revision: yes

Circularity Check

0 steps flagged

No circularity in theoretical equivalence claim

full rationale

The paper's main theoretical result asserts equivalence between the self-supervised objective and its supervised counterpart in expectation under mild conditions. The abstract presents this as a derived result without any indication that the self-supervised loss is constructed by definition from the supervised loss, that parameters are fitted to a subset and then renamed as predictions, or that the proof relies on self-citations or imported uniqueness theorems. No equations or derivation steps are visible in the provided text that reduce the claimed equivalence to an input by construction. The derivation chain is therefore self-contained against external benchmarks, consistent with the default expectation that most papers exhibit no significant circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based on abstract only: the framework rests on the assumption that data partitions yield complementary information and that the self-supervised objective approximates supervised training under unspecified mild conditions. No explicit free parameters or new physical entities are introduced in the abstract.

axioms (1)
  • domain assumption Mild conditions exist under which the self-supervised objective equals the supervised objective in expectation
    Invoked as the basis for the main theoretical result in the abstract.

pith-pipeline@v0.9.0 · 5477 in / 1249 out tokens · 40366 ms · 2026-05-10T08:20:55.445126+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

37 extracted references · 37 canonical work pages · 1 internal anchor

  1. [1]

    Spectral effects on three-dimensional reconstruction from x rays.Medical Physics, 2(6):321–324, 1975

    William D McDavid, Robert G Waggener, William H Payne, and Michael J Dennis. Spectral effects on three-dimensional reconstruction from x rays.Medical Physics, 2(6):321–324, 1975

  2. [2]

    Beam filtration for object-tailored x-ray ct of multi-material cultural heritage objects

    Maximilian B Kiss, Francien G Bossema, Paul JC van Laar, Suzan Meijer, Felix Lucka, Tristan van Leeuwen, and K Joost Batenburg. Beam filtration for object-tailored x-ray ct of multi-material cultural heritage objects. Heritage Science, 11(1):130, 2023

  3. [3]

    Anniversary paper: Devel- opment of x-ray computed tomography: The role of medical physics and aapm from the 1970s to present

    Xiaochuan Pan, Jeffrey Siewerdsen, Patrick J La Riviere, and Willi A Kalender. Anniversary paper: Devel- opment of x-ray computed tomography: The role of medical physics and aapm from the 1970s to present. Medical Physics, 35(8):3728–3739, 2008

  4. [4]

    Correction for beam hardening in computed tomography.Physics in Medicine & Biology, 24(1):81, 1979

    Gabor T Herman. Correction for beam hardening in computed tomography.Physics in Medicine & Biology, 24(1):81, 1979

  5. [5]

    Iterative correction of beam hardening artifacts in ct.Medical Physics, 38(S1):S36–S49, 2011

    Gert Van Gompel, Katrien Van Slambrouck, Michel Defrise, K Joost Batenburg, Johan De Mey, Jan Sijbers, and Johan Nuyts. Iterative correction of beam hardening artifacts in ct.Medical Physics, 38(S1):S36–S49, 2011

  6. [6]

    On analytical solutions to beam-hardening.Sensing and Imaging, 18:1–17, 2017

    G Rigaud. On analytical solutions to beam-hardening.Sensing and Imaging, 18:1–17, 2017

  7. [7]

    Joint image reconstruction method with correlative multi-channel prior for x-ray spectral computed tomography.Inverse Problems, 34(6):064001, 2018

    Daniil Kazantsev, Jakob S Jørgensen, Martin S Andersen, William RB Lionheart, Peter D Lee, and Philip J Withers. Joint image reconstruction method with correlative multi-channel prior for x-ray spectral computed tomography.Inverse Problems, 34(6):064001, 2018

  8. [8]

    Joint reconstruction of multi-channel, spectral ct data via constrained total nuclear variation minimization.Physics in Medicine & Biology, 60(5):1741, 2015

    David S Rigie and Patrick J La Riviere. Joint reconstruction of multi-channel, spectral ct data via constrained total nuclear variation minimization.Physics in Medicine & Biology, 60(5):1741, 2015

  9. [9]

    Nonlinear optimization for mixed attenu- ation polyenergetic image reconstruction.Inverse Problems, 35(6):064004, 2019

    Yunyi Hu, James G Nagy, Jianjun Zhang, and Martin S Andersen. Nonlinear optimization for mixed attenu- ation polyenergetic image reconstruction.Inverse Problems, 35(6):064004, 2019

  10. [10]

    (an overview of) synergistic reconstruc- tion for multimodality/multichannel imaging methods.Philosophical Transactions of the Royal Society A, 379(2200):20200205, 2021

    Simon R Arridge, Matthias J Ehrhardt, and Kris Thielemans. (an overview of) synergistic reconstruc- tion for multimodality/multichannel imaging methods.Philosophical Transactions of the Royal Society A, 379(2200):20200205, 2021. 10

  11. [11]

    Comparison of five one-step reconstruction algorithms for spectral ct.Physics in Medicine & Biology, 63(23):235001, 2018

    Cyril Mory, Bruno Sixou, Salim Si-Mohamed, Loïc Boussel, and Simon Rit. Comparison of five one-step reconstruction algorithms for spectral ct.Physics in Medicine & Biology, 63(23):235001, 2018

  12. [12]

    Quantitative image-based spectral reconstruction for computed tomography

    B Heismann and M Balda. Quantitative image-based spectral reconstruction for computed tomography. Medical Physics, 36(10):4471–4485, 2009

  13. [13]

    Image-based dual energy ct using optimized precor- rection functions: A practical new approach of material decomposition in image domain.Medical Physics, 36(8):3818–3829, 2009

    Clemens Maaß, Matthias Baer, and Marc Kachelrieß. Image-based dual energy ct using optimized precor- rection functions: A practical new approach of material decomposition in image domain.Medical Physics, 36(8):3818–3829, 2009

  14. [14]

    An algorithm for constrained one-step inversion of spectral ct data.Physics in Medicine & Biology, 61(10):3784, 2016

    Rina Foygel Barber, Emil Y Sidky, Taly Gilat Schmidt, and Xiaochuan Pan. An algorithm for constrained one-step inversion of spectral ct data.Physics in Medicine & Biology, 61(10):3784, 2016

  15. [15]

    Regularized material decomposition for k- edge separation in hyperspectral computed tomography

    Francesca Bevilacqua, Yiqiu Dong, and Jakob Sauer Jørgensen. Regularized material decomposition for k- edge separation in hyperspectral computed tomography. InInternational Conference on Scale Space and Variational Methods in Computer Vision, pages 107–119. Springer, 2023

  16. [16]

    Noise2self: Blind denoising by self-supervision

    Joshua Batson and Loic Royer. Noise2self: Blind denoising by self-supervision. InInternational Conference on Machine Learning, pages 524–533. PMLR, 2019

  17. [17]

    Noise2inverse: Self-supervised deep convolutional denoising for tomography.IEEE Transactions on Computational Imaging, 6:1320–1335, 2020

    Allard Adriaan Hendriksen, Daniël Maria Pelt, and K Joost Batenburg. Noise2inverse: Self-supervised deep convolutional denoising for tomography.IEEE Transactions on Computational Imaging, 6:1320–1335, 2020

  18. [18]

    Sparse2inverse: Self-supervised inversion of sparse-view ct data,

    Nadja Gruber, Johannes Schwab, Elke Gizewski, and Markus Haltmeier. Sparse2inverse: Self-supervised inversion of sparse-view ct data.arXiv:2402.16921, 2024

  19. [19]

    Equivariance2Inverse: A Practical Self-Supervised CT Reconstruction Method Benchmarked on Real, Limited-Angle, and Blurred Data

    Dirk Elias Schut, Adriaan Graas, Robert van Liere, and Tristan van Leeuwen. Equivariance2inverse: A practical self-supervised ct reconstruction method benchmarked on real, limited-angle, and blurred data. arXiv preprint arXiv:2510.23317, 2025

  20. [20]

    Proj2proj: self-supervised low-dose ct reconstruction

    Mehmet Ozan Unal, Metin Ertas, and Isa Yildirim. Proj2proj: self-supervised low-dose ct reconstruction. PeerJ Computer Science, 10:e1849, 2024

  21. [21]

    Self-supervised learning of physics-guided reconstruction neural networks without fully sampled reference data.Magnetic resonance in medicine, 84(6):3172–3191, 2020

    Burhaneddin Yaman, Seyed Amir Hossein Hosseini, Steen Moeller, Jutta Ellermann, Kâmil Uğurbil, and Mehmet Akçakaya. Self-supervised learning of physics-guided reconstruction neural networks without fully sampled reference data.Magnetic resonance in medicine, 84(6):3172–3191, 2020

  22. [22]

    Noisier2inverse: Self-supervised learning for image reconstruction with correlated noise.IEEE Access, 13:139445–139459, 2025

    Nadja Gruber, Johannes Schwab, Markus Haltmeier, Ander Biguri, Clemens Dlaska, and Gyeongha Hwang. Noisier2inverse: Self-supervised learning for image reconstruction with correlated noise.IEEE Access, 13:139445–139459, 2025

  23. [23]

    Robust equivariant imaging: a fully unsupervised framework for learning to image from noisy and partial measurements

    Dongdong Chen, Julián Tachella, and Mike E Davies. Robust equivariant imaging: a fully unsupervised framework for learning to image from noisy and partial measurements. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5647–5656, 2022

  24. [24]

    SIAM, 2006

    Timothy A Davis.Direct methods for sparse linear systems. SIAM, 2006

  25. [25]

    Cambridge university press, 2002

    Michael Brin and Garrett Stuck.Introduction to dynamical systems. Cambridge university press, 2002

  26. [26]

    Derivative-free iterative one-step reconstruction for multispectral ct.Journal of Imaging, 10(5):98, 2024

    Thomas Prohaszka, Lukas Neumann, and Markus Haltmeier. Derivative-free iterative one-step reconstruction for multispectral ct.Journal of Imaging, 10(5):98, 2024

  27. [27]

    Multi-energy ct decomposition using convolutional neural networks

    Darin P Clark, Matthew Holbrook, and Cristian T Badea. Multi-energy ct decomposition using convolutional neural networks. InMedical Imaging 2018: Physics of Medical Imaging, volume 10573, pages 415–423. SPIE, 2018

  28. [28]

    Deep- learning-based direct inversion for material decomposition.Medical physics, 47(12):6294–6309, 2020

    Hao Gong, Shengzhen Tao, Kishore Rajendran, Wei Zhou, Cynthia H McCollough, and Shuai Leng. Deep- learning-based direct inversion for material decomposition.Medical physics, 47(12):6294–6309, 2020

  29. [29]

    Material decomposition in spectral ct using deep learning: a sim2real transfer approach.IEEE Access, 9:25632–25647, 2021

    Juan FPJ Abascal, Nicolas Ducros, Valeriya Pronina, Simon Rit, Pierre-Antoine Rodesch, Thomas Broussaud, Suzanne Bussod, Philippe C Douek, Andreas Hauptmann, Simon Arridge, et al. Material decomposition in spectral ct using deep learning: a sim2real transfer approach.IEEE Access, 9:25632–25647, 2021

  30. [30]

    Deep learning based spectral ct imaging

    Weiwen Wu, Dianlin Hu, Chuang Niu, Lieza Vanden Broeke, Anthony PH Butler, Peng Cao, James Atlas, Alexander Chernoglazov, Varut Vardhanabhuti, and Ge Wang. Deep learning based spectral ct imaging. Neural Networks, 144:342–358, 2021. 11

  31. [31]

    Iterative material decomposition for spectral ct using self-supervised noise2noise prior.Physics in medicine & biology, 66(15):155013, 2021

    Wei Fang, Dufan Wu, Kyungsang Kim, Mannudeep K Kalra, Ramandeep Singh, Liang Li, and Quanzheng Li. Iterative material decomposition for spectral ct using self-supervised noise2noise prior.Physics in medicine & biology, 66(15):155013, 2021

  32. [32]

    Improving spectral ct image quality based on channel correlation and self-supervised learning.IEEE Transactions on Computational Imaging, 9:1084–1097, 2023

    Xi Chen, Chaoyang Zhang, Ti Bai, and Shaojie Chang. Improving spectral ct image quality based on channel correlation and self-supervised learning.IEEE Transactions on Computational Imaging, 9:1084–1097, 2023

  33. [33]

    Unsupervised denoising for sparse multi-spectral computed tomography.arXiv preprint arXiv:2211.01159, 2022

    Satu I Inkinen, Mikael AK Brix, Miika T Nieminen, Simon Arridge, and Andreas Hauptmann. Unsupervised denoising for sparse multi-spectral computed tomography.arXiv preprint arXiv:2211.01159, 2022

  34. [34]

    Senas-net: Self-supervised noise and artifact suppression network for material decomposition in spectral ct.IEEE Transactions on Computational Imaging, 10:677–689, 2024

    Xu Ji, Yuchen Lu, Yikun Zhang, Xu Zhuo, Shengqi Kan, Weilong Mao, Gouenou Coatrieux, Jean-Louis Coatrieux, Guotao Quan, Yan Xi, et al. Senas-net: Self-supervised noise and artifact suppression network for material decomposition in spectral ct.IEEE Transactions on Computational Imaging, 10:677–689, 2024

  35. [35]

    Spectral2spectral: Image-spectral similarity assisted spectral ct deep reconstruction without reference.arXiv:2210.01125, 2022

    Peng He, Peng Feng, Xiaodong Guo, Weiwen Wu, Hengyong Yu, et al. Spectral2spectral: Image-spectral similarity assisted spectral ct deep reconstruction without reference.arXiv:2210.01125, 2022

  36. [36]

    Unsupervised denoising in spectral ct: Multi-dimensional u-net for energy channel regularisation.Sensors, 24(20):6654, 2024

    Raziye Kubra Kumrular and Thomas Blumensath. Unsupervised denoising in spectral ct: Multi-dimensional u-net for energy channel regularisation.Sensors, 24(20):6654, 2024

  37. [37]

    U-net: Convolutional networks for biomedical im- age segmentation

    Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical im- age segmentation. InMedical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pages 234–241. Springer, 2015. 12