RelativeFlow: Taming Medical Image Denoising Learning with Noisy Reference
Pith reviewed 2026-05-10 09:13 UTC · model grok-4.3
The pith
RelativeFlow decomposes medical image denoising into relative flows between noisy versions to reach a unified high-quality target.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
RelativeFlow reformulates flow matching by decomposing the absolute noise-to-clean mapping into relative noisier-to-noisy mappings, realized through consistent transport (CoT) that constrains the relative flows to compose progressively into a unified absolute flow and through a simulation-based velocity field (SVF) constructed from modality-specific degradation operators. This enables the framework to learn from heterogeneous noisy references and drive inputs from arbitrary quality levels toward a single high-quality target, outperforming existing simulated-supervised and self-supervised methods on CT and MR denoising tasks.
What carries the argument
Consistent transport (CoT), a displacement map that forces relative noisier-to-noisy flows to compose into one unified absolute flow, together with a simulation-based velocity field (SVF) built from modality-specific degradation operators to define the direction and speed of each step.
If this is right
- The method can train directly on whatever noisy references exist without treating them as clean targets, avoiding both suboptimal convergence and reference bias.
- A single model can improve images that start at widely different quality levels because all paths are driven toward the same unified high-quality endpoint.
- Modality-specific degradation operators in the velocity field allow the same framework to handle CT and MR data without separate retraining pipelines.
- By avoiding restrictive assumptions about noise statistics, the approach applies to realistic clinical acquisitions where noise distributions are unknown or mixed.
Where Pith is reading between the lines
- If the composition of relative flows holds, the same decomposition idea could be tested on denoising tasks outside medicine where only noisy references exist, such as astronomical imaging or sensor data.
- Adapting the simulation-based velocity field to additional modalities would allow the method to scale to new imaging types without collecting fresh clean datasets.
- The flow perspective suggests that other generative models could be redesigned to treat noisy references as intermediate states rather than fixed targets.
Load-bearing premise
The assumption that relative mappings between noisy images can be constrained so they reliably add up to one overall mapping toward a clean image, and that the velocity field constructed from degradation operators will work across scan types without additional adjustments.
What would settle it
Apply the trained model to a held-out test set that does contain known clean ground-truth images and check whether the denoised outputs match those ground truths more closely than outputs from prior methods, or verify whether the sequence of relative flow steps actually reaches the expected quality level when composed.
Figures
read the original abstract
Medical image denoising (MID) lacks absolutely clean images for supervision, leading to a noisy reference problem that fundamentally limits denoising performance. Existing simulated-supervised discriminative learning (SimSDL) and simulated-supervised generative learning (SimSGL) treat noisy references as clean targets, causing suboptimal convergence or reference-biased learning, while self-supervised learning (SSL) imposes restrictive noise assumptions that are seldom satisfied in realistic MID scenarios. We propose \textbf{RelativeFlow}, a flow matching framework that learns from heterogeneous noisy references and drives inputs from arbitrary quality levels toward a unified high-quality target. RelativeFlow reformulates flow matching by decomposing the absolute noise-to-clean mapping into relative noisier-to-noisy mappings, and realizes this formulation through two key components: 1) consistent transport (CoT), a displacement map that constrains relative flows to be components of and progressively compose a unified absolute flow, and 2) simulation-based velocity field (SVF), which constructs a learnable velocity field using modality-specific degradation operators to support different medical imaging modalities. Extensive experiments on Computed Tomography (CT) and Magnetic Resonance (MR) denoising demonstrate that RelativeFlow significantly outperforms existing methods, taming MID with noisy references.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes RelativeFlow, a flow matching framework for medical image denoising that reformulates the problem by decomposing absolute noise-to-clean mappings into relative noisier-to-noisy mappings. It introduces Consistent Transport (CoT) to constrain relative flows as components that progressively compose a unified absolute flow, and Simulation-based Velocity Field (SVF) constructed via modality-specific degradation operators. The central claim is that this tames the noisy reference problem in MID, enabling learning from heterogeneous noisy references and significantly outperforming SimSDL, SimSGL, and SSL methods on CT and MR data.
Significance. If the composition property of relative flows holds and the empirical gains are robust, the work offers a principled way to handle the absence of clean references in medical imaging without restrictive noise assumptions, which could improve practical denoising pipelines. The explicit decomposition into relative mappings and the modality-aware SVF construction represent a clear conceptual advance over treating noisy references as clean targets.
major comments (3)
- [Abstract] Abstract: the claim that CoT 'constrains relative flows to be components of and progressively compose a unified absolute flow' is load-bearing for the central contribution, yet no equation or derivation is supplied showing that the learned SVF on heterogeneous pairs yields the same terminal distribution as direct absolute transport or that path consistency is preserved when noise levels are not strictly ordered.
- [Experiments] Experiments: the assertion of significant outperformance on CT and MR denoising rests on high-level statements without visible error bars, ablation tables, or statistical tests, making it impossible to verify whether the reported gains exceed those of ordinary flow matching on noisy pairs.
- [Method] Method: the SVF construction is said to support different modalities via degradation operators, but no analysis or constraint is given to guarantee generalization across CT and MR without modality-specific retraining or additional regularization, which directly affects the cross-modality claims.
minor comments (2)
- [Abstract] Abstract: the inline definitions of CoT and SVF are clear but would benefit from a short enumerated list of the two components to improve scannability.
- [Abstract] Notation: the distinction between 'relative noisier-to-noisy mappings' and the final 'unified high-quality target' could be clarified with a simple diagram or one-line recurrence relation.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on RelativeFlow. The comments identify important areas for clarification and strengthening, particularly around mathematical rigor, experimental validation, and modality handling. We address each major comment point-by-point below, with proposed revisions where the manuscript can be improved without misrepresenting the work.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim that CoT 'constrains relative flows to be components of and progressively compose a unified absolute flow' is load-bearing for the central contribution, yet no equation or derivation is supplied showing that the learned SVF on heterogeneous pairs yields the same terminal distribution as direct absolute transport or that path consistency is preserved when noise levels are not strictly ordered.
Authors: We acknowledge that the abstract is high-level and that an explicit derivation for heterogeneous noisy pairs would strengthen the presentation. Section 3.2 defines CoT as a displacement map enforcing composition: for pairs with noise levels satisfying σ_i > σ_j, the relative velocity integrates to the absolute transport map under the SVF. However, the manuscript does not include a full proof for non-monotonic noise ordering or terminal distribution equivalence across arbitrary heterogeneous references. We will add a concise derivation (new paragraph in Section 3.2 and a short appendix) showing that, given consistent degradation operators, the composed relative paths converge to the same high-quality terminal distribution as direct absolute flow matching. Path consistency for non-strictly ordered levels will be addressed by noting that CoT implicitly sorts via pairwise consistency during training. revision: yes
-
Referee: [Experiments] Experiments: the assertion of significant outperformance on CT and MR denoising rests on high-level statements without visible error bars, ablation tables, or statistical tests, making it impossible to verify whether the reported gains exceed those of ordinary flow matching on noisy pairs.
Authors: The referee is correct that the current experimental reporting lacks sufficient statistical detail to allow independent verification of the gains over baselines including standard flow matching on noisy pairs. We will revise the Experiments section to include: (i) error bars showing mean ± std over 5 random seeds for all metrics, (ii) an expanded ablation table directly comparing RelativeFlow against vanilla flow matching trained on the same noisy-reference pairs, and (iii) paired t-test p-values for the reported improvements on both CT and MR datasets. These additions will be placed in the main text and supplementary material. revision: yes
-
Referee: [Method] Method: the SVF construction is said to support different modalities via degradation operators, but no analysis or constraint is given to guarantee generalization across CT and MR without modality-specific retraining or additional regularization, which directly affects the cross-modality claims.
Authors: The manuscript does not claim zero-shot cross-modality generalization. SVF is explicitly constructed with modality-specific degradation operators (Section 3.3), and all reported experiments train separate models for CT and MR. The framework's contribution is that the same relative-flow formulation and CoT mechanism apply once the appropriate operator is supplied. We will add a clarifying paragraph in Section 3.3 and the discussion section stating that modality-specific retraining is required and that no cross-modality transfer without adaptation is demonstrated. This removes any ambiguity about cross-modality claims while preserving the unified methodological advance. revision: partial
Circularity Check
No significant circularity in RelativeFlow derivation
full rationale
The abstract and available description present RelativeFlow as a proposed reformulation of flow matching that introduces CoT and SVF as new components to handle noisy references. No equations or steps are shown that reduce the claimed unified absolute flow or performance gains to a fitted parameter, self-citation chain, or definitional tautology. The decomposition is presented as a modeling choice rather than derived from prior results by the same authors in a load-bearing way. The framework is self-contained against external benchmarks like CT/MR experiments, with no evidence of the central claim collapsing to its inputs by construction.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Relative noisier-to-noisy mappings can be constrained by a displacement map to compose into a single absolute noise-to-clean flow.
- domain assumption Modality-specific degradation operators can be used to construct a learnable velocity field that supports arbitrary medical imaging modalities.
invented entities (2)
-
Consistent Transport (CoT)
no independent evidence
-
Simulation-based Velocity Field (SVF)
no independent evidence
Reference graph
Works this paper leans on
-
[1]
A closer look at memorization in deep networks
Devansh Arpit, Stanislaw Jastrzebski, Nicolas Ballas, David Krueger, Emmanuel Bengio, Maxinder S Kanwal, Tegan Maharaj, Asja Fischer, Aaron Courville, Yoshua Bengio, and Simon Lacoste-Julien. A closer look at memorization in deep networks. InInternational Conference on Machine Learn- ing, pages 233–242. PMLR, 2017. 2
work page 2017
-
[2]
Noise2self: Blind denoising by self-supervision
Joshua Batson and Loic Royer. Noise2self: Blind denoising by self-supervision. InInternational Conference on Machine Learning, pages 524–533. PMLR, 2019. 2, 5, 13
work page 2019
-
[3]
De-noising of contrast-enhanced mri se- quences by an ensemble of expert deep neural networks
Ariel Benou, Ronel Veksler, Alon Friedman, and Tammy Riklin-Raviv. De-noising of contrast-enhanced mri se- quences by an ensemble of expert deep neural networks. InInternational Workshop on Large-Scale Annotation of Biomedical Data and Expert Label Synthesis, pages 95–110. Springer, 2016. 2
work page 2016
-
[4]
Removing structured noise with self-supervised blind-spot networks
Coleman Broaddus, Alexander Krull, Martin Weigert, Uwe Schmidt, and Gene Myers. Removing structured noise with self-supervised blind-spot networks. InIEEE International Symposium on Biomedical Imaging, pages 159–163. IEEE,
-
[5]
Hu Chen, Yi Zhang, Mannudeep K Kalra, Feng Lin, Yang Chen, Peixi Liao, Jiliu Zhou, and Ge Wang. Low-dose ct with a residual encoder-decoder convolutional neural net- work.IEEE Transactions on Medical Imaging, 36(12):2524– 2535, 2017. 2, 5, 13
work page 2017
-
[6]
Hyungjin Chung, Eunha Lee, and Jong Chul Ye. Mr im- age denoising and super-resolution using regularized reverse diffusion.IEEE Transactions on Medical Imaging, 42(4): 922–934, 2023. 5, 13
work page 2023
-
[7]
Blake E Dewey, Chenyu Zhao, Jacob C Reinhold, Aaron Carass, Kathryn C Fitzgerald, Peter A Calabresi, Pascal Sati, Jiwon Oh, Dzung L Pham, Daniel S Reich, and Jerry L Prince. Deep learning for the harmonization of structural MRI scans: A survey.Magnetic Resonance Imaging, 80: 8–23, 2021. 1
work page 2021
-
[8]
Diffusion models beat gans on image synthesis
Prafulla Dhariwal and Alexander Nichol. Diffusion models beat gans on image synthesis. InAdvances in Neural Infor- mation Processing Systems, pages 8780–8794, 2021. 5, 13
work page 2021
-
[9]
Patch2self: Denoising diffusion mri with self- supervised learning
Shreyas Fadnavis, Joshua Batson, and Eleftherios Gary- fallidis. Patch2self: Denoising diffusion mri with self- supervised learning. InAdvances in Neural Information Pro- cessing Systems, pages 16293–16303, 2020. 2
work page 2020
-
[10]
Qi Gao, Zilong Li, Junping Zhang, Yi Zhang, and Hong- ming Shan. Corediff: Contextual error-modulated general- ized diffusion model for low-dose ct denoising and gener- alization.IEEE Transactions on Medical Imaging, 43(2): 745–759, 2024. 2, 3
work page 2024
-
[11]
Hydranet: a multi-branch convolutional neural network architecture for mri denoising
Stephen Gregory, Hu Cheng, Sharlene D Newman, and Yu Gan. Hydranet: a multi-branch convolutional neural network architecture for mri denoising. InMedical Imaging 2021: Image Processing, page 115962H. SPIE, 2021. 2
work page 2021
-
[12]
The rician distribu- tion of noisy mri data.Magnetic resonance in medicine, 34 (6):910–914, 1995
H ´akon Gudbjartsson and Samuel Patz. The rician distribu- tion of noisy mri data.Magnetic resonance in medicine, 34 (6):910–914, 1995. 1, 2, 5
work page 1995
-
[13]
Toward convolutional blind denoising of real pho- tographs
Shi Guo, Zifei Yan, Kai Zhang, Wangmeng Zuo, and Lei Zhang. Toward convolutional blind denoising of real pho- tographs. InIEEE Conference on Computer Vision and Pat- tern Recognition, pages 1712–1722, 2019. 2
work page 2019
-
[14]
Denoising dif- fusion probabilistic models
Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising dif- fusion probabilistic models. InAdvances in Neural Informa- tion Processing Systems, pages 6840–6851, 2020. 3
work page 2020
-
[15]
Ixi dataset.https://brain- development.org/ixi-dataset/, 2012
Imperial College London. Ixi dataset.https://brain- development.org/ixi-dataset/, 2012. Informa- tion eXtraction from Images. 5
work page 2012
-
[16]
Bowen Jiang, Tao Yue, and Xue-mei Hu. Thermal noise re- moval of magnetic resonance images: A deep learning ap- proach based on an attentive residue multi-dilated network with adaptive filtering and discrete cosine transform. InIEEE International Joint Conference on Neural Networks, pages 1–8. IEEE, 2023. 2
work page 2023
-
[17]
Jie Jing, Wenjun Xia, Mingzheng Hou, Hu Chen, Yan Liu, Jiliu Zhou, and Yi Zhang. Training low dose ct denois- ing network without high quality reference data.Physics in Medicine & Biology, 67(9):095008, 2022. 2
work page 2022
-
[18]
Dong-Wook Kim, Jae Ryun Chung, and Seung-Won Jung. Grdn: Grouped residual dense network for real image denoising and gan-based real-world noise modeling. In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 1913–1921, 2019. 2
work page 1913
-
[19]
Noise2void - learning denoising from single noisy images
Alexander Krull, Tim-Oliver Buchholz, and Florian Jug. Noise2void - learning denoising from single noisy images. InIEEE Conference on Computer Vision and Pattern Recog- nition, pages 2129–2137, 2019. 2
work page 2019
-
[20]
Noise2noise: Learning image restoration without clean data
Jaakko Lehtinen, Jacob Munkberg, Jon Hasselgren, Samuli Laine, Tero Karras, Miika Aittala, and Timo Aila. Noise2noise: Learning image restoration without clean data. InInternational Conference on Machine Learning, pages 2965–2974. PMLR, 2018. 2
work page 2018
-
[21]
Meng Li, William Hsu, Xiaodong Xie, Jason Cong, and Wen Gao. Sacnn: Self-attention convolutional neural network for low-dose ct denoising with self-supervised perceptual loss network.IEEE Transactions on Medical Imaging, 39(7): 2289–2301, 2020. 2
work page 2020
-
[22]
Swinir: Image restoration using swin transformer
Jingyun Liang, Jie Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. Swinir: Image restoration using swin transformer. InIEEE/CVF International Con- ference on Computer Vision Workshops, pages 1833–1844,
-
[23]
Feiyang Liao, Yufei Tang, Qiang Du, Jiping Wang, Ming Li, and Jian Zheng. Domain progressive low-dose ct imaging using iterative partial diffusion model.IEEE Transactions on Medical Imaging, 43(12):4449–4461, 2024. 2, 3, 5, 13
work page 2024
-
[24]
Flow Matching for Generative Modeling
Yaron Lipman, Ricky TQ Chen, Heli Ben-Hamu, Maximil- ian Nickel, and Matt Le. Flow matching for generative mod- eling.arXiv preprint arXiv:2210.02747, 2022. 2, 3, 5, 8, 13
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[25]
Chuang Niu, Mengzhou Li, Fenglei Fan, Weiwen Wu, Xi- aodong Guo, Qing Lyu, and Ge Wang. Noise suppression with similarity-based self-supervised deep learning.IEEE Transactions on Medical Imaging, 41(11):3142–3152, 2022. 2, 5, 13
work page 2022
-
[26]
Quantification of heterogeneity ob- served in medical images.BMC Medical Imaging, 13(1):7,
JPB O’Connor, CJ Rose, JC Waterton, RAD Carano, GJM Parker, and A Jackson. Quantification of heterogeneity ob- served in medical images.BMC Medical Imaging, 13(1):7,
-
[27]
Pazhou Laboratory (Huangpu). 2024 greater bay area (huangpu) international algorithm competition: Low-dose ct denoising challenge dataset.https : / / iacc . pazhoulab-huangpu.com/, 2024. Third Greater Bay Area International Algorithm Competition. 5
work page 2024
-
[28]
U- net: Convolutional networks for biomedical image segmen- tation
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U- net: Convolutional networks for biomedical image segmen- tation. InInternational Conference on Medical image com- puting and computer-assisted intervention, pages 234–241. Springer, 2015. 5, 13
work page 2015
-
[29]
U- net: Convolutional networks for biomedical image segmen- tation
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U- net: Convolutional networks for biomedical image segmen- tation. InInternational Conference on Medical Image Com- puting and Computer-Assisted Intervention, pages 234–241. Springer, 2015. 2
work page 2015
-
[30]
Hwanjun Song, Minseok Kim, Dongmin Park, Yooju Shin, and Jae-Gil Lee. Learning from noisy labels with deep neural networks: A survey.IEEE Transactions on Neural Networks and Learning Systems, 34(11):8135–8153, 2023. 1, 2
work page 2023
-
[31]
Denois- ing diffusion implicit models
Jiaming Song, Chenlin Meng, and Stefano Ermon. Denois- ing diffusion implicit models. InInternational Conference on Learning Representations, 2021. 5, 8, 13
work page 2021
-
[32]
Score-based generative modeling through stochastic differential equa- tions
Yang Song, Jascha Sohl-Dickstein, Diederik P Kingma, Ab- hishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equa- tions. InInternational Conference on Learning Represen- tations, 2021. 3
work page 2021
-
[33]
Solv- ing inverse problems in medical imaging with score-based generative models
Yang Song, Liyue Shen, Lei Xing, and Stefano Ermon. Solv- ing inverse problems in medical imaging with score-based generative models. InInternational Conference on Learning Representations, 2022. 2, 3
work page 2022
-
[34]
Utku Soylu and Michael L Oelze. Calibrating data mis- matches in deep learning-based quantitative ultrasound using setting transfer functions.arXiv preprint arXiv:2210.01958,
-
[35]
Mem- net: A persistent memory network for image restoration
Ying Tai, Jian Yang, Xiaoming Liu, and Chunyan Xu. Mem- net: A persistent memory network for image restoration. In IEEE International Conference on Computer Vision, pages 4539–4547, 2017. 2
work page 2017
-
[36]
Qiyuan Tian, Ziyu Li, Qiuyun Fan, Jonathan R Polimeni, Berkin Bilgic ¸, David H Salat, and Susie Y Huang. Sdndti: Self-supervised deep learning-based denoising for diffusion tensor mri.NeuroImage, 253:119033, 2022. 2
work page 2022
-
[37]
Prabhat Chandra Tripathi and Soumya Bag. Cnn-dmri: A convolutional neural network for denoising of magnetic res- onance images.Pattern Recognition Letters, 135:57–63,
-
[38]
Ddm2: Self-supervised diffusion mri denoising with generative diffusion models
Tiange Xiang, Mahmut Yurt, Ali B Syed, Kawin Setsompop, and Akshay Chaudhari. Ddm2: Self-supervised diffusion mri denoising with generative diffusion models. InInternational Conference on Learning Representations, 2023. 2, 3
work page 2023
-
[39]
Deformed2self: Self- supervised denoising for dynamic medical imaging
Junshen Xu and Elfar Adalsteinsson. Deformed2self: Self- supervised denoising for dynamic medical imaging. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 25–35. Springer,
-
[40]
Qingsong Yang, Pingkun Yan, Yanbo Zhang, Hengyong Yu, Yongyi Shi, Xuanqin Mou, Mannudeep K Kalra, Yi Zhang, Ling Sun, and Ge Wang. Low-dose ct image denoising using a generative adversarial network with wasserstein distance and perceptual loss.IEEE Transactions on Medical Imaging, 37(6):1348–1357, 2018. 2
work page 2018
-
[41]
Variational denoising network: Toward blind noise modeling and removal
Zongsheng Yue, Hongwei Yong, Qian Zhao, Lei Zhang, and Deyu Meng. Variational denoising network: Toward blind noise modeling and removal. InAdvances in Neural Infor- mation Processing Systems, pages 1690–1701, 2019. 2
work page 2019
-
[42]
Dong Zeng, Jing Huang, Zhaoying Bian, Shanzhou Niu, Hua Zhang, Qianjin Feng, Zhengrong Liang, and Jianhua Ma. A simple low-dose x-ray ct simulation from high-dose scan.IEEE transactions on nuclear science, 62(5):2226– 2233, 2015. 1, 2, 5
work page 2015
-
[43]
Kai Zhang, Wangmeng Zuo, Yunjin Chen, Deyu Meng, and Lei Zhang. Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising.IEEE Transactions on Image Processing, 26(7):3142–3155, 2017. 1, 2
work page 2017
-
[44]
Ffdnet: Toward a fast and flexible solution for cnn-based image denoising
Kai Zhang, Wangmeng Zuo, and Lei Zhang. Ffdnet: Toward a fast and flexible solution for cnn-based image denoising. IEEE Transactions on Image Processing, 27(9):4608–4622,
-
[45]
Cfnet: Conditional filter learning with dynamic noise estimation for real image denoising
Yifan Zuo, Jiacheng Xie, Yuming Fang, Yan Huang, and Wenhui Jiang. Cfnet: Conditional filter learning with dynamic noise estimation for real image denoising. Knowledge-Based Systems, 258:110004, 2022. 2 Supplementary Materials A. Mathematical Derivations of RelativeFlow Frame- work A.1 CoT Component Property A.2 CoT Composition Property A.3 Simulation-bas...
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.