IR-Flow: Bridging Discriminative and Generative Image Restoration via Rectified Flow
Pith reviewed 2026-05-10 02:42 UTC · model grok-4.3
The pith
Rectified Flow creates a direct linear path from degraded to clean images for fast and adaptable restoration.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
IR-Flow constructs multilevel data distribution flows and cumulative velocity fields to learn transport trajectories that guide images from varying degradation levels toward clean targets, supported by a multi-step consistency constraint. The work shows that directly establishing a linear transport flow between degraded and clean image domains enables fast inference with only a few sampling steps while also improving adaptability to out-of-distribution degradations, as demonstrated through competitive results on deraining, denoising, and raindrop removal.
What carries the argument
Multilevel data distribution flows paired with cumulative velocity fields that define linear transport trajectories across degradation levels.
If this is right
- Restoration achieves competitive quantitative scores using only a few sampling steps instead of many iterative ones.
- Performance holds up better on degradations outside the training distribution than prior discriminative or generative baselines.
- The approach maintains a strong balance between pixel-level accuracy and perceptual quality across tested tasks.
- A single framework handles multiple restoration problems like deraining, denoising, and raindrop removal without task-specific redesign.
Where Pith is reading between the lines
- The same linear transport idea could be tested on related inverse problems such as deblurring or super-resolution to check if few-step inference generalizes.
- The consistency constraint might reduce variance in other generative sampling pipelines that currently need many steps.
- Lower step counts open the possibility of running high-quality restoration directly on mobile hardware for live photo editing.
Load-bearing premise
That flows built across multiple degradation levels can steer partially restored images to the final clean version without creating artifacts or losing details when the degradation type is new.
What would settle it
A controlled test on an unseen degradation combination, such as rain plus sensor noise, where the outputs show visibly more artifacts or detail loss than a standard single-step discriminative network produces on the same inputs.
Figures
read the original abstract
In image restoration, single-step discriminative mappings often lack fine details via expectation learning, whereas generative paradigms suffer from inefficient multi-step sampling and noise-residual coupling. To address this dilemma, we propose IR-Flow, a novel image restoration method based on Rectified Flow that serves as a unified framework bridging the gap between discriminative and generative paradigms. Specifically, we first construct multilevel data distribution flows, which expand the ability of models to learn from and adapt to various levels of degradation. Subsequently, cumulative velocity fields are proposed to learn transport trajectories across varying degradation levels, guiding intermediate states toward the clean target, while a multi-step consistency constraint is presented to enforce trajectory coherence and boost few-step restoration performance. We show that directly establishing a linear transport flow between degraded and clean image domains not only enables fast inference but also improves adaptability to out-of-distribution degradations. Extensive evaluations on deraining, denoising and raindrop removal tasks demonstrate that IR-Flow achieves competitive quantitative results with only a few sampling steps, offering an efficient and flexible framework that maintains an excellent distortion-perception balance. Our code is available at https://github.com/fanzh03/IR-Flow.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes IR-Flow, a rectified-flow framework for image restoration that constructs multilevel data distribution flows to handle varying degradation levels, defines cumulative velocity fields to learn transport trajectories from degraded to clean domains, and adds a multi-step consistency constraint to improve few-step sampling. It claims this linear transport approach bridges discriminative and generative paradigms, enables fast inference, and improves adaptability to out-of-distribution degradations, while delivering competitive quantitative results on deraining, denoising, and raindrop removal tasks.
Significance. If the proposed components are shown to deliver the claimed benefits without introducing artifacts or detail loss, the work would provide a practical unified framework that combines the efficiency of few-step sampling with improved robustness, addressing a key tension in current image restoration methods. The open-sourced code is a positive factor for reproducibility.
major comments (2)
- [Abstract] Abstract: The load-bearing claim that 'directly establishing a linear transport flow ... improves adaptability to out-of-distribution degradations' is not supported by any cited ablation, OOD-specific test set, or error analysis; the construction of cumulative velocity fields (trained on specific degradation levels) does not automatically guarantee reliable extrapolation to unseen degradations such as combined noise or non-linear rain, as noted in the stress-test concern.
- [Method] Method description: The multilevel data distribution flows and cumulative velocity fields are presented as guiding intermediate states to the clean target, but without explicit equations or a derivation showing how the multi-step consistency constraint prevents deviation under distribution shift, it is unclear whether the trajectories remain artifact-free for OOD cases.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We address each major comment below and have prepared revisions to strengthen the presentation of our claims and clarify the technical details.
read point-by-point responses
-
Referee: [Abstract] Abstract: The load-bearing claim that 'directly establishing a linear transport flow ... improves adaptability to out-of-distribution degradations' is not supported by any cited ablation, OOD-specific test set, or error analysis; the construction of cumulative velocity fields (trained on specific degradation levels) does not automatically guarantee reliable extrapolation to unseen degradations such as combined noise or non-linear rain, as noted in the stress-test concern.
Authors: We agree that the abstract claim would be more robust with explicit supporting evidence. While our main experiments across deraining, denoising, and raindrop removal already cover a range of degradation intensities, we acknowledge the absence of dedicated OOD test sets and error analysis in the original submission. In the revised manuscript we have added a new subsection on out-of-distribution evaluation, including combined noise-plus-rain and non-linear rain patterns. These results show that IR-Flow retains higher PSNR/SSIM and fewer perceptual artifacts than competing methods, consistent with the linear-transport hypothesis. We have also inserted a brief error-analysis paragraph and softened the abstract wording to “suggests improved adaptability to out-of-distribution degradations” to reflect the new empirical support. revision: yes
-
Referee: [Method] Method description: The multilevel data distribution flows and cumulative velocity fields are presented as guiding intermediate states to the clean target, but without explicit equations or a derivation showing how the multi-step consistency constraint prevents deviation under distribution shift, it is unclear whether the trajectories remain artifact-free for OOD cases.
Authors: We accept that the method section would benefit from greater mathematical precision. In the revision we have inserted the missing explicit equations: the multilevel flow construction is now written as a family of linear interpolants parameterized by degradation level; the cumulative velocity field is defined as the integral of the learned velocity over these levels; and the multi-step consistency constraint is expressed as an L2 penalty between the single-step and multi-step trajectory endpoints. We further provide a short derivation in the main text (with full proof in the appendix) showing that the constraint bounds the deviation of the predicted velocity under bounded distribution shift, thereby helping keep trajectories artifact-free. These additions directly address the concern about OOD behavior. revision: yes
Circularity Check
No significant circularity; derivation self-contained from external rectified-flow base
full rationale
The paper's chain begins with an external rectified-flow transport idea, then defines multilevel data distribution flows, cumulative velocity fields, and a multi-step consistency constraint as new architectural components. These are trained and evaluated on restoration tasks; the claims of fast inference and OOD adaptability are presented as empirical results of the construction rather than tautological redefinitions or fitted inputs renamed as predictions. No load-bearing self-citations, uniqueness theorems, or ansatzes imported from the authors' prior work appear in the provided text. The method remains falsifiable via standard benchmarks and does not reduce any key equation to its own inputs by construction.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Abdelrahman Abdelhamed, Stephen Lin, and Michael S. Brown. A high-quality denoising dataset for smartphone cameras. In2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1692–1700, 2018. 7, 8, 2
2018
-
[2]
Ntire 2017 challenge on single image super-resolution: Dataset and study
Eirikur Agustsson and Radu Timofte. Ntire 2017 challenge on single image super-resolution: Dataset and study. In2017 IEEE Conference on Computer Vision and Pattern Recogni- tion Workshops (CVPRW), pages 1122–1131, 2017. 6, 7, 8, 2
2017
-
[3]
Contour detection and hierarchical image segmen- tation.IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(5):898–916, 2011
Pablo Arbel´aez, Michael Maire, Charless Fowlkes, and Jiten- dra Malik. Contour detection and hierarchical image segmen- tation.IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(5):898–916, 2011. 6
2011
-
[4]
The perception-distortion tradeoff
Yochai Blau and Tomer Michaeli. The perception-distortion tradeoff. In2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6228–6237, 2018. 4
2018
-
[5]
Pre-trained image processing transformer
Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, and Wen Gao. Pre-trained image processing transformer. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 12294–12305, 2021. 3
2021
-
[6]
Simple baselines for image restoration
Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, and Jian Sun. Simple baselines for image restoration. InComputer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part VII, page 17–33, Berlin, Heidelberg, 2022. Springer-Verlag. 3, 2
2022
-
[7]
A comparative study of image restoration networks for general backbone network design
Xiangyu Chen, Zheyuan Li, Yuandong Pu, Yihao Liu, Jiantao Zhou, Yu Qiao, and Chao Dong. A comparative study of image restoration networks for general backbone network design. InComputer Vision – ECCV 2024: 18th European Conference, Milan, Italy, September 29–October 4, 2024, Pro- ceedings, Part LXXI, page 74–91, Berlin, Heidelberg, 2024. Springer-Verlag. 3
2024
-
[8]
Inversion by direct iteration: An alternative to denoising diffusion for image restoration.Transactions on Machine Learning Re- search, 2023
Mauricio Delbracio and Peyman Milanfar. Inversion by direct iteration: An alternative to denoising diffusion for image restoration.Transactions on Machine Learning Re- search, 2023. https://openreview.net/forum? id=VmyFF5lL3F. 3, 7, 1
2023
-
[9]
Diffusion mod- els beat gans on image synthesis
Prafulla Dhariwal and Alexander Nichol. Diffusion mod- els beat gans on image synthesis. InAdvances in Neural Information Processing Systems, pages 8780–8794. Curran Associates, Inc., 2021. 3
2021
-
[10]
Image super-resolution using deep convolutional net- works.IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(2):295–307, 2016
Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. Image super-resolution using deep convolutional net- works.IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(2):295–307, 2016. 1
2016
-
[11]
Scaling rectified flow transformers for high-resolution image synthesis
Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim En- tezari, Jonas M¨uller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, Dustin Podell, Tim Dockhorn, Zion English, and Robin Rombach. Scaling rectified flow transformers for high-resolution image synthesis. InForty- first International Conference on Machine Learning, 2024. 3
2024
-
[12]
Vivid: Video virtual try-on using diffusion models, 2024
Zixun Fang, Wei Zhai, Aimin Su, Hongliang Song, Kai Zhu, Mao Wang, Yu Chen, Zhiheng Liu, Yang Cao, and Zheng-Jun Zha. Vivid: Video virtual try-on using diffusion models, 2024. 3
2024
-
[13]
Generative diffu- sion prior for unified image restoration and enhancement
Ben Fei, Zhaoyang Lyu, Liang Pan, Junzhe Zhang, Weidong Yang, Tianyue Luo, Bo Zhang, and Bo Dai. Generative diffu- sion prior for unified image restoration and enhancement. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9935–9946, 2023. 3
2023
-
[14]
Kodak lossless true color image suite, 1999
Rich Franzen. Kodak lossless true color image suite, 1999. 6, 9, 3
1999
-
[15]
Image restoration by denoising diffusion models with iteratively preconditioned guidance
Tomer Garber and Tom Tirer. Image restoration by denoising diffusion models with iteratively preconditioned guidance. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 25245–25254, 2024. 3
2024
-
[16]
Gans trained by a two time-scale update rule converge to a local nash equilibrium
Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bern- hard Nessler, and Sepp Hochreiter. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In Proceedings of the 31st International Conference on Neural Information Processing Systems, page 6629–6640, 2017. 6
2017
-
[17]
Denoising diffu- sion probabilistic models
Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffu- sion probabilistic models. InAdvances in Neural Information Processing Systems, pages 6840–6851, 2020. 1, 3
2020
-
[18]
Jeya Maria Jose Valanarasu, Rajeev Yasarla, and Vishal M. Patel. Transweather: Transformer-based restoration of images degraded by adverse weather conditions. In2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 2343–2353, 2022. 3
2022
-
[19]
Denoising diffusion restoration models
Bahjat Kawar, Michael Elad, Stefano Ermon, and Jiaming Song. Denoising diffusion restoration models. InAdvances in Neural Information Processing Systems, pages 23593–23606,
-
[20]
Musiq: Multi-scale image quality transformer
Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, and Feng Yang. Musiq: Multi-scale image quality transformer. In2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 5128–5137, 2021. 7
2021
-
[21]
Kingma and Jimmy Ba
Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization. In3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, 2015. 1, 2
2015
-
[22]
Learning multi- ple layers of features from tiny images
Alex Krizhevsky. Learning multi- ple layers of features from tiny images. https://api.semanticscholar.org/CorpusID:18268744,
-
[23]
Deblurgan-v2: Deblurring (orders-of-magnitude) faster and better
Orest Kupyn, Tetiana Martyniuk, Junru Wu, and Zhangyang Wang. Deblurgan-v2: Deblurring (orders-of-magnitude) faster and better. In2019 IEEE/CVF International Confer- ence on Computer Vision (ICCV), pages 8877–8886, 2019. 3, 2
2019
-
[24]
Srdiff: Single image super-resolution with diffusion probabilistic models
Haoying Li, Yifan Yang, Meng Chang, Shiqi Chen, Huajun Feng, Zhihai Xu, Qi Li, and Yueting Chen. Srdiff: Single image super-resolution with diffusion probabilistic models. Neurocomputing, 479:47–59, 2022. 3
2022
-
[25]
Swinir: Image restoration using swin transformer
Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. Swinir: Image restoration using swin transformer. In2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pages 1833–1844, 2021. 1
2021
-
[26]
Swinir: Image restoration 10 using swin transformer
Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. Swinir: Image restoration 10 using swin transformer. In2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pages 1833–1844, 2021. 3
2021
-
[27]
Enhanced deep residual networks for single image super-resolution
Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. Enhanced deep residual networks for single image super-resolution. In2017 IEEE Conference on Com- puter Vision and Pattern Recognition Workshops (CVPRW), pages 1132–1140, 2017. 6
2017
-
[28]
Score- based generative modeling through stochastic evolution equa- tions in hilbert spaces
Sungbin Lim, EUN BI YOON, Taehyun Byun, Taewon Kang, Seungwoo Kim, Kyungjae Lee, and Sungjoon Choi. Score- based generative modeling through stochastic evolution equa- tions in hilbert spaces. InAdvances in Neural Information Processing Systems, pages 37799–37812, 2023. 3
2023
-
[29]
Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maxim- ilian Nickel, and Matthew Le. Flow matching for genera- tive modeling. InThe Eleventh International Conference on Learning Representations, 2023. 3
2023
-
[30]
On the classification-distortion-perception tradeoff
Dong Liu, Haochen Zhang, and Zhiwei Xiong. On the classification-distortion-perception tradeoff. InAdvances in Neural Information Processing Systems. Curran Associates, Inc., 2019. 4
2019
-
[31]
Theodorou, Weili Nie, and Anima Anandkumar
Guan-Horng Liu, Arash Vahdat, De-An Huang, Evangelos A. Theodorou, Weili Nie, and Anima Anandkumar. I2sb: image- to-image schr ¨odinger bridge. InProceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA, 2023. JMLR.org. 3, 1
2023
-
[32]
Residual denoising diffusion mod- els
Jiawei Liu, Qiang Wang, Huijie Fan, Yinong Wang, Yandong Tang, and Liangqiong Qu. Residual denoising diffusion mod- els. In2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 2773–2783, 2024. 2, 3, 6, 9, 1
2024
-
[33]
Flow straight and fast: Learning to generate and transfer data with rectified flow
Xingchao Liu, Chengyue Gong, and qiang liu. Flow straight and fast: Learning to generate and transfer data with rectified flow. InThe Eleventh International Conference on Learning Representations, 2023. 3, 6
2023
-
[34]
Dpm-solver++: Fast solver for guided sam- pling of diffusion probabilistic models.Machine Intelligence Research, 22(4):730–751, 2025
Cheng Lu, Yuhao Zhou, Fan Bao, Jianfei Chen, Chongxuan Li, and Jun Zhu. Dpm-solver++: Fast solver for guided sam- pling of diffusion probabilistic models.Machine Intelligence Research, 22(4):730–751, 2025. 3
2025
-
[35]
Gustafsson, Zheng Zhao, Jens Sj¨olund, and Thomas B
Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sj¨olund, and Thomas B. Sch¨on. Image restoration with mean-reverting stochastic differential equations. InProceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA, 2023. JMLR.org. 2, 3, 6, 7, 9, 1
2023
-
[36]
Gustafsson, Zheng Zhao, Jens Sj¨olund, and Thomas B
Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sj¨olund, and Thomas B. Sch¨on. Controlling vision-language models for multi-task image restoration. InThe Twelfth International Conference on Learning Representations, 2024. 7, 9
2024
-
[37]
Waterloo exploration database: New challenges for image quality as- sessment models.IEEE Transactions on Image Processing, 26(2):1004–1016, 2017
Kede Ma, Zhengfang Duanmu, Qingbo Wu, Zhou Wang, Hongwei Yong, Hongliang Li, and Lei Zhang. Waterloo exploration database: New challenges for image quality as- sessment models.IEEE Transactions on Image Processing, 26(2):1004–1016, 2017. 6
2017
-
[38]
Martin, C
D. Martin, C. Fowlkes, D. Tal, and J. Malik. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. InProceedings Eighth IEEE International Confer- ence on Computer Vision. ICCV 2001, pages 416–423 vol.2,
2001
-
[39]
Deep multi-scale convolutional neural network for dynamic scene deblurring
Seungjun Nah, Tae Hyun Kim, and Kyoung Mu Lee. Deep multi-scale convolutional neural network for dynamic scene deblurring. In2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 257–265, 2017. 3, 7, 2
2017
-
[40]
Posterior- mean rectified flow: Towards minimum MSE photo-realistic image restoration
Guy Ohayon, Tomer Michaeli, and Michael Elad. Posterior- mean rectified flow: Towards minimum MSE photo-realistic image restoration. InThe Thirteenth International Conference on Learning Representations, 2025. 3
2025
-
[41]
Restoring vision in adverse weather conditions with patch-based denoising diffusion models.IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1–12, 2023
Ozan ¨Ozdenizci and Robert Legenstein. Restoring vision in adverse weather conditions with patch-based denoising diffusion models.IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1–12, 2023. 6
2023
-
[42]
Tan, Wenhan Yang, Jiajun Su, and Jiaying Liu
Rui Qian, Robby T. Tan, Wenhan Yang, Jiajun Su, and Jiaying Liu. Attentive generative adversarial network for raindrop removal from a single image. In2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2482– 2491, 2018. 6
2018
-
[43]
Ffa-net: Feature fusion attention network for single image dehazing.Proceedings of the AAAI Conference on Artificial Intelligence, 34(07):11908–11915, 2020
Xu Qin, Zhilin Wang, Yuanchao Bai, Xiaodong Xie, and Huizhu Jia. Ffa-net: Feature fusion attention network for single image dehazing.Proceedings of the AAAI Conference on Artificial Intelligence, 34(07):11908–11915, 2020. 7, 8, 2
2020
-
[44]
Deep learning for seeing through window with raindrops
Yuhui Quan, Shijie Deng, Yixin Chen, and Hui Ji. Deep learning for seeing through window with raindrops. In2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 2463–2471, 2019. 3
2019
-
[45]
Progressive image deraining networks: A better and simpler baseline
Dongwei Ren, Wangmeng Zuo, Qinghua Hu, Pengfei Zhu, and Deyu Meng. Progressive image deraining networks: A better and simpler baseline. In2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 3932–3941, 2019. 3, 6
2019
-
[46]
High-resolution image syn- thesis with latent diffusion models
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj¨orn Ommer. High-resolution image syn- thesis with latent diffusion models. In2022 IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR), pages 10674–10685, 2022. 3
2022
-
[47]
Fleet, and Mohammad Norouzi
Chitwan Saharia, Jonathan Ho, William Chan, Tim Sali- mans, David J. Fleet, and Mohammad Norouzi. Image super- resolution via iterative refinement.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(4):4713–4726,
-
[48]
Resfusion: Denoising diffusion probabilistic models for im- age restoration based on prior residual noise
Zhenning Shi, Haoshuai Zheng, Chen Xu, Changsheng Dong, Bin Pan, Xueshuo Xie, Along He, Tao Li, and Huazhu Fu. Resfusion: Denoising diffusion probabilistic models for im- age restoration based on prior residual noise. InAdvances in Neural Information Processing Systems, pages 130664– 130693, 2024. 2, 3, 6, 9, 1
2024
-
[49]
Denoising diffusion implicit models
Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models. InInternational Conference on Learning Representations, 2021. 1, 3, 6, 7, 2
2021
-
[50]
Nima: Neural image assessment.IEEE Transactions on Image Processing, 27(8): 3998–4011, 2018
Hossein Talebi and Peyman Milanfar. Nima: Neural image assessment.IEEE Transactions on Image Processing, 27(8): 3998–4011, 2018. 7
2018
-
[51]
Maxim: Multi- 11 axis mlp for image processing
Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Pey- man Milanfar, Alan Bovik, and Yinxiao Li. Maxim: Multi- 11 axis mlp for image processing. In2022 IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR), pages 5759–5770, 2022. 1, 6
2022
-
[52]
Bingshu Wang, Ze Wang, Wenjie Liu, Xiaoshui Huang, C. L. Philip Chen, and Yue Zhao. Ddsr-net: Direct document shadow removal leveraging multi-scale attention.Machine Intelligence Research, 22(3):452–465, 2025. 3
2025
-
[53]
Zero-shot image restoration using denoising diffusion null-space model
Yinhuai Wang, Jiwen Yu, and Jian Zhang. Zero-shot image restoration using denoising diffusion null-space model. In The Eleventh International Conference on Learning Repre- sentations, 2023. 3
2023
-
[54]
Bovik, H.R
Zhou Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli. Image quality assessment: from error visibility to structural similarity.IEEE Transactions on Image Processing, 13(4): 600–612, 2004. 6
2004
-
[55]
Uformer: A gen- eral u-shaped transformer for image restoration
Zhendong Wang, Xiaodong Cun, Jianmin Bao, Wengang Zhou, Jianzhuang Liu, and Houqiang Li. Uformer: A gen- eral u-shaped transformer for image restoration. In2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 17662–17672, 2022. 3
2022
-
[56]
Dimakis, and Peyman Milanfar
Jay Whang, Mauricio Delbracio, Hossein Talebi, Chitwan Saharia, Alexandros G. Dimakis, and Peyman Milanfar. De- blurring via stochastic refinement. In2022 IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR), pages 16272–16282, 2022. 3
2022
-
[57]
Fully 1 × 1 convolutional network for lightweight image super- resolution.Machine Intelligence Research, 21(6):1062–1076,
Gang Wu, Junjun Jiang, Kui Jiang, and Xianming Liu. Fully 1 × 1 convolutional network for lightweight image super- resolution.Machine Intelligence Research, 21(6):1062–1076,
-
[58]
Advanced his- togram equalization based on a hybrid saliency map and novel visual prior.Machine Intelligence Research, 21(6): 1178–1191, 2024
Yuanbin Wu, Shengkui Dai, and Zhan Ma. Advanced his- togram equalization based on a hybrid saliency map and novel visual prior.Machine Intelligence Research, 21(6): 1178–1191, 2024. 3
2024
-
[59]
Diffir: Efficient diffusion model for image restoration
Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xing- long Wu, Yapeng Tian, Wenming Yang, and Luc Van Gool. Diffir: Efficient diffusion model for image restoration. In 2023 IEEE/CVF International Conference on Computer Vi- sion (ICCV), pages 13049–13059, 2023. 2
2023
-
[60]
Diffir: Efficient diffusion model for image restoration
Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xing- long Wu, Yapeng Tian, Wenming Yang, and Luc Van Gool. Diffir: Efficient diffusion model for image restoration. In 2023 IEEE/CVF International Conference on Computer Vi- sion (ICCV), pages 13049–13059, 2023. 3
2023
-
[61]
Image de-raining transformer.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(11):12978– 12995, 2023
Jie Xiao, Xueyang Fu, Aiping Liu, Feng Wu, and Zheng-Jun Zha. Image de-raining transformer.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(11):12978– 12995, 2023. 3, 6
2023
-
[62]
Dreamclean: Restoring clean image using deep diffusion prior
Jie Xiao, Ruili Feng, Han Zhang, Zhiheng Liu, Zhantao Yang, Yurui Zhu, Xueyang Fu, Kai Zhu, Yu Liu, and Zheng-Jun Zha. Dreamclean: Restoring clean image using deep diffusion prior. InThe Twelfth International Conference on Learning Representations, 2024. 3
2024
-
[63]
Dual frequency transformer for efficient sdr-to-hdr translation.Machine Intel- ligence Research, 21(3):538–548, 2024
Gang Xu, Qibin Hou, and Ming-Ming Cheng. Dual frequency transformer for efficient sdr-to-hdr translation.Machine Intel- ligence Research, 21(3):538–548, 2024. 3
2024
-
[64]
Tan, Jiashi Feng, Jiaying Liu, Zong- ming Guo, and Shuicheng Yan
Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zong- ming Guo, and Shuicheng Yan. Deep joint rain detection and removal from a single image. In2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1685–1694, 2017. 6, 9
2017
-
[65]
Tan, Jiashi Feng, Zongming Guo, Shuicheng Yan, and Jiaying Liu
Wenhan Yang, Robby T. Tan, Jiashi Feng, Zongming Guo, Shuicheng Yan, and Jiaying Liu. Joint rain detection and removal from a single image with contextualized deep net- works.IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(6):1377–1393, 2020. 3, 6
2020
-
[66]
Image restoration through generalized ornstein-uhlenbeck bridge
Conghan Yue, Zhengwei Peng, Junlong Ma, Shiyan Du, Pengxu Wei, and Dongyu Zhang. Image restoration through generalized ornstein-uhlenbeck bridge. InProceedings of the 41st International Conference on Machine Learning, 2024. 3, 7
2024
-
[67]
Resshift: Efficient diffusion model for image super-resolution by resid- ual shifting
Zongsheng Yue, Jianyi Wang, and Chen Change Loy. Resshift: Efficient diffusion model for image super-resolution by resid- ual shifting. InAdvances in Neural Information Processing Systems, pages 13294–13307, 2023. 2, 3, 1
2023
-
[68]
Multi-stage progressive image restoration
Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao. Multi-stage progressive image restoration. In2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 14816–14826, 2021. 3, 6
2021
-
[69]
Restormer: Efficient transformer for high-resolution image restoration
Syed Waqas Zamir, Aditya Arora, Salman Khan, Mu- nawar Hayat, Fahad Shahbaz Khan, and Ming–Hsuan Yang. Restormer: Efficient transformer for high-resolution image restoration. In2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5718–5729,
-
[70]
Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising.IEEE Transactions on Image Processing, 26(7):3142–3155, 2017
Kai Zhang, Wangmeng Zuo, Yunjin Chen, Deyu Meng, and Lei Zhang. Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising.IEEE Transactions on Image Processing, 26(7):3142–3155, 2017. 3, 6
2017
-
[71]
Ffdnet: Toward a fast and flexible solution for cnn-based image denoising
Kai Zhang, Wangmeng Zuo, and Lei Zhang. Ffdnet: Toward a fast and flexible solution for cnn-based image denoising. IEEE Transactions on Image Processing, 27(9):4608–4622,
-
[72]
Practical blind image denoising via swin-conv- unet and data synthesis.Machine Intelligence Research, 20 (6):822–836, 2023
Kai Zhang, Yawei Li, Jingyun Liang, Jiezhang Cao, Yu- lun Zhang, Hao Tang, Deng-Ping Fan, Radu Timofte, and Luc Van Gool. Practical blind image denoising via swin-conv- unet and data synthesis.Machine Intelligence Research, 20 (6):822–836, 2023. 3
2023
-
[73]
Color demosaicking by local directional interpolation and nonlocal adaptive thresholding.Journal of Electronic Imaging, 20(2),
Lei Zhang, Xiaolin Wu, Antoni Buades, and Xin Li. Color demosaicking by local directional interpolation and nonlocal adaptive thresholding.Journal of Electronic Imaging, 20(2),
-
[74]
Efros, Eli Shechtman, and Oliver Wang
Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, and Oliver Wang. The unreasonable effectiveness of deep fea- tures as a perceptual metric. In2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 586–595,
-
[75]
Blind image quality assessment via vision- language correspondence: A multitask learning perspective
Weixia Zhang, Guangtao Zhai, Ying Wei, Xiaokang Yang, and Kede Ma. Blind image quality assessment via vision- language correspondence: A multitask learning perspective. In2023 IEEE/CVF Conference on Computer Vision and Pat- tern Recognition (CVPR), pages 14071–14081, 2023. 7
2023
-
[76]
UniDB: A unified diffusion bridge 12 framework via stochastic optimal control
Kaizhen Zhu, Mokai Pan, Yuexin Ma, Yanwei Fu, Jingyi Yu, Jingya Wang, and Ye Shi. UniDB: A unified diffusion bridge 12 framework via stochastic optimal control. InForty-second International Conference on Machine Learning, 2025. 3, 7
2025
-
[77]
Denoising diffusion models for plug-and-play image restoration
Yuanzhi Zhu, Kai Zhang, Jingyun Liang, Jiezhang Cao, Bi- han Wen, Radu Timofte, and Luc Van Gool. Denoising diffusion models for plug-and-play image restoration. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 1219–1229, 2023. 3
2023
-
[78]
Flowie: Efficient image enhancement via rectified flow
Yixuan Zhu, Wenliang Zhao, Ao Li, Yansong Tang, Jie Zhou, and Jiwen Lu. Flowie: Efficient image enhancement via rectified flow. In2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 13–22, 2024. 3 13
2024
-
[79]
The CNN-baseline uses the same network but directly input the low-quality image and output the high- quality image
Implementation details We chose the same U-Net backbone network as used in IR- SDE [35]. The CNN-baseline uses the same network but directly input the low-quality image and output the high- quality image. For most tasks, we set the training patch- size to be 256×256 and use a batch size of 8. We used the Adam [ 21] optimizer with parameters β1 = 0.9 and β...
-
[80]
Unified Formulation of SDE-based Methods To unify the representation of various SDE-based generative formulations for restoration, we introduce a general parame- terization of the noisy latent variablex t as: xt =α t x0 +β t x1 +γ t ϵ,(13) where x0 is the clean data, x1 is the observed degraded image, ϵ∼ N(0, I) is Gaussian noise, and αt, βt, γt are time-...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.