IR-Flow: Bridging Discriminative and Generative Image Restoration via Rectified Flow
Pith reviewed 2026-05-10 02:42 UTC · model grok-4.3
The pith
Rectified Flow creates a direct linear path from degraded to clean images for fast and adaptable restoration.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
IR-Flow constructs multilevel data distribution flows and cumulative velocity fields to learn transport trajectories that guide images from varying degradation levels toward clean targets, supported by a multi-step consistency constraint. The work shows that directly establishing a linear transport flow between degraded and clean image domains enables fast inference with only a few sampling steps while also improving adaptability to out-of-distribution degradations, as demonstrated through competitive results on deraining, denoising, and raindrop removal.
What carries the argument
Multilevel data distribution flows paired with cumulative velocity fields that define linear transport trajectories across degradation levels.
If this is right
- Restoration achieves competitive quantitative scores using only a few sampling steps instead of many iterative ones.
- Performance holds up better on degradations outside the training distribution than prior discriminative or generative baselines.
- The approach maintains a strong balance between pixel-level accuracy and perceptual quality across tested tasks.
- A single framework handles multiple restoration problems like deraining, denoising, and raindrop removal without task-specific redesign.
Where Pith is reading between the lines
- The same linear transport idea could be tested on related inverse problems such as deblurring or super-resolution to check if few-step inference generalizes.
- The consistency constraint might reduce variance in other generative sampling pipelines that currently need many steps.
- Lower step counts open the possibility of running high-quality restoration directly on mobile hardware for live photo editing.
Load-bearing premise
That flows built across multiple degradation levels can steer partially restored images to the final clean version without creating artifacts or losing details when the degradation type is new.
What would settle it
A controlled test on an unseen degradation combination, such as rain plus sensor noise, where the outputs show visibly more artifacts or detail loss than a standard single-step discriminative network produces on the same inputs.
Figures
read the original abstract
In image restoration, single-step discriminative mappings often lack fine details via expectation learning, whereas generative paradigms suffer from inefficient multi-step sampling and noise-residual coupling. To address this dilemma, we propose IR-Flow, a novel image restoration method based on Rectified Flow that serves as a unified framework bridging the gap between discriminative and generative paradigms. Specifically, we first construct multilevel data distribution flows, which expand the ability of models to learn from and adapt to various levels of degradation. Subsequently, cumulative velocity fields are proposed to learn transport trajectories across varying degradation levels, guiding intermediate states toward the clean target, while a multi-step consistency constraint is presented to enforce trajectory coherence and boost few-step restoration performance. We show that directly establishing a linear transport flow between degraded and clean image domains not only enables fast inference but also improves adaptability to out-of-distribution degradations. Extensive evaluations on deraining, denoising and raindrop removal tasks demonstrate that IR-Flow achieves competitive quantitative results with only a few sampling steps, offering an efficient and flexible framework that maintains an excellent distortion-perception balance. Our code is available at https://github.com/fanzh03/IR-Flow.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes IR-Flow, a rectified-flow framework for image restoration that constructs multilevel data distribution flows to handle varying degradation levels, defines cumulative velocity fields to learn transport trajectories from degraded to clean domains, and adds a multi-step consistency constraint to improve few-step sampling. It claims this linear transport approach bridges discriminative and generative paradigms, enables fast inference, and improves adaptability to out-of-distribution degradations, while delivering competitive quantitative results on deraining, denoising, and raindrop removal tasks.
Significance. If the proposed components are shown to deliver the claimed benefits without introducing artifacts or detail loss, the work would provide a practical unified framework that combines the efficiency of few-step sampling with improved robustness, addressing a key tension in current image restoration methods. The open-sourced code is a positive factor for reproducibility.
major comments (2)
- [Abstract] Abstract: The load-bearing claim that 'directly establishing a linear transport flow ... improves adaptability to out-of-distribution degradations' is not supported by any cited ablation, OOD-specific test set, or error analysis; the construction of cumulative velocity fields (trained on specific degradation levels) does not automatically guarantee reliable extrapolation to unseen degradations such as combined noise or non-linear rain, as noted in the stress-test concern.
- [Method] Method description: The multilevel data distribution flows and cumulative velocity fields are presented as guiding intermediate states to the clean target, but without explicit equations or a derivation showing how the multi-step consistency constraint prevents deviation under distribution shift, it is unclear whether the trajectories remain artifact-free for OOD cases.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We address each major comment below and have prepared revisions to strengthen the presentation of our claims and clarify the technical details.
read point-by-point responses
-
Referee: [Abstract] Abstract: The load-bearing claim that 'directly establishing a linear transport flow ... improves adaptability to out-of-distribution degradations' is not supported by any cited ablation, OOD-specific test set, or error analysis; the construction of cumulative velocity fields (trained on specific degradation levels) does not automatically guarantee reliable extrapolation to unseen degradations such as combined noise or non-linear rain, as noted in the stress-test concern.
Authors: We agree that the abstract claim would be more robust with explicit supporting evidence. While our main experiments across deraining, denoising, and raindrop removal already cover a range of degradation intensities, we acknowledge the absence of dedicated OOD test sets and error analysis in the original submission. In the revised manuscript we have added a new subsection on out-of-distribution evaluation, including combined noise-plus-rain and non-linear rain patterns. These results show that IR-Flow retains higher PSNR/SSIM and fewer perceptual artifacts than competing methods, consistent with the linear-transport hypothesis. We have also inserted a brief error-analysis paragraph and softened the abstract wording to “suggests improved adaptability to out-of-distribution degradations” to reflect the new empirical support. revision: yes
-
Referee: [Method] Method description: The multilevel data distribution flows and cumulative velocity fields are presented as guiding intermediate states to the clean target, but without explicit equations or a derivation showing how the multi-step consistency constraint prevents deviation under distribution shift, it is unclear whether the trajectories remain artifact-free for OOD cases.
Authors: We accept that the method section would benefit from greater mathematical precision. In the revision we have inserted the missing explicit equations: the multilevel flow construction is now written as a family of linear interpolants parameterized by degradation level; the cumulative velocity field is defined as the integral of the learned velocity over these levels; and the multi-step consistency constraint is expressed as an L2 penalty between the single-step and multi-step trajectory endpoints. We further provide a short derivation in the main text (with full proof in the appendix) showing that the constraint bounds the deviation of the predicted velocity under bounded distribution shift, thereby helping keep trajectories artifact-free. These additions directly address the concern about OOD behavior. revision: yes
Circularity Check
No significant circularity; derivation self-contained from external rectified-flow base
full rationale
The paper's chain begins with an external rectified-flow transport idea, then defines multilevel data distribution flows, cumulative velocity fields, and a multi-step consistency constraint as new architectural components. These are trained and evaluated on restoration tasks; the claims of fast inference and OOD adaptability are presented as empirical results of the construction rather than tautological redefinitions or fitted inputs renamed as predictions. No load-bearing self-citations, uniqueness theorems, or ansatzes imported from the authors' prior work appear in the provided text. The method remains falsifiable via standard benchmarks and does not reduce any key equation to its own inputs by construction.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Abdelrahman Abdelhamed, Stephen Lin, and Michael S. Brown. A high-quality denoising dataset for smartphone cameras. In2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1692–1700, 2018. 7, 8, 2
work page 2018
-
[2]
Ntire 2017 challenge on single image super-resolution: Dataset and study
Eirikur Agustsson and Radu Timofte. Ntire 2017 challenge on single image super-resolution: Dataset and study. In2017 IEEE Conference on Computer Vision and Pattern Recogni- tion Workshops (CVPRW), pages 1122–1131, 2017. 6, 7, 8, 2
work page 2017
-
[3]
Pablo Arbel´aez, Michael Maire, Charless Fowlkes, and Jiten- dra Malik. Contour detection and hierarchical image segmen- tation.IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(5):898–916, 2011. 6
work page 2011
-
[4]
The perception-distortion tradeoff
Yochai Blau and Tomer Michaeli. The perception-distortion tradeoff. In2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6228–6237, 2018. 4
work page 2018
-
[5]
Pre-trained image processing transformer
Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, and Wen Gao. Pre-trained image processing transformer. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 12294–12305, 2021. 3
work page 2021
-
[6]
Simple baselines for image restoration
Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, and Jian Sun. Simple baselines for image restoration. InComputer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part VII, page 17–33, Berlin, Heidelberg, 2022. Springer-Verlag. 3, 2
work page 2022
-
[7]
A comparative study of image restoration networks for general backbone network design
Xiangyu Chen, Zheyuan Li, Yuandong Pu, Yihao Liu, Jiantao Zhou, Yu Qiao, and Chao Dong. A comparative study of image restoration networks for general backbone network design. InComputer Vision – ECCV 2024: 18th European Conference, Milan, Italy, September 29–October 4, 2024, Pro- ceedings, Part LXXI, page 74–91, Berlin, Heidelberg, 2024. Springer-Verlag. 3
work page 2024
-
[8]
Mauricio Delbracio and Peyman Milanfar. Inversion by direct iteration: An alternative to denoising diffusion for image restoration.Transactions on Machine Learning Re- search, 2023. https://openreview.net/forum? id=VmyFF5lL3F. 3, 7, 1
work page 2023
-
[9]
Diffusion mod- els beat gans on image synthesis
Prafulla Dhariwal and Alexander Nichol. Diffusion mod- els beat gans on image synthesis. InAdvances in Neural Information Processing Systems, pages 8780–8794. Curran Associates, Inc., 2021. 3
work page 2021
-
[10]
Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. Image super-resolution using deep convolutional net- works.IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(2):295–307, 2016. 1
work page 2016
-
[11]
Scaling rectified flow transformers for high-resolution image synthesis
Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim En- tezari, Jonas M¨uller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, Dustin Podell, Tim Dockhorn, Zion English, and Robin Rombach. Scaling rectified flow transformers for high-resolution image synthesis. InForty- first International Conference on Machine Learning, 2024. 3
work page 2024
-
[12]
Vivid: Video virtual try-on using diffusion models, 2024
Zixun Fang, Wei Zhai, Aimin Su, Hongliang Song, Kai Zhu, Mao Wang, Yu Chen, Zhiheng Liu, Yang Cao, and Zheng-Jun Zha. Vivid: Video virtual try-on using diffusion models, 2024. 3
work page 2024
-
[13]
Generative diffu- sion prior for unified image restoration and enhancement
Ben Fei, Zhaoyang Lyu, Liang Pan, Junzhe Zhang, Weidong Yang, Tianyue Luo, Bo Zhang, and Bo Dai. Generative diffu- sion prior for unified image restoration and enhancement. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9935–9946, 2023. 3
work page 2023
-
[14]
Kodak lossless true color image suite, 1999
Rich Franzen. Kodak lossless true color image suite, 1999. 6, 9, 3
work page 1999
-
[15]
Image restoration by denoising diffusion models with iteratively preconditioned guidance
Tomer Garber and Tom Tirer. Image restoration by denoising diffusion models with iteratively preconditioned guidance. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 25245–25254, 2024. 3
work page 2024
-
[16]
Gans trained by a two time-scale update rule converge to a local nash equilibrium
Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bern- hard Nessler, and Sepp Hochreiter. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In Proceedings of the 31st International Conference on Neural Information Processing Systems, page 6629–6640, 2017. 6
work page 2017
-
[17]
Denoising diffu- sion probabilistic models
Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffu- sion probabilistic models. InAdvances in Neural Information Processing Systems, pages 6840–6851, 2020. 1, 3
work page 2020
-
[18]
Jeya Maria Jose Valanarasu, Rajeev Yasarla, and Vishal M. Patel. Transweather: Transformer-based restoration of images degraded by adverse weather conditions. In2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 2343–2353, 2022. 3
work page 2022
-
[19]
Denoising diffusion restoration models
Bahjat Kawar, Michael Elad, Stefano Ermon, and Jiaming Song. Denoising diffusion restoration models. InAdvances in Neural Information Processing Systems, pages 23593–23606,
-
[20]
Musiq: Multi-scale image quality transformer
Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, and Feng Yang. Musiq: Multi-scale image quality transformer. In2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 5128–5137, 2021. 7
work page 2021
-
[21]
Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization. In3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, 2015. 1, 2
work page 2015
-
[22]
Learning multi- ple layers of features from tiny images
Alex Krizhevsky. Learning multi- ple layers of features from tiny images. https://api.semanticscholar.org/CorpusID:18268744,
-
[23]
Deblurgan-v2: Deblurring (orders-of-magnitude) faster and better
Orest Kupyn, Tetiana Martyniuk, Junru Wu, and Zhangyang Wang. Deblurgan-v2: Deblurring (orders-of-magnitude) faster and better. In2019 IEEE/CVF International Confer- ence on Computer Vision (ICCV), pages 8877–8886, 2019. 3, 2
work page 2019
-
[24]
Srdiff: Single image super-resolution with diffusion probabilistic models
Haoying Li, Yifan Yang, Meng Chang, Shiqi Chen, Huajun Feng, Zhihai Xu, Qi Li, and Yueting Chen. Srdiff: Single image super-resolution with diffusion probabilistic models. Neurocomputing, 479:47–59, 2022. 3
work page 2022
-
[25]
Swinir: Image restoration using swin transformer
Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. Swinir: Image restoration using swin transformer. In2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pages 1833–1844, 2021. 1
work page 2021
-
[26]
Swinir: Image restoration 10 using swin transformer
Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. Swinir: Image restoration 10 using swin transformer. In2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pages 1833–1844, 2021. 3
work page 2021
-
[27]
Enhanced deep residual networks for single image super-resolution
Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. Enhanced deep residual networks for single image super-resolution. In2017 IEEE Conference on Com- puter Vision and Pattern Recognition Workshops (CVPRW), pages 1132–1140, 2017. 6
work page 2017
-
[28]
Score- based generative modeling through stochastic evolution equa- tions in hilbert spaces
Sungbin Lim, EUN BI YOON, Taehyun Byun, Taewon Kang, Seungwoo Kim, Kyungjae Lee, and Sungjoon Choi. Score- based generative modeling through stochastic evolution equa- tions in hilbert spaces. InAdvances in Neural Information Processing Systems, pages 37799–37812, 2023. 3
work page 2023
-
[29]
Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maxim- ilian Nickel, and Matthew Le. Flow matching for genera- tive modeling. InThe Eleventh International Conference on Learning Representations, 2023. 3
work page 2023
-
[30]
On the classification-distortion-perception tradeoff
Dong Liu, Haochen Zhang, and Zhiwei Xiong. On the classification-distortion-perception tradeoff. InAdvances in Neural Information Processing Systems. Curran Associates, Inc., 2019. 4
work page 2019
-
[31]
Theodorou, Weili Nie, and Anima Anandkumar
Guan-Horng Liu, Arash Vahdat, De-An Huang, Evangelos A. Theodorou, Weili Nie, and Anima Anandkumar. I2sb: image- to-image schr ¨odinger bridge. InProceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA, 2023. JMLR.org. 3, 1
work page 2023
-
[32]
Residual denoising diffusion mod- els
Jiawei Liu, Qiang Wang, Huijie Fan, Yinong Wang, Yandong Tang, and Liangqiong Qu. Residual denoising diffusion mod- els. In2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 2773–2783, 2024. 2, 3, 6, 9, 1
work page 2024
-
[33]
Flow straight and fast: Learning to generate and transfer data with rectified flow
Xingchao Liu, Chengyue Gong, and qiang liu. Flow straight and fast: Learning to generate and transfer data with rectified flow. InThe Eleventh International Conference on Learning Representations, 2023. 3, 6
work page 2023
-
[34]
Cheng Lu, Yuhao Zhou, Fan Bao, Jianfei Chen, Chongxuan Li, and Jun Zhu. Dpm-solver++: Fast solver for guided sam- pling of diffusion probabilistic models.Machine Intelligence Research, 22(4):730–751, 2025. 3
work page 2025
-
[35]
Gustafsson, Zheng Zhao, Jens Sj¨olund, and Thomas B
Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sj¨olund, and Thomas B. Sch¨on. Image restoration with mean-reverting stochastic differential equations. InProceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA, 2023. JMLR.org. 2, 3, 6, 7, 9, 1
work page 2023
-
[36]
Gustafsson, Zheng Zhao, Jens Sj¨olund, and Thomas B
Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sj¨olund, and Thomas B. Sch¨on. Controlling vision-language models for multi-task image restoration. InThe Twelfth International Conference on Learning Representations, 2024. 7, 9
work page 2024
-
[37]
Kede Ma, Zhengfang Duanmu, Qingbo Wu, Zhou Wang, Hongwei Yong, Hongliang Li, and Lei Zhang. Waterloo exploration database: New challenges for image quality as- sessment models.IEEE Transactions on Image Processing, 26(2):1004–1016, 2017. 6
work page 2017
-
[38]
D. Martin, C. Fowlkes, D. Tal, and J. Malik. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. InProceedings Eighth IEEE International Confer- ence on Computer Vision. ICCV 2001, pages 416–423 vol.2,
work page 2001
-
[39]
Deep multi-scale convolutional neural network for dynamic scene deblurring
Seungjun Nah, Tae Hyun Kim, and Kyoung Mu Lee. Deep multi-scale convolutional neural network for dynamic scene deblurring. In2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 257–265, 2017. 3, 7, 2
work page 2017
-
[40]
Posterior- mean rectified flow: Towards minimum MSE photo-realistic image restoration
Guy Ohayon, Tomer Michaeli, and Michael Elad. Posterior- mean rectified flow: Towards minimum MSE photo-realistic image restoration. InThe Thirteenth International Conference on Learning Representations, 2025. 3
work page 2025
-
[41]
Ozan ¨Ozdenizci and Robert Legenstein. Restoring vision in adverse weather conditions with patch-based denoising diffusion models.IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1–12, 2023. 6
work page 2023
-
[42]
Tan, Wenhan Yang, Jiajun Su, and Jiaying Liu
Rui Qian, Robby T. Tan, Wenhan Yang, Jiajun Su, and Jiaying Liu. Attentive generative adversarial network for raindrop removal from a single image. In2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2482– 2491, 2018. 6
work page 2018
-
[43]
Xu Qin, Zhilin Wang, Yuanchao Bai, Xiaodong Xie, and Huizhu Jia. Ffa-net: Feature fusion attention network for single image dehazing.Proceedings of the AAAI Conference on Artificial Intelligence, 34(07):11908–11915, 2020. 7, 8, 2
work page 2020
-
[44]
Deep learning for seeing through window with raindrops
Yuhui Quan, Shijie Deng, Yixin Chen, and Hui Ji. Deep learning for seeing through window with raindrops. In2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 2463–2471, 2019. 3
work page 2019
-
[45]
Progressive image deraining networks: A better and simpler baseline
Dongwei Ren, Wangmeng Zuo, Qinghua Hu, Pengfei Zhu, and Deyu Meng. Progressive image deraining networks: A better and simpler baseline. In2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 3932–3941, 2019. 3, 6
work page 2019
-
[46]
High-resolution image syn- thesis with latent diffusion models
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj¨orn Ommer. High-resolution image syn- thesis with latent diffusion models. In2022 IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR), pages 10674–10685, 2022. 3
work page 2022
-
[47]
Chitwan Saharia, Jonathan Ho, William Chan, Tim Sali- mans, David J. Fleet, and Mohammad Norouzi. Image super- resolution via iterative refinement.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(4):4713–4726,
-
[48]
Zhenning Shi, Haoshuai Zheng, Chen Xu, Changsheng Dong, Bin Pan, Xueshuo Xie, Along He, Tao Li, and Huazhu Fu. Resfusion: Denoising diffusion probabilistic models for im- age restoration based on prior residual noise. InAdvances in Neural Information Processing Systems, pages 130664– 130693, 2024. 2, 3, 6, 9, 1
work page 2024
-
[49]
Denoising diffusion implicit models
Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models. InInternational Conference on Learning Representations, 2021. 1, 3, 6, 7, 2
work page 2021
-
[50]
Nima: Neural image assessment.IEEE Transactions on Image Processing, 27(8): 3998–4011, 2018
Hossein Talebi and Peyman Milanfar. Nima: Neural image assessment.IEEE Transactions on Image Processing, 27(8): 3998–4011, 2018. 7
work page 2018
-
[51]
Maxim: Multi- 11 axis mlp for image processing
Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Pey- man Milanfar, Alan Bovik, and Yinxiao Li. Maxim: Multi- 11 axis mlp for image processing. In2022 IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR), pages 5759–5770, 2022. 1, 6
work page 2022
-
[52]
Bingshu Wang, Ze Wang, Wenjie Liu, Xiaoshui Huang, C. L. Philip Chen, and Yue Zhao. Ddsr-net: Direct document shadow removal leveraging multi-scale attention.Machine Intelligence Research, 22(3):452–465, 2025. 3
work page 2025
-
[53]
Zero-shot image restoration using denoising diffusion null-space model
Yinhuai Wang, Jiwen Yu, and Jian Zhang. Zero-shot image restoration using denoising diffusion null-space model. In The Eleventh International Conference on Learning Repre- sentations, 2023. 3
work page 2023
-
[54]
Zhou Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli. Image quality assessment: from error visibility to structural similarity.IEEE Transactions on Image Processing, 13(4): 600–612, 2004. 6
work page 2004
-
[55]
Uformer: A gen- eral u-shaped transformer for image restoration
Zhendong Wang, Xiaodong Cun, Jianmin Bao, Wengang Zhou, Jianzhuang Liu, and Houqiang Li. Uformer: A gen- eral u-shaped transformer for image restoration. In2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 17662–17672, 2022. 3
work page 2022
-
[56]
Jay Whang, Mauricio Delbracio, Hossein Talebi, Chitwan Saharia, Alexandros G. Dimakis, and Peyman Milanfar. De- blurring via stochastic refinement. In2022 IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR), pages 16272–16282, 2022. 3
work page 2022
-
[57]
Gang Wu, Junjun Jiang, Kui Jiang, and Xianming Liu. Fully 1 × 1 convolutional network for lightweight image super- resolution.Machine Intelligence Research, 21(6):1062–1076,
-
[58]
Yuanbin Wu, Shengkui Dai, and Zhan Ma. Advanced his- togram equalization based on a hybrid saliency map and novel visual prior.Machine Intelligence Research, 21(6): 1178–1191, 2024. 3
work page 2024
-
[59]
Diffir: Efficient diffusion model for image restoration
Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xing- long Wu, Yapeng Tian, Wenming Yang, and Luc Van Gool. Diffir: Efficient diffusion model for image restoration. In 2023 IEEE/CVF International Conference on Computer Vi- sion (ICCV), pages 13049–13059, 2023. 2
work page 2023
-
[60]
Diffir: Efficient diffusion model for image restoration
Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xing- long Wu, Yapeng Tian, Wenming Yang, and Luc Van Gool. Diffir: Efficient diffusion model for image restoration. In 2023 IEEE/CVF International Conference on Computer Vi- sion (ICCV), pages 13049–13059, 2023. 3
work page 2023
-
[61]
Jie Xiao, Xueyang Fu, Aiping Liu, Feng Wu, and Zheng-Jun Zha. Image de-raining transformer.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(11):12978– 12995, 2023. 3, 6
work page 2023
-
[62]
Dreamclean: Restoring clean image using deep diffusion prior
Jie Xiao, Ruili Feng, Han Zhang, Zhiheng Liu, Zhantao Yang, Yurui Zhu, Xueyang Fu, Kai Zhu, Yu Liu, and Zheng-Jun Zha. Dreamclean: Restoring clean image using deep diffusion prior. InThe Twelfth International Conference on Learning Representations, 2024. 3
work page 2024
-
[63]
Gang Xu, Qibin Hou, and Ming-Ming Cheng. Dual frequency transformer for efficient sdr-to-hdr translation.Machine Intel- ligence Research, 21(3):538–548, 2024. 3
work page 2024
-
[64]
Tan, Jiashi Feng, Jiaying Liu, Zong- ming Guo, and Shuicheng Yan
Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zong- ming Guo, and Shuicheng Yan. Deep joint rain detection and removal from a single image. In2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1685–1694, 2017. 6, 9
work page 2017
-
[65]
Tan, Jiashi Feng, Zongming Guo, Shuicheng Yan, and Jiaying Liu
Wenhan Yang, Robby T. Tan, Jiashi Feng, Zongming Guo, Shuicheng Yan, and Jiaying Liu. Joint rain detection and removal from a single image with contextualized deep net- works.IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(6):1377–1393, 2020. 3, 6
work page 2020
-
[66]
Image restoration through generalized ornstein-uhlenbeck bridge
Conghan Yue, Zhengwei Peng, Junlong Ma, Shiyan Du, Pengxu Wei, and Dongyu Zhang. Image restoration through generalized ornstein-uhlenbeck bridge. InProceedings of the 41st International Conference on Machine Learning, 2024. 3, 7
work page 2024
-
[67]
Resshift: Efficient diffusion model for image super-resolution by resid- ual shifting
Zongsheng Yue, Jianyi Wang, and Chen Change Loy. Resshift: Efficient diffusion model for image super-resolution by resid- ual shifting. InAdvances in Neural Information Processing Systems, pages 13294–13307, 2023. 2, 3, 1
work page 2023
-
[68]
Multi-stage progressive image restoration
Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao. Multi-stage progressive image restoration. In2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 14816–14826, 2021. 3, 6
work page 2021
-
[69]
Restormer: Efficient transformer for high-resolution image restoration
Syed Waqas Zamir, Aditya Arora, Salman Khan, Mu- nawar Hayat, Fahad Shahbaz Khan, and Ming–Hsuan Yang. Restormer: Efficient transformer for high-resolution image restoration. In2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5718–5729,
-
[70]
Kai Zhang, Wangmeng Zuo, Yunjin Chen, Deyu Meng, and Lei Zhang. Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising.IEEE Transactions on Image Processing, 26(7):3142–3155, 2017. 3, 6
work page 2017
-
[71]
Ffdnet: Toward a fast and flexible solution for cnn-based image denoising
Kai Zhang, Wangmeng Zuo, and Lei Zhang. Ffdnet: Toward a fast and flexible solution for cnn-based image denoising. IEEE Transactions on Image Processing, 27(9):4608–4622,
-
[72]
Kai Zhang, Yawei Li, Jingyun Liang, Jiezhang Cao, Yu- lun Zhang, Hao Tang, Deng-Ping Fan, Radu Timofte, and Luc Van Gool. Practical blind image denoising via swin-conv- unet and data synthesis.Machine Intelligence Research, 20 (6):822–836, 2023. 3
work page 2023
-
[73]
Lei Zhang, Xiaolin Wu, Antoni Buades, and Xin Li. Color demosaicking by local directional interpolation and nonlocal adaptive thresholding.Journal of Electronic Imaging, 20(2),
-
[74]
Efros, Eli Shechtman, and Oliver Wang
Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, and Oliver Wang. The unreasonable effectiveness of deep fea- tures as a perceptual metric. In2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 586–595,
-
[75]
Blind image quality assessment via vision- language correspondence: A multitask learning perspective
Weixia Zhang, Guangtao Zhai, Ying Wei, Xiaokang Yang, and Kede Ma. Blind image quality assessment via vision- language correspondence: A multitask learning perspective. In2023 IEEE/CVF Conference on Computer Vision and Pat- tern Recognition (CVPR), pages 14071–14081, 2023. 7
work page 2023
-
[76]
UniDB: A unified diffusion bridge 12 framework via stochastic optimal control
Kaizhen Zhu, Mokai Pan, Yuexin Ma, Yanwei Fu, Jingyi Yu, Jingya Wang, and Ye Shi. UniDB: A unified diffusion bridge 12 framework via stochastic optimal control. InForty-second International Conference on Machine Learning, 2025. 3, 7
work page 2025
-
[77]
Denoising diffusion models for plug-and-play image restoration
Yuanzhi Zhu, Kai Zhang, Jingyun Liang, Jiezhang Cao, Bi- han Wen, Radu Timofte, and Luc Van Gool. Denoising diffusion models for plug-and-play image restoration. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 1219–1229, 2023. 3
work page 2023
-
[78]
Flowie: Efficient image enhancement via rectified flow
Yixuan Zhu, Wenliang Zhao, Ao Li, Yansong Tang, Jie Zhou, and Jiwen Lu. Flowie: Efficient image enhancement via rectified flow. In2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 13–22, 2024. 3 13
work page 2024
-
[79]
Implementation details We chose the same U-Net backbone network as used in IR- SDE [35]. The CNN-baseline uses the same network but directly input the low-quality image and output the high- quality image. For most tasks, we set the training patch- size to be 256×256 and use a batch size of 8. We used the Adam [ 21] optimizer with parameters β1 = 0.9 and β...
-
[80]
Unified Formulation of SDE-based Methods To unify the representation of various SDE-based generative formulations for restoration, we introduce a general parame- terization of the noisy latent variablex t as: xt =α t x0 +β t x1 +γ t ϵ,(13) where x0 is the clean data, x1 is the observed degraded image, ϵ∼ N(0, I) is Gaussian noise, and αt, βt, γt are time-...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.