Recognition: unknown
Beyond Model Design: Data-Centric Training and Self-Ensemble for Gaussian Color Image Denoising
Pith reviewed 2026-05-10 14:55 UTC · model grok-4.3
The pith
Expanding training data and applying self-ensemble to Restormer improves Gaussian color image denoising to 30.762 dB PSNR.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By expanding the standard multi-dataset training recipe with larger and more diverse public image corpora, organizing optimization into two stages, and applying eight-fold geometric self-ensemble at inference while retaining a TLC-style local wrapper, the mature Restormer architecture reaches 30.762 dB PSNR and 0.861 SSIM on the 100-image challenge validation set for Gaussian color denoising at σ = 50, exceeding the public pretrained baseline by up to 3.366 dB. Ablation experiments indicate that the dominant contribution comes from the enlarged corpus and two-stage schedule, with self-ensemble supplying smaller but consistent further gains and the local wrapper adding negligible quantitative
What carries the argument
Two-stage optimization on an expanded multi-dataset corpus combined with eight-fold geometric self-ensemble applied to the fixed Restormer backbone.
If this is right
- Existing restoration backbones still contain substantial unused capacity that can be unlocked by larger and more varied training data.
- Two-stage optimization schedules improve final denoising quality for a fixed noise level without changing model size.
- Geometric self-ensemble delivers reliable though modest metric gains at inference time.
- Local inference wrappers such as TLC contribute little to performance in this high-noise regime.
Where Pith is reading between the lines
- The same data-expansion tactic could be tested on other mature models for tasks such as deblurring or inpainting.
- Challenge results may increasingly reward careful dataset curation and training recipes over novel architecture proposals.
- If the pattern holds, research effort could shift toward systematic collection of large, diverse clean-image corpora.
Load-bearing premise
The measured gains arise mainly from the added training images and two-stage schedule rather than from unreported differences in code, hyperparameters, or the exact makeup of the new datasets.
What would settle it
Retraining the identical Restormer model on only the original smaller corpus while keeping the two-stage schedule and self-ensemble unchanged, then measuring whether PSNR falls below 29.5 dB on the same validation set.
Figures
read the original abstract
This paper presents our solution to the NTIRE 2026 Image Denoising Challenge (Gaussian color image denoising at fixed noise level $\sigma = 50$). Rather than proposing a new restoration backbone, we revisit the performance boundary of the mature Restormer architecture from two complementary directions: stronger data-centric training and more complete Test-Time capability release. Starting from the public Restormer $\sigma\!=\!50$ baseline, we expand the standard multi-dataset training recipe with larger and more diverse public image corpora and organize optimization into two stages. At inference, we apply $\times 8$ geometric self-ensemble to further release model capacity. A TLC-style local inference wrapper is retained for implementation consistency; however, systematic ablation reveals its quantitative contribution to be negligible in this setting. On the challenge validation set of 100 images, our final submission achieves 30.762 dB PSNR and 0.861 SSIM, improving over the public Restormer $\sigma\!=\!50$ pretrained baseline by up to 3.366 dB PSNR. Ablation studies show that the dominant gain originates from the expanded training corpus and the two-stage optimization schedule, and self-ensemble provides marginal but consistent improvement.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a solution for the NTIRE 2026 Gaussian color image denoising challenge at fixed noise level σ=50. Rather than introducing a new backbone, the authors start from the public Restormer σ=50 pretrained model and improve performance via data-centric training (expansion of the standard multi-dataset corpus with larger and more diverse public images) and a two-stage optimization schedule. At inference they add ×8 geometric self-ensemble while retaining a TLC-style local wrapper whose contribution is reported as negligible. On the challenge validation set of 100 images the final submission reaches 30.762 dB PSNR and 0.861 SSIM, improving over the public baseline by up to 3.366 dB PSNR. Ablation studies attribute the dominant gains to the expanded corpus and two-stage schedule, with self-ensemble providing marginal but consistent further improvement.
Significance. If the attribution of gains holds, the work provides concrete empirical support for prioritizing training corpus size and staged optimization over architectural novelty in mature low-level vision models. The reported 3.366 dB lift on a 100-image validation set, together with the explicit ablation isolating data and schedule effects, offers a useful reference point for practitioners working on Gaussian denoising. The decision to release test-time capacity via self-ensemble while quantifying the wrapper's limited impact is a practical contribution. The overall approach is reproducible in principle because it builds on a public baseline and public datasets.
major comments (1)
- Abstract (and the ablation studies referenced therein): the central claim that 'the dominant gain originates from the expanded training corpus and the two-stage optimization schedule' is load-bearing for the paper's contribution. The comparison is made to the 'public Restormer σ=50 pretrained baseline,' yet the manuscript does not state that this baseline was re-trained inside the authors' exact training loop with all other variables (data augmentation, optimizer, learning-rate schedule, mixed precision, random seeds, etc.) frozen. Without that controlled re-implementation, the 3.366 dB PSNR delta cannot be unambiguously attributed to the described changes rather than incidental implementation differences.
minor comments (2)
- The exact composition, sizes, and preprocessing of the 'larger and more diverse public image corpora' added to the training set are not enumerated; a table listing each source and its image count would allow readers to reproduce the data-centric recipe.
- While the abstract states that systematic ablation shows the TLC-style wrapper's contribution is negligible, no quantitative numbers or table for this ablation appear in the provided summary; including those results would strengthen the claim.
Simulated Author's Rebuttal
We thank the referee for the positive overall assessment and for identifying a key point that requires clarification in our experimental design. We address the concern directly below and will revise the manuscript to improve transparency around the baseline comparison.
read point-by-point responses
-
Referee: Abstract (and the ablation studies referenced therein): the central claim that 'the dominant gain originates from the expanded training corpus and the two-stage optimization schedule' is load-bearing for the paper's contribution. The comparison is made to the 'public Restormer σ=50 pretrained baseline,' yet the manuscript does not state that this baseline was re-trained inside the authors' exact training loop with all other variables (data augmentation, optimizer, learning-rate schedule, mixed precision, random seeds, etc.) frozen. Without that controlled re-implementation, the 3.366 dB PSNR delta cannot be unambiguously attributed to the described changes rather than incidental implementation differences.
Authors: We agree that a fully controlled re-implementation of the baseline within our exact training loop would provide the strongest isolation of effects. The manuscript uses the publicly released Restormer σ=50 pretrained weights directly as the reference point, as is standard for NTIRE challenge submissions; these weights were not re-trained from scratch in our environment. Our procedure starts from this public initialization and continues optimization using the expanded corpus and two-stage schedule. The ablation studies isolate the incremental contributions of data expansion and the staged schedule by varying these elements while holding the initialization and other pipeline details fixed. We acknowledge that minor implementation differences (e.g., random seeds, mixed-precision settings) between the original public training and our continuation cannot be ruled out. In the revised manuscript we will explicitly state that the baseline consists of the official public pretrained model without additional training in our loop, and we will qualify the attribution claim to reflect that the reported gains arise from the additional training steps we introduce. This clarification will be added to the abstract and the experimental section. revision: yes
Circularity Check
No circularity: all claims are direct empirical measurements on held-out validation data
full rationale
The paper reports PSNR/SSIM numbers obtained by training the public Restormer architecture on expanded corpora with a two-stage schedule followed by geometric self-ensemble. No equations, first-principles derivations, or fitted parameters are presented that reduce by construction to the inputs. Ablations isolate contributions via standard train/val splits; the central performance delta is measured on an external 100-image challenge validation set and does not rely on self-citation chains, uniqueness theorems, or renaming of known results. The work is self-contained empirical engineering against public baselines.
Axiom & Free-Parameter Ledger
Forward citations
Cited by 9 Pith papers
-
FluxFlow: Conservative Flow-Matching for Astronomical Image Super-Resolution
FluxFlow is a conservative pixel-space flow-matching framework for astronomical super-resolution that incorporates real atmospheric uncertainty and a training-free Wiener correction, outperforming baselines on a new 1...
-
FluxFlow: Conservative Flow-Matching for Astronomical Image Super-Resolution
FluxFlow uses conservative pixel-space flow-matching with uncertainty weights and Wiener test-time correction to outperform baselines on photometric and scientific accuracy for ground-to-space super-resolution, valida...
-
Dehaze-then-Splat: Generative Dehazing with Physics-Informed 3D Gaussian Splatting for Smoke-Free Novel View Synthesis
Dehaze-then-Splat uses per-frame generative dehazing followed by physics-regularized 3D Gaussian Splatting to achieve 20.98 dB PSNR and 0.683 SSIM on the Akikaze scene, a 1.5 dB gain over baseline by mitigating cross-...
-
3D Smoke Scene Reconstruction Guided by Vision Priors from Multimodal Large Language Models
A framework that combines MLLM-based image enhancement with a medium-aware 3D Gaussian Splatting model to reconstruct and render smoke scenes.
-
CLIP-Guided Data Augmentation for Night-Time Image Dehazing
CLIP-guided selection of external data plus staged NAFNet training and inference fusion provides an effective pipeline for nighttime image dehazing in the NTIRE 2026 challenge.
-
Training-Free Model Ensemble for Single-Image Super-Resolution via Strong-Branch Compensation
A dual-branch training-free ensemble fuses a hybrid attention network with a Mamba-based model via weighted combination to enhance super-resolution PSNR on DIV2K x4.
-
Dual-Branch Remote Sensing Infrared Image Super-Resolution
Dual-branch fusion of HAT-L and MambaIRv2-L with eight-way ensemble and equal-weight averaging outperforms single branches on PSNR, SSIM, and challenge score for infrared super-resolution.
-
SmokeGS-R: Physics-Guided Pseudo-Clean 3DGS for Real-World Multi-View Smoke Restoration
SmokeGS-R uses refined dark channel prior for pseudo-clean supervision to train 3DGS geometry, followed by ensemble-based appearance harmonization, achieving PSNR 15.21 and outperforming baselines on smoke restoration...
-
NTIRE 2026 3D Restoration and Reconstruction in Real-world Adverse Conditions: RealX3D Challenge Results
The NTIRE 2026 challenge reports measurable progress in 3D reconstruction pipelines that handle real-world low-light and smoke degradation via the RealX3D benchmark.
Reference graph
Works this paper leans on
-
[1]
Brown, et al
Abdelrahman Abdelhamed, Mahmoud Afifi, Radu Timofte, Michael S. Brown, et al. NTIRE 2020 challenge on real im- age denoising: Dataset, methods and results. InProceedings of the IEEE/CVF Conference on Computer Vision and Pat- tern Recognition Workshops, pages 2077–2088, 2020. 3 6
2020
-
[2]
Noise Flow: Noise modeling with con- ditional normalizing flows
Abdelrahman Abdelhamed, Marcus A Brubaker, and Michael S Brown. Noise Flow: Noise modeling with con- ditional normalizing flows. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3165– 3173, 2019. 3
2019
-
[3]
A high-quality denoising dataset for smartphone cameras
Abdelrahman Abdelhamed, Stephen Lin, and Michael S Brown. A high-quality denoising dataset for smartphone cameras. InProceedings of the IEEE Conference on Com- puter Vision and Pattern Recognition, pages 1692–1700,
-
[4]
Brown, et al
Abdelrahman Abdelhamed, Radu Timofte, Michael S. Brown, et al. NTIRE 2019 challenge on real image denois- ing: Methods and results. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 2197–2210, 2019. 3
2019
-
[5]
NTIRE 2017 chal- lenge on single image super-resolution: Dataset and study
Eirikur Agustsson and Radu Timofte. NTIRE 2017 chal- lenge on single image super-resolution: Dataset and study. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pages 126–135, 2017. 2, 3, 4
2017
-
[6]
RENOIR–a dataset for real low-light image noise reduction.Journal of Visual Commu- nication and Image Representation, 51:144–154, 2018
Josue Anaya and Adrian Barbu. RENOIR–a dataset for real low-light image noise reduction.Journal of Visual Commu- nication and Image Representation, 51:144–154, 2018. 3
2018
-
[7]
Real image denoising with feature attention
Saeed Anwar and Nick Barnes. Real image denoising with feature attention. InProceedings of the IEEE/CVF Inter- national Conference on Computer Vision, pages 3155–3164,
-
[8]
Contour detection and hierarchical image seg- mentation.IEEE Transactions on Pattern Analysis and Ma- chine Intelligence, 33(5):898–916, 2011
Pablo Arbel ´aez, Michael Maire, Charless Fowlkes, and Ji- tendra Malik. Contour detection and hierarchical image seg- mentation.IEEE Transactions on Pattern Analysis and Ma- chine Intelligence, 33(5):898–916, 2011. 2, 3
2011
-
[9]
Noise2Self: Blind denois- ing by self-supervision
Joshua Batson and Loic Royer. Noise2Self: Blind denois- ing by self-supervision. InProceedings of the International Conference on Machine Learning, pages 524–533. PMLR,
-
[10]
Unprocessing images for learned raw denoising
Tim Brooks, Ben Mildenhall, Tianfan Xue, Jiawen Chen, Dillon Sharlet, and Jonathan T Barron. Unprocessing images for learned raw denoising. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11036–11045, 2019. 3
2019
-
[11]
A non- local algorithm for image denoising
Antoni Buades, Bartomeu Coll, and J-M Morel. A non- local algorithm for image denoising. InProceedings of the 2005 IEEE Computer Society Conference on Computer Vi- sion and Pattern Recognition (CVPR’05), volume 2, pages 60–65. IEEE, 2005. 2
2005
-
[12]
Qida Cao, Xinyuan Hu, Changyue Shi, Jiajun Ding, Zhou Yu, and Jun Yu. GenSmoke-GS: A multi-stage method for novel view synthesis from smoke-degraded images using a generative model.arXiv preprint arXiv:2604.03039, 2026. 3
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[13]
Training-Free Model Ensemble for Single-Image Super-Resolution via Strong-Branch Compensation
Gengjia Chang, Xining Ge, Weijun Yuan, Zhan Li, Qiurong Song, Luen Zhu, and Shuhong Liu. Training-free model en- semble for single-image super-resolution via strong-branch compensation.arXiv preprint arXiv:2604.11564, 2026. 2
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[14]
Boss Chen and Hanqing Wang. Dehaze-then-splat: Gen- erative dehazing with physics-informed 3D gaussian splat- ting for smoke-free novel view synthesis.arXiv preprint arXiv:2604.13589, 2026. 3
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[15]
Real-world image denoising with deep boost- ing.IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(12):3071–3087, 2019
Chang Chen, Zhiwei Xiong, Xinmei Tian, Zheng-Jun Zha, and Feng Wu. Real-world image denoising with deep boost- ing.IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(12):3071–3087, 2019. 2
2019
-
[16]
Pre-trained image processing transformer
Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, and Wen Gao. Pre-trained image processing transformer. InPro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12299–12310, 2021. 2
2021
-
[17]
Real-world single image super-resolution: A brief review.Information Fusion, 79:124–145, 2022
Honggang Chen, Xiaohai He, Linbo Qing, Yuanyuan Wu, Chao Ren, Ray E Sheriff, and Ce Zhu. Real-world single image super-resolution: A brief review.Information Fusion, 79:124–145, 2022. 3
2022
-
[18]
Simple baselines for image restoration
Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, and Jian Sun. Simple baselines for image restoration. InProceedings of the European Conference on Computer Vision (ECCV), pages 17–33. Springer, 2022. 2
2022
-
[19]
HiNet: Half instance normalization network for image restoration
Liangyu Chen, Xin Lu, Jie Zhang, Xiaojie Chu, and Cheng- peng Chen. HiNet: Half instance normalization network for image restoration. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 182–192, 2021. 2
2021
-
[20]
Activating more pixels in image super- resolution transformer
Xiangyu Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, and Chao Dong. Activating more pixels in image super- resolution transformer. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22367–22377, 2023. 2
2023
-
[21]
Yunjin Chen and Thomas Pock. Trainable nonlinear reaction diffusion: A flexible framework for fast and effective image restoration.IEEE Transactions on Pattern Analysis and Ma- chine Intelligence, 39(6):1256–1272, 2016. 2
2016
-
[22]
The fourth challenge on image super-resolution (×4) at NTIRE 2026: Bench- mark results and method overview
Zheng Chen, Kai Liu, Jingkai Wang, Xianglong Yan, Jianze Li, Ziqing Zhang, Jue Gong, Jiatong Li, Lei Sun, Xiaoyang Liu, Radu Timofte, Yulun Zhang, et al. The fourth challenge on image super-resolution (×4) at NTIRE 2026: Bench- mark results and method overview. InProceedings of the Computer Vision and Pattern Recognition Conference Work- shops, 2026
2026
-
[23]
Cross aggregation transformer for image restora- tion.Advances in Neural Information Processing Systems, 35:25478–25490, 2022
Zheng Chen, Yulun Zhang, Jinjin Gu, Linghe Kong, Xin Yuan, et al. Cross aggregation transformer for image restora- tion.Advances in Neural Information Processing Systems, 35:25478–25490, 2022. 2
2022
-
[24]
Improving image restoration by revisiting global information aggregation
Xiaojie Chu, Liangyu Chen, Chengpeng Chen, and Xin Lu. Improving image restoration by revisiting global information aggregation. InProceedings of the European Conference on Computer Vision (ECCV), pages 53–71. Springer, 2022. 2, 3, 4, 5
2022
-
[25]
ViDeNN: Deep blind video denoising
Michele Claus and Jan Van Gemert. ViDeNN: Deep blind video denoising. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition Work- shops, pages 0–0, 2019. 2
2019
-
[26]
Ziteng Cui, Shuhong Liu, Xiaoyu Dong, Xuangeng Chu, Lin Gu, Ming-Hsuan Yang, and Tatsuya Harada. Unifying color and lightness correction with view-adaptive curve ad- justment for robust 3d novel view synthesis.arXiv preprint arXiv:2602.18322, 2026. 1
-
[27]
Image denoising by sparse 3-d transform- 7 domain collaborative filtering.IEEE Transactions on Image Processing, 16(8):2080–2095, 2007
Kostadin Dabov, Alessandro Foi, Vladimir Katkovnik, and Karen Egiazarian. Image denoising by sparse 3-d transform- 7 domain collaborative filtering.IEEE Transactions on Image Processing, 16(8):2080–2095, 2007. 2
2080
-
[28]
Ren, Chun-Le Guo, and Chongyi Li
Zheng-Peng Duan, Jiawei Zhang, Xin Jin, Ziheng Zhang, Zheng Xiong, Dongqing Zou, Jimmy S. Ren, Chun-Le Guo, and Chongyi Li. NKUSR8K: dataset release in the official DiT4SR project repository. Official project repository, 2025. Repository documentation states that the NKUSR8K dataset is released for training with the DiT4SR project. 3, 4
2025
-
[29]
Image denoising via sparse and redundant representations over learned dictionar- ies.IEEE Transactions on Image Processing, 15(12):3736– 3745, 2006
Michael Elad and Michal Aharon. Image denoising via sparse and redundant representations over learned dictionar- ies.IEEE Transactions on Image Processing, 15(12):3736– 3745, 2006. 2
2006
-
[30]
SmokeGS-R: Physics-Guided Pseudo-Clean 3DGS for Real-World Multi-View Smoke Restoration
Xueming Fu and Lixia Han. SmokeGS-R: Physics- guided pseudo-clean 3DGS for real-world multi-view smoke restoration.arXiv preprint arXiv:2604.05301, 2026. 3
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[31]
Dual-Branch Remote Sensing Infrared Image Super-Resolution
Xining Ge, Gengjia Chang, Weijun Yuan, Zhan Li, Zhanglu Chen, Boyang Yao, Yihang Chen, Yifan Deng, and Shuhong Liu. Dual-branch remote sensing infrared image super- resolution.arXiv preprint arXiv:2604.10112, 2026. 2
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[32]
CLIP-Guided Data Augmentation for Night-Time Image Dehazing
Xining Ge, Weijun Yuan, Gengjia Chang, Xuyang Li, and Shuhong Liu. Clip-guided data augmentation for night-time image dehazing.arXiv preprint arXiv:2604.05500, 2026. 1
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[33]
Deep burst denoising
Cl ´ement Godard, Kevin Matzen, and Matt Uyttendaele. Deep burst denoising. InProceedings of the European Con- ference on Computer Vision (ECCV), pages 538–554, 2018. 2
2018
-
[34]
DIV8K: DI- Verse 8k resolution image dataset
Shuhang Gu, Andreas Lugmayr, Martin Danelljan, Manuel Fritsche, Julien Lamour, and Radu Timofte. DIV8K: DI- Verse 8k resolution image dataset. InProceedings of the IEEE/CVF International Conference on Computer Vision Workshops, pages 3512–3516. IEEE, 2019. 3, 4
2019
-
[35]
MambaIRv2: Atten- tive state space restoration
Hang Guo, Yong Guo, Yaohua Zha, Yulun Zhang, Wenbo Li, Tao Dai, Shu-Tao Xia, and Yawei Li. MambaIRv2: Atten- tive state space restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 28124–28133, 2025. 2
2025
-
[36]
MambaIR: A simple baseline for image restoration with state-space model
Hang Guo, Jinmin Li, Tao Dai, Zhihao Ouyang, Xudong Ren, and Shu-Tao Xia. MambaIR: A simple baseline for image restoration with state-space model. InProceedings of the European Conference on Computer Vision (ECCV), pages 222–241. Springer, 2024. 2
2024
-
[37]
Reliability-aware staged low-light gaussian splatting.ResearchGate preprint, 2026
Haojie Guo and Ke Xian. Reliability-aware staged low-light gaussian splatting.ResearchGate preprint, 2026. 3
2026
-
[38]
Toward convolutional blind denoising of real pho- tographs
Shi Guo, Zifei Yan, Kai Zhang, Wangmeng Zuo, and Lei Zhang. Toward convolutional blind denoising of real pho- tographs. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1712– 1722, 2019. 2
2019
-
[39]
Neighbor2Neighbor: Self-supervised de- noising from single noisy images
Tao Huang, Songjiang Li, Xu Jia, Huchuan Lu, and Jianzhuang Liu. Neighbor2Neighbor: Self-supervised de- noising from single noisy images. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14781–14790, 2021. 2
2021
-
[40]
Noise2V oid: Learning denoising from single noisy images
Alexander Krull, Tim-Oliver Buchholz, and Florian Jug. Noise2V oid: Learning denoising from single noisy images. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2129–2137, 2019. 2
2019
-
[41]
High-quality self-supervised deep image denoising.Ad- vances in Neural Information Processing Systems, 32, 2019
Samuli Laine, Tero Karras, Jaakko Lehtinen, and Timo Aila. High-quality self-supervised deep image denoising.Ad- vances in Neural Information Processing Systems, 32, 2019. 2
2019
-
[42]
Noise2Noise: Learning image restoration without clean data
Jaakko Lehtinen, Jacob Munkberg, Jon Hasselgren, Samuli Laine, Tero Karras, Miika Aittala, and Timo Aila. Noise2Noise: Learning image restoration without clean data. InProceedings of the 35th International Conference on Machine Learning, volume 80 ofProceedings of Machine Learning Research, pages 2965–2974. PMLR, 2018. 2
2018
-
[43]
Densesplat: Densifying gaussian splatting slam with neural radiance prior.IEEE Transactions on Visualization & Computer Graphics, (01):1–14, 2025
Mingrui Li, Shuhong Liu, Tianchen Deng, and Hongyu Wang. Densesplat: Densifying gaussian splatting slam with neural radiance prior.IEEE Transactions on Visualization & Computer Graphics, (01):1–14, 2025. 1
2025
-
[44]
Sgs-slam: Semantic gaussian splatting for neural dense slam
Mingrui Li, Shuhong Liu, Heng Zhou, Guohao Zhu, Na Cheng, Tianchen Deng, and Hongyu Wang. Sgs-slam: Semantic gaussian splatting for neural dense slam. InEuro- pean Conference on Computer Vision, pages 163–179, 2025. 1
2025
-
[45]
LSDIR: A large-scale dataset for image restoration
Yawei Li, Kai Zhang, Jingyun Liang, Jiezhang Cao, Ce Liu, Rui Gong, Yulun Zhang, Hao Tang, Yun Liu, Denis De- mandolx, et al. LSDIR: A large-scale dataset for image restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1775– 1787, 2023. 3, 4
2023
-
[46]
Ntire 2023 challenge on image denois- ing: Methods and results
Yawei Li, Yulun Zhang, Radu Timofte, Luc Van Gool, Zhi- jun Tu, Kunpeng Du, Hailing Wang, Hanting Chen, Wei Li, Xiaofei Wang, et al. Ntire 2023 challenge on image denois- ing: Methods and results. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1905–1921, 2023. 3
2023
-
[47]
SwinIR: Image restoration using Swin Transformer
Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. SwinIR: Image restoration using Swin Transformer. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 1833– 1844, 2021. 1, 2, 3, 4
2021
-
[48]
Flickr2K dataset
Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. Flickr2K dataset. Official dataset release accompanying the NTIRE2017/EDSR repository,
-
[49]
Dataset collected by the authors using the Flickr API. 2, 3, 4
-
[50]
Blind image super-resolution: A survey and beyond.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(5):5461–5480, 2022
Anran Liu, Yihao Liu, Jinjin Gu, Yu Qiao, and Chao Dong. Blind image super-resolution: A survey and beyond.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(5):5461–5480, 2022. 3
2022
-
[51]
LIU4K-v2 dataset
Jiaying Liu, Dong Liu, Wenhan Yang, Sifeng Xia, Xiaoshuai Zhang, and Yuanying Dai. LIU4K-v2 dataset. Official dataset page, 2020. The official LIU4K-v2 page recom- mends citing the accompanying compression artifact reduc- tion benchmark paper. 3, 4
2020
-
[52]
Shuhong Liu, Chenyu Bao, Ziteng Cui, Xuangeng Chu, Bin Ren, Lin Gu, Xiang Chen, Mingrui Li, Long Ma, Marcos V . Conde, Radu Timofte, et al. NTIRE 2026 3D restoration and reconstruction in adverse conditions: RealX3D challenge re- sults.arXiv preprint arXiv:2604.04135, 2026. 1
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[53]
Shuhong Liu, Chenyu Bao, Ziteng Cui, Yun Liu, Xuangeng Chu, Lin Gu, Marcos V Conde, Ryo Umagami, Tomohiro 8 Hashimoto, Zijian Hu, et al. Realx3d: A physically-degraded 3d benchmark for multi-view visual restoration and recon- struction.arXiv preprint arXiv:2512.23437, 2026. 1
-
[54]
Deraings: Gaussian splatting for enhanced scene reconstruction in rainy environments.Proceedings of the AAAI Conference on Artificial Intelligence, 39(5):5558– 5566, 2025
Shuhong Liu, Xiang Chen, Hongming Chen, Quanfeng Xu, and Mingrui Li. Deraings: Gaussian splatting for enhanced scene reconstruction in rainy environments.Proceedings of the AAAI Conference on Artificial Intelligence, 39(5):5558– 5566, 2025. 1
2025
-
[55]
Mg-slam: Structure gaussian splatting slam with manhattan world hy- pothesis.IEEE Transactions on Automation Science and En- gineering, 22:17034–17049, 2025
Shuhong Liu, Tianchen Deng, Heng Zhou, Liuzhuozheng Li, Hongyu Wang, Danwei Wang, and Mingrui Li. Mg-slam: Structure gaussian splatting slam with manhattan world hy- pothesis.IEEE Transactions on Automation Science and En- gineering, 22:17034–17049, 2025. 1
2025
-
[56]
Shuhong Liu, Xining Ge, Ziying Gu, Lin Gu, Ziteng Cui, Xuangeng Chu, Jun Liu, Dong Li, and Tatsuya Harada. De- noising the deep sky: Physics-based ccd noise formation for astronomical imaging.arXiv preprint arXiv:2601.23276,
-
[57]
I2-nerf: Learning neural radiance fields un- der physically-grounded media interactions
Shuhong Liu, Lin Gu, Ziteng Cui, Xuangeng Chu, and Tat- suya Harada. I2-nerf: Learning neural radiance fields un- der physically-grounded media interactions. InAdvances in Neural Information Processing Systems, 2025. 1
2025
-
[58]
Yuhao Liu, Dingju Wang, and Ziyang Zheng. ELoG-GS: Dual-branch gaussian splatting with luminance-guided en- hancement for extreme low-light 3D reconstruction.arXiv preprint arXiv:2604.12592, 2026. 3
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[59]
Swin Transformer: Hierarchical vision transformer using shifted windows
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin Transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 10012–10022, 2021. 2
2021
-
[60]
Waterloo Ex- ploration Database: New challenges for image quality as- sessment models.IEEE Transactions on Image Processing, 26(2):1004–1016, 2017
Kede Ma, Zhengfang Duanmu, Qingbo Wu, Zhou Wang, Hongwei Yong, Hongliang Li, and Lei Zhang. Waterloo Ex- ploration Database: New challenges for image quality as- sessment models.IEEE Transactions on Image Processing, 26(2):1004–1016, 2017. 2, 3
2017
-
[61]
A holistic approach to cross-channel im- age noise modeling and its application to image denoising
Seonghyeon Nam, Youngbae Hwang, Yasuyuki Matsushita, and Seon Joo Kim. A holistic approach to cross-channel im- age noise modeling and its application to image denoising. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1683–1691, 2016. 3
2016
-
[62]
Benchmarking denoising al- gorithms with real photographs
Tobias Plotz and Stefan Roth. Benchmarking denoising al- gorithms with real photographs. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1586–1595, 2017. 3
2017
-
[63]
Self2Self with dropout: Learning self-supervised denoising from single image
Yuhui Quan, Mingqin Chen, Tongyao Pang, and Hui Ji. Self2Self with dropout: Learning self-supervised denoising from single image. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 1890–1898, 2020. 2
2020
-
[64]
The Eleventh NTIRE 2026 Efficient Super-Resolution Challenge Report
Bin Ren, Hang Guo, Yan Shu, Jiaqi Ma, Ziteng Cui, Shuhong Liu, Guofeng Mei, Lei Sun, Zongwei Wu, Fahad Shahbaz Khan, Salman Khan, Radu Timofte, Yawei Li, et al. The eleventh NTIRE 2026 efficient super-resolution challenge re- port.arXiv preprint arXiv:2604.03198, 2026. 1
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[65]
The Third Challenge on Image Denoising at NTIRE 2026: Methods and Results
Lei Sun, Hang Guo, Bin Ren, Shaolin Su, Xian Wang, Danda Pani Paudel, Luc Van Gool, Radu Timofte, Yawei Li, et al. The Third Challenge on Image Denoising at NTIRE 2026: Methods and Results. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2026. 2
2026
-
[66]
Mem- Net: A persistent memory network for image restoration
Ying Tai, Jian Yang, Xiaoming Liu, and Chunyan Xu. Mem- Net: A persistent memory network for image restoration. In Proceedings of the IEEE International Conference on Com- puter Vision, pages 4539–4547, 2017. 2
2017
-
[67]
FastDVD- net: Towards real-time deep video denoising without flow estimation
Matias Tassano, Julie Delon, and Thomas Veit. FastDVD- net: Towards real-time deep video denoising without flow estimation. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1354– 1363, 2020. 2
2020
-
[68]
MAXIM: Multi-axis MLP for image processing
Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, and Yinxiao Li. MAXIM: Multi-axis MLP for image processing. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5769–5780, 2022. 2
2022
-
[69]
Recovering realistic texture in image super-resolution by deep spatial feature transform
Xintao Wang, Ke Yu, Chao Dong, and Chen Change Loy. Recovering realistic texture in image super-resolution by deep spatial feature transform. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 606–615, 2018. 3, 4
2018
-
[70]
ESRGAN: En- hanced super-resolution generative adversarial networks
Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, and Chen Change Loy. ESRGAN: En- hanced super-resolution generative adversarial networks. In Proceedings of the European Conference on Computer Vi- sion Workshops, 2018. 3, 4
2018
-
[71]
Uformer: A general u-shaped transformer for image restoration
Zhendong Wang, Xiaodong Cun, Jianmin Bao, Wengang Zhou, Jianzhuang Liu, and Houqiang Li. Uformer: A general u-shaped transformer for image restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pat- tern Recognition, pages 17683–17693, 2022. 1, 2
2022
-
[72]
Restormer: Efficient transformer for high-resolution image restoration
Syed Waqas Zamir, Aditya Arora, Salman Khan, Mu- nawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang. Restormer: Efficient transformer for high-resolution image restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5728– 5739, 2022. 1, 2, 3, 5
2022
-
[73]
CycleISP: Real image restoration via improved data synthesis
Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao. CycleISP: Real image restoration via improved data synthesis. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2696– 2705, 2020. 2
2020
-
[74]
Learning enriched features for real image restoration and enhancement
Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao. Learning enriched features for real image restoration and enhancement. InProceedings of the European Confer- ence on Computer Vision (ECCV), pages 492–511. Springer,
-
[75]
Multi-stage progressive image restoration
Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao. Multi-stage progressive image restoration. InPro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14821–14831, 2021. 1, 2
2021
-
[76]
Learning enriched features for fast image restoration and enhancement.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(2):1934–1948, 2022
Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling 9 Shao. Learning enriched features for fast image restoration and enhancement.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(2):1934–1948, 2022. 2
1934
-
[77]
Practical blind image denoising via Swin- Conv-UNet and data synthesis.Machine Intelligence Re- search, 20(6):822–836, 2023
Kai Zhang, Yawei Li, Jingyun Liang, Jiezhang Cao, Yu- lun Zhang, Hao Tang, Deng-Ping Fan, Radu Timofte, and Luc Van Gool. Practical blind image denoising via Swin- Conv-UNet and data synthesis.Machine Intelligence Re- search, 20(6):822–836, 2023. 2
2023
-
[78]
Plug-and-play image restora- tion with deep denoiser prior.IEEE Transactions on Pat- tern Analysis and Machine Intelligence, 44(10):6360–6376,
Kai Zhang, Yawei Li, Wangmeng Zuo, Lei Zhang, Luc Van Gool, and Radu Timofte. Plug-and-play image restora- tion with deep denoiser prior.IEEE Transactions on Pat- tern Analysis and Machine Intelligence, 44(10):6360–6376,
-
[79]
Beyond a gaussian denoiser: Residual learning of deep CNN for image denoising.IEEE Transactions on Image Processing, 26(7):3142–3155, 2017
Kai Zhang, Wangmeng Zuo, Yunjin Chen, Deyu Meng, and Lei Zhang. Beyond a gaussian denoiser: Residual learning of deep CNN for image denoising.IEEE Transactions on Image Processing, 26(7):3142–3155, 2017. 1, 2
2017
-
[80]
FFDNet: Toward a fast and flexible solution for CNN-based im- age denoising.IEEE Transactions on Image Processing, 27(9):4608–4622, 2018
Kai Zhang, Wangmeng Zuo, and Lei Zhang. FFDNet: Toward a fast and flexible solution for CNN-based im- age denoising.IEEE Transactions on Image Processing, 27(9):4608–4622, 2018. 1, 2
2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.