arxiv: 2604.11468 · v2 · submitted 2026-04-13 · 💻 cs.CV

Recognition: unknown

Beyond Model Design: Data-Centric Training and Self-Ensemble for Gaussian Color Image Denoising

Gengjia Chang , Xining Ge , Weijun Yuan , Zhan Li , Qiurong Song , Luen Zhu , Shuhong Liu

Authors on Pith no claims yet

Pith reviewed 2026-05-10 14:55 UTC · model grok-4.3

classification 💻 cs.CV

keywords Gaussian color image denoisingRestormerdata-centric trainingself-ensembleNTIRE challengePSNR improvementtwo-stage optimization

0 comments

The pith

Expanding training data and applying self-ensemble to Restormer improves Gaussian color image denoising to 30.762 dB PSNR.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper shows that large gains in fixed-noise Gaussian color image denoising are possible without inventing new network architectures. The authors start from the public Restormer model and instead enlarge the training corpus with more public images, switch to a two-stage optimization schedule, and add eight-fold geometric self-ensemble at test time. These changes produce a final validation score of 30.762 dB PSNR and 0.861 SSIM on the NTIRE 2026 challenge set. The reported improvement of 3.366 dB over the original Restormer baseline is attributed mainly to the bigger dataset and staged training. A reader would care because the result reframes progress in denoising as a question of data scale and inference tricks rather than model invention.

Core claim

By expanding the standard multi-dataset training recipe with larger and more diverse public image corpora, organizing optimization into two stages, and applying eight-fold geometric self-ensemble at inference while retaining a TLC-style local wrapper, the mature Restormer architecture reaches 30.762 dB PSNR and 0.861 SSIM on the 100-image challenge validation set for Gaussian color denoising at σ = 50, exceeding the public pretrained baseline by up to 3.366 dB. Ablation experiments indicate that the dominant contribution comes from the enlarged corpus and two-stage schedule, with self-ensemble supplying smaller but consistent further gains and the local wrapper adding negligible quantitative

What carries the argument

Two-stage optimization on an expanded multi-dataset corpus combined with eight-fold geometric self-ensemble applied to the fixed Restormer backbone.

If this is right

Existing restoration backbones still contain substantial unused capacity that can be unlocked by larger and more varied training data.
Two-stage optimization schedules improve final denoising quality for a fixed noise level without changing model size.
Geometric self-ensemble delivers reliable though modest metric gains at inference time.
Local inference wrappers such as TLC contribute little to performance in this high-noise regime.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same data-expansion tactic could be tested on other mature models for tasks such as deblurring or inpainting.
Challenge results may increasingly reward careful dataset curation and training recipes over novel architecture proposals.
If the pattern holds, research effort could shift toward systematic collection of large, diverse clean-image corpora.

Load-bearing premise

The measured gains arise mainly from the added training images and two-stage schedule rather than from unreported differences in code, hyperparameters, or the exact makeup of the new datasets.

What would settle it

Retraining the identical Restormer model on only the original smaller corpus while keeping the two-stage schedule and self-ensemble unchanged, then measuring whether PSNR falls below 29.5 dB on the same validation set.

Figures

Figures reproduced from arXiv: 2604.11468 by Gengjia Chang, Luen Zhu, Qiurong Song, Shuhong Liu, Weijun Yuan, Xining Ge, Zhan Li.

**Figure 1.** Figure 1: Overview of the final denoising pipeline. A noisy image is restored by Restormer, the TLC-style wrapper is retained for inference [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗

**Figure 2.** Figure 2: Qualitative comparison among the noisy input, single-pass Restormer, Restormer with [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: Same-protocol PSNR comparison with the public [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

read the original abstract

This paper presents our solution to the NTIRE 2026 Image Denoising Challenge (Gaussian color image denoising at fixed noise level $\sigma = 50$). Rather than proposing a new restoration backbone, we revisit the performance boundary of the mature Restormer architecture from two complementary directions: stronger data-centric training and more complete Test-Time capability release. Starting from the public Restormer $\sigma\!=\!50$ baseline, we expand the standard multi-dataset training recipe with larger and more diverse public image corpora and organize optimization into two stages. At inference, we apply $\times 8$ geometric self-ensemble to further release model capacity. A TLC-style local inference wrapper is retained for implementation consistency; however, systematic ablation reveals its quantitative contribution to be negligible in this setting. On the challenge validation set of 100 images, our final submission achieves 30.762 dB PSNR and 0.861 SSIM, improving over the public Restormer $\sigma\!=\!50$ pretrained baseline by up to 3.366 dB PSNR. Ablation studies show that the dominant gain originates from the expanded training corpus and the two-stage optimization schedule, and self-ensemble provides marginal but consistent improvement.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This gets a 3.4 dB PSNR lift on the NTIRE sigma=50 denoising validation set by expanding data and using two-stage training on Restormer plus self-ensemble, but the gains aren't isolated from possible implementation differences.

read the letter

The headline result here is a 3.366 dB PSNR improvement on the 100-image validation set for the NTIRE 2026 sigma=50 Gaussian color denoising task. They start from the public Restormer baseline and push it further with expanded training corpora, a two-stage training schedule, and x8 geometric self-ensemble at inference. The final numbers are 30.762 dB PSNR and 0.861 SSIM. What stands out is that they keep the architecture fixed and focus on data-centric and training adjustments instead of proposing a new network. The abstract notes that ablations attribute most of the gain to the larger and more diverse datasets plus the staged optimization, with self-ensemble adding a smaller consistent boost. They also test a TLC-style wrapper but find it negligible here. This approach is straightforward and could help others working with similar transformer-based denoisers on benchmark challenges. The main soft spot is the lack of a direct control experiment. To be sure the gains come from the described data expansion and schedule, it would help to see the public baseline re-trained in their exact setup with everything else held constant. Without that, differences in data augmentation, learning rate schedules, or even random seeds could explain part of the delta. The paper is based on empirical measurements on a held-out set, so there's no circularity, but the causal story would be stronger with tighter isolation. This kind of work is aimed at people entering image restoration challenges or tuning Restormer for denoising. It shows what can be achieved with careful data and training choices on an established model. The techniques themselves are not new, but the specific application and reported lift on this track make it a useful data point. I would bring this to a reading group if the group is focused on practical computer vision or denoising methods, as the numbers are concrete. It doesn't introduce a new framework, so it may not be something I'd cite in my own papers unless I'm directly comparing to Restormer on this benchmark. Still, the empirical result is sharp enough that a serious editor should send it to peer review for closer look at the training details and ablations.

Referee Report

1 major / 2 minor

Summary. The manuscript presents a solution for the NTIRE 2026 Gaussian color image denoising challenge at fixed noise level σ=50. Rather than introducing a new backbone, the authors start from the public Restormer σ=50 pretrained model and improve performance via data-centric training (expansion of the standard multi-dataset corpus with larger and more diverse public images) and a two-stage optimization schedule. At inference they add ×8 geometric self-ensemble while retaining a TLC-style local wrapper whose contribution is reported as negligible. On the challenge validation set of 100 images the final submission reaches 30.762 dB PSNR and 0.861 SSIM, improving over the public baseline by up to 3.366 dB PSNR. Ablation studies attribute the dominant gains to the expanded corpus and two-stage schedule, with self-ensemble providing marginal but consistent further improvement.

Significance. If the attribution of gains holds, the work provides concrete empirical support for prioritizing training corpus size and staged optimization over architectural novelty in mature low-level vision models. The reported 3.366 dB lift on a 100-image validation set, together with the explicit ablation isolating data and schedule effects, offers a useful reference point for practitioners working on Gaussian denoising. The decision to release test-time capacity via self-ensemble while quantifying the wrapper's limited impact is a practical contribution. The overall approach is reproducible in principle because it builds on a public baseline and public datasets.

major comments (1)

Abstract (and the ablation studies referenced therein): the central claim that 'the dominant gain originates from the expanded training corpus and the two-stage optimization schedule' is load-bearing for the paper's contribution. The comparison is made to the 'public Restormer σ=50 pretrained baseline,' yet the manuscript does not state that this baseline was re-trained inside the authors' exact training loop with all other variables (data augmentation, optimizer, learning-rate schedule, mixed precision, random seeds, etc.) frozen. Without that controlled re-implementation, the 3.366 dB PSNR delta cannot be unambiguously attributed to the described changes rather than incidental implementation differences.

minor comments (2)

The exact composition, sizes, and preprocessing of the 'larger and more diverse public image corpora' added to the training set are not enumerated; a table listing each source and its image count would allow readers to reproduce the data-centric recipe.
While the abstract states that systematic ablation shows the TLC-style wrapper's contribution is negligible, no quantitative numbers or table for this ablation appear in the provided summary; including those results would strengthen the claim.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the positive overall assessment and for identifying a key point that requires clarification in our experimental design. We address the concern directly below and will revise the manuscript to improve transparency around the baseline comparison.

read point-by-point responses

Referee: Abstract (and the ablation studies referenced therein): the central claim that 'the dominant gain originates from the expanded training corpus and the two-stage optimization schedule' is load-bearing for the paper's contribution. The comparison is made to the 'public Restormer σ=50 pretrained baseline,' yet the manuscript does not state that this baseline was re-trained inside the authors' exact training loop with all other variables (data augmentation, optimizer, learning-rate schedule, mixed precision, random seeds, etc.) frozen. Without that controlled re-implementation, the 3.366 dB PSNR delta cannot be unambiguously attributed to the described changes rather than incidental implementation differences.

Authors: We agree that a fully controlled re-implementation of the baseline within our exact training loop would provide the strongest isolation of effects. The manuscript uses the publicly released Restormer σ=50 pretrained weights directly as the reference point, as is standard for NTIRE challenge submissions; these weights were not re-trained from scratch in our environment. Our procedure starts from this public initialization and continues optimization using the expanded corpus and two-stage schedule. The ablation studies isolate the incremental contributions of data expansion and the staged schedule by varying these elements while holding the initialization and other pipeline details fixed. We acknowledge that minor implementation differences (e.g., random seeds, mixed-precision settings) between the original public training and our continuation cannot be ruled out. In the revised manuscript we will explicitly state that the baseline consists of the official public pretrained model without additional training in our loop, and we will qualify the attribution claim to reflect that the reported gains arise from the additional training steps we introduce. This clarification will be added to the abstract and the experimental section. revision: yes

Circularity Check

0 steps flagged

No circularity: all claims are direct empirical measurements on held-out validation data

full rationale

The paper reports PSNR/SSIM numbers obtained by training the public Restormer architecture on expanded corpora with a two-stage schedule followed by geometric self-ensemble. No equations, first-principles derivations, or fitted parameters are presented that reduce by construction to the inputs. Ablations isolate contributions via standard train/val splits; the central performance delta is measured on an external 100-image challenge validation set and does not rely on self-citation chains, uniqueness theorems, or renaming of known results. The work is self-contained empirical engineering against public baselines.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The work is purely empirical and applies standard supervised training to a public architecture; no new mathematical axioms, free parameters in a derivation sense, or postulated entities are introduced.

pith-pipeline@v0.9.0 · 5538 in / 1160 out tokens · 42841 ms · 2026-05-10T14:55:43.923089+00:00 · methodology

discussion (0)

Forward citations

Cited by 9 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

FluxFlow: Conservative Flow-Matching for Astronomical Image Super-Resolution
cs.CV 2026-05 unverdicted novelty 7.0

FluxFlow is a conservative pixel-space flow-matching framework for astronomical super-resolution that incorporates real atmospheric uncertainty and a training-free Wiener correction, outperforming baselines on a new 1...
FluxFlow: Conservative Flow-Matching for Astronomical Image Super-Resolution
cs.CV 2026-05 unverdicted novelty 5.0

FluxFlow uses conservative pixel-space flow-matching with uncertainty weights and Wiener test-time correction to outperform baselines on photometric and scientific accuracy for ground-to-space super-resolution, valida...
Dehaze-then-Splat: Generative Dehazing with Physics-Informed 3D Gaussian Splatting for Smoke-Free Novel View Synthesis
cs.CV 2026-04 unverdicted novelty 5.0

Dehaze-then-Splat uses per-frame generative dehazing followed by physics-regularized 3D Gaussian Splatting to achieve 20.98 dB PSNR and 0.683 SSIM on the Akikaze scene, a 1.5 dB gain over baseline by mitigating cross-...
3D Smoke Scene Reconstruction Guided by Vision Priors from Multimodal Large Language Models
cs.CV 2026-04 unverdicted novelty 5.0

A framework that combines MLLM-based image enhancement with a medium-aware 3D Gaussian Splatting model to reconstruct and render smoke scenes.
CLIP-Guided Data Augmentation for Night-Time Image Dehazing
cs.CV 2026-04 unverdicted novelty 5.0

CLIP-guided selection of external data plus staged NAFNet training and inference fusion provides an effective pipeline for nighttime image dehazing in the NTIRE 2026 challenge.
Training-Free Model Ensemble for Single-Image Super-Resolution via Strong-Branch Compensation
cs.CV 2026-04 unverdicted novelty 4.0

A dual-branch training-free ensemble fuses a hybrid attention network with a Mamba-based model via weighted combination to enhance super-resolution PSNR on DIV2K x4.
Dual-Branch Remote Sensing Infrared Image Super-Resolution
cs.CV 2026-04 unverdicted novelty 4.0

Dual-branch fusion of HAT-L and MambaIRv2-L with eight-way ensemble and equal-weight averaging outperforms single branches on PSNR, SSIM, and challenge score for infrared super-resolution.
SmokeGS-R: Physics-Guided Pseudo-Clean 3DGS for Real-World Multi-View Smoke Restoration
cs.CV 2026-04 conditional novelty 4.0

SmokeGS-R uses refined dark channel prior for pseudo-clean supervision to train 3DGS geometry, followed by ensemble-based appearance harmonization, achieving PSNR 15.21 and outperforming baselines on smoke restoration...
NTIRE 2026 3D Restoration and Reconstruction in Real-world Adverse Conditions: RealX3D Challenge Results
cs.CV 2026-04 unverdicted novelty 2.0

The NTIRE 2026 challenge reports measurable progress in 3D reconstruction pipelines that handle real-world low-light and smoke degradation via the RealX3D benchmark.

Reference graph

Works this paper leans on

85 extracted references · 14 canonical work pages · cited by 8 Pith papers · 11 internal anchors

[1]

Brown, et al

Abdelrahman Abdelhamed, Mahmoud Afifi, Radu Timofte, Michael S. Brown, et al. NTIRE 2020 challenge on real im- age denoising: Dataset, methods and results. InProceedings of the IEEE/CVF Conference on Computer Vision and Pat- tern Recognition Workshops, pages 2077–2088, 2020. 3 6

2020
[2]

Noise Flow: Noise modeling with con- ditional normalizing flows

Abdelrahman Abdelhamed, Marcus A Brubaker, and Michael S Brown. Noise Flow: Noise modeling with con- ditional normalizing flows. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3165– 3173, 2019. 3

2019
[3]

A high-quality denoising dataset for smartphone cameras

Abdelrahman Abdelhamed, Stephen Lin, and Michael S Brown. A high-quality denoising dataset for smartphone cameras. InProceedings of the IEEE Conference on Com- puter Vision and Pattern Recognition, pages 1692–1700,
[4]

Brown, et al

Abdelrahman Abdelhamed, Radu Timofte, Michael S. Brown, et al. NTIRE 2019 challenge on real image denois- ing: Methods and results. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 2197–2210, 2019. 3

2019
[5]

NTIRE 2017 chal- lenge on single image super-resolution: Dataset and study

Eirikur Agustsson and Radu Timofte. NTIRE 2017 chal- lenge on single image super-resolution: Dataset and study. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pages 126–135, 2017. 2, 3, 4

2017
[6]

RENOIR–a dataset for real low-light image noise reduction.Journal of Visual Commu- nication and Image Representation, 51:144–154, 2018

Josue Anaya and Adrian Barbu. RENOIR–a dataset for real low-light image noise reduction.Journal of Visual Commu- nication and Image Representation, 51:144–154, 2018. 3

2018
[7]

Real image denoising with feature attention

Saeed Anwar and Nick Barnes. Real image denoising with feature attention. InProceedings of the IEEE/CVF Inter- national Conference on Computer Vision, pages 3155–3164,
[8]

Contour detection and hierarchical image seg- mentation.IEEE Transactions on Pattern Analysis and Ma- chine Intelligence, 33(5):898–916, 2011

Pablo Arbel ´aez, Michael Maire, Charless Fowlkes, and Ji- tendra Malik. Contour detection and hierarchical image seg- mentation.IEEE Transactions on Pattern Analysis and Ma- chine Intelligence, 33(5):898–916, 2011. 2, 3

2011
[9]

Noise2Self: Blind denois- ing by self-supervision

Joshua Batson and Loic Royer. Noise2Self: Blind denois- ing by self-supervision. InProceedings of the International Conference on Machine Learning, pages 524–533. PMLR,
[10]

Unprocessing images for learned raw denoising

Tim Brooks, Ben Mildenhall, Tianfan Xue, Jiawen Chen, Dillon Sharlet, and Jonathan T Barron. Unprocessing images for learned raw denoising. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11036–11045, 2019. 3

2019
[11]

A non- local algorithm for image denoising

Antoni Buades, Bartomeu Coll, and J-M Morel. A non- local algorithm for image denoising. InProceedings of the 2005 IEEE Computer Society Conference on Computer Vi- sion and Pattern Recognition (CVPR’05), volume 2, pages 60–65. IEEE, 2005. 2

2005
[12]

GenSmoke-GS: A Multi-Stage Method for Novel View Synthesis from Smoke-Degraded Images Using a Generative Model

Qida Cao, Xinyuan Hu, Changyue Shi, Jiajun Ding, Zhou Yu, and Jun Yu. GenSmoke-GS: A multi-stage method for novel view synthesis from smoke-degraded images using a generative model.arXiv preprint arXiv:2604.03039, 2026. 3

work page internal anchor Pith review Pith/arXiv arXiv 2026
[13]

Training-Free Model Ensemble for Single-Image Super-Resolution via Strong-Branch Compensation

Gengjia Chang, Xining Ge, Weijun Yuan, Zhan Li, Qiurong Song, Luen Zhu, and Shuhong Liu. Training-free model en- semble for single-image super-resolution via strong-branch compensation.arXiv preprint arXiv:2604.11564, 2026. 2

work page internal anchor Pith review Pith/arXiv arXiv 2026
[14]

Dehaze-then-Splat: Generative Dehazing with Physics-Informed 3D Gaussian Splatting for Smoke-Free Novel View Synthesis

Boss Chen and Hanqing Wang. Dehaze-then-splat: Gen- erative dehazing with physics-informed 3D gaussian splat- ting for smoke-free novel view synthesis.arXiv preprint arXiv:2604.13589, 2026. 3

work page internal anchor Pith review Pith/arXiv arXiv 2026
[15]

Real-world image denoising with deep boost- ing.IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(12):3071–3087, 2019

Chang Chen, Zhiwei Xiong, Xinmei Tian, Zheng-Jun Zha, and Feng Wu. Real-world image denoising with deep boost- ing.IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(12):3071–3087, 2019. 2

2019
[16]

Pre-trained image processing transformer

Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, and Wen Gao. Pre-trained image processing transformer. InPro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12299–12310, 2021. 2

2021
[17]

Real-world single image super-resolution: A brief review.Information Fusion, 79:124–145, 2022

Honggang Chen, Xiaohai He, Linbo Qing, Yuanyuan Wu, Chao Ren, Ray E Sheriff, and Ce Zhu. Real-world single image super-resolution: A brief review.Information Fusion, 79:124–145, 2022. 3

2022
[18]

Simple baselines for image restoration

Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, and Jian Sun. Simple baselines for image restoration. InProceedings of the European Conference on Computer Vision (ECCV), pages 17–33. Springer, 2022. 2

2022
[19]

HiNet: Half instance normalization network for image restoration

Liangyu Chen, Xin Lu, Jie Zhang, Xiaojie Chu, and Cheng- peng Chen. HiNet: Half instance normalization network for image restoration. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 182–192, 2021. 2

2021
[20]

Activating more pixels in image super- resolution transformer

Xiangyu Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, and Chao Dong. Activating more pixels in image super- resolution transformer. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22367–22377, 2023. 2

2023
[21]

Yunjin Chen and Thomas Pock. Trainable nonlinear reaction diffusion: A flexible framework for fast and effective image restoration.IEEE Transactions on Pattern Analysis and Ma- chine Intelligence, 39(6):1256–1272, 2016. 2

2016
[22]

The fourth challenge on image super-resolution (×4) at NTIRE 2026: Bench- mark results and method overview

Zheng Chen, Kai Liu, Jingkai Wang, Xianglong Yan, Jianze Li, Ziqing Zhang, Jue Gong, Jiatong Li, Lei Sun, Xiaoyang Liu, Radu Timofte, Yulun Zhang, et al. The fourth challenge on image super-resolution (×4) at NTIRE 2026: Bench- mark results and method overview. InProceedings of the Computer Vision and Pattern Recognition Conference Work- shops, 2026

2026
[23]

Cross aggregation transformer for image restora- tion.Advances in Neural Information Processing Systems, 35:25478–25490, 2022

Zheng Chen, Yulun Zhang, Jinjin Gu, Linghe Kong, Xin Yuan, et al. Cross aggregation transformer for image restora- tion.Advances in Neural Information Processing Systems, 35:25478–25490, 2022. 2

2022
[24]

Improving image restoration by revisiting global information aggregation

Xiaojie Chu, Liangyu Chen, Chengpeng Chen, and Xin Lu. Improving image restoration by revisiting global information aggregation. InProceedings of the European Conference on Computer Vision (ECCV), pages 53–71. Springer, 2022. 2, 3, 4, 5

2022
[25]

ViDeNN: Deep blind video denoising

Michele Claus and Jan Van Gemert. ViDeNN: Deep blind video denoising. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition Work- shops, pages 0–0, 2019. 2

2019
[26]

Unifying color and lightness correction with view-adaptive curve ad- justment for robust 3d novel view synthesis.arXiv preprint arXiv:2602.18322, 2026

Ziteng Cui, Shuhong Liu, Xiaoyu Dong, Xuangeng Chu, Lin Gu, Ming-Hsuan Yang, and Tatsuya Harada. Unifying color and lightness correction with view-adaptive curve ad- justment for robust 3d novel view synthesis.arXiv preprint arXiv:2602.18322, 2026. 1

work page arXiv 2026
[27]

Image denoising by sparse 3-d transform- 7 domain collaborative filtering.IEEE Transactions on Image Processing, 16(8):2080–2095, 2007

Kostadin Dabov, Alessandro Foi, Vladimir Katkovnik, and Karen Egiazarian. Image denoising by sparse 3-d transform- 7 domain collaborative filtering.IEEE Transactions on Image Processing, 16(8):2080–2095, 2007. 2

2080
[28]

Ren, Chun-Le Guo, and Chongyi Li

Zheng-Peng Duan, Jiawei Zhang, Xin Jin, Ziheng Zhang, Zheng Xiong, Dongqing Zou, Jimmy S. Ren, Chun-Le Guo, and Chongyi Li. NKUSR8K: dataset release in the official DiT4SR project repository. Official project repository, 2025. Repository documentation states that the NKUSR8K dataset is released for training with the DiT4SR project. 3, 4

2025
[29]

Image denoising via sparse and redundant representations over learned dictionar- ies.IEEE Transactions on Image Processing, 15(12):3736– 3745, 2006

Michael Elad and Michal Aharon. Image denoising via sparse and redundant representations over learned dictionar- ies.IEEE Transactions on Image Processing, 15(12):3736– 3745, 2006. 2

2006
[30]

SmokeGS-R: Physics-Guided Pseudo-Clean 3DGS for Real-World Multi-View Smoke Restoration

Xueming Fu and Lixia Han. SmokeGS-R: Physics- guided pseudo-clean 3DGS for real-world multi-view smoke restoration.arXiv preprint arXiv:2604.05301, 2026. 3

work page internal anchor Pith review Pith/arXiv arXiv 2026
[31]

Dual-Branch Remote Sensing Infrared Image Super-Resolution

Xining Ge, Gengjia Chang, Weijun Yuan, Zhan Li, Zhanglu Chen, Boyang Yao, Yihang Chen, Yifan Deng, and Shuhong Liu. Dual-branch remote sensing infrared image super- resolution.arXiv preprint arXiv:2604.10112, 2026. 2

work page internal anchor Pith review Pith/arXiv arXiv 2026
[32]

CLIP-Guided Data Augmentation for Night-Time Image Dehazing

Xining Ge, Weijun Yuan, Gengjia Chang, Xuyang Li, and Shuhong Liu. Clip-guided data augmentation for night-time image dehazing.arXiv preprint arXiv:2604.05500, 2026. 1

work page internal anchor Pith review Pith/arXiv arXiv 2026
[33]

Deep burst denoising

Cl ´ement Godard, Kevin Matzen, and Matt Uyttendaele. Deep burst denoising. InProceedings of the European Con- ference on Computer Vision (ECCV), pages 538–554, 2018. 2

2018
[34]

DIV8K: DI- Verse 8k resolution image dataset

Shuhang Gu, Andreas Lugmayr, Martin Danelljan, Manuel Fritsche, Julien Lamour, and Radu Timofte. DIV8K: DI- Verse 8k resolution image dataset. InProceedings of the IEEE/CVF International Conference on Computer Vision Workshops, pages 3512–3516. IEEE, 2019. 3, 4

2019
[35]

MambaIRv2: Atten- tive state space restoration

Hang Guo, Yong Guo, Yaohua Zha, Yulun Zhang, Wenbo Li, Tao Dai, Shu-Tao Xia, and Yawei Li. MambaIRv2: Atten- tive state space restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 28124–28133, 2025. 2

2025
[36]

MambaIR: A simple baseline for image restoration with state-space model

Hang Guo, Jinmin Li, Tao Dai, Zhihao Ouyang, Xudong Ren, and Shu-Tao Xia. MambaIR: A simple baseline for image restoration with state-space model. InProceedings of the European Conference on Computer Vision (ECCV), pages 222–241. Springer, 2024. 2

2024
[37]

Reliability-aware staged low-light gaussian splatting.ResearchGate preprint, 2026

Haojie Guo and Ke Xian. Reliability-aware staged low-light gaussian splatting.ResearchGate preprint, 2026. 3

2026
[38]

Toward convolutional blind denoising of real pho- tographs

Shi Guo, Zifei Yan, Kai Zhang, Wangmeng Zuo, and Lei Zhang. Toward convolutional blind denoising of real pho- tographs. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1712– 1722, 2019. 2

2019
[39]

Neighbor2Neighbor: Self-supervised de- noising from single noisy images

Tao Huang, Songjiang Li, Xu Jia, Huchuan Lu, and Jianzhuang Liu. Neighbor2Neighbor: Self-supervised de- noising from single noisy images. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14781–14790, 2021. 2

2021
[40]

Noise2V oid: Learning denoising from single noisy images

Alexander Krull, Tim-Oliver Buchholz, and Florian Jug. Noise2V oid: Learning denoising from single noisy images. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2129–2137, 2019. 2

2019
[41]

High-quality self-supervised deep image denoising.Ad- vances in Neural Information Processing Systems, 32, 2019

Samuli Laine, Tero Karras, Jaakko Lehtinen, and Timo Aila. High-quality self-supervised deep image denoising.Ad- vances in Neural Information Processing Systems, 32, 2019. 2

2019
[42]

Noise2Noise: Learning image restoration without clean data

Jaakko Lehtinen, Jacob Munkberg, Jon Hasselgren, Samuli Laine, Tero Karras, Miika Aittala, and Timo Aila. Noise2Noise: Learning image restoration without clean data. InProceedings of the 35th International Conference on Machine Learning, volume 80 ofProceedings of Machine Learning Research, pages 2965–2974. PMLR, 2018. 2

2018
[43]

Densesplat: Densifying gaussian splatting slam with neural radiance prior.IEEE Transactions on Visualization & Computer Graphics, (01):1–14, 2025

Mingrui Li, Shuhong Liu, Tianchen Deng, and Hongyu Wang. Densesplat: Densifying gaussian splatting slam with neural radiance prior.IEEE Transactions on Visualization & Computer Graphics, (01):1–14, 2025. 1

2025
[44]

Sgs-slam: Semantic gaussian splatting for neural dense slam

Mingrui Li, Shuhong Liu, Heng Zhou, Guohao Zhu, Na Cheng, Tianchen Deng, and Hongyu Wang. Sgs-slam: Semantic gaussian splatting for neural dense slam. InEuro- pean Conference on Computer Vision, pages 163–179, 2025. 1

2025
[45]

LSDIR: A large-scale dataset for image restoration

Yawei Li, Kai Zhang, Jingyun Liang, Jiezhang Cao, Ce Liu, Rui Gong, Yulun Zhang, Hao Tang, Yun Liu, Denis De- mandolx, et al. LSDIR: A large-scale dataset for image restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1775– 1787, 2023. 3, 4

2023
[46]

Ntire 2023 challenge on image denois- ing: Methods and results

Yawei Li, Yulun Zhang, Radu Timofte, Luc Van Gool, Zhi- jun Tu, Kunpeng Du, Hailing Wang, Hanting Chen, Wei Li, Xiaofei Wang, et al. Ntire 2023 challenge on image denois- ing: Methods and results. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1905–1921, 2023. 3

2023
[47]

SwinIR: Image restoration using Swin Transformer

Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. SwinIR: Image restoration using Swin Transformer. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 1833– 1844, 2021. 1, 2, 3, 4

2021
[48]

Flickr2K dataset

Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. Flickr2K dataset. Official dataset release accompanying the NTIRE2017/EDSR repository,
[49]

Dataset collected by the authors using the Flickr API. 2, 3, 4
[50]

Blind image super-resolution: A survey and beyond.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(5):5461–5480, 2022

Anran Liu, Yihao Liu, Jinjin Gu, Yu Qiao, and Chao Dong. Blind image super-resolution: A survey and beyond.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(5):5461–5480, 2022. 3

2022
[51]

LIU4K-v2 dataset

Jiaying Liu, Dong Liu, Wenhan Yang, Sifeng Xia, Xiaoshuai Zhang, and Yuanying Dai. LIU4K-v2 dataset. Official dataset page, 2020. The official LIU4K-v2 page recom- mends citing the accompanying compression artifact reduc- tion benchmark paper. 3, 4

2020
[52]

NTIRE 2026 3D Restoration and Reconstruction in Real-world Adverse Conditions: RealX3D Challenge Results

Shuhong Liu, Chenyu Bao, Ziteng Cui, Xuangeng Chu, Bin Ren, Lin Gu, Xiang Chen, Mingrui Li, Long Ma, Marcos V . Conde, Radu Timofte, et al. NTIRE 2026 3D restoration and reconstruction in adverse conditions: RealX3D challenge re- sults.arXiv preprint arXiv:2604.04135, 2026. 1

work page internal anchor Pith review Pith/arXiv arXiv 2026
[53]

Realx3d: A physically-degraded 3d benchmark for multi-view visual restoration and recon- struction.arXiv preprint arXiv:2512.23437, 2025

Shuhong Liu, Chenyu Bao, Ziteng Cui, Yun Liu, Xuangeng Chu, Lin Gu, Marcos V Conde, Ryo Umagami, Tomohiro 8 Hashimoto, Zijian Hu, et al. Realx3d: A physically-degraded 3d benchmark for multi-view visual restoration and recon- struction.arXiv preprint arXiv:2512.23437, 2026. 1

work page arXiv 2026
[54]

Deraings: Gaussian splatting for enhanced scene reconstruction in rainy environments.Proceedings of the AAAI Conference on Artificial Intelligence, 39(5):5558– 5566, 2025

Shuhong Liu, Xiang Chen, Hongming Chen, Quanfeng Xu, and Mingrui Li. Deraings: Gaussian splatting for enhanced scene reconstruction in rainy environments.Proceedings of the AAAI Conference on Artificial Intelligence, 39(5):5558– 5566, 2025. 1

2025
[55]

Mg-slam: Structure gaussian splatting slam with manhattan world hy- pothesis.IEEE Transactions on Automation Science and En- gineering, 22:17034–17049, 2025

Shuhong Liu, Tianchen Deng, Heng Zhou, Liuzhuozheng Li, Hongyu Wang, Danwei Wang, and Mingrui Li. Mg-slam: Structure gaussian splatting slam with manhattan world hy- pothesis.IEEE Transactions on Automation Science and En- gineering, 22:17034–17049, 2025. 1

2025
[56]

De- noising the deep sky: Physics-based ccd noise formation for astronomical imaging.arXiv preprint arXiv:2601.23276,

Shuhong Liu, Xining Ge, Ziying Gu, Lin Gu, Ziteng Cui, Xuangeng Chu, Jun Liu, Dong Li, and Tatsuya Harada. De- noising the deep sky: Physics-based ccd noise formation for astronomical imaging.arXiv preprint arXiv:2601.23276,

work page arXiv
[57]

I2-nerf: Learning neural radiance fields un- der physically-grounded media interactions

Shuhong Liu, Lin Gu, Ziteng Cui, Xuangeng Chu, and Tat- suya Harada. I2-nerf: Learning neural radiance fields un- der physically-grounded media interactions. InAdvances in Neural Information Processing Systems, 2025. 1

2025
[58]

ELoG-GS: Dual-Branch Gaussian Splatting with Luminance-Guided Enhancement for Extreme Low-light 3D Reconstruction

Yuhao Liu, Dingju Wang, and Ziyang Zheng. ELoG-GS: Dual-branch gaussian splatting with luminance-guided en- hancement for extreme low-light 3D reconstruction.arXiv preprint arXiv:2604.12592, 2026. 3

work page internal anchor Pith review Pith/arXiv arXiv 2026
[59]

Swin Transformer: Hierarchical vision transformer using shifted windows

Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin Transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 10012–10022, 2021. 2

2021
[60]

Waterloo Ex- ploration Database: New challenges for image quality as- sessment models.IEEE Transactions on Image Processing, 26(2):1004–1016, 2017

Kede Ma, Zhengfang Duanmu, Qingbo Wu, Zhou Wang, Hongwei Yong, Hongliang Li, and Lei Zhang. Waterloo Ex- ploration Database: New challenges for image quality as- sessment models.IEEE Transactions on Image Processing, 26(2):1004–1016, 2017. 2, 3

2017
[61]

A holistic approach to cross-channel im- age noise modeling and its application to image denoising

Seonghyeon Nam, Youngbae Hwang, Yasuyuki Matsushita, and Seon Joo Kim. A holistic approach to cross-channel im- age noise modeling and its application to image denoising. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1683–1691, 2016. 3

2016
[62]

Benchmarking denoising al- gorithms with real photographs

Tobias Plotz and Stefan Roth. Benchmarking denoising al- gorithms with real photographs. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1586–1595, 2017. 3

2017
[63]

Self2Self with dropout: Learning self-supervised denoising from single image

Yuhui Quan, Mingqin Chen, Tongyao Pang, and Hui Ji. Self2Self with dropout: Learning self-supervised denoising from single image. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 1890–1898, 2020. 2

2020
[64]

The Eleventh NTIRE 2026 Efficient Super-Resolution Challenge Report

Bin Ren, Hang Guo, Yan Shu, Jiaqi Ma, Ziteng Cui, Shuhong Liu, Guofeng Mei, Lei Sun, Zongwei Wu, Fahad Shahbaz Khan, Salman Khan, Radu Timofte, Yawei Li, et al. The eleventh NTIRE 2026 efficient super-resolution challenge re- port.arXiv preprint arXiv:2604.03198, 2026. 1

work page internal anchor Pith review Pith/arXiv arXiv 2026
[65]

The Third Challenge on Image Denoising at NTIRE 2026: Methods and Results

Lei Sun, Hang Guo, Bin Ren, Shaolin Su, Xian Wang, Danda Pani Paudel, Luc Van Gool, Radu Timofte, Yawei Li, et al. The Third Challenge on Image Denoising at NTIRE 2026: Methods and Results. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2026. 2

2026
[66]

Mem- Net: A persistent memory network for image restoration

Ying Tai, Jian Yang, Xiaoming Liu, and Chunyan Xu. Mem- Net: A persistent memory network for image restoration. In Proceedings of the IEEE International Conference on Com- puter Vision, pages 4539–4547, 2017. 2

2017
[67]

FastDVD- net: Towards real-time deep video denoising without flow estimation

Matias Tassano, Julie Delon, and Thomas Veit. FastDVD- net: Towards real-time deep video denoising without flow estimation. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1354– 1363, 2020. 2

2020
[68]

MAXIM: Multi-axis MLP for image processing

Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, and Yinxiao Li. MAXIM: Multi-axis MLP for image processing. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5769–5780, 2022. 2

2022
[69]

Recovering realistic texture in image super-resolution by deep spatial feature transform

Xintao Wang, Ke Yu, Chao Dong, and Chen Change Loy. Recovering realistic texture in image super-resolution by deep spatial feature transform. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 606–615, 2018. 3, 4

2018
[70]

ESRGAN: En- hanced super-resolution generative adversarial networks

Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, and Chen Change Loy. ESRGAN: En- hanced super-resolution generative adversarial networks. In Proceedings of the European Conference on Computer Vi- sion Workshops, 2018. 3, 4

2018
[71]

Uformer: A general u-shaped transformer for image restoration

Zhendong Wang, Xiaodong Cun, Jianmin Bao, Wengang Zhou, Jianzhuang Liu, and Houqiang Li. Uformer: A general u-shaped transformer for image restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pat- tern Recognition, pages 17683–17693, 2022. 1, 2

2022
[72]

Restormer: Efficient transformer for high-resolution image restoration

Syed Waqas Zamir, Aditya Arora, Salman Khan, Mu- nawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang. Restormer: Efficient transformer for high-resolution image restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5728– 5739, 2022. 1, 2, 3, 5

2022
[73]

CycleISP: Real image restoration via improved data synthesis

Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao. CycleISP: Real image restoration via improved data synthesis. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2696– 2705, 2020. 2

2020
[74]

Learning enriched features for real image restoration and enhancement

Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao. Learning enriched features for real image restoration and enhancement. InProceedings of the European Confer- ence on Computer Vision (ECCV), pages 492–511. Springer,
[75]

Multi-stage progressive image restoration

Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao. Multi-stage progressive image restoration. InPro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14821–14831, 2021. 1, 2

2021
[76]

Learning enriched features for fast image restoration and enhancement.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(2):1934–1948, 2022

Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling 9 Shao. Learning enriched features for fast image restoration and enhancement.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(2):1934–1948, 2022. 2

1934
[77]

Practical blind image denoising via Swin- Conv-UNet and data synthesis.Machine Intelligence Re- search, 20(6):822–836, 2023

Kai Zhang, Yawei Li, Jingyun Liang, Jiezhang Cao, Yu- lun Zhang, Hao Tang, Deng-Ping Fan, Radu Timofte, and Luc Van Gool. Practical blind image denoising via Swin- Conv-UNet and data synthesis.Machine Intelligence Re- search, 20(6):822–836, 2023. 2

2023
[78]

Plug-and-play image restora- tion with deep denoiser prior.IEEE Transactions on Pat- tern Analysis and Machine Intelligence, 44(10):6360–6376,

Kai Zhang, Yawei Li, Wangmeng Zuo, Lei Zhang, Luc Van Gool, and Radu Timofte. Plug-and-play image restora- tion with deep denoiser prior.IEEE Transactions on Pat- tern Analysis and Machine Intelligence, 44(10):6360–6376,
[79]

Beyond a gaussian denoiser: Residual learning of deep CNN for image denoising.IEEE Transactions on Image Processing, 26(7):3142–3155, 2017

Kai Zhang, Wangmeng Zuo, Yunjin Chen, Deyu Meng, and Lei Zhang. Beyond a gaussian denoiser: Residual learning of deep CNN for image denoising.IEEE Transactions on Image Processing, 26(7):3142–3155, 2017. 1, 2

2017
[80]

FFDNet: Toward a fast and flexible solution for CNN-based im- age denoising.IEEE Transactions on Image Processing, 27(9):4608–4622, 2018

Kai Zhang, Wangmeng Zuo, and Lei Zhang. FFDNet: Toward a fast and flexible solution for CNN-based im- age denoising.IEEE Transactions on Image Processing, 27(9):4608–4622, 2018. 1, 2

2018

Showing first 80 references.