pith. machine review for the scientific record. sign in

arxiv: 2604.05301 · v2 · submitted 2026-04-07 · 💻 cs.CV

Recognition: 2 theorem links

· Lean Theorem

SmokeGS-R: Physics-Guided Pseudo-Clean 3DGS for Real-World Multi-View Smoke Restoration

Authors on Pith no claims yet

Pith reviewed 2026-05-10 18:40 UTC · model grok-4.3

classification 💻 cs.CV
keywords 3D Gaussian Splattingsmoke restorationmulti-view reconstructiondark channel priorimage harmonizationphysics-guidedNTIRE challengeappearance correction
0
0 comments X

The pith

Geometry-first 3D Gaussian Splatting trained on physics-guided pseudo-clean images followed by post-render harmonization restores real-world multi-view smoke scenes.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces SmokeGS-R as a pipeline that separates geometry recovery from appearance correction to handle smoke degradation in multi-view images. It generates pseudo-clean supervision signals via a refined dark channel prior and guided filtering, uses those to train a clean 3D Gaussian Splatting model that recovers scene structure, and then applies stable harmonization to the rendered outputs using geometric-mean ensemble aggregation, LAB-space Reinhard color transfer, and light Gaussian smoothing. The resulting system reaches PSNR of 15.217 and SSIM of 0.666 on the NTIRE 2026 challenge test set and improves by 3.68 dB PSNR over the strongest baseline on the seven released scenes without any retraining. A sympathetic reader would care because the work demonstrates a concrete, reproducible way to obtain usable 3D models and consistent renderings from smoke-corrupted captures where clean reference data does not exist.

Core claim

We generate physics-guided pseudo-clean supervision with a refined dark channel prior and guided filtering, train a sharp clean-only 3D Gaussian Splatting source model, and then harmonize its renderings with a donor ensemble using geometric-mean reference aggregation, LAB-space Reinhard transfer, and light Gaussian smoothing. On the official challenge testing leaderboard, the final submission achieved PSNR = 15.217 and SSIM = 0.666. After the public release of RealX3D, we re-evaluated the same frozen result on the seven released challenge scenes without retraining and obtained PSNR = 15.209, SSIM = 0.644, and LPIPS = 0.551, outperforming the strongest official baseline average on the same 7.

What carries the argument

The two-stage pipeline that first produces pseudo-clean images for geometry-only 3DGS training and then performs post-render appearance harmonization via ensemble aggregation and color transfer.

If this is right

  • The geometry recovered from the pseudo-clean images remains sharp enough for downstream 3D tasks even when input views contain strong smoke.
  • Post-render harmonization alone is sufficient to restore multi-view appearance consistency once geometry is fixed.
  • A single frozen model trained this way generalizes to additional test scenes released after submission without retraining.
  • The recipe yields a measurable 3.68 dB PSNR gain over the strongest published baseline on the same real-world smoke data.
  • The approach supplies a complete, code-released submission that meets the NTIRE 2026 Track 2 evaluation criteria.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same decoupling of geometry recovery from appearance correction could be tested on other scattering media such as haze or thin fog by substituting an appropriate prior for the dark-channel step.
  • If the dark-channel prior is replaced by a learned or data-driven cleaner, the harmonization stage might become unnecessary or lighter.
  • The pipeline suggests that many atmospheric degradations in 3D vision are best treated by first securing geometry and only afterward matching radiance, rather than attempting joint optimization.
  • Releasing the code allows direct ablation of each harmonization component on new smoke densities or dynamic sequences.

Load-bearing premise

The refined dark channel prior and guided filtering produce pseudo-clean images that accurately recover the underlying scene geometry without introducing artifacts that affect the 3DGS training or require the subsequent harmonization to compensate for errors.

What would settle it

Training the same 3DGS architecture directly on the original smoky images or on an alternative pseudo-clean method, then applying identical harmonization, and finding no PSNR or geometry improvement on the released challenge scenes would falsify the claim that the specific physics-guided pseudo-clean stage is necessary.

Figures

Figures reproduced from arXiv: 2604.05301 by Lixia Han, Xueming Fu.

Figure 1
Figure 1. Figure 1: Overview of SmokeGS-R. A refined DCP inversion pipeline produces physics-guided pseudo-clean supervision for a clean-only [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Scene-wise PSNR on the seven released challenge [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Qualitative comparison on the publicly released smoke scenes. Each row corresponds to one scene from the former testing split. [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
read the original abstract

Real-world smoke simultaneously attenuates scene radiance, adds airlight, and destabilizes multi-view appearance consistency, making robust 3D reconstruction particularly difficult. We present \textbf{SmokeGS-R}, a practical pipeline developed for the NTIRE 2026 3D Restoration and Reconstruction Track 2 challenge. The key idea is to decouple geometry recovery from appearance correction: we generate physics-guided pseudo-clean supervision with a refined dark channel prior and guided filtering, train a sharp clean-only 3D Gaussian Splatting source model, and then harmonize its renderings with a donor ensemble using geometric-mean reference aggregation, LAB-space Reinhard transfer, and light Gaussian smoothing. On the official challenge testing leaderboard, the final submission achieved \mbox{PSNR $=15.217$} and \mbox{SSIM $=0.666$}. After the public release of RealX3D, we re-evaluated the same frozen result on the seven released challenge scenes without retraining and obtained \mbox{PSNR $=15.209$}, \mbox{SSIM $=0.644$}, and \mbox{LPIPS $=0.551$}, outperforming the strongest official baseline average on the same scenes by $+3.68$ dB PSNR. These results suggest that a geometry-first reconstruction strategy combined with stable post-render appearance harmonization is an effective recipe for real-world multi-view smoke restoration. The code is available at https://github.com/windrise/3drr_Track2_SmokeGS-R.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The paper presents SmokeGS-R, a pipeline for real-world multi-view smoke restoration in 3D reconstruction. It decouples geometry recovery from appearance correction by generating physics-guided pseudo-clean images via a refined dark channel prior and guided filtering to supervise training of a clean 3D Gaussian Splatting (3DGS) model, then applies post-render harmonization to the outputs using geometric-mean reference aggregation, LAB-space Reinhard color transfer, and light Gaussian smoothing. On the NTIRE 2026 3D Restoration and Reconstruction Track 2 challenge test set, it reports PSNR=15.217 and SSIM=0.666; re-evaluation on the seven released RealX3D scenes yields PSNR=15.209, SSIM=0.644, LPIPS=0.551, outperforming the strongest baseline by +3.68 dB PSNR. Code is released at the provided GitHub link.

Significance. If the results hold, the work demonstrates a practical geometry-first recipe for handling simultaneous attenuation, airlight, and multi-view inconsistency caused by real-world smoke, a persistent challenge in 3D vision. The concrete leaderboard metrics, re-evaluation on public scenes without retraining, and code release provide verifiable evidence of effectiveness and support adoption or extension in atmospheric degradation tasks.

major comments (2)
  1. [Method (pseudo-clean generation and 3DGS training)] The central claim that a geometry-first strategy is effective rests on the assumption that pseudo-clean images from the refined dark channel prior + guided filtering recover accurate scene geometry for 3DGS training. However, the manuscript provides no direct validation (e.g., depth map comparisons or point-cloud fidelity metrics) that this step avoids low-frequency artifacts in uniform smoke regions, where the dark-channel assumption is often violated; such errors would propagate to incorrect 3D structure and could not be fixed by the post-render harmonization, which operates only on appearance.
  2. [Experiments and results] No ablation study isolates the contribution of the 3DGS geometry recovery step from the subsequent harmonization. Without this, it is unclear whether the reported +3.68 dB PSNR gain on RealX3D scenes is primarily due to accurate geometry supervision or could be achieved by harmonization applied to a weaker baseline reconstruction.
minor comments (3)
  1. The abstract and method description omit explicit parameter values or refinement details for the dark channel prior and guided filter; while code is released, these should be stated in the text for clarity.
  2. [Method (harmonization)] Figure captions and the harmonization pipeline description would benefit from additional equations or pseudocode for the geometric-mean aggregation and LAB Reinhard transfer to improve reproducibility.
  3. A brief discussion of failure cases (e.g., scenes with highly varying smoke density across views) would help contextualize the limitations of the pseudo-clean assumption.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the positive assessment and constructive feedback on our manuscript. We address each major comment below.

read point-by-point responses
  1. Referee: [Method (pseudo-clean generation and 3DGS training)] The central claim that a geometry-first strategy is effective rests on the assumption that pseudo-clean images from the refined dark channel prior + guided filtering recover accurate scene geometry for 3DGS training. However, the manuscript provides no direct validation (e.g., depth map comparisons or point-cloud fidelity metrics) that this step avoids low-frequency artifacts in uniform smoke regions, where the dark-channel assumption is often violated; such errors would propagate to incorrect 3D structure and could not be fixed by the post-render harmonization, which operates only on appearance.

    Authors: We agree that explicit validation of geometry accuracy (such as depth or point-cloud metrics) would strengthen the presentation. The refined dark-channel prior combined with guided filtering is a physics-motivated choice drawn from established dehazing methods, and the final leaderboard and RealX3D results indicate that the recovered geometry supports high-quality novel-view synthesis. Nevertheless, to directly address the concern, we will add qualitative depth-map visualizations comparing the 3DGS model trained on pseudo-clean images versus a model trained on raw smoky inputs in the revised manuscript. revision: partial

  2. Referee: [Experiments and results] No ablation study isolates the contribution of the 3DGS geometry recovery step from the subsequent harmonization. Without this, it is unclear whether the reported +3.68 dB PSNR gain on RealX3D scenes is primarily due to accurate geometry supervision or could be achieved by harmonization applied to a weaker baseline reconstruction.

    Authors: We recognize that an explicit ablation would help isolate the geometry-recovery contribution. The reported gains are measured against the official challenge baselines, which do not employ our pseudo-clean supervision. To provide the requested isolation, we will add a new experiment in the revision: we train a standard 3DGS model directly on the smoky images, apply the same post-render harmonization pipeline, and report the resulting metrics on the RealX3D scenes for direct comparison with our full method. revision: yes

Circularity Check

0 steps flagged

Empirical pipeline with no self-referential derivations or fitted predictions

full rationale

The paper describes a practical, multi-stage pipeline: refined dark-channel prior plus guided filtering to create pseudo-clean supervision, followed by standard 3D Gaussian Splatting training on those images, then post-render harmonization via geometric-mean aggregation, LAB Reinhard transfer, and Gaussian smoothing. No equations, first-principles derivations, or predictions are presented that reduce to the inputs by construction. Performance is measured on external NTIRE challenge scenes with reported metrics (PSNR/SSIM/LPIPS) that are independent of any internal fitting loop. The central claim is therefore an empirical recipe validated externally rather than a closed mathematical reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach builds on established image processing priors and the 3DGS framework without introducing new free parameters or entities beyond standard tuning in the methods.

axioms (1)
  • domain assumption The dark channel prior applies to clean natural images
    Used to generate pseudo-clean supervision from smoky images.

pith-pipeline@v0.9.0 · 5583 in / 1339 out tokens · 54210 ms · 2026-05-10T18:40:34.025596+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 7 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Dehaze-then-Splat: Generative Dehazing with Physics-Informed 3D Gaussian Splatting for Smoke-Free Novel View Synthesis

    cs.CV 2026-04 unverdicted novelty 5.0

    Dehaze-then-Splat uses per-frame generative dehazing followed by physics-regularized 3D Gaussian Splatting to achieve 20.98 dB PSNR and 0.683 SSIM on the Akikaze scene, a 1.5 dB gain over baseline by mitigating cross-...

  2. 3D Smoke Scene Reconstruction Guided by Vision Priors from Multimodal Large Language Models

    cs.CV 2026-04 unverdicted novelty 5.0

    A framework that combines MLLM-based image enhancement with a medium-aware 3D Gaussian Splatting model to reconstruct and render smoke scenes.

  3. CLIP-Guided Data Augmentation for Night-Time Image Dehazing

    cs.CV 2026-04 unverdicted novelty 5.0

    CLIP-guided selection of external data plus staged NAFNet training and inference fusion provides an effective pipeline for nighttime image dehazing in the NTIRE 2026 challenge.

  4. Training-Free Model Ensemble for Single-Image Super-Resolution via Strong-Branch Compensation

    cs.CV 2026-04 unverdicted novelty 4.0

    A dual-branch training-free ensemble fuses a hybrid attention network with a Mamba-based model via weighted combination to enhance super-resolution PSNR on DIV2K x4.

  5. Dual-Branch Remote Sensing Infrared Image Super-Resolution

    cs.CV 2026-04 unverdicted novelty 4.0

    Dual-branch fusion of HAT-L and MambaIRv2-L with eight-way ensemble and equal-weight averaging outperforms single branches on PSNR, SSIM, and challenge score for infrared super-resolution.

  6. Beyond Model Design: Data-Centric Training and Self-Ensemble for Gaussian Color Image Denoising

    cs.CV 2026-04 unverdicted novelty 3.0

    Expanding training data diversity, adopting two-stage optimization, and applying geometric self-ensemble raises Restormer performance on Gaussian color denoising at sigma=50 by 3.366 dB PSNR on the NTIRE 2026 validation set.

  7. NTIRE 2026 3D Restoration and Reconstruction in Real-world Adverse Conditions: RealX3D Challenge Results

    cs.CV 2026-04 unverdicted novelty 2.0

    The NTIRE 2026 challenge reports measurable progress in 3D reconstruction pipelines that handle real-world low-light and smoke degradation via the RealX3D benchmark.

Reference graph

Works this paper leans on

23 extracted references · 12 canonical work pages · cited by 7 Pith papers · 11 internal anchors

  1. [1]

    GenSmoke-GS: A Multi-Stage Method for Novel View Synthesis from Smoke-Degraded Images Using a Generative Model

    Qida Cao, Xinyuan Hu, Changyue Shi, Jiajun Ding, Zhou Yu, and Jun Yu. Gensmoke-gs: A multi-stage method for novel view synthesis from smoke-degraded images using a generative model.arXiv preprint arXiv:2604.03039, 2026. 2

  2. [2]

    Beyond Model Design: Data-Centric Training and Self-Ensemble for Gaussian Color Image Denoising

    Gengjia Chang, Xining Ge, Weijun Yuan, Zhan Li, Qiurong Song, Luen Zhu, and Shuhong Liu. Beyond model design: Data-centric training and self-ensemble for gaussian color image denoising.arXiv preprint arXiv:2604.11468, 2026. 2

  3. [3]

    Training-Free Model Ensemble for Single-Image Super-Resolution via Strong-Branch Compensation

    Gengjia Chang, Xining Ge, Weijun Yuan, Zhan Li, Qiurong Song, Luen Zhu, and Shuhong Liu. Training-free model en- semble for single-image super-resolution via strong-branch compensation.arXiv preprint arXiv:2604.11564, 2026. 2

  4. [4]

    Dehaze-then-Splat: Generative Dehazing with Physics-Informed 3D Gaussian Splatting for Smoke-Free Novel View Synthesis

    Yuchao Chen and Hanqing Wang. Dehaze-then-splat: Gen- erative dehazing with physics-informed 3d gaussian splat- ting for smoke-free novel view synthesis.arXiv preprint arXiv:2604.13589, 2026. 2

  5. [5]

    Luminance-gs: Adapting 3d gaussian splatting to chal- lenging lighting conditions with view-adaptive curve adjustment

    Ziteng Cui, Xuangeng Chu, and Tatsuya Harada. Luminance-gs: Adapting 3d gaussian splatting to chal- lenging lighting conditions with view-adaptive curve adjustment. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 26472–26482, 2025. 2

  6. [6]

    SmokeGS-R: Physics-Guided Pseudo-Clean 3DGS for Real-World Multi-View Smoke Restoration

    Xueming Fu and Lixia Han. Smokegs-r: Physics-guided pseudo-clean 3dgs for real-world multi-view smoke restora- tion.arXiv preprint arXiv:2604.05301, 2026. 1

  7. [7]

    Dual-Branch Remote Sensing Infrared Image Super-Resolution

    Xining Ge, Gengjia Chang, Weijun Yuan, Zhan Li, Zhanglu Chen, Boyang Yao, Yihang Chen, Yifan Deng, and Shuhong Liu. Dual-branch remote sensing infrared image super- resolution.arXiv preprint arXiv:2604.10112, 2026. 2

  8. [8]

    CLIP-Guided Data Augmentation for Night-Time Image Dehazing

    Xining Ge, Weijun Yuan, Gengjia Chang, Xuyang Li, and Shuhong Liu. Clip-guided data augmentation for night-time image dehazing.arXiv preprint arXiv:2604.05500, 2026. 2

  9. [9]

    Reliability-aware staged low-light gaussian splatting.ResearchGate preprint, 2026

    Haojie Guo and Ke Xian. Reliability-aware staged low-light gaussian splatting.ResearchGate preprint, 2026. 2 6 Figure 3. Qualitative comparison on the publicly released smoke scenes. Each row corresponds to one scene from the former testing split. We show the released reference view, the official 3DGS baseline, the official SeaThru-NeRF baseline, and our ...

  10. [10]

    Single image haze removal using dark channel prior.IEEE transactions on pat- tern analysis and machine intelligence, 33(12):2341–2353,

    Kaiming He, Jian Sun, and Xiaoou Tang. Single image haze removal using dark channel prior.IEEE transactions on pat- tern analysis and machine intelligence, 33(12):2341–2353,

  11. [11]

    Guided image fil- tering.IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(6):1397–1409, 2013

    Kaiming He, Jian Sun, and Xiaoou Tang. Guided image fil- tering.IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(6):1397–1409, 2013. 2, 3

  12. [12]

    3d gaussian splatting for real-time radiance field rendering.ACM Transactions on Graphics, 42 (4):139–1, 2023

    Bernhard Kerbl, Georgios Kopanas, Thomas Leimk ¨uhler, and George Drettakis. 3d gaussian splatting for real-time radiance field rendering.ACM Transactions on Graphics, 42 (4):139–1, 2023. 1, 2, 5

  13. [13]

    Seathru- nerf: Neural radiance fields in scattering media

    Deborah Levy, Amit Peleg, Naama Pearl, Dan Rosenbaum, Derya Akkaynak, Simon Korman, and Tali Treibitz. Seathru- nerf: Neural radiance fields in scattering media. InProceed- ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 56–65, 2023. 2, 5

  14. [14]

    Watersplatting: Fast underwater 3d scene reconstruction using gaussian splatting

    Huapeng Li, Wenxuan Song, Tianao Xu, Alexandre Elsig, and Jonas Kulhanek. Watersplatting: Fast underwater 3d scene reconstruction using gaussian splatting. InInterna- tional Conference on 3D Vision, pages 969–978. IEEE, 2025. 2, 5

  15. [15]

    Realx3d: A physically-degraded 3d benchmark for multi-view visual restoration and recon- struction.arXiv preprint arXiv:2512.23437, 2025

    Shuhong Liu, Chenyu Bao, Ziteng Cui, Yun Liu, Xuangeng Chu, Lin Gu, Marcos V Conde, Ryo Umagami, Tomohiro Hashimoto, Zijian Hu, et al. Realx3d: A physically-degraded 3d benchmark for multi-view visual restoration and recon- struction.arXiv preprint arXiv:2512.23437, 2025. 1, 2, 4, 5

  16. [16]

    I2-nerf: Learning neural radiance fields un- der physically-grounded media interactions

    Shuhong Liu, Lin Gu, Ziteng Cui, Xuangeng Chu, and Tat- suya Harada. I2-nerf: Learning neural radiance fields un- der physically-grounded media interactions. InAdvances in Neural Information Processing Systems (NeurIPS), 2025. 2, 5

  17. [17]

    NTIRE 2026 3D Restoration and Reconstruction in Real-world Adverse Conditions: RealX3D Challenge Results

    Shuhong Liu, Chenyu Bao, Ziteng Cui, Xuangeng Chu, Bin Ren, Lin Gu, Xiang Chen, Mingrui Li, Long Ma, Marcos V Conde, et al. Ntire 2026 3d restoration and reconstruction in real-world adverse conditions: Realx3d challenge results. arXiv preprint arXiv:2604.04135, 2026. 1, 2

  18. [18]

    ELoG-GS: Dual-Branch Gaussian Splatting with Luminance-Guided Enhancement for Extreme Low-light 3D Reconstruction

    Yuhao Liu, Dingju Wang, and Ziyang Zheng. Elog-gs: Dual- branch gaussian splatting with luminance-guided enhance- ment for extreme low-light 3d reconstruction.arXiv preprint arXiv:2604.12592, 2026. 2

  19. [19]

    Color transfer between images

    Erik Reinhard, Michael Ashikhmin, Bruce Gooch, and Peter Shirley. Color transfer between images. InIEEE Computer Graphics and Applications, pages 34–41, 2001. 2, 4

  20. [20]

    Sea- splat: Representing underwater scenes with 3d gaussian splatting and a physically grounded image formation model

    Daniel Yang, John J Leonard, and Yogesh Girdhar. Sea- splat: Representing underwater scenes with 3d gaussian splatting and a physically grounded image formation model. 7 InIEEE International Conference on Robotics and Automa- tion, pages 7632–7638. IEEE, 2025. 2, 5

  21. [21]

    3D Smoke Scene Reconstruction Guided by Vision Priors from Multimodal Large Language Models

    Xinye Zheng, Fei Wang, Yiqi Nie, Kun Li, Junjie Chen, Ji- aqi Zhao, Yanyan Wei, and Zhiliang Wu. 3d smoke scene re- construction guided by vision priors from multimodal large language models.arXiv preprint arXiv:2604.05687, 2026. 2

  22. [22]

    Lita-gs: Illumination- agnostic novel view synthesis via reference-free 3d gaus- sian splatting and physical priors

    Han Zhou, Wei Dong, and Jun Chen. Lita-gs: Illumination- agnostic novel view synthesis via reference-free 3d gaus- sian splatting and physical priors. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 21580–21589, 2025. 2

  23. [23]

    Naka-GS: A Bionics-inspired Dual-Branch Naka Correction and Progressive Point Pruning for Low-Light 3DGS

    Runyu Zhu, SiXun Dong, Zhiqiang Zhang, Qingxia Ye, and Zhihua Xu. Naka-gs: A bionics-inspired dual-branch naka correction and progressive point pruning for low-light 3dgs. arXiv preprint arXiv:2604.11142, 2026. 2 8