pith. sign in

arxiv: 2606.06901 · v1 · pith:AGBGEKS2new · submitted 2026-06-05 · 💻 cs.CV

LUCID: Learning Unified Control for Image Deflaring and Exposure Mastery in Nighttime Photography

Pith reviewed 2026-06-27 22:42 UTC · model grok-4.3

classification 💻 cs.CV
keywords nighttime photographyimage deflaringexposure controldiffusion modelsclassifier-free guidanceimage restorationunified frameworkHDR reconstruction
0
0 comments X

The pith

LUCID unifies flare removal and exposure adjustment in nighttime photos through four-mode training and classifier-free guidance.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper argues that flares and photon noise in nighttime scenes are entangled, so handling them separately misses their interactions. It introduces LUCID as a framework that first disentangles flares to recover structure, then uses a diffusion module to generate clean, properly exposed images. A four-mode training strategy adds explicit control, letting users adjust light sources, flares, and exposure levels via classifier-free guidance. If this holds, a single model could replace multiple specialized tools for restoring complex night photographs. The work treats restoration as an adjustable continuous process instead of a fixed output.

Core claim

LUCID decomposes nighttime restoration into a flare disentanglement module that lifts optical artifacts to supply structural guidance and a diffusion-driven module that applies generative priors to reconstruct clean well-exposed imagery. It adds explicit controllability through a novel four-mode training strategy that supports selective steering of light sources, flare and ghosting artifacts, and high dynamic range reconstruction via continuous exposure control through classifier-free guidance.

What carries the argument

The four-mode training strategy that enables selective control via classifier-free guidance over the flare disentanglement module and diffusion-driven reconstruction.

Load-bearing premise

Nighttime degradations from flares and photon noise are entangled in a way that allows one model trained in four modes to provide reliable selective control without new artifacts or loss of scene structure.

What would settle it

Running the model on a real nighttime scene with bright light sources produces either visible residual flares or distorted scene details when compared to a clean reference capture.

Figures

Figures reproduced from arXiv: 2606.06901 by Tingyu Yang, Xiaoyun Yuan, Yuan Cheng.

Figure 1
Figure 1. Figure 1: Unveiling the night with LUCID. We present a unified diffusion framework that jointly addresses severe underexposure and intense lens flares. By [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Comparison with a disjoint baseline. The bottom row illustrates the [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Training framework of LUCID. The pipeline begins with the Flare Disentangle stage (left), where the degraded input [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Prompt-driven light source preservation. By supervising the network [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗
Figure 6
Figure 6. Figure 6: Visual comparison on the ExDark dataset. This comparison focuses on evaluating the enhancement performance on authentic night scenes. [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Visual comparison on the Flare7K dataset. The comparison centers on the effectiveness of flare mitigation and preservation of light sources. [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Visual comparison on the SiHDR dataset. The comparison centers on HDR reconstruction. [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: LUCID enables creative image editing. Top: LUCID suppresses existing lens flare, effectively decoupling flare artifacts from the underlying image and [PITH_FULL_IMAGE:figures/full_fig_p008_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Relationship between CFG scales (𝛽) and exposure value changes (ΔEV). Left: Violin plots demonstrate a predictable linear response under a standard sRGB gamma mapping (𝛾 = 2.2). Right: Visual examples verify that adjusting 𝛽 achieves perceptually smooth exposure ramping. Naive Baseline PhotoShop Cascaded Pipeline PhotoShop Commercial GenAI LUCID Continuous Control Deflare + LLIE B ,righter prompt β=0.25 β… view at source ↗
Figure 12
Figure 12. Figure 12: Dual restoration aesthetics. LUCID yields both perceptual realism [PITH_FULL_IMAGE:figures/full_fig_p009_12.png] view at source ↗
read the original abstract

Photography is the art of painting with light, yet nighttime scenes are shaped by competing degradations: intense flares obscure scene structure, while photon-limited regions collapse into noise. Conventional approaches address these factors in isolation, overlooking the fact that these degradations are fundamentally entangled. To bridge this gap, we introduce LUCID, a unified framework that reframes nighttime restoration as a continuous and controllable process rather than a fixed correction. We decompose nighttime restoration into two cooperative components: a flare disentanglement module that lifts the 'curtain' of optical artifacts to provide reliable structural guidance, and a diffusion-driven module that leverages generative priors to reconstruct clean and well-exposed imagery. Crucially, LUCID introduces explicit controllability through a novel four-mode training strategy, enabling users to steer the restoration process via classifier-free guidance (CFG) and allowing selective control over light sources and their associated flare and ghosting artifacts, while also supporting high dynamic range (HDR) reconstruction through continuous exposure control. Extensive experiments demonstrate that LUCID consistently outperforms state-of-the-art methods across diverse real-world nighttime scenarios.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The paper introduces LUCID, a unified framework for nighttime image restoration that decomposes the task into a flare disentanglement module and a diffusion-driven reconstruction module. It proposes a four-mode training strategy to enable controllable restoration via classifier-free guidance, supporting selective control over flares/ghosting and continuous exposure for HDR output, and claims consistent outperformance over state-of-the-art methods on diverse real-world nighttime scenes.

Significance. If the central claims hold with supporting evidence, the work could represent a meaningful advance in controllable low-light restoration by treating entangled degradations in a single framework rather than isolated corrections, with potential utility in computational photography pipelines.

major comments (2)
  1. [Abstract] Abstract: The claim that 'LUCID consistently outperforms state-of-the-art methods across diverse real-world nighttime scenarios' is presented without any quantitative results, comparison tables, ablation studies, or experimental details, making it impossible to evaluate whether the four-mode strategy or CFG-based control delivers the stated gains.
  2. [Abstract] Abstract: No description, equations, or implementation details are supplied for the flare disentanglement module, the diffusion-driven module, the four-mode training strategy, or the classifier-free guidance mechanism, all of which are load-bearing for the controllability and unified-restoration claims.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their feedback on the abstract. The comments highlight the need for clarity on what belongs in an abstract versus the full manuscript. We address each point below and note that the full paper contains the requested details, tables, and equations.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The claim that 'LUCID consistently outperforms state-of-the-art methods across diverse real-world nighttime scenarios' is presented without any quantitative results, comparison tables, ablation studies, or experimental details, making it impossible to evaluate whether the four-mode strategy or CFG-based control delivers the stated gains.

    Authors: Abstracts are concise summaries and conventionally omit tables or full quantitative results to remain within length limits. The manuscript provides these in Section 4 (Experiments), including comparison tables against state-of-the-art methods on real-world nighttime datasets, ablation studies on the four-mode strategy, and quantitative metrics demonstrating the gains from CFG-based control. The abstract claim is supported by those results. revision: no

  2. Referee: [Abstract] Abstract: No description, equations, or implementation details are supplied for the flare disentanglement module, the diffusion-driven module, the four-mode training strategy, or the classifier-free guidance mechanism, all of which are load-bearing for the controllability and unified-restoration claims.

    Authors: The abstract is a high-level overview. Full technical descriptions, network architectures, loss functions, equations for the flare disentanglement module (Section 3.1), diffusion-driven module (Section 3.2), four-mode training strategy (Section 3.3), and classifier-free guidance mechanism are provided in the main body of the manuscript with accompanying figures and pseudocode. revision: no

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The abstract and available description contain no equations, derivations, or load-bearing self-citations. Claims rest on empirical outperformance and a proposed four-mode training strategy for controllability, with no visible reduction of any prediction or uniqueness result to fitted inputs or prior self-work by construction. Without concrete technical sections or equations supplied for inspection, the derivation chain cannot be shown to collapse internally; the work is treated as self-contained method description.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no mathematical derivations, training objectives, or modeling assumptions are stated. Therefore the ledger cannot be populated with specific free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5723 in / 1169 out tokens · 20760 ms · 2026-06-27T22:42:04.819122+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

61 extracted references · 2 canonical work pages

  1. [1]

    2022 , organization=

    Unsupervised night image enhancement: When layer decomposition meets light-effects suppression , author=. 2022 , organization=

  2. [2]

    Nighttime Visibility Enhancement by Increasing the Dynamic Range and Suppression of Light Effects , author=

  3. [3]

    From Generation to Suppression: Towards Effective Irregular Glow Removal for Nighttime Visibility Enhancement , author=

  4. [4]

    Flare7K: A Phenomenological Nighttime Flare Removal Dataset , author=

  5. [5]

    Improving Lens Flare Removal with General Purpose Pipeline and Multiple Light Sources Recovery , author=

  6. [6]

    MFDNet: Multi-Frequency Deflare Network for Efficient Nighttime Flare Removal , author=

  7. [7]

    Difflare: Removing Image Lens Flare with Latent Diffusion Model , author=

  8. [8]

    2010 , publisher=

    Thin-film optical filters , author=. 2010 , publisher=

  9. [9]

    Cockpit Displays X , volume=

    Liquid-filled camera for the measurement of high-contrast images , author=. Cockpit Displays X , volume=. 2003 , organization=

  10. [10]

    Kindling the Darkness: a Practical Low-light Image Enhancer , author=

  11. [11]

    Guo, Chunle and Li, Chongyi and Guo, Jichang and Loy, Chen Change and Hou, Junhui and Kwong, Sam and Cong, Runmin , title =

  12. [12]

    Benchmarking Low-Light Image Enhancement and Beyond , author=

  13. [13]

    Diff-retinex: Rethinking low-light image enhancement with a generative diffusion model , author=

  14. [14]

    Deep Retinex Decomposition for Low-Light Enhancement , author=

  15. [15]

    Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement , author=

  16. [16]

    and Garcia, Alvaro and Conde, Marcos V

    Feijoo, Daniel and Benito, Juan C. and Garcia, Alvaro and Conde, Marcos V. , title =

  17. [17]

    Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model , author=

  18. [18]

    Journal of the Optical Society of America , volume=

    Lightness and retinex theory , author=. Journal of the Optical Society of America , volume=. 1971 , doi=

  19. [19]

    Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

    DIFIX3D+: Improving 3D Reconstructions with Single-Step Diffusion Models , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

  20. [20]

    Xinqi Lin and Jingwen He and Ziyan Chen and Zhaoyang Lyu and Bo Dai and Fanghua Yu and Wanli Ouyang and Yu Qiao and Chao Dong , year=

  21. [21]

    Yu, Fanghua and Gu, Jinjin and Li, Zheyuan and Hu, Jinfan and Kong, Xiangtao and Wang, Xintao and He, Jingwen and Qiao, Yu and Dong, Chao , title =

  22. [22]

    Wu, Rongyuan and Yang, Tao and Sun, Lingchen and Zhang, Zhengqiang and Li, Shuai and Zhang, Lei , booktitle=CVPR, year=

  23. [23]

    One-Step Effective Diffusion Network for Real-World Image Super-Resolution , author=

  24. [24]

    arxiv , year =

    Aiping Zhang and Zongsheng Yue and Renjing Pei and Wenqi Ren and Xiaochun Cao , title =. arxiv , year =

  25. [25]

    ArXiv , year =

    Classifier-Free Diffusion Guidance , author =. ArXiv , year =

  26. [26]

    GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR , author=

  27. [27]

    Intrinsic Single-Image HDR Reconstruction , booktitle=ECCV, year=

    Sebastian Dille and Chris Careaga and Ya. Intrinsic Single-Image HDR Reconstruction , booktitle=ECCV, year=

  28. [28]

    LEDiff: Latent Exposure Diffusion for HDR Generation , author =

  29. [29]

    Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography , author=

  30. [30]

    arXiv:2308.16512 , year =

    Shi, Yichun and Wang, Peng and Ye, Jianglong and Mai, Long and Li, Kejie and Yang, Xiao , title =. arXiv:2308.16512 , year =

  31. [31]

    The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , author =

  32. [32]

    arXiv preprint arXiv:2311.17042 , year=

    Adversarial Diffusion Distillation , author=. arXiv preprint arXiv:2311.17042 , year=

  33. [33]

    IEEE Transactions on Image Processing , volume=

    No-reference image quality assessment in the spatial domain , author=. IEEE Transactions on Image Processing , volume=

  34. [34]

    Journal of Visual Communication and Image Representation , volume=

    R2rnet: Low-light image enhancement via real-low to real-normal network , author=. Journal of Visual Communication and Image Representation , volume=

  35. [35]

    Aakerberg, Andreas and Moeslund, Thomas B and Nasrollahi, Kamal , booktitle=NIPS, year=

  36. [36]

    IEEE Transactions on Image Processing , volume=

    Learning a Deep Single Image Contrast Enhancer from Multi-Exposure Images , author=. IEEE Transactions on Image Processing , volume=

  37. [37]

    Learning to See in the Dark , author=

  38. [38]

    Computer Vision and Image Understanding , volume =

    Getting to Know Low-light Images with The Exclusively Dark Dataset , author =. Computer Vision and Image Understanding , volume =. 2019 , doi =

  39. [39]

    SI-HDR - dataset for comparison of single-image high dynamic range reconstruction methods , url=

    Hanji, Param and Mantiuk, Rafal and Eilertsen, Gabriel and Hajisharif, Saghi and Unger, Jonas , year=. SI-HDR - dataset for comparison of single-image high dynamic range reconstruction methods , url=. doi:10.17863/CAM.87333 , publisher=

  40. [40]

    MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment , author=

  41. [41]

    Wang, Jianyi and Chan, Kelvin CK and Loy, Chen Change , title =

  42. [42]

    Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective , author=

  43. [43]

    MUSIQ: Multi-scale Image Quality Transformer , author=

  44. [44]

    Proceedings of the ACM SIGGRAPH Conference , year =

    Param Hanji and Rafal Mantiuk and Gabriel Eilertsen and Saghi Hajisharif and Jonas Unger , title =. Proceedings of the ACM SIGGRAPH Conference , year =

  45. [45]

    NIMA: Neural Image Assessment , year=

    Talebi, Hossein and Milanfar, Peyman , journal=. NIMA: Neural Image Assessment , year=. doi:10.1109/TIP.2018.2831899 , pages=

  46. [46]

    Removing diffraction image artifacts in under-display camera via dynamic skip connection network , author=

  47. [47]

    Under-display camera image restoration with scattering effect , author=

  48. [48]

    UDC-VIT: A Real-World Video Dataset for Under-Display Cameras , author=

  49. [49]

    Nighttime smartphone reflective flare removal using optical center symmetry prior , author=

  50. [50]

    AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion , author=

  51. [51]

    Controlling Vision-Language Models for Multi-Task Image Restoration , author=

  52. [52]

    UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations , author=

  53. [53]

    Neural Networks , year =

    Jiahuan Ren and Zhao Zhang and Suiyi Zhao and Jicong Fan and Zhongqiu Zhao and Yang Zhao and Richang Hong and Meng Wang , title =. Neural Networks , year =

  54. [54]

    Tan , title =

    Yeying Jin and Beibei Lin and Wending Yan and Yuan Yuan and Wei Ye and Robby T. Tan , title =

  55. [55]

    Bernabel and Sos S

    Sherwin A. Bernabel and Sos S. Agaian , title =

  56. [56]

    Journal of King Saud University Computer and Information Sciences , year =

    Yiqiang Zhou and Xindan Gao and Jifeng Guo and Guang Li and Lu Wang and Jing Liu , title =. Journal of King Saud University Computer and Information Sciences , year =

  57. [57]

    Simoncelli , title =

    Keyan Ding and Kede Ma and Shiqi Wang and Eero P. Simoncelli , title =. IEEE Transactions on Pattern Analysis and Machine Intelligence , year =

  58. [58]

    Bovik , title =

    Anish Mittal and Rajiv Soundararajan and Alan C. Bovik , title =. IEEE Signal Processing Letters , volume =

  59. [59]

    Venkatanath and D

    N. Venkatanath and D. M. Praneeth and M. S. C. Murthy and S. S. Channappayya and A. C. Bovik , title =. 2015 Twenty First National Conference on Communications (NCC) , year =

  60. [60]

    2021 , publisher =

    How to Train Neural Networks for Flare Removal , author =. 2021 , publisher =

  61. [61]

    2023 , publisher =

    Semi-Supervised Learning for Low-Light Image Restoration Through Quality Assisted Pseudo-Labeling , author =. 2023 , publisher =