There and Back Again: A Flexible-Frame Transformer for Multi-Exposure Fusion

Hui Zeng; Jie Liang; Jufeng Yang; Lei Zhang; Lishen Qu; Shihao Zhou; Yao Liu

arxiv: 2606.27905 · v2 · pith:Y5RRIBJEnew · submitted 2026-06-26 · 💻 cs.CV

There and Back Again: A Flexible-Frame Transformer for Multi-Exposure Fusion

Lishen Qu , Yao Liu , Shihao Zhou , Jie Liang , Hui Zeng , Lei Zhang , Jufeng Yang This is my paper

Pith reviewed 2026-06-30 09:54 UTC · model grok-4.3

classification 💻 cs.CV

keywords multi-exposure fusionflexible frametransformerrecurrent state spacehigh dynamic rangeattention mechanismimage restoration

0 comments

The pith

A new transformer fuses any number of exposure frames for high-dynamic-range imaging without retraining or model changes.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Conventional multi-exposure fusion methods require separate models for each fixed number of input frames, which creates deployment overhead when scene luminance varies. The paper introduces FreeMEF, a transformer architecture that accepts input sequences of arbitrary length by processing them sequentially through recurrent state-space modeling followed by global-feature-guided restoration. This removes the need to maintain multiple trained networks while preserving fusion quality across benchmarks. A sympathetic reader would care because it converts a practical engineering constraint into a single deployable system.

Core claim

FreeMEF is the first flexible-frame transformer for multi-exposure fusion. It uses a recurrent state space module to fuse features from arbitrary-length exposure sequences via adaptive alignment and state-space recurrence, supplying global guidance, and a global feature guided block that combines extremity-aware hybrid attention with an affine-injection feed-forward network to resolve similarity paradoxes while regulating contrast and brightness.

What carries the argument

Recurrent state space module (RSSM) that sequentially fuses arbitrary-length feature sequences, paired with global feature guided block (GFGB) containing extremity-aware hybrid attention (EAHA) and affine-injection feed-forward network (AFFN).

If this is right

Deployment systems no longer need to store and switch between multiple fixed-frame fusion models.
A single trained network can process scenes captured with two, three, four, or more exposures.
The same architecture works for both short and long exposure stacks without architectural modification.
Global information guidance from the recurrent module improves fusion consistency across varying input counts.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could extend to other variable-length multi-frame tasks such as burst denoising or video deblurring if the state-space recurrence generalizes.
Real-time mobile HDR pipelines could adopt a single lightweight model instead of frame-count-specific variants.
Training data collection can focus on diverse exposure combinations rather than balancing separate datasets per frame count.

Load-bearing premise

The recurrent state space module and extremity-aware hybrid attention maintain performance for any input sequence length without retraining or quality loss.

What would settle it

Quantitative results on the three benchmark datasets showing measurable drops in PSNR, SSIM, or perceptual scores when the number of input frames differs from the training distribution.

Figures

Figures reproduced from arXiv: 2606.27905 by Hui Zeng, Jie Liang, Jufeng Yang, Lei Zhang, Lishen Qu, Shihao Zhou, Yao Liu.

**Figure 1.** Figure 1: Comparison of feature fusion pipelines. (a) Fusion → Self-Attention: Features from three frames are fused prior to self-attention. (b) Cross-Attention → Fusion: Reference frames interact with the base frame via cross-attention before fusion. (c) Recurrent Fusion → Cross-Attention (Ours): Features are fused sequentially, and the aggregated feature is then incorporated into the base frame via cross-attenti… view at source ↗

**Figure 2.** Figure 2: Overview of our FreeMEF pipeline. Unlike previous methods [29, 35, 53] that process multiple frames in parallel, we design a recurrent fusion structure based on the ASE [18], which can adaptively exclude redundant features. Additionally, our model can be transferred to a different number of exposure levels without modification or retraining. The fused global features are then incorporated as auxiliary info… view at source ↗

**Figure 3.** Figure 3: The visualization of the query Qbase from the base feature and the query Qref from the reference feature. In the over-exposed red-boxed region, the values of Qbase are small, which leads to low attention map values when querying K in this region. As a result, Qref is required to provide supplementary information. linear computational complexity. Inspired by this, we incorporate an attentive state-space equ… view at source ↗

**Figure 4.** Figure 4: Illustration of GFGB. It comprises an extremity-aware hybrid attention (EAHA) and an affine-injection feed-forward network (AFFN). 4.2 Global Feature Guided Block The Similarity Paradox. Although the global feature HT provides information about the reference frames, utilizing it to restore the base frame I0 introduces a fundamental conflict. The primary objective of HDR imaging is to hallucinate details… view at source ↗

**Figure 5.** Figure 5: Visual comparisons on Kalantari et al.’s dataset [27] (left) and Real-HDRV dataset [51] (right). Our method achieves robust performance under both underexposed and over-exposed conditions. converted to PNG format for use in the MEF task. In addition, we conduct crossdataset evaluation by training on Kalantari et al.’s dataset [27] and testing on the SICE dataset [5]. Specifically, since SICE includes mul… view at source ↗

**Figure 6.** Figure 6: Visual comparisons between models trained on Kalantari et al.’s dataset [27] and tested on the SICE dataset [5]. Under-exposed and over-exposed regions are marked with yellow and red boxes, respectively. Our method achieves a higher dynamic range and consistently delivers superior visual quality. Zoom in for a better view. Note that Kalantari et al.’s training dataset [27] contains fewer exposure levels th… view at source ↗

**Figure 7.** Figure 7: Feature fusion strategy comparison. (a) The parallel fusion strategy treats all frame features equally, which can easily introduce ghosting artifacts. (b) Our method decouples the base frame from the reference features and employs a cross-attention mechanism for adaptive fusion. Unlike prior parallel fusion methods, our proposed "there and back again" mechanism effectively suppresses ghosting artifacts. Fu… view at source ↗

**Figure 8.** Figure 8: Ablation study of EAHA and AFFN. Their synergy improves restoration in over-exposed regions and the performance of global brightness control. Comparison of Fusion Mechanism. The key difference between our method and previous approaches lies in the fusion mechanism. We adopt a recurrent fusion strategy based on RSSM, whereas prior methods typically rely on parallel fusion. We analyze the differences betwee… view at source ↗

**Figure 9.** Figure 9: Visual comparison of fusion orders. (II) may introduce some noise. Base frame FreeMEF Ground Truth [PITH_FULL_IMAGE:figures/full_fig_p015_9.png] view at source ↗

**Figure 10.** Figure 10: Example of color shift after multi-exposure fusion. 6 Conclusion In this paper, we propose FreeMEF, a flexible and robust transformer framework to tackle the inherent limitations of fixed-input architectures and the similarity paradox in multi-exposure fusion (MEF). Specifically, to break the constraint of fixed exposure counts, we developed a recurrent state space module (RSSM) that progressively aggrega… view at source ↗

read the original abstract

Multi-exposure fusion (MEF) brings the dynamic range of conventional cameras closer to that of human vision, producing images with rich scene content. Given the large variability in scene luminance, exposure strategies often require different numbers of frames to capture the full radiance range faithfully. However, conventional MEF techniques are typically designed for a fixed number of inputs, forcing deployment systems to maintain separate models for different frame-count requirements, which undermines deployment efficiency. To address this limitation, we propose FreeMEF, the first flexible-frame transformer for MEF that seamlessly accommodates varying numbers of input exposures without retraining or architectural changes. The proposed approach consists of two key modules. First, we introduce a recurrent state space module (RSSM) that sequentially fuses features from arbitrary sequences via adaptive alignment and state-space recurrent modeling, thereby providing global information guidance for the subsequent restoration. Second, we devise a global feature guided block (GFGB) incorporating an extremity-aware hybrid attention (EAHA) and an affine-injection feed-forward network (AFFN), which effectively resolves the similarity paradox while simultaneously optimizing contrast and brightness regulation. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our method, which performs favorably against state-of-the-art methods both quantitatively and qualitatively.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

FreeMEF claims the first flexible-frame MEF transformer via RSSM and GFGB, but the abstract supplies no numbers or training details to check whether it actually holds for arbitrary lengths.

read the letter

The main point is that this paper targets a real deployment headache in multi-exposure fusion: most methods lock to a fixed number of input frames, so you end up shipping separate models. FreeMEF tries to fix that with a recurrent state space module that fuses features sequentially through adaptive alignment, plus a global feature guided block that adds extremity-aware hybrid attention and an affine-injection feed-forward network.

The design is new in its application. It extends existing state-space and transformer pieces to the variable-length constraint without requiring retraining or architecture swaps. That framing of the problem is useful for anyone building imaging pipelines that must handle different exposure strategies.

The soft spots are the missing evidence. The abstract states favorable results on three benchmarks but reports no metrics, no ablations on frame count, and no sign that training deliberately varied the number of inputs or measured degradation at extremes. The stress-test concern about RSSM accumulating alignment or state errors outside the training distribution therefore lands, because nothing contradicts it. Recurrent models often show exactly that behavior when sequence length shifts.

The modules are described at a conceptual level and build directly on prior work, so the contribution sits in the MEF-specific combination rather than new primitives. No circularity appears.

This is for computational photography groups that care about flexible deployment in MEF systems. A reader already working on variable-input fusion might pick up the module ideas, but without the experiments it is hard to judge whether the claims hold. I would not bring it to a reading group on current information and would not cite it. It does not yet look ready for serious peer review because the central flexibility claim rests on an unverified assertion.

Referee Report

2 major / 1 minor

Summary. The paper proposes FreeMEF, the first flexible-frame transformer for multi-exposure fusion (MEF). It claims to handle arbitrary numbers of input exposures without retraining or architectural modification via a recurrent state space module (RSSM) for sequential fusion with adaptive alignment and a global feature guided block (GFGB) containing extremity-aware hybrid attention (EAHA) and affine-injection feed-forward network (AFFN). The method is evaluated on three benchmark datasets and reported to outperform state-of-the-art approaches both quantitatively and qualitatively.

Significance. If the flexibility claim is substantiated, the work would address a practical deployment limitation in MEF by removing the need for multiple fixed-frame models. The application of recurrent state-space modeling to sequential fusion in this domain is a timely direction given recent SSM advances, and the extremity-aware attention mechanism targets a known fusion challenge.

major comments (2)

[Abstract] Abstract: the central claim that RSSM enables seamless accommodation of arbitrary input sequence lengths without retraining or degradation is load-bearing, yet the abstract supplies no quantitative results, ablation studies on frame-count variation, or details on whether training explicitly varied the number of exposures; this leaves the flexibility assertion unevaluated.
[Method (RSSM module)] The description of RSSM sequential fusion via adaptive alignment and state-space recurrent modeling does not address potential progressive state dilution or misalignment for lengths far from the training distribution (e.g., 2 vs. 9+ frames), which directly undermines the 'arbitrary sequences' guarantee.

minor comments (1)

[Abstract] The abstract states 'extensive experiments' but provides no metrics, tables, or implementation details; the full manuscript should include these to allow assessment of the quantitative claims.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment point by point below.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that RSSM enables seamless accommodation of arbitrary input sequence lengths without retraining or degradation is load-bearing, yet the abstract supplies no quantitative results, ablation studies on frame-count variation, or details on whether training explicitly varied the number of exposures; this leaves the flexibility assertion unevaluated.

Authors: The abstract provides a concise overview of the contribution and method. Quantitative evaluations and ablations on varying input frame counts are presented in the experimental section of the manuscript. To strengthen the abstract's support for the flexibility claim, we will revise it to include a brief reference to these results and the training procedure regarding exposure variation. revision: yes
Referee: [Method (RSSM module)] The description of RSSM sequential fusion via adaptive alignment and state-space recurrent modeling does not address potential progressive state dilution or misalignment for lengths far from the training distribution (e.g., 2 vs. 9+ frames), which directly undermines the 'arbitrary sequences' guarantee.

Authors: The RSSM employs state-space modeling specifically to support stable long-range dependencies with reduced dilution relative to standard recurrent architectures, combined with adaptive alignment for handling variable inputs. We agree that the current method description does not explicitly discuss or analyze edge cases of extreme sequence lengths. We will add a dedicated paragraph in the method section addressing these considerations along with supporting analysis or experiments on generalization. revision: yes

Circularity Check

0 steps flagged

No circularity: novel architecture with independent design choices

full rationale

The paper presents FreeMEF as a new transformer architecture consisting of explicitly introduced modules (RSSM for sequential fusion via adaptive alignment and state-space modeling; GFGB with EAHA and AFFN). No equations, fitted parameters, or predictions are shown that reduce by construction to inputs. No self-citations are invoked as load-bearing uniqueness theorems or ansatzes. The central claim of flexibility for arbitrary frame counts rests on the proposed design rather than renaming or re-deriving prior fitted results. This is a standard case of a self-contained architectural proposal.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no information on free parameters, axioms, or invented entities can be extracted.

pith-pipeline@v0.9.1-grok · 5763 in / 985 out tokens · 29710 ms · 2026-06-30T09:54:44.698279+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

63 extracted references · 5 canonical work pages · 3 internal anchors

[1]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Afifi, M., Derpanis, K.G., Ommer, B., Brown, M.S.: Learning multi-scale photo exposure correction. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 9157–9167 (2021)

2021
[2]

ACM Transactions on Graphics (TOG).26(3), 38–es (2007)

Akyüz, A.O., Fleming, R., Riecke, B.E., Reinhard, E., Bülthoff, H.H.: Do HDR displays support LDR content? A psychophysical evaluation. ACM Transactions on Graphics (TOG).26(3), 38–es (2007)

2007
[3]

Layer Normalization

Ba, J.L., Kiros, J.R., Hinton, G.E.: Layer normalization. arXiv preprint arXiv:1607.06450 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016
[4]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Bhat, G., Danelljan, M., Van Gool, L., Timofte, R.: Deep burst super-resolution. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 9209–9218 (2021)

2021
[5]

IEEE Transactions on Image Processing (TIP).27(4), 2049–2062 (2018)

Cai, J., Gu, S., Zhang, L.: Learning a deep single image contrast enhancer from multi-exposure images. IEEE Transactions on Image Processing (TIP).27(4), 2049–2062 (2018)

2049
[6]

IEEE Transactions on Image Processing (TIP).31, 2661–2672 (2022)

Chen, J., Yang, Z., Chan, T.N., Li, H., Hou, J., Chau, L.P.: Attention-guided progressive neural texture fusion for high dynamic range image restoration. IEEE Transactions on Image Processing (TIP).31, 2661–2672 (2022)

2022
[7]

In: AAAI Conference on Artificial Intelligence (AAAI)

Chen, R., Zheng, B., Zhang, H., Chen, Q., Yan, C., Slabaugh, G., Yuan, S.: Im- proving dynamic hdr imaging with fusion transformer. In: AAAI Conference on Artificial Intelligence (AAAI). pp. 340–349 (2023)

2023
[8]

Chen, Z., Wang, Y., Cai, X., You, Z., Lu, Z., Zhang, F., Guo, S., Xue, T.: Ultrafu- sion:Ultrahighdynamicimagingusingexposurefusion.In:IEEE/CVFConference on Computer Vision and Pattern Recognition (CVPR). pp. 16111–16121 (2025)

2025
[9]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H., Wei, Y.: Deformable convo- lutional networks. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 764–773 (2017)

2017
[10]

Towards In-Context Tone Style Transfer with A Large-Scale Triplet Dataset

Deng, Y., She, H., Shen, W., Li, M., Wu, R., Yuan, L., Li, X.: Towards in- context tone style transfer with a large-scale triplet dataset. arXiv preprint arXiv:2604.16114 (2026)

work page internal anchor Pith review Pith/arXiv arXiv 2026
[11]

In: European Conference on Computer Vision (ECCV)

Dong, C., Loy, C.C., He, K., Tang, X.: Learning a deep convolutional network for image super-resolution. In: European Conference on Computer Vision (ECCV). pp. 184–199 (2014)

2014
[12]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Dudhane, A., Zamir, S.W., Khan, S., Khan, F.S., Yang, M.H.: Burstormer: Burst image restoration and enhancement transformer. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 5703–5712 (2023)

2023
[13]

In: ACM SIGGRAPH Annual Conference (SIGGRAPH)

Durand, F., Dorsey, J.: Fast bilateral filtering for the display of high-dynamic- range images. In: ACM SIGGRAPH Annual Conference (SIGGRAPH). pp. 257– 266 (2002)

2002
[14]

IEEE Micro18(03), 8–15 (1998)

Fossum, E.R.: Digital camera system on a chip. IEEE Micro18(03), 8–15 (1998)

1998
[15]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Gao, T., Guo, L., Zhao, S., Xu, P., Yang, Y., Liu, X., Wang, S., Zhu, S., Zhou, D.: Quantnas: Quantization-aware neural architecture search for efficient deployment on mobile device. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1704–1713 (2024)

2024
[16]

In: Advances in Neural Information Processing Systems (NeurIPS)

Ghasemabadi, A., Janjua, M.K., Salameh, M., Niu, D.: Learning truncated causal history model for video restoration. In: Advances in Neural Information Processing Systems (NeurIPS). pp. 27584–27615 (2024)

2024
[17]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Girshick, R.: Fast r-cnn. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 1440–1448 (2015) There and Back Again 17

2015
[18]

In: Conference on Language Modeling (COLM) (2024)

Gu, A., Dao, T.: Mamba: Linear-time sequence modeling with selective state spaces. In: Conference on Language Modeling (COLM) (2024)

2024
[19]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Guo, H.,Guo,Y.,Zha,Y.,Zhang,Y.,Li,W.,Dai,T., Xia,S.T.,Li,Y.:Mambairv2: Attentive state space restoration. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 28124–28133 (2025)

2025
[20]

In: European Conference on Computer Vision (ECCV)

Guo, H., Li, J., Dai, T., Ouyang, Z., Ren, X., Xia, S.T.: Mambair: A simple base- line for image restoration with state-space model. In: European Conference on Computer Vision (ECCV). pp. 222–241 (2024)

2024
[21]

ACM Transactions on Graphics (TOG).35(6), 1–12 (2016)

Hasinoff, S.W., Sharlet, D., Geiss, R., Adams, A., Barron, J.T., Kainz, F., Chen, J., Levoy, M.: Burst photography for high dynamic range and low-light imaging on mobile cameras. ACM Transactions on Graphics (TOG).35(6), 1–12 (2016)

2016
[22]

Hu, J., Gallo, O., Pulli, K., Sun, X.: HDR deghosting: How to deal with saturation? In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1163–1170 (2013)

2013
[23]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Huang, D.A., Ramanathan, V., Mahajan, D., Torresani, L., Paluri, M., Fei-Fei, L., Niebles, J.C.: What makes a video a video: Analyzing temporal information in video understanding models and datasets. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 7366–7375 (2018)

2018
[24]

In: IEEE/CVF International Conference on Computer Vi- sion (ICCV)

Huang, X., Belongie, S.: Arbitrary style transfer in real-time with adaptive in- stance normalization. In: IEEE/CVF International Conference on Computer Vi- sion (ICCV). pp. 1501–1510 (2017)

2017
[25]

The Visual Computer30(5), 507–517 (2014)

Huo, Y., Yang, F., Dong, L., Brost, V.: Physiological inverse tone mapping based on retina response. The Visual Computer30(5), 507–517 (2014)

2014
[26]

Categorical Reparameterization with Gumbel-Softmax

Jang, E., Gu, S., Poole, B.: Categorical reparameterization with gumbel-softmax. arXiv preprint arXiv:1611.01144 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016
[27]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Jiang, T., Wang, C., Li, X., Li, R., Fan, H., Liu, S.: Meflut: Unsupervised 1d lookup tables for multi-exposure image fusion. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 10542–10551 (2023)

2023
[28]

ACM Transactions on Graphics (TOG).36(4), 144–155 (2017)

Kalantari, N.K., Ramamoorthi, R., et al.: Deep high dynamic range imaging of dynamic scenes. ACM Transactions on Graphics (TOG).36(4), 144–155 (2017)

2017
[29]

In: European Conference on Computer Vision (ECCV)

Kim, T.H., Sajjadi, M.S., Hirsch, M., Scholkopf, B.: Spatio-temporal transformer network for video restoration. In: European Conference on Computer Vision (ECCV). pp. 106–122 (2018)

2018
[30]

In: European Conference on Computer Vision (ECCV)

Kong, L., Li, B., Xiong, Y., Zhang, H., Gu, H., Chen, J.: Safnet: Selective alignment fusion network for efficient hdr imaging. In: European Conference on Computer Vision (ECCV). pp. 256–273 (2024)

2024
[31]

In: Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI)

Kovaleski, R.P., Oliveira, M.M.: High-quality reverse tone mapping for a wide range of exposures. In: Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI). pp. 49–56 (2014)

2014
[32]

In: Advances in Neural Information Processing Systems (NeurIPS)

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep con- volutional neural networks. In: Advances in Neural Information Processing Systems (NeurIPS). (2012)

2012
[33]

Nature.521(7553), 436–444 (2015)

LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature.521(7553), 436–444 (2015)

2015
[34]

IEEE Transactions on Image Processing (TIP)

Li,H.,Ma,K.,Yong,H.,Zhang,L.:Fastmulti-scalestructuralpatchdecomposition for multi-exposure image fusion. IEEE Transactions on Image Processing (TIP). 29, 5805–5816 (2020)

2020
[35]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Li, K., Wang, Y., He, Y., Li, Y., Wang, Y., Wang, L., Qiao, Y.: Uniformerv2: Unlocking the potential of image vits for video understanding. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 1632–1643 (2023) 18 L. Qu et al

2023
[36]

In: IEEE/CVF International Confer- ence on Computer Vision (ICCV)

Li, X., Ni, Z., Yang, W.: Afunet: Cross-iterative alignment-fusion synergy for hdr reconstruction via deep unfolding paradigm. In: IEEE/CVF International Confer- ence on Computer Vision (ICCV). pp. 10666–10675 (2025)

2025
[37]

In: Advances in Neural Information Processing Systems (NeurIPS)

Li, Y., Fang, C., Yang, J., Wang, Z., Lu, X., Yang, M.H.: Universal style transfer via feature transforms. In: Advances in Neural Information Processing Systems (NeurIPS). (2017)

2017
[38]

arXiv preprint arXiv:2410.12274 (2024)

Liang, P., Jiang, J., Ma, Q., Liu, X., Ma, J.: Fusion from decomposition: A self- supervised approach for image fusion and beyond. arXiv preprint arXiv:2410.12274 (2024)

work page arXiv 2024
[39]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Liu, S., Zhang, X., Sun, L., Liang, Z., Zeng, H., Zhang, L.: Joint hdr denoising and fusion: A real-world mobile hdr image dataset. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 13966–13975 (2023)

2023
[40]

arXiv preprint arXiv:2403.10376 (2024)

Liu, X., Li, A., Wu, Z., Du, Y., Zhang, L., Zhang, Y., Timofte, R., Zhu, C.: Pasta: Towards flexible and efficient hdr imaging via progressively aggregated spatio- temporal alignment. arXiv preprint arXiv:2403.10376 (2024)

work page arXiv 2024
[41]

In: Advances in Neural Information Processing Systems (NeurIPS)

Liu, Y., Tian, Y., Zhao, Y., Yu, H., Xie, L., Wang, Y., Ye, Q., Jiao, J., Liu, Y.: Vmamba: Visual state space model. In: Advances in Neural Information Processing Systems (NeurIPS). pp. 103031–103063 (2024)

2024
[42]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., Guo, B.: Swin transformer: Hierarchical vision transformer using shifted windows. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 10012–10022 (2021)

2021
[43]

In: European Conference on Computer Vision (ECCV)

Liu, Z., Wang, Y., Zeng, B., Liu, S.: Ghost-free high dynamic range imaging with context-aware transformer. In: European Conference on Computer Vision (ECCV). pp. 344–360 (2022)

2022
[44]

In: IEEE/CVF Conference on Computer Vision and Pattern Recog- nition (CVPR)

Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE/CVF Conference on Computer Vision and Pattern Recog- nition (CVPR). pp. 3431–3440 (2015)

2015
[45]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Luo, Z., Li, Y., Cheng, S., Yu, L., Wu, Q., Wen, Z., Fan, H., Sun, J., Liu, S.: Bsrt: Improving burst super-resolution with swin transformer and flow-guided de- formable alignment. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 998–1008 (2022)

2022
[46]

IEEE Transactions on Image Processing (TIP).26(5), 2519–2532 (2017)

Ma, K., Li, H., Yong, H., Wang, Z., Meng, D., Zhang, L.: Robust multi-exposure image fusion: a structural patch decomposition approach. IEEE Transactions on Image Processing (TIP).26(5), 2519–2532 (2017)

2017
[47]

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).37(6), 1219–1232 (2014)

Oh, T.H., Lee, J.Y., Tai, Y.W., Kweon, I.S.: Robust high dynamic range imag- ing by rank minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).37(6), 1219–1232 (2014)

2014
[48]

In: IEEE International Conference on Image Processing (ICIP)

Pourreza-Shahri,R.,Kehtarnavaz,N.:Exposurebracketingviaautomaticexposure selection. In: IEEE International Conference on Image Processing (ICIP). pp. 320– 323 (2015)

2015
[49]

IEEE Transactions on Information Theory (TIT).57(3), 1675–1691 (2011)

Rachlin, Y., Negi, R., Khosla, P.K.: The sensing capacity of sensor networks. IEEE Transactions on Information Theory (TIT).57(3), 1675–1691 (2011)

2011
[50]

ACM Transactions on Graphics (TOG).21(3), 267—-276 (2002)

Reinhard, E., Stark, M., Shirley, P., Ferwerda, J.: Photographic tone reproduction for digital images. ACM Transactions on Graphics (TOG).21(3), 267—-276 (2002)

2002
[51]

ACM Transactions on Graphics (TOG).31(6), 203–1 (2012)

Sen, P., Kalantari, N.K., Yaesoubi, M., Darabi, S., Goldman, D.B., Shechtman, E.: Robust patch-based hdr reconstruction of dynamic scenes. ACM Transactions on Graphics (TOG).31(6), 203–1 (2012)

2012
[52]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Shu, Y., Shen, L., Hu, X., Li, M., Zhou, Z.: Towards real-world hdr video recon- struction: A large-scale benchmark dataset and a two-stage alignment network. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 2879–2888 (2024) There and Back Again 19

2024
[53]

In: European Conference on Computer Vision (ECCV)

Song, J.W., Park, Y.I., Kong, K., Kwak, J., Kang, S.J.: Selective transhdr: Transformer-based selective hdr imaging using ghost region mask. In: European Conference on Computer Vision (ECCV). pp. 288–304 (2022)

2022
[54]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Tel, S., Wu, Z., Zhang, Y., Heyrman, B., Demonceaux, C., Timofte, R., Gin- hac, D.: Alignment-free hdr deghosting with semantics consistent transformer. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 12836– 12845 (2023)

2023
[55]

In: Advances in Neural Information Processing Systems (NeurIPS)

Vaswani,A.,Shazeer,N.,Parmar,N.,Uszkoreit,J.,Jones,L.,Gomez,A.N.,Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems (NeurIPS). (2017)

2017
[56]

In: IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR)

Wang, J., Chen, D., Luo, C., He, B., Yuan, L., Wu, Z., Jiang, Y.G.: Omnivid: A generative framework for universal video understanding. In: IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR). pp. 18209–18220 (2024)

2024
[57]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Wang, X., Chan, K.C., Yu, K., Dong, C., Change Loy, C.: Edvr: Video restoration with enhanced deformable convolutional networks. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). pp. 1954–1963 (2019)

1954
[58]

In: European Conference on Computer Vision (ECCV)

Wu, S., Xu, J., Tai, Y.W., Tang, C.K.: Deep high dynamic range imaging with large foreground motions. In: European Conference on Computer Vision (ECCV). pp. 117–132 (2018)

2018
[59]

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).44(1), 502–518 (2020)

Xu, H., Ma, J., Jiang, J., Guo, X., Ling, H.: U2fusion: A unified unsupervised image fusion network. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).44(1), 502–518 (2020)

2020
[60]

In: IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR)

Yan, Q., Gong, D., Shi, Q., Hengel, A.v.d., Shen, C., Reid, I., Zhang, Y.: Attention- guided network for ghost-free high dynamic range imaging. In: IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR). pp. 1751–1760 (2019)

2019
[61]

In: IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR)

Zamir, S.W., Arora, A., Khan, S., Hayat, M., Khan, F.S., Yang, M.H.: Restormer: Efficient transformer for high-resolution image restoration. In: IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR). pp. 5728–5739 (2022)

2022
[62]

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).47(11), 10344–10360 (2025)

Zhou,S.,Pan,J.,Yang,J.:Learninganadaptivesparsetransformerforefficientim- age restoration. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).47(11), 10344–10360 (2025)

2025
[63]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Zhu, P., Sun, Y., Cao, B., Hu, Q.: Task-customized mixture of adapters for gen- eral image fufsion. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 7099–7108 (2024)

2024

[1] [1]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Afifi, M., Derpanis, K.G., Ommer, B., Brown, M.S.: Learning multi-scale photo exposure correction. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 9157–9167 (2021)

2021

[2] [2]

ACM Transactions on Graphics (TOG).26(3), 38–es (2007)

Akyüz, A.O., Fleming, R., Riecke, B.E., Reinhard, E., Bülthoff, H.H.: Do HDR displays support LDR content? A psychophysical evaluation. ACM Transactions on Graphics (TOG).26(3), 38–es (2007)

2007

[3] [3]

Layer Normalization

Ba, J.L., Kiros, J.R., Hinton, G.E.: Layer normalization. arXiv preprint arXiv:1607.06450 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016

[4] [4]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Bhat, G., Danelljan, M., Van Gool, L., Timofte, R.: Deep burst super-resolution. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 9209–9218 (2021)

2021

[5] [5]

IEEE Transactions on Image Processing (TIP).27(4), 2049–2062 (2018)

Cai, J., Gu, S., Zhang, L.: Learning a deep single image contrast enhancer from multi-exposure images. IEEE Transactions on Image Processing (TIP).27(4), 2049–2062 (2018)

2049

[6] [6]

IEEE Transactions on Image Processing (TIP).31, 2661–2672 (2022)

Chen, J., Yang, Z., Chan, T.N., Li, H., Hou, J., Chau, L.P.: Attention-guided progressive neural texture fusion for high dynamic range image restoration. IEEE Transactions on Image Processing (TIP).31, 2661–2672 (2022)

2022

[7] [7]

In: AAAI Conference on Artificial Intelligence (AAAI)

Chen, R., Zheng, B., Zhang, H., Chen, Q., Yan, C., Slabaugh, G., Yuan, S.: Im- proving dynamic hdr imaging with fusion transformer. In: AAAI Conference on Artificial Intelligence (AAAI). pp. 340–349 (2023)

2023

[8] [8]

Chen, Z., Wang, Y., Cai, X., You, Z., Lu, Z., Zhang, F., Guo, S., Xue, T.: Ultrafu- sion:Ultrahighdynamicimagingusingexposurefusion.In:IEEE/CVFConference on Computer Vision and Pattern Recognition (CVPR). pp. 16111–16121 (2025)

2025

[9] [9]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H., Wei, Y.: Deformable convo- lutional networks. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 764–773 (2017)

2017

[10] [10]

Towards In-Context Tone Style Transfer with A Large-Scale Triplet Dataset

Deng, Y., She, H., Shen, W., Li, M., Wu, R., Yuan, L., Li, X.: Towards in- context tone style transfer with a large-scale triplet dataset. arXiv preprint arXiv:2604.16114 (2026)

work page internal anchor Pith review Pith/arXiv arXiv 2026

[11] [11]

In: European Conference on Computer Vision (ECCV)

Dong, C., Loy, C.C., He, K., Tang, X.: Learning a deep convolutional network for image super-resolution. In: European Conference on Computer Vision (ECCV). pp. 184–199 (2014)

2014

[12] [12]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Dudhane, A., Zamir, S.W., Khan, S., Khan, F.S., Yang, M.H.: Burstormer: Burst image restoration and enhancement transformer. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 5703–5712 (2023)

2023

[13] [13]

In: ACM SIGGRAPH Annual Conference (SIGGRAPH)

Durand, F., Dorsey, J.: Fast bilateral filtering for the display of high-dynamic- range images. In: ACM SIGGRAPH Annual Conference (SIGGRAPH). pp. 257– 266 (2002)

2002

[14] [14]

IEEE Micro18(03), 8–15 (1998)

Fossum, E.R.: Digital camera system on a chip. IEEE Micro18(03), 8–15 (1998)

1998

[15] [15]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Gao, T., Guo, L., Zhao, S., Xu, P., Yang, Y., Liu, X., Wang, S., Zhu, S., Zhou, D.: Quantnas: Quantization-aware neural architecture search for efficient deployment on mobile device. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1704–1713 (2024)

2024

[16] [16]

In: Advances in Neural Information Processing Systems (NeurIPS)

Ghasemabadi, A., Janjua, M.K., Salameh, M., Niu, D.: Learning truncated causal history model for video restoration. In: Advances in Neural Information Processing Systems (NeurIPS). pp. 27584–27615 (2024)

2024

[17] [17]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Girshick, R.: Fast r-cnn. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 1440–1448 (2015) There and Back Again 17

2015

[18] [18]

In: Conference on Language Modeling (COLM) (2024)

Gu, A., Dao, T.: Mamba: Linear-time sequence modeling with selective state spaces. In: Conference on Language Modeling (COLM) (2024)

2024

[19] [19]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Guo, H.,Guo,Y.,Zha,Y.,Zhang,Y.,Li,W.,Dai,T., Xia,S.T.,Li,Y.:Mambairv2: Attentive state space restoration. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 28124–28133 (2025)

2025

[20] [20]

In: European Conference on Computer Vision (ECCV)

Guo, H., Li, J., Dai, T., Ouyang, Z., Ren, X., Xia, S.T.: Mambair: A simple base- line for image restoration with state-space model. In: European Conference on Computer Vision (ECCV). pp. 222–241 (2024)

2024

[21] [21]

ACM Transactions on Graphics (TOG).35(6), 1–12 (2016)

Hasinoff, S.W., Sharlet, D., Geiss, R., Adams, A., Barron, J.T., Kainz, F., Chen, J., Levoy, M.: Burst photography for high dynamic range and low-light imaging on mobile cameras. ACM Transactions on Graphics (TOG).35(6), 1–12 (2016)

2016

[22] [22]

Hu, J., Gallo, O., Pulli, K., Sun, X.: HDR deghosting: How to deal with saturation? In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1163–1170 (2013)

2013

[23] [23]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Huang, D.A., Ramanathan, V., Mahajan, D., Torresani, L., Paluri, M., Fei-Fei, L., Niebles, J.C.: What makes a video a video: Analyzing temporal information in video understanding models and datasets. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 7366–7375 (2018)

2018

[24] [24]

In: IEEE/CVF International Conference on Computer Vi- sion (ICCV)

Huang, X., Belongie, S.: Arbitrary style transfer in real-time with adaptive in- stance normalization. In: IEEE/CVF International Conference on Computer Vi- sion (ICCV). pp. 1501–1510 (2017)

2017

[25] [25]

The Visual Computer30(5), 507–517 (2014)

Huo, Y., Yang, F., Dong, L., Brost, V.: Physiological inverse tone mapping based on retina response. The Visual Computer30(5), 507–517 (2014)

2014

[26] [26]

Categorical Reparameterization with Gumbel-Softmax

Jang, E., Gu, S., Poole, B.: Categorical reparameterization with gumbel-softmax. arXiv preprint arXiv:1611.01144 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016

[27] [27]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Jiang, T., Wang, C., Li, X., Li, R., Fan, H., Liu, S.: Meflut: Unsupervised 1d lookup tables for multi-exposure image fusion. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 10542–10551 (2023)

2023

[28] [28]

ACM Transactions on Graphics (TOG).36(4), 144–155 (2017)

Kalantari, N.K., Ramamoorthi, R., et al.: Deep high dynamic range imaging of dynamic scenes. ACM Transactions on Graphics (TOG).36(4), 144–155 (2017)

2017

[29] [29]

In: European Conference on Computer Vision (ECCV)

Kim, T.H., Sajjadi, M.S., Hirsch, M., Scholkopf, B.: Spatio-temporal transformer network for video restoration. In: European Conference on Computer Vision (ECCV). pp. 106–122 (2018)

2018

[30] [30]

In: European Conference on Computer Vision (ECCV)

Kong, L., Li, B., Xiong, Y., Zhang, H., Gu, H., Chen, J.: Safnet: Selective alignment fusion network for efficient hdr imaging. In: European Conference on Computer Vision (ECCV). pp. 256–273 (2024)

2024

[31] [31]

In: Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI)

Kovaleski, R.P., Oliveira, M.M.: High-quality reverse tone mapping for a wide range of exposures. In: Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI). pp. 49–56 (2014)

2014

[32] [32]

In: Advances in Neural Information Processing Systems (NeurIPS)

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep con- volutional neural networks. In: Advances in Neural Information Processing Systems (NeurIPS). (2012)

2012

[33] [33]

Nature.521(7553), 436–444 (2015)

LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature.521(7553), 436–444 (2015)

2015

[34] [34]

IEEE Transactions on Image Processing (TIP)

Li,H.,Ma,K.,Yong,H.,Zhang,L.:Fastmulti-scalestructuralpatchdecomposition for multi-exposure image fusion. IEEE Transactions on Image Processing (TIP). 29, 5805–5816 (2020)

2020

[35] [35]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Li, K., Wang, Y., He, Y., Li, Y., Wang, Y., Wang, L., Qiao, Y.: Uniformerv2: Unlocking the potential of image vits for video understanding. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 1632–1643 (2023) 18 L. Qu et al

2023

[36] [36]

In: IEEE/CVF International Confer- ence on Computer Vision (ICCV)

Li, X., Ni, Z., Yang, W.: Afunet: Cross-iterative alignment-fusion synergy for hdr reconstruction via deep unfolding paradigm. In: IEEE/CVF International Confer- ence on Computer Vision (ICCV). pp. 10666–10675 (2025)

2025

[37] [37]

In: Advances in Neural Information Processing Systems (NeurIPS)

Li, Y., Fang, C., Yang, J., Wang, Z., Lu, X., Yang, M.H.: Universal style transfer via feature transforms. In: Advances in Neural Information Processing Systems (NeurIPS). (2017)

2017

[38] [38]

arXiv preprint arXiv:2410.12274 (2024)

Liang, P., Jiang, J., Ma, Q., Liu, X., Ma, J.: Fusion from decomposition: A self- supervised approach for image fusion and beyond. arXiv preprint arXiv:2410.12274 (2024)

work page arXiv 2024

[39] [39]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Liu, S., Zhang, X., Sun, L., Liang, Z., Zeng, H., Zhang, L.: Joint hdr denoising and fusion: A real-world mobile hdr image dataset. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 13966–13975 (2023)

2023

[40] [40]

arXiv preprint arXiv:2403.10376 (2024)

Liu, X., Li, A., Wu, Z., Du, Y., Zhang, L., Zhang, Y., Timofte, R., Zhu, C.: Pasta: Towards flexible and efficient hdr imaging via progressively aggregated spatio- temporal alignment. arXiv preprint arXiv:2403.10376 (2024)

work page arXiv 2024

[41] [41]

In: Advances in Neural Information Processing Systems (NeurIPS)

Liu, Y., Tian, Y., Zhao, Y., Yu, H., Xie, L., Wang, Y., Ye, Q., Jiao, J., Liu, Y.: Vmamba: Visual state space model. In: Advances in Neural Information Processing Systems (NeurIPS). pp. 103031–103063 (2024)

2024

[42] [42]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., Guo, B.: Swin transformer: Hierarchical vision transformer using shifted windows. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 10012–10022 (2021)

2021

[43] [43]

In: European Conference on Computer Vision (ECCV)

Liu, Z., Wang, Y., Zeng, B., Liu, S.: Ghost-free high dynamic range imaging with context-aware transformer. In: European Conference on Computer Vision (ECCV). pp. 344–360 (2022)

2022

[44] [44]

In: IEEE/CVF Conference on Computer Vision and Pattern Recog- nition (CVPR)

Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE/CVF Conference on Computer Vision and Pattern Recog- nition (CVPR). pp. 3431–3440 (2015)

2015

[45] [45]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Luo, Z., Li, Y., Cheng, S., Yu, L., Wu, Q., Wen, Z., Fan, H., Sun, J., Liu, S.: Bsrt: Improving burst super-resolution with swin transformer and flow-guided de- formable alignment. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 998–1008 (2022)

2022

[46] [46]

IEEE Transactions on Image Processing (TIP).26(5), 2519–2532 (2017)

Ma, K., Li, H., Yong, H., Wang, Z., Meng, D., Zhang, L.: Robust multi-exposure image fusion: a structural patch decomposition approach. IEEE Transactions on Image Processing (TIP).26(5), 2519–2532 (2017)

2017

[47] [47]

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).37(6), 1219–1232 (2014)

Oh, T.H., Lee, J.Y., Tai, Y.W., Kweon, I.S.: Robust high dynamic range imag- ing by rank minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).37(6), 1219–1232 (2014)

2014

[48] [48]

In: IEEE International Conference on Image Processing (ICIP)

Pourreza-Shahri,R.,Kehtarnavaz,N.:Exposurebracketingviaautomaticexposure selection. In: IEEE International Conference on Image Processing (ICIP). pp. 320– 323 (2015)

2015

[49] [49]

IEEE Transactions on Information Theory (TIT).57(3), 1675–1691 (2011)

Rachlin, Y., Negi, R., Khosla, P.K.: The sensing capacity of sensor networks. IEEE Transactions on Information Theory (TIT).57(3), 1675–1691 (2011)

2011

[50] [50]

ACM Transactions on Graphics (TOG).21(3), 267—-276 (2002)

Reinhard, E., Stark, M., Shirley, P., Ferwerda, J.: Photographic tone reproduction for digital images. ACM Transactions on Graphics (TOG).21(3), 267—-276 (2002)

2002

[51] [51]

ACM Transactions on Graphics (TOG).31(6), 203–1 (2012)

Sen, P., Kalantari, N.K., Yaesoubi, M., Darabi, S., Goldman, D.B., Shechtman, E.: Robust patch-based hdr reconstruction of dynamic scenes. ACM Transactions on Graphics (TOG).31(6), 203–1 (2012)

2012

[52] [52]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Shu, Y., Shen, L., Hu, X., Li, M., Zhou, Z.: Towards real-world hdr video recon- struction: A large-scale benchmark dataset and a two-stage alignment network. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 2879–2888 (2024) There and Back Again 19

2024

[53] [53]

In: European Conference on Computer Vision (ECCV)

Song, J.W., Park, Y.I., Kong, K., Kwak, J., Kang, S.J.: Selective transhdr: Transformer-based selective hdr imaging using ghost region mask. In: European Conference on Computer Vision (ECCV). pp. 288–304 (2022)

2022

[54] [54]

In: IEEE/CVF International Conference on Computer Vision (ICCV)

Tel, S., Wu, Z., Zhang, Y., Heyrman, B., Demonceaux, C., Timofte, R., Gin- hac, D.: Alignment-free hdr deghosting with semantics consistent transformer. In: IEEE/CVF International Conference on Computer Vision (ICCV). pp. 12836– 12845 (2023)

2023

[55] [55]

In: Advances in Neural Information Processing Systems (NeurIPS)

Vaswani,A.,Shazeer,N.,Parmar,N.,Uszkoreit,J.,Jones,L.,Gomez,A.N.,Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems (NeurIPS). (2017)

2017

[56] [56]

In: IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR)

Wang, J., Chen, D., Luo, C., He, B., Yuan, L., Wu, Z., Jiang, Y.G.: Omnivid: A generative framework for universal video understanding. In: IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR). pp. 18209–18220 (2024)

2024

[57] [57]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Wang, X., Chan, K.C., Yu, K., Dong, C., Change Loy, C.: Edvr: Video restoration with enhanced deformable convolutional networks. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). pp. 1954–1963 (2019)

1954

[58] [58]

In: European Conference on Computer Vision (ECCV)

Wu, S., Xu, J., Tai, Y.W., Tang, C.K.: Deep high dynamic range imaging with large foreground motions. In: European Conference on Computer Vision (ECCV). pp. 117–132 (2018)

2018

[59] [59]

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).44(1), 502–518 (2020)

Xu, H., Ma, J., Jiang, J., Guo, X., Ling, H.: U2fusion: A unified unsupervised image fusion network. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).44(1), 502–518 (2020)

2020

[60] [60]

In: IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR)

Yan, Q., Gong, D., Shi, Q., Hengel, A.v.d., Shen, C., Reid, I., Zhang, Y.: Attention- guided network for ghost-free high dynamic range imaging. In: IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR). pp. 1751–1760 (2019)

2019

[61] [61]

In: IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR)

Zamir, S.W., Arora, A., Khan, S., Hayat, M., Khan, F.S., Yang, M.H.: Restormer: Efficient transformer for high-resolution image restoration. In: IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition (CVPR). pp. 5728–5739 (2022)

2022

[62] [62]

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).47(11), 10344–10360 (2025)

Zhou,S.,Pan,J.,Yang,J.:Learninganadaptivesparsetransformerforefficientim- age restoration. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).47(11), 10344–10360 (2025)

2025

[63] [63]

In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Zhu, P., Sun, Y., Cao, B., Hu, Q.: Task-customized mixture of adapters for gen- eral image fufsion. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 7099–7108 (2024)

2024