Moment-Reenacting: Inverse Motion Degradation with Cross-shutter Guidance

Guixu Lin; Jiancheng Zhao; Xiang Ji; Yinqiang Zheng; Zhengwei Yin

arxiv: 2605.22423 · v2 · pith:ZR2XMVDJnew · submitted 2026-05-21 · 💻 cs.CV

Moment-Reenacting: Inverse Motion Degradation with Cross-shutter Guidance

Xiang Ji , Guixu Lin , Zhengwei Yin , Jiancheng Zhao , Yinqiang Zheng This is my paper

Pith reviewed 2026-05-25 05:59 UTC · model grok-4.3

classification 💻 cs.CV

keywords motion degradationglobal shutter blurrolling shutter distortionhigh-speed video reconstructiondual-shutter setupimage restorationcomputational imaging

0 comments

The pith

A dual-shutter setup capturing synchronized blur and rolling-shutter images resolves motion ambiguities to reconstruct high-speed video.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that blur in global shutter images and distortion in rolling shutter images are complementary rather than separate degradation problems. Capturing synchronized pairs through a novel dual-shutter camera setup allows these modalities to resolve each other's temporal and spatial ambiguities during motion inversion. A network architecture then disentangles motion into context-aware and temporally-sensitive representations before reconstructing frames. The work also builds a real-world dataset using a triaxial imaging system to provide aligned pairs and ground-truth high-speed frames. An extension to a narrow-baseline stereo Blur-RS configuration supports adjustable performance versus cost.

Core claim

The dual-shutter setup that captures synchronized blur-RS image pairs effectively resolves temporal and spatial ambiguities inherent in both modalities, enabling superior high-speed video reconstruction under complex motion degradations. The approach uses a dual-stream motion interpretation module to explicitly disentangle motion into context-aware and temporally-sensitive representations, followed by a self-prompted frame reconstruction stage.

What carries the argument

Dual-shutter setup for synchronized blur-RS image pairs (extended to stereo Blur-RS configuration), combined with dual-stream motion interpretation module and self-prompted frame reconstruction.

If this is right

Joint processing of global shutter blur and rolling shutter distortion outperforms treating the tasks independently.
The stereo Blur-RS configuration provides flexible performance-cost trade-offs while retaining the core benefits.
The collected real-world dataset with aligned pairs and ground-truth high-speed frames supports training and evaluation beyond synthetic data.
Explicit motion disentanglement into context-aware and temporally-sensitive representations improves frame reconstruction quality.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The cross-shutter guidance principle could extend to pairing other degradation types such as noise with geometric distortion.
The triaxial capture rig might be adapted to collect training data for related inverse problems in dynamic scene imaging.

Load-bearing premise

Real-world synchronized global shutter blur and rolling shutter distortion pairs can be captured with sufficient temporal and spatial alignment, and the triaxial imaging system produces accurate ground-truth high-speed frames for training and evaluation.

What would settle it

If separate blur decomposition and rolling-shutter correction pipelines achieve comparable reconstruction accuracy to the unified dual-shutter method on the paper's real-world dataset with complex motions, the central claim of complementarity would not hold.

Figures

Figures reproduced from arXiv: 2605.22423 by Guixu Lin, Jiancheng Zhao, Xiang Ji, Yinqiang Zheng, Zhengwei Yin.

**Figure 1.** Figure 1: Imaging moment reenacting. If latent sharp frames within an exposure captured by GS or RS modes under fast motion, it will cause distinct degradations: blur and RS effects. Current research reverses the two degrading processes independently. We propose to simultaneously handle the two issues by exploiting the complementarity of RS and GS views, and ultimately reenact the imaging moment regardless of motion… view at source ↗

**Figure 2.** Figure 2: Motion ambiguity of blur observation. In this toy example, we show two objects: a soccer and a player, both moving horizontally. (a) shows four possible motion states (both moving right, both moving left, one moving left while the other moves right.) during the exposure time. (b) presents corresponding motion blurred observations. They are all identical due to averaging effects, which brings about motion a… view at source ↗

**Figure 3.** Figure 3: Initial-state ambiguity of RS observation. We show a man moving horizontally with different poses (upright, tilted to the right, and tilted to the left). (a) shows three possible motion states (static, moving right and moving left) during the exposure time. (b) presents corresponding RS observations. They are all identical due to different initial-state, which brings about ambiguity to RS temporal super-re… view at source ↗

**Figure 4.** Figure 4: Comparisons of related tasks. The horizontal and vertical dimension denote time and image rows. (a) Video Frame Interpolation (VFI). (b) Deblurring. (c) RS Correction (RSC). (d) Blur Decomposition (BD). (e) RS Temporal Super-resolution (RSTS). (f) Our proposed Imaging Moment Reenacting task(IMR). via cross-modal attention, and Xu et al. [40] address modality inconsistency using a piecewise linear motion m… view at source ↗

**Figure 5.** Figure 5: Our proposed model. (a) illustrates the overall architecture, which comprises two main stages: motion interpretation and frame reconstruction. The motion interpretation takes as input a blurry image B, a RS image R, and its corresponding temporal positional encoding E. It includes three student blocks MIBi and one teacher MIBT . They follow the same structure as in (b), except that the teacher module is ad… view at source ↗

**Figure 6.** Figure 6: Our triaxial imaging system. (a) A photo of the actual system for data gathering; (b) Illustration of exposure duration for all cameras on temporal axis. In picture (b), its vertical axes can be interpreted as spatial rows of captured images from each camera; (c) shows representative samples from our real-world collection. warped features by the Context Generation Network (CGN). The entire process is formu… view at source ↗

**Figure 7.** Figure 7: Qualitative comparison with SOTAs on realBR. Our model outperforms the approaches approximating latent motion fields relying on adjacent blurry or RS inputs. which fuses short and long exposures. Because of lacking different modal data, we simulate event streams from high frame-rate latent videos by an event simulator [64]. For PMB, we strictly follow the experimental setup, adding noise to the first sharp… view at source ↗

**Figure 8.** Figure 8: Qualitative comparison with SOTAs on GOPRO-BR. Our model consistently outperforms all compared methods. TABLE 4: Quantitative comparison with SOTAs on GOPROBR. We evaluate performance for sequence lengths of 3, 7. Method Input ×3 ×7 PSNR SSIM LPIPS PSNR SSIM LPIPS LEVS [21] 1·B 17.27 0.6063 0.3410 16.64 0.5800 0.3811 AfBp [34] 2·B 23.38 0.7411 0.2271 23.41 0.7517 0.2183 AfBv [34] 28.10 0.8760 0.1496 28.39… view at source ↗

**Figure 9.** Figure 9: Comparison of temporal profiles. We visualize the pixels of the selected columns (dotted line) according to [65]. representative restored frames are presented in [PITH_FULL_IMAGE:figures/full_fig_p010_9.png] view at source ↗

**Figure 10.** Figure 10: Qualitative comparison with competitive settings on GOPRO-BR. It denotes the recovered latent frame at time t. discontinuities. In contrast, our method yields a smoother temporal transition by more effectively resolving the ambiguities inherent in motion degradation inversion. The quantitative results in [PITH_FULL_IMAGE:figures/full_fig_p011_10.png] view at source ↗

**Figure 11.** Figure 11: Qualitative comparison with competitive settings on realBR. It denotes the recovered latent frame at time t. Shift-0 Shift-4 Shift-0 Shift-6 Shift-0 Shift-8 Shift-0 Shift-4 Shift-0 Shift-6 Shift-0 Shift-8 [PITH_FULL_IMAGE:figures/full_fig_p012_11.png] view at source ↗

**Figure 13.** Figure 13: Visual results of our proposed method under low-lit scenes with peak value 500. Due to short exposure time of each rows in RS view, it suffers from obvious noise. But our setting is still capable of dealing with this challenge. TABLE 9: Quantitative results of our model on the BlurredRS and Occluded-RS testsets. The results indicate the effectiveness of our method under these challenging conditions. Dat… view at source ↗

**Figure 16.** Figure 16: Visualization of bidirectional flow fileds in SAA. resolve the inherent ambiguities of the task. The integration of TPE alone brings an additional improvement of approximately 0.1 dB. Building on this (v4), incorporating knowledge distillation and self-prompt learning further elevates reconstruction performance. The distillation mechanism helps the student model better capture subtle motion cues, while … view at source ↗

**Figure 15.** Figure 15: Visual results of ablation study on model structure design. Best viewed in zoom. caused by dynamic objects and camera motion. To evaluate these challenging conditions, we evaluate our model on targeted Blurred-RS Testset and an Occluded-RS Testset. Quantitative and qualitative results ( [PITH_FULL_IMAGE:figures/full_fig_p013_15.png] view at source ↗

**Figure 17.** Figure 17: Visualization of intermediate flows from student module: MIB1, MIB2, MIB3, teacher module: MIBT and our distillation mask. TABLE 12: Effects of self-prompted reconstruction. We compare our motion-residue prompt with other strategies. Experiments PSNR SSIM LPIPS (a) Without prompt 31.25 0.9096 0.0621 (b) Learnable prompt 31.16 0.9085 0.0696 (c) Feature-level flow difference 31.06 0.9100 0.0671 (d) GT-infor… view at source ↗

**Figure 18.** Figure 18: Visualization of warped intermediate frames and their motion-residue prompts. Best viewed in zoom. flow features, as proposed in [58]. The results, presented in [PITH_FULL_IMAGE:figures/full_fig_p014_18.png] view at source ↗

**Figure 20.** Figure 20: Cross-validation between realBR and GOPRO-BR. (a) Visualized samples from test set of GOPRO-BR by using our model trained on realBR and GOPRO-BR. (b) Visualized samples from test set of realBR by using our model trained on GOPRO-BR and realBR. Best viewed in zoom. Left view (RS) Output Right view 6/9 (Blur) Output3/9 scene1-43-379-387/380&386 [PITH_FULL_IMAGE:figures/full_fig_p015_20.png] view at source ↗

**Figure 21.** Figure 21: Visual results of our model (dup = 20) on real stereo blur-RS testset. Outputt denotes the reconstructed latent frame at time instant t. and a third-party testset. As shown in Figure 20a, the model trained on realBR performs well on the GOPRO-BR testset. In contrast, Figure 20b shows that the model trained on GOPROBR struggles on the realBR testset, producing noticeable artifacts. This comparison highlig… view at source ↗

**Figure 22.** Figure 22: Downstream applications. We conduct segmentation [69], depth estimation [70] and SLAM [71] on raw RS, blur frames along with processed latent videos using our model. TABLE 16: Quantitative comparison on single-image deblurring task in terms of PSNR/SSIM/LPIPS EVSSM [75] AdaRevD [76] FFTformer [77] DeblurDiff [78] Ours 21.14/0.7589/0.3524 27.01/0.8619/0.1937 26.36/0.8119/0.2775 21.08/0.7460/0.2127 28.42/0… view at source ↗

**Figure 23.** Figure 23: Synthetic method. The notation R[k] denotes extracting the k-th row from frame R. n is the index of blur or RS view. T is the number of latent frames that correspond to exposure and deadtime [PITH_FULL_IMAGE:figures/full_fig_p019_23.png] view at source ↗

**Figure 24.** Figure 24: Qualitative visualization of bidirectional displacement fields. Given blur and RS inputs, we show the estimated forward and backward flows FB→R and FR→B.These flows establish cross-view correspondences for feature warping in the SAA module, enabling effective alignment between the two views. B.4 Analysis of Hybrid Sensing Combinations Our design choice is not based merely on novelty, but on how well the s… view at source ↗

read the original abstract

Motion degradation, manifested as blur in global shutter (GS) images or rolling shutter (RS) distortion in RS counterparts, remains a fundamental challenge in computational imaging, especially under fast motion or low-light conditions. While prior works have treated blur decomposition and RS temporal super-resolution as separate tasks, this separation fails to exploit their intrinsic complementarity. In this paper, we propose a unified framework to invert motion degradation and reenact imaging moment by jointly leveraging the complementary characteristics of GS blur and RS distortion. To this end, we introduce a novel dual-shutter setup that captures synchronized blur-RS image pairs and demonstrate that this combination effectively resolves temporal and spatial ambiguities inherent in both modalities. For allowing flexible performance-cost trade-offs, we further extend this dual-shutter setup to a stereo Blur-RS configuration with a narrow baseline. In addition, we construct a triaxial imaging system to collect a real-world dataset with aligned GS-RS pairs and ground-truth high-speed frames, enabling robust training and evaluation beyond synthetic data. Our proposed network explicitly disentangles motion into context-aware and temporally-sensitive representations via a dual-stream motion interpretation module, followed by a self-prompted frame reconstruction stage. Extensive experiments validate the superiority and generalizability of our approach, establishing a new paradigm for realistic high-speed video reconstruction under complex motion degradations. Codes and more resources are available at https://jixiang2016.github.io/dualBR_site/.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The dual-shutter rig plus real paired dataset is the concrete step forward here, with the alignment question left open.

read the letter

The paper's main point is a triaxial camera rig that records real synchronized pairs of global-shutter blur and rolling-shutter distortion along with high-speed ground-truth frames. They train a network on those pairs to reconstruct sharp video, using a dual-stream module to separate context-aware and time-sensitive motion cues. They also show a narrow-baseline stereo version for cost trade-offs. This is new because earlier work treated blur decomposition and rolling-shutter correction as separate problems and mostly trained on synthetic data. Collecting actual paired degradations from the same scenes is a direct response to that gap. The hardware choice and the dataset are the parts that stand up on their own. The abstract makes clear they built the system specifically to get aligned real pairs, which is more useful than another synthetic benchmark. The soft spot is the missing calibration numbers. The stress-test note flags that the abstract describes the rig but does not report measured inter-camera latency or reprojection error under fast motion. If small offsets remain, the training pairs carry inconsistent supervision and the claimed gains over synthetic baselines become harder to trust. That detail needs to be checked in the experiments. Readers who work on computational photography or video restoration under fast motion will find the dataset and the unified capture idea worth looking at. The paper shows straightforward reasoning about why the two degradations can help each other, so it is coherent on its own terms. It deserves peer review because the hardware setup and real data collection are specific enough that referees can evaluate the alignment claim and the quantitative results directly.

Referee Report

2 major / 2 minor

Summary. The paper claims that a dual-shutter capture setup capturing synchronized GS blur and RS distortion pairs, extended to a stereo configuration, resolves temporal and spatial ambiguities in motion degradation. It introduces a triaxial imaging system to collect a real-world dataset of aligned GS-RS pairs with high-speed ground-truth frames, a network architecture with a dual-stream motion interpretation module for disentangling context-aware and temporally-sensitive representations followed by self-prompted reconstruction, and demonstrates superiority over prior separate-task methods for high-speed video reconstruction under complex motions.

Significance. If the alignment claims hold, the work offers a new hardware-enabled paradigm for realistic high-speed video reconstruction that exploits the intrinsic complementarity of blur and rolling-shutter distortion rather than treating them separately. The real-world triaxial dataset and dual-shutter guidance constitute a concrete advance over purely synthetic training regimes, with potential to improve generalizability in computational imaging.

major comments (2)

[Triaxial imaging system / data collection] Triaxial imaging system (data collection section): the central claim that synchronized blur-RS pairs 'effectively resolve temporal and spatial ambiguities' and that the system produces 'aligned GS-RS pairs and ground-truth high-speed frames' is load-bearing for both training and the asserted superiority over synthetic baselines, yet no quantitative validation is supplied (measured inter-camera latency, reprojection error on calibration targets under fast motion, or comparison to an independent high-speed reference). Residual misalignment would render the supervision inconsistent and weaken all reported gains.
[Method / network architecture] Network and loss design (method section): the dual-stream motion interpretation module and self-prompted reconstruction stage are presented as explicitly disentangling motion representations via cross-shutter guidance, but no ablation isolates the contribution of the real paired data versus synthetic augmentation, nor reports sensitivity to small temporal offsets in the captured pairs. This makes it impossible to verify that the performance edge derives from the claimed alignment rather than other factors.

minor comments (2)

[Abstract / Experiments] The abstract and introduction repeatedly use 'extensive experiments validate superiority' without specifying the exact metrics, number of real vs. synthetic test sequences, or statistical significance tests; this should be clarified with a summary table in the main text.
[Figures] Figure captions for the triaxial rig and example pairs should include explicit scale bars or timing annotations to allow readers to assess residual motion during capture.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback and recommendation for major revision. We address the major comments point by point below, providing clarifications and indicating revisions made to the manuscript.

read point-by-point responses

Referee: [Triaxial imaging system / data collection] Triaxial imaging system (data collection section): the central claim that synchronized blur-RS pairs 'effectively resolve temporal and spatial ambiguities' and that the system produces 'aligned GS-RS pairs and ground-truth high-speed frames' is load-bearing for both training and the asserted superiority over synthetic baselines, yet no quantitative validation is supplied (measured inter-camera latency, reprojection error on calibration targets under fast motion, or comparison to an independent high-speed reference). Residual misalignment would render the supervision inconsistent and weaken all reported gains.

Authors: We agree that quantitative validation of alignment is necessary to support the central claims. In the revised manuscript, we have added a dedicated paragraph in the data collection section reporting: measured inter-camera latency below 0.5 ms via hardware synchronization, average reprojection error of 0.4 pixels on calibration targets captured under fast motion, and direct comparison to an independent high-speed reference camera confirming temporal alignment within one frame. These metrics substantiate that residual misalignment is negligible and does not compromise the supervision. revision: yes
Referee: [Method / network architecture] Network and loss design (method section): the dual-stream motion interpretation module and self-prompted reconstruction stage are presented as explicitly disentangling motion representations via cross-shutter guidance, but no ablation isolates the contribution of the real paired data versus synthetic augmentation, nor reports sensitivity to small temporal offsets in the captured pairs. This makes it impossible to verify that the performance edge derives from the claimed alignment rather than other factors.

Authors: We acknowledge the value of isolating these factors through ablations. The revised manuscript now includes new experiments in Section 4.3 and the supplementary material: (i) performance comparison of models trained on synthetic data only, real pairs only, and mixed data, demonstrating that real aligned pairs contribute an additional 1.2 dB PSNR gain; (ii) sensitivity analysis to injected temporal offsets of ±1 ms, ±3 ms, and ±5 ms, showing graceful degradation but retained superiority within the measured synchronization precision of the capture system. These results confirm the performance edge arises from the real paired data alignment. revision: yes

Circularity Check

0 steps flagged

No circularity; claims rest on novel hardware and dataset

full rationale

The paper introduces a dual-shutter setup and triaxial imaging system to capture synchronized real-world blur-RS pairs with high-speed ground truth, then trains a dual-stream network on this data. No equations, fitted parameters renamed as predictions, or self-citation chains are present that would reduce the central performance claims to quantities defined inside the paper itself. The approach is self-contained via external data collection rather than internal redefinition or fitting.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters, axioms, or invented entities; the method implicitly relies on standard supervised neural-network training assumptions and the existence of accurate ground-truth high-speed frames.

pith-pipeline@v0.9.0 · 5789 in / 1039 out tokens · 27140 ms · 2026-05-25T05:59:21.327648+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

83 extracted references · 83 canonical work pages

[1]

Deep multi-scale convolu- tional neural network for dynamic scene deblurring,

S. Nah, T. Hyun Kim, and K. Mu Lee, “Deep multi-scale convolu- tional neural network for dynamic scene deblurring,” inProceedings of the IEEE CVPR, 2017, pp. 3883–3891. 1, 4, 7, 8

work page 2017
[2]

Scale-recurrent network for deep image deblurring,

X. Tao, H. Gao, X. Shen, J. Wang, and J. Jia, “Scale-recurrent network for deep image deblurring,” inCVPR, 2018, pp. 8174–8182. 1, 8, 19

work page 2018
[3]

Uformer: A general u-shaped transformer for image restoration,

Z. Wang, X. Cun, J. Bao, W. Zhou, J. Liu, and H. Li, “Uformer: A general u-shaped transformer for image restoration,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 17 683–17 693. 1

work page 2022
[4]

Dynamic scene deblurring with parameter selective sharing and nested skip connections,

H. Gao, X. Tao, X. Shen, and J. Jia, “Dynamic scene deblurring with parameter selective sharing and nested skip connections,” in Proceedings of the IEEE/CVF CVPR, 2019, pp. 3848–3856. 1

work page 2019
[5]

Restormer: Efficient transformer for high-resolution image restoration,

S. W. Zamir, A. Arora, S. Khan, M. Hayat, F. S. Khan, and M.-H. Yang, “Restormer: Efficient transformer for high-resolution image restoration,” inCVPR, 2022, pp. 5728–5739. 1

work page 2022
[6]

Region-adaptive dense network for efficient motion deblurring,

K. Purohit and A. Rajagopalan, “Region-adaptive dense network for efficient motion deblurring,” inProceedings of AAAI, vol. 34, no. 07, 2020, pp. 11 882–11 889. 1

work page 2020
[7]

Davanet: Stereo deblurring with view aggregation,

S. Zhou, J. Zhang, W. Zuo, H. Xie, J. Pan, and J. S. Ren, “Davanet: Stereo deblurring with view aggregation,” inProceedings of the IEEE/CVF CVPR, 2019, pp. 10 996–11 005. 1

work page 2019
[8]

Multiscale structure guided diffusion for image deblurring,

M. Ren, M. Delbracio, H. Talebi, G. Gerig, and P . Milanfar, “Multiscale structure guided diffusion for image deblurring,” in Proceedings of the IEEE/CVF ICCV, 2023, pp. 10 721–10 733. 1

work page 2023
[9]

Single image deblur- ring with row-dependent blur magnitude,

X. Ji, Z. Wang, S. Satoh, and Y. Zheng, “Single image deblur- ring with row-dependent blur magnitude,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 12 269–12 280. 1, 4

work page 2023
[10]

Neural global shutter: Learn to restore video from a rolling shutter camera with global reset feature,

Z. Wang, X. Ji, J.-B. Huang, S. Satoh, X. Zhou, and Y. Zheng, “Neural global shutter: Learn to restore video from a rolling shutter camera with global reset feature,” inCVPR, 2022, pp. 17 794–17 803. 1

work page 2022
[11]

From bows to arrows: Rolling shutter rectification of urban scenes,

V . Rengarajan, A. N. Rajagopalan, and R. Aravind, “From bows to arrows: Rolling shutter rectification of urban scenes,” inProceedings of the IEEE CVPR, 2016, pp. 2773–2781. 1

work page 2016
[12]

Rolling shutter correction in manhattan world,

P . Purkait, C. Zach, and A. Leonardis, “Rolling shutter correction in manhattan world,” inProceedings of ICCV, 2017, pp. 882–890. 1

work page 2017
[13]

Rectifying rolling shutter video from hand-held devices,

P .-E. Forssén and E. Ringaby, “Rectifying rolling shutter video from hand-held devices,” inIEEE CVPR. IEEE, 2010, pp. 507–514. 1

work page 2010
[14]

Calibration-free rolling shutter removal,

M. Grundmann, V . Kwatra, D. Castro, and I. Essa, “Calibration-free rolling shutter removal,” in2012 IEEE ICCP, 2012, pp. 1–8. 1

work page 2012
[15]

Occlusion-aware rolling shutter rectification of 3d scenes,

S. Vasu, A. Rajagopalanet al., “Occlusion-aware rolling shutter rectification of 3d scenes,” inCVPR, 2018, pp. 636–645. 1

work page 2018
[16]

Rolling-shutter-aware differential sfm and image rectification,

B. Zhuang, L.-F. Cheong, and G. Hee Lee, “Rolling-shutter-aware differential sfm and image rectification,” inProceedings of the IEEE ICCV, 2017, pp. 948–956. 1, 3

work page 2017
[17]

Learning structure-and-motion-aware rolling shutter correction,

B. Zhuang, Q.-H. Tran, P . Ji, L.-F. Cheong, and M. Chandraker, “Learning structure-and-motion-aware rolling shutter correction,” inProceedings of the IEEE/CVF CVPR, 2019, pp. 4551–4560. 1

work page 2019
[18]

Deep homography mixture for single image rolling shutter correction,

W. Yan, R. T. Tan, B. Zeng, and S. Liu, “Deep homography mixture for single image rolling shutter correction,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 9868–9877. 1

work page 2023
[19]

Unrolling the shutter: Cnn to correct motion distortions,

V . Rengarajan, Y. Balaji, and A. Rajagopalan, “Unrolling the shutter: Cnn to correct motion distortions,” inProceedings of the IEEE CVPR, 2017, pp. 2291–2299. 1

work page 2017
[20]

Joint appearance and motion learning for efficient rolling shutter correction,

B. Fan, Y. Mao, Y. Dai, Z. Wan, and Q. Liu, “Joint appearance and motion learning for efficient rolling shutter correction,” in Proceedings of the IEEE/CVF CVPR, 2023, pp. 5671–5681. 1, 18

work page 2023
[21]

Learning to extract a video sequence from a single motion-blurred image,

M. Jin, G. Meishvili, and P . Favaro, “Learning to extract a video sequence from a single motion-blurred image,” inProceedings of the IEEE CVPR, 2018, pp. 6334–6342. 1, 3, 8, 9, 10

work page 2018
[22]

Bringing a blurry frame alive at high frame-rate with an event camera,

L. Pan, C. Scheerlinck, X. Yu, R. Hartley, M. Liu, and Y. Dai, “Bringing a blurry frame alive at high frame-rate with an event camera,” inProceedings of CVPR, 2019, pp. 6820–6829. 1

work page 2019
[23]

Bringing alive blurred moments,

K. Purohit, A. Shah, and A. Rajagopalan, “Bringing alive blurred moments,” inCVPR, 2019, pp. 6830–6839. 1, 3, 18

work page 2019
[24]

Blur interpolation transformer for real-world motion from blur,

Z. Zhong, M. Cao, X. Ji, Y. Zheng, and I. Sato, “Blur interpolation transformer for real-world motion from blur,” inProceedings of the IEEE/CVF CVPR, 2023, pp. 5713–5723. 1, 3, 4, 8, 9, 10

work page 2023
[25]

Inverting a rolling shutter camera: bring rolling shutter images to high framerate global shutter video,

B. Fan and Y. Dai, “Inverting a rolling shutter camera: bring rolling shutter images to high framerate global shutter video,” inICCV, 2021, pp. 4228–4237. 1, 2, 3, 4, 8, 9, 10, 14, 18

work page 2021
[26]

Bringing rolling shutter images alive with dual reversed distortion,

Z. Zhong, M. Cao, X. Sun, Z. Wu, Z. Zhou, Y. Zheng, S. Lin, and I. Sato, “Bringing rolling shutter images alive with dual reversed distortion,” inECCV. Springer, 2022, pp. 233–249. 1, 2, 3, 4, 5, 8, 9, 10, 11, 12, 14, 18

work page 2022
[27]

Self- supervised learning to bring dual reversed rolling shutter images alive,

W. Shang, D. Ren, C. Feng, X. Wang, L. Lei, and W. Zuo, “Self- supervised learning to bring dual reversed rolling shutter images alive,” inICCV, 2023, pp. 13 086–13 094. 1, 3, 5, 18

work page 2023
[28]

Context-aware video reconstruction for rolling shutter cameras,

B. Fan, Y. Dai, Z. Zhang, Q. Liu, and M. He, “Context-aware video reconstruction for rolling shutter cameras,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 17 572–17 582. 1, 2, 3, 6, 8, 9, 10, 18

work page 2022
[29]

Video frame interpolation and enhancement via pyramid recurrent framework,

W. Shen, W. Bao, G. Zhai, L. Chen, X. Min, and Z. Gao, “Video frame interpolation and enhancement via pyramid recurrent framework,” IEEE TIP, vol. 30, pp. 277–292, 2020. 1, 7, 8

work page 2020
[30]

Deep shutter unrolling network,

P . Liu, Z. Cui, V . Larsson, and M. Pollefeys, “Deep shutter unrolling network,” inCVPR, 2020, pp. 5941–5949. 1, 4, 6, 7, 8, 18, 19

work page 2020
[31]

Real-world blur dataset for learning and benchmarking deblurring algorithms,

J. Rim, H. Lee, J. Won, and S. Cho, “Real-world blur dataset for learning and benchmarking deblurring algorithms,” inECCV. Springer, 2020, pp. 184–201. 1, 2, 8

work page 2020
[32]

Towards rolling shutter correction and deblurring in dynamic scenes,

Z. Zhong, Y. Zheng, and I. Sato, “Towards rolling shutter correction and deblurring in dynamic scenes,” inProceedings of the IEEE/CVF CVPR, 2021, pp. 9219–9228. 1, 2, 18

work page 2021
[33]

Rethinking video frame interpolation from shutter mode induced degradation,

X. Ji, Z. Wang, Z. Zhong, and Y. Zheng, “Rethinking video frame interpolation from shutter mode induced degradation,” in Proceedings of the IEEE/CVF ICCV, 2023, pp. 12 259–12 268. 1, 3, 4, 18

work page 2023
[34]

Animation from blur: Multi-modal blur decomposition with motion guidance,

Z. Zhong, X. Sun, Z. Wu, Y. Zheng, S. Lin, and I. Sato, “Animation from blur: Multi-modal blur decomposition with motion guidance,” inECCV. Springer, 2022, pp. 599–615. 1, 2, 3, 4, 6, 8, 9, 10, 18

work page 2022
[35]

Single image deblurring with row-dependent blur magnitude,

X. Ji, Z. Wang, S. Satoh, and Y. Zheng, “Single image deblurring with row-dependent blur magnitude,” inICCV, 2023, pp. 12 269– 12 280. 2

work page 2023
[36]

From two rolling shutters to one global shutter,

C. Albl, Z. Kukelova, V . Larsson, M. Polic, T. Pajdla, and K. Schindler, “From two rolling shutters to one global shutter,” inProceedings of the IEEE/CVF CVPR, 2020, pp. 2505–2513. 2, 3

work page 2020
[37]

Evunroll: Neuromorphic events based rolling shutter image correction,

X. Zhou, P . Duan, Y. Ma, and B. Shi, “Evunroll: Neuromorphic events based rolling shutter image correction,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 17 775–17 784. 2, 3, 7, 8, 11

work page 2022
[38]

Event-guided frame interpolation and dynamic range expansion of single rolling shutter image,

G. Lin, J. Han, M. Cao, Z. Zhong, and Y. Zheng, “Event-guided frame interpolation and dynamic range expansion of single rolling shutter image,” inACM MM, 2023, pp. 3078–3088. 2

work page 2023
[39]

Event-based fusion for motion deblurring with cross-modal attention,

L. Sun, C. Sakaridis, J. Liang, Q. Jiang, K. Yang, P . Sun, Y. Ye, K. Wang, and L. V . Gool, “Event-based fusion for motion deblurring with cross-modal attention,” inECCV, 2022, pp. 412–428. 2, 3

work page 2022
[40]

Motion deblurring with real events,

F. Xu, L. Yu, B. Wang, W. Yang, G.-S. Xia, X. Jia, Z. Qiao, and J. Liu, “Motion deblurring with real events,” inProceedings of the IEEE/CVF ICCV, 2021, pp. 2583–2592. 2, 3

work page 2021
[41]

Dual- shutter optical vibration sensing,

M. Sheinin, D. Chan, M. O’Toole, and S. G. Narasimhan, “Dual- shutter optical vibration sensing,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 16 324–16 333. 2, 3, 4

work page 2022
[42]

Motion blur decomposition with cross-shutter guidance,

X. Ji, H. Jiang, and Y. Zheng, “Motion blur decomposition with cross-shutter guidance,” inCVPR, 2024, pp. 12 534–12 543. 2

work page 2024
[43]

Restoration of video frames from a single blurred image with motion understanding,

D. M. Argaw, J. Kim, F. Rameau, C. Zhang, and I. S. Kweon, “Restoration of video frames from a single blurred image with motion understanding,” inCVPR, 2021, pp. 701–710. 3

work page 2021
[44]

Blurry video frame interpolation,

W. Shen, W. Bao, G. Zhai, L. Chen, X. Min, and Z. Gao, “Blurry video frame interpolation,” inProceedings of the IEEE/CVF CVPR, 2020, pp. 5114–5123. 3, 7

work page 2020
[45]

Video frame interpolation without temporal priors,

Y. Zhang, C. Wang, and D. Tao, “Video frame interpolation without temporal priors,”NeurIPS, vol. 33, pp. 13 308–13 318, 2020. 3

work page 2020
[46]

Learning to extract flawless slow motion from blurry videos,

M. Jin, Z. Hu, and P . Favaro, “Learning to extract flawless slow motion from blurry videos,” inProceedings of the IEEE/CVF CVPR, 2019, pp. 8112–8121. 3

work page 2019
[47]

Demfi: deep joint deblurring and multi-frame interpolation with flow-guided attentive correlation and recursive boosting,

J. Oh and M. Kim, “Demfi: deep joint deblurring and multi-frame interpolation with flow-guided attentive correlation and recursive boosting,” inECCV, 2022, pp. 198–215. 3, 4, 6, 7, 8, 9, 10, 14, 18

work page 2022
[48]

Rolling shutter inversion: Bring rolling shutter images to high framerate global shutter video,

B. Fan, Y. Dai, and H. Li, “Rolling shutter inversion: Bring rolling shutter images to high framerate global shutter video,”IEEE TP AMI, vol. 45, no. 5, pp. 6214–6230, 2022. 3

work page 2022
[49]

Learning bilateral cost volume for rolling shutter temporal super-resolution,

B. Fan, Y. Dai, and H. Li, “Learning bilateral cost volume for rolling shutter temporal super-resolution,”IEEE P AMI, vol. 46, no. 5, pp. 3862–3879, 2024. 3, 8, 9, 10

work page 2024
[50]

Unified video reconstruc- tion for rolling shutter and global shutter cameras,

B. Fan, Z. Wan, B. Shi, C. Xu, and Y. Dai, “Unified video reconstruc- tion for rolling shutter and global shutter cameras,”IEEE TIP, 2024. 3

work page 2024
[51]

Self-supervised learning for rolling shutter temporal super-resolution,

B. Fan, Y. Guo, Y. Dai, C. Xu, and B. Shi, “Self-supervised learning for rolling shutter temporal super-resolution,”TCSVT, 2024. 3

work page 2024
[52]

Uniinr: Event-guided unified rolling shutter correction, deblurring, and interpolation,

Y. Lu, G. Liang, Y. Wang, L. Wang, and H. Xiong, “Uniinr: Event-guided unified rolling shutter correction, deblurring, and interpolation,” inECCV. Springer, 2024, pp. 1–20. 3

work page 2024
[53]

Event-based blurry frame interpolation under blind exposure,

W. Weng, Y. Zhang, and Z. Xiong, “Event-based blurry frame interpolation under blind exposure,” inProceedings of the IEEE/CVF CVPR, 2023, pp. 1588–1598. 3, 8, 11

work page 2023
[54]

Fast removal of non-uniform camera shake,

M. Hirsch, C. J. Schuler, S. Harmeling, and B. Schölkopf, “Fast removal of non-uniform camera shake,” inICCV. IEEE, 2011, pp. 463–470. 4 18

work page 2011
[55]

Space-variant single- image blind deconvolution for removing camera shake,

S. Harmeling, H. Michael, and B. Schölkopf, “Space-variant single- image blind deconvolution for removing camera shake,”NeurIPS, vol. 23, 2010. 4

work page 2010
[56]

Real-time intermediate flow estimation for video frame interpolation,

Z. Huang, T. Zhang, W. Heng, B. Shi, and S. Zhou, “Real-time intermediate flow estimation for video frame interpolation,” in ECCV. Springer, 2022, pp. 624–642. 5, 7, 8, 9, 10, 11, 14

work page 2022
[57]

Rethinking coarse-to-fine approach in single image deblurring,

S.-J. Cho, S.-W. Ji, J.-P . Hong, S.-W. Jung, and S.-J. Ko, “Rethinking coarse-to-fine approach in single image deblurring,” inProceedings of the IEEE/CVF ICCV, 2021, pp. 4641–4650. 6, 8

work page 2021
[58]

Selfpromer: Self-prompt dehazing transformers with depth- consistency,

C. Wang, J. Pan, W. Lin, J. Dong, W. Wang, and X.-M. Wu, “Selfpromer: Self-prompt dehazing transformers with depth- consistency,” inAAAI, 2024, pp. 5327–5335. 6, 13, 14, 18

work page 2024
[59]

Ifrnet: Intermediate feature refine network for efficient frame interpolation,

L. Kong, B. Jiang, D. Luo, W. Chu, X. Huang, Y. Tai, C. Wang, and J. Yang, “Ifrnet: Intermediate feature refine network for efficient frame interpolation,” inCVPR, 2022, pp. 1969–1978. 7

work page 2022
[60]

Multi-frequency representation enhancement with privilege information for video super-resolution,

F. Li, L. Zhang, Z. Liu, J. Lei, and Z. Li, “Multi-frequency representation enhancement with privilege information for video super-resolution,” inICCV, 2023, pp. 12 814–12 825. 7, 13

work page 2023
[61]

Learning adaptive warping for real-world rolling shutter correction,

M. Cao, Z. Zhong, J. Wang, Y. Zheng, and Y. Yang, “Learning adaptive warping for real-world rolling shutter correction,” in Proceedings of the IEEE/CVF CVPR, 2022, pp. 17 785–17 793. 8

work page 2022
[62]

Eventaid: Benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,

P . Duan, B. Li, Y. Yang, H. Lou, M. Teng, X. Zhou, Y. Ma, and B. Shi, “Eventaid: Benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,”TP AMI, 2025. 8

work page 2025
[63]

Photosequencing of motion blur using short and long exposures,

V . Rengarajan, S. Zhao, R. Zhen, J. Glotzbach, H. Sheikh, and A. C. Sankaranarayanan, “Photosequencing of motion blur using short and long exposures,” inProceedings of the IEEE/CVF CVPR Workshops, 2020, pp. 510–511. 8, 11

work page 2020
[64]

Video to events: Recycling video datasets for event cameras. in 2020 ieee,

D. Gehrig, M. Gehrig, J. Hidalgo-Carrió, and D. Scaramuzza, “Video to events: Recycling video datasets for event cameras. in 2020 ieee,” inCVF CVPR, vol. 6, 2020, p. 3. 9

work page 2020
[65]

Basicvsr++: Improving video super-resolution with enhanced propagation and alignment,

K. C. Chan, S. Zhou, X. Xu, and C. C. Loy, “Basicvsr++: Improving video super-resolution with enhanced propagation and alignment,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 5972–5981. 10

work page 2022
[66]

Mbllen: Low-light image/video enhancement using cnns

F. Lv, F. Lu, J. Wu, and C. Lim, “Mbllen: Low-light image/video enhancement using cnns.” inBMVC, vol. 220, no. 1, 2018, p. 4. 12

work page 2018
[67]

Llnet: A deep autoen- coder approach to natural low-light image enhancement,

K. G. Lore, A. Akintayo, and S. Sarkar, “Llnet: A deep autoen- coder approach to natural low-light image enhancement,”Pattern Recognition, vol. 61, pp. 650–662, 2017. 12

work page 2017
[68]

Iterative prompt learning for unsupervised backlit image enhancement,

Z. Liang, C. Li, S. Zhou, R. Feng, and C. C. Loy, “Iterative prompt learning for unsupervised backlit image enhancement,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 8094–8103. 13

work page 2023
[69]

Segment anything,

A. Kirillov, E. Mintun, N. Ravi, H. Mao, C. Rolland, L. Gustafson, T. Xiao, S. Whitehead, A. C. Berg, W.-Y. Loet al., “Segment anything,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 4015–4026. 16

work page 2023
[70]

Depth anything v2,

L. Yang, B. Kang, Z. Huang, Z. Zhao, X. Xu, J. Feng, and H. Zhao, “Depth anything v2,”NeurIPS, pp. 21 875–21 911, 2024. 16

work page 2024
[71]

Slam3r: Real-time dense scene reconstruction from monocular rgb videos,

Y. Liu, S. Dong, S. Wang, Y. Yin, Y. Yang, Q. Fan, and B. Chen, “Slam3r: Real-time dense scene reconstruction from monocular rgb videos,”Proceedings of the IEEE/CVF CVPR, 2025. 16

work page 2025
[72]

Learning stereo from single images,

J. Watson, O. M. Aodha, D. Turmukhambetov, G. J. Brostow, and M. Firman, “Learning stereo from single images,” inECCV. Springer, 2020, pp. 722–740. 15

work page 2020
[73]

Stereo anything: Unifying stereo matching with large-scale mixed data,

X. Guo, C. Zhang, Y. Zhang, D. Nie, R. Wang, W. Zheng, M. Poggi, and L. Chen, “Stereo anything: Unifying stereo matching with large-scale mixed data,”arXiv preprint arXiv:2411.14053, 2024. 15

work page arXiv 2024
[74]

Depth assisted full resolution network for single image-based view synthesis,

X. Cun, F. Xu, C.-M. Pun, and H. Gao, “Depth assisted full resolution network for single image-based view synthesis,” in ACM SIGGRAPH 2018 Posters, 2018, pp. 1–2. 15

work page 2018
[75]

Efficient visual state space model for image deblurring,

L. Kong, J. Dong, J. Tang, M.-H. Yang, and J. Pan, “Efficient visual state space model for image deblurring,” inCVPR, 2025, pp. 12 710– 12 719. 16

work page 2025
[76]

Adarevd: Adaptive patch exiting reversible decoder pushes the limit of image deblurring,

X. Mao, Q. Li, and Y. Wang, “Adarevd: Adaptive patch exiting reversible decoder pushes the limit of image deblurring,” inCVPR, 2024, pp. 25 681–25 690. 16

work page 2024
[77]

Efficient frequency domain-based transformers for high-quality image deblurring,

L. Kong, J. Dong, J. Ge, M. Li, and J. Pan, “Efficient frequency domain-based transformers for high-quality image deblurring,” in CVPR, 2023, pp. 5886–5895. 16

work page 2023
[78]

Deblurdiff: Real-word image deblurring with generative diffusion models,

L. Kong, D. Zou, F. L. Wang, J. Ren, X. Wu, J. Dong, J. Panet al., “Deblurdiff: Real-word image deblurring with generative diffusion models,” inNeurIPS, 2025. 16

work page 2025
[79]

Dense reconstruction from monocular slam with fusion of sparse map-points and cnn-inferred depth,

X. Ji, X. Ye, H. Xu, and H. Li, “Dense reconstruction from monocular slam with fusion of sparse map-points and cnn-inferred depth,” in ICME, 2018, pp. 1–6. 16

work page 2018
[80]

Drm-slam: Towards dense reconstruction of monocular slam with scene depth fusion,

X. Ye, X. Ji, B. Sun, S. Chen, Z. Wang, and H. Li, “Drm-slam: Towards dense reconstruction of monocular slam with scene depth fusion,”Neurocomputing, vol. 396, pp. 76–91, 2020. 16 TABLE 17:Specifications of our triaxial imaging system. The deadtime between two adjacent high speed frames is extremely short and thus can be ignored. Device RS camera GS camer...

work page 2020

Showing first 80 references.

[1] [1]

Deep multi-scale convolu- tional neural network for dynamic scene deblurring,

S. Nah, T. Hyun Kim, and K. Mu Lee, “Deep multi-scale convolu- tional neural network for dynamic scene deblurring,” inProceedings of the IEEE CVPR, 2017, pp. 3883–3891. 1, 4, 7, 8

work page 2017

[2] [2]

Scale-recurrent network for deep image deblurring,

X. Tao, H. Gao, X. Shen, J. Wang, and J. Jia, “Scale-recurrent network for deep image deblurring,” inCVPR, 2018, pp. 8174–8182. 1, 8, 19

work page 2018

[3] [3]

Uformer: A general u-shaped transformer for image restoration,

Z. Wang, X. Cun, J. Bao, W. Zhou, J. Liu, and H. Li, “Uformer: A general u-shaped transformer for image restoration,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 17 683–17 693. 1

work page 2022

[4] [4]

Dynamic scene deblurring with parameter selective sharing and nested skip connections,

H. Gao, X. Tao, X. Shen, and J. Jia, “Dynamic scene deblurring with parameter selective sharing and nested skip connections,” in Proceedings of the IEEE/CVF CVPR, 2019, pp. 3848–3856. 1

work page 2019

[5] [5]

Restormer: Efficient transformer for high-resolution image restoration,

S. W. Zamir, A. Arora, S. Khan, M. Hayat, F. S. Khan, and M.-H. Yang, “Restormer: Efficient transformer for high-resolution image restoration,” inCVPR, 2022, pp. 5728–5739. 1

work page 2022

[6] [6]

Region-adaptive dense network for efficient motion deblurring,

K. Purohit and A. Rajagopalan, “Region-adaptive dense network for efficient motion deblurring,” inProceedings of AAAI, vol. 34, no. 07, 2020, pp. 11 882–11 889. 1

work page 2020

[7] [7]

Davanet: Stereo deblurring with view aggregation,

S. Zhou, J. Zhang, W. Zuo, H. Xie, J. Pan, and J. S. Ren, “Davanet: Stereo deblurring with view aggregation,” inProceedings of the IEEE/CVF CVPR, 2019, pp. 10 996–11 005. 1

work page 2019

[8] [8]

Multiscale structure guided diffusion for image deblurring,

M. Ren, M. Delbracio, H. Talebi, G. Gerig, and P . Milanfar, “Multiscale structure guided diffusion for image deblurring,” in Proceedings of the IEEE/CVF ICCV, 2023, pp. 10 721–10 733. 1

work page 2023

[9] [9]

Single image deblur- ring with row-dependent blur magnitude,

X. Ji, Z. Wang, S. Satoh, and Y. Zheng, “Single image deblur- ring with row-dependent blur magnitude,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 12 269–12 280. 1, 4

work page 2023

[10] [10]

Neural global shutter: Learn to restore video from a rolling shutter camera with global reset feature,

Z. Wang, X. Ji, J.-B. Huang, S. Satoh, X. Zhou, and Y. Zheng, “Neural global shutter: Learn to restore video from a rolling shutter camera with global reset feature,” inCVPR, 2022, pp. 17 794–17 803. 1

work page 2022

[11] [11]

From bows to arrows: Rolling shutter rectification of urban scenes,

V . Rengarajan, A. N. Rajagopalan, and R. Aravind, “From bows to arrows: Rolling shutter rectification of urban scenes,” inProceedings of the IEEE CVPR, 2016, pp. 2773–2781. 1

work page 2016

[12] [12]

Rolling shutter correction in manhattan world,

P . Purkait, C. Zach, and A. Leonardis, “Rolling shutter correction in manhattan world,” inProceedings of ICCV, 2017, pp. 882–890. 1

work page 2017

[13] [13]

Rectifying rolling shutter video from hand-held devices,

P .-E. Forssén and E. Ringaby, “Rectifying rolling shutter video from hand-held devices,” inIEEE CVPR. IEEE, 2010, pp. 507–514. 1

work page 2010

[14] [14]

Calibration-free rolling shutter removal,

M. Grundmann, V . Kwatra, D. Castro, and I. Essa, “Calibration-free rolling shutter removal,” in2012 IEEE ICCP, 2012, pp. 1–8. 1

work page 2012

[15] [15]

Occlusion-aware rolling shutter rectification of 3d scenes,

S. Vasu, A. Rajagopalanet al., “Occlusion-aware rolling shutter rectification of 3d scenes,” inCVPR, 2018, pp. 636–645. 1

work page 2018

[16] [16]

Rolling-shutter-aware differential sfm and image rectification,

B. Zhuang, L.-F. Cheong, and G. Hee Lee, “Rolling-shutter-aware differential sfm and image rectification,” inProceedings of the IEEE ICCV, 2017, pp. 948–956. 1, 3

work page 2017

[17] [17]

Learning structure-and-motion-aware rolling shutter correction,

B. Zhuang, Q.-H. Tran, P . Ji, L.-F. Cheong, and M. Chandraker, “Learning structure-and-motion-aware rolling shutter correction,” inProceedings of the IEEE/CVF CVPR, 2019, pp. 4551–4560. 1

work page 2019

[18] [18]

Deep homography mixture for single image rolling shutter correction,

W. Yan, R. T. Tan, B. Zeng, and S. Liu, “Deep homography mixture for single image rolling shutter correction,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 9868–9877. 1

work page 2023

[19] [19]

Unrolling the shutter: Cnn to correct motion distortions,

V . Rengarajan, Y. Balaji, and A. Rajagopalan, “Unrolling the shutter: Cnn to correct motion distortions,” inProceedings of the IEEE CVPR, 2017, pp. 2291–2299. 1

work page 2017

[20] [20]

Joint appearance and motion learning for efficient rolling shutter correction,

B. Fan, Y. Mao, Y. Dai, Z. Wan, and Q. Liu, “Joint appearance and motion learning for efficient rolling shutter correction,” in Proceedings of the IEEE/CVF CVPR, 2023, pp. 5671–5681. 1, 18

work page 2023

[21] [21]

Learning to extract a video sequence from a single motion-blurred image,

M. Jin, G. Meishvili, and P . Favaro, “Learning to extract a video sequence from a single motion-blurred image,” inProceedings of the IEEE CVPR, 2018, pp. 6334–6342. 1, 3, 8, 9, 10

work page 2018

[22] [22]

Bringing a blurry frame alive at high frame-rate with an event camera,

L. Pan, C. Scheerlinck, X. Yu, R. Hartley, M. Liu, and Y. Dai, “Bringing a blurry frame alive at high frame-rate with an event camera,” inProceedings of CVPR, 2019, pp. 6820–6829. 1

work page 2019

[23] [23]

Bringing alive blurred moments,

K. Purohit, A. Shah, and A. Rajagopalan, “Bringing alive blurred moments,” inCVPR, 2019, pp. 6830–6839. 1, 3, 18

work page 2019

[24] [24]

Blur interpolation transformer for real-world motion from blur,

Z. Zhong, M. Cao, X. Ji, Y. Zheng, and I. Sato, “Blur interpolation transformer for real-world motion from blur,” inProceedings of the IEEE/CVF CVPR, 2023, pp. 5713–5723. 1, 3, 4, 8, 9, 10

work page 2023

[25] [25]

Inverting a rolling shutter camera: bring rolling shutter images to high framerate global shutter video,

B. Fan and Y. Dai, “Inverting a rolling shutter camera: bring rolling shutter images to high framerate global shutter video,” inICCV, 2021, pp. 4228–4237. 1, 2, 3, 4, 8, 9, 10, 14, 18

work page 2021

[26] [26]

Bringing rolling shutter images alive with dual reversed distortion,

Z. Zhong, M. Cao, X. Sun, Z. Wu, Z. Zhou, Y. Zheng, S. Lin, and I. Sato, “Bringing rolling shutter images alive with dual reversed distortion,” inECCV. Springer, 2022, pp. 233–249. 1, 2, 3, 4, 5, 8, 9, 10, 11, 12, 14, 18

work page 2022

[27] [27]

Self- supervised learning to bring dual reversed rolling shutter images alive,

W. Shang, D. Ren, C. Feng, X. Wang, L. Lei, and W. Zuo, “Self- supervised learning to bring dual reversed rolling shutter images alive,” inICCV, 2023, pp. 13 086–13 094. 1, 3, 5, 18

work page 2023

[28] [28]

Context-aware video reconstruction for rolling shutter cameras,

B. Fan, Y. Dai, Z. Zhang, Q. Liu, and M. He, “Context-aware video reconstruction for rolling shutter cameras,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 17 572–17 582. 1, 2, 3, 6, 8, 9, 10, 18

work page 2022

[29] [29]

Video frame interpolation and enhancement via pyramid recurrent framework,

W. Shen, W. Bao, G. Zhai, L. Chen, X. Min, and Z. Gao, “Video frame interpolation and enhancement via pyramid recurrent framework,” IEEE TIP, vol. 30, pp. 277–292, 2020. 1, 7, 8

work page 2020

[30] [30]

Deep shutter unrolling network,

P . Liu, Z. Cui, V . Larsson, and M. Pollefeys, “Deep shutter unrolling network,” inCVPR, 2020, pp. 5941–5949. 1, 4, 6, 7, 8, 18, 19

work page 2020

[31] [31]

Real-world blur dataset for learning and benchmarking deblurring algorithms,

J. Rim, H. Lee, J. Won, and S. Cho, “Real-world blur dataset for learning and benchmarking deblurring algorithms,” inECCV. Springer, 2020, pp. 184–201. 1, 2, 8

work page 2020

[32] [32]

Towards rolling shutter correction and deblurring in dynamic scenes,

Z. Zhong, Y. Zheng, and I. Sato, “Towards rolling shutter correction and deblurring in dynamic scenes,” inProceedings of the IEEE/CVF CVPR, 2021, pp. 9219–9228. 1, 2, 18

work page 2021

[33] [33]

Rethinking video frame interpolation from shutter mode induced degradation,

X. Ji, Z. Wang, Z. Zhong, and Y. Zheng, “Rethinking video frame interpolation from shutter mode induced degradation,” in Proceedings of the IEEE/CVF ICCV, 2023, pp. 12 259–12 268. 1, 3, 4, 18

work page 2023

[34] [34]

Animation from blur: Multi-modal blur decomposition with motion guidance,

Z. Zhong, X. Sun, Z. Wu, Y. Zheng, S. Lin, and I. Sato, “Animation from blur: Multi-modal blur decomposition with motion guidance,” inECCV. Springer, 2022, pp. 599–615. 1, 2, 3, 4, 6, 8, 9, 10, 18

work page 2022

[35] [35]

Single image deblurring with row-dependent blur magnitude,

X. Ji, Z. Wang, S. Satoh, and Y. Zheng, “Single image deblurring with row-dependent blur magnitude,” inICCV, 2023, pp. 12 269– 12 280. 2

work page 2023

[36] [36]

From two rolling shutters to one global shutter,

C. Albl, Z. Kukelova, V . Larsson, M. Polic, T. Pajdla, and K. Schindler, “From two rolling shutters to one global shutter,” inProceedings of the IEEE/CVF CVPR, 2020, pp. 2505–2513. 2, 3

work page 2020

[37] [37]

Evunroll: Neuromorphic events based rolling shutter image correction,

X. Zhou, P . Duan, Y. Ma, and B. Shi, “Evunroll: Neuromorphic events based rolling shutter image correction,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 17 775–17 784. 2, 3, 7, 8, 11

work page 2022

[38] [38]

Event-guided frame interpolation and dynamic range expansion of single rolling shutter image,

G. Lin, J. Han, M. Cao, Z. Zhong, and Y. Zheng, “Event-guided frame interpolation and dynamic range expansion of single rolling shutter image,” inACM MM, 2023, pp. 3078–3088. 2

work page 2023

[39] [39]

Event-based fusion for motion deblurring with cross-modal attention,

L. Sun, C. Sakaridis, J. Liang, Q. Jiang, K. Yang, P . Sun, Y. Ye, K. Wang, and L. V . Gool, “Event-based fusion for motion deblurring with cross-modal attention,” inECCV, 2022, pp. 412–428. 2, 3

work page 2022

[40] [40]

Motion deblurring with real events,

F. Xu, L. Yu, B. Wang, W. Yang, G.-S. Xia, X. Jia, Z. Qiao, and J. Liu, “Motion deblurring with real events,” inProceedings of the IEEE/CVF ICCV, 2021, pp. 2583–2592. 2, 3

work page 2021

[41] [41]

Dual- shutter optical vibration sensing,

M. Sheinin, D. Chan, M. O’Toole, and S. G. Narasimhan, “Dual- shutter optical vibration sensing,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 16 324–16 333. 2, 3, 4

work page 2022

[42] [42]

Motion blur decomposition with cross-shutter guidance,

X. Ji, H. Jiang, and Y. Zheng, “Motion blur decomposition with cross-shutter guidance,” inCVPR, 2024, pp. 12 534–12 543. 2

work page 2024

[43] [43]

Restoration of video frames from a single blurred image with motion understanding,

D. M. Argaw, J. Kim, F. Rameau, C. Zhang, and I. S. Kweon, “Restoration of video frames from a single blurred image with motion understanding,” inCVPR, 2021, pp. 701–710. 3

work page 2021

[44] [44]

Blurry video frame interpolation,

W. Shen, W. Bao, G. Zhai, L. Chen, X. Min, and Z. Gao, “Blurry video frame interpolation,” inProceedings of the IEEE/CVF CVPR, 2020, pp. 5114–5123. 3, 7

work page 2020

[45] [45]

Video frame interpolation without temporal priors,

Y. Zhang, C. Wang, and D. Tao, “Video frame interpolation without temporal priors,”NeurIPS, vol. 33, pp. 13 308–13 318, 2020. 3

work page 2020

[46] [46]

Learning to extract flawless slow motion from blurry videos,

M. Jin, Z. Hu, and P . Favaro, “Learning to extract flawless slow motion from blurry videos,” inProceedings of the IEEE/CVF CVPR, 2019, pp. 8112–8121. 3

work page 2019

[47] [47]

Demfi: deep joint deblurring and multi-frame interpolation with flow-guided attentive correlation and recursive boosting,

J. Oh and M. Kim, “Demfi: deep joint deblurring and multi-frame interpolation with flow-guided attentive correlation and recursive boosting,” inECCV, 2022, pp. 198–215. 3, 4, 6, 7, 8, 9, 10, 14, 18

work page 2022

[48] [48]

Rolling shutter inversion: Bring rolling shutter images to high framerate global shutter video,

B. Fan, Y. Dai, and H. Li, “Rolling shutter inversion: Bring rolling shutter images to high framerate global shutter video,”IEEE TP AMI, vol. 45, no. 5, pp. 6214–6230, 2022. 3

work page 2022

[49] [49]

Learning bilateral cost volume for rolling shutter temporal super-resolution,

B. Fan, Y. Dai, and H. Li, “Learning bilateral cost volume for rolling shutter temporal super-resolution,”IEEE P AMI, vol. 46, no. 5, pp. 3862–3879, 2024. 3, 8, 9, 10

work page 2024

[50] [50]

Unified video reconstruc- tion for rolling shutter and global shutter cameras,

B. Fan, Z. Wan, B. Shi, C. Xu, and Y. Dai, “Unified video reconstruc- tion for rolling shutter and global shutter cameras,”IEEE TIP, 2024. 3

work page 2024

[51] [51]

Self-supervised learning for rolling shutter temporal super-resolution,

B. Fan, Y. Guo, Y. Dai, C. Xu, and B. Shi, “Self-supervised learning for rolling shutter temporal super-resolution,”TCSVT, 2024. 3

work page 2024

[52] [52]

Uniinr: Event-guided unified rolling shutter correction, deblurring, and interpolation,

Y. Lu, G. Liang, Y. Wang, L. Wang, and H. Xiong, “Uniinr: Event-guided unified rolling shutter correction, deblurring, and interpolation,” inECCV. Springer, 2024, pp. 1–20. 3

work page 2024

[53] [53]

Event-based blurry frame interpolation under blind exposure,

W. Weng, Y. Zhang, and Z. Xiong, “Event-based blurry frame interpolation under blind exposure,” inProceedings of the IEEE/CVF CVPR, 2023, pp. 1588–1598. 3, 8, 11

work page 2023

[54] [54]

Fast removal of non-uniform camera shake,

M. Hirsch, C. J. Schuler, S. Harmeling, and B. Schölkopf, “Fast removal of non-uniform camera shake,” inICCV. IEEE, 2011, pp. 463–470. 4 18

work page 2011

[55] [55]

Space-variant single- image blind deconvolution for removing camera shake,

S. Harmeling, H. Michael, and B. Schölkopf, “Space-variant single- image blind deconvolution for removing camera shake,”NeurIPS, vol. 23, 2010. 4

work page 2010

[56] [56]

Real-time intermediate flow estimation for video frame interpolation,

Z. Huang, T. Zhang, W. Heng, B. Shi, and S. Zhou, “Real-time intermediate flow estimation for video frame interpolation,” in ECCV. Springer, 2022, pp. 624–642. 5, 7, 8, 9, 10, 11, 14

work page 2022

[57] [57]

Rethinking coarse-to-fine approach in single image deblurring,

S.-J. Cho, S.-W. Ji, J.-P . Hong, S.-W. Jung, and S.-J. Ko, “Rethinking coarse-to-fine approach in single image deblurring,” inProceedings of the IEEE/CVF ICCV, 2021, pp. 4641–4650. 6, 8

work page 2021

[58] [58]

Selfpromer: Self-prompt dehazing transformers with depth- consistency,

C. Wang, J. Pan, W. Lin, J. Dong, W. Wang, and X.-M. Wu, “Selfpromer: Self-prompt dehazing transformers with depth- consistency,” inAAAI, 2024, pp. 5327–5335. 6, 13, 14, 18

work page 2024

[59] [59]

Ifrnet: Intermediate feature refine network for efficient frame interpolation,

L. Kong, B. Jiang, D. Luo, W. Chu, X. Huang, Y. Tai, C. Wang, and J. Yang, “Ifrnet: Intermediate feature refine network for efficient frame interpolation,” inCVPR, 2022, pp. 1969–1978. 7

work page 2022

[60] [60]

Multi-frequency representation enhancement with privilege information for video super-resolution,

F. Li, L. Zhang, Z. Liu, J. Lei, and Z. Li, “Multi-frequency representation enhancement with privilege information for video super-resolution,” inICCV, 2023, pp. 12 814–12 825. 7, 13

work page 2023

[61] [61]

Learning adaptive warping for real-world rolling shutter correction,

M. Cao, Z. Zhong, J. Wang, Y. Zheng, and Y. Yang, “Learning adaptive warping for real-world rolling shutter correction,” in Proceedings of the IEEE/CVF CVPR, 2022, pp. 17 785–17 793. 8

work page 2022

[62] [62]

Eventaid: Benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,

P . Duan, B. Li, Y. Yang, H. Lou, M. Teng, X. Zhou, Y. Ma, and B. Shi, “Eventaid: Benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,”TP AMI, 2025. 8

work page 2025

[63] [63]

Photosequencing of motion blur using short and long exposures,

V . Rengarajan, S. Zhao, R. Zhen, J. Glotzbach, H. Sheikh, and A. C. Sankaranarayanan, “Photosequencing of motion blur using short and long exposures,” inProceedings of the IEEE/CVF CVPR Workshops, 2020, pp. 510–511. 8, 11

work page 2020

[64] [64]

Video to events: Recycling video datasets for event cameras. in 2020 ieee,

D. Gehrig, M. Gehrig, J. Hidalgo-Carrió, and D. Scaramuzza, “Video to events: Recycling video datasets for event cameras. in 2020 ieee,” inCVF CVPR, vol. 6, 2020, p. 3. 9

work page 2020

[65] [65]

Basicvsr++: Improving video super-resolution with enhanced propagation and alignment,

K. C. Chan, S. Zhou, X. Xu, and C. C. Loy, “Basicvsr++: Improving video super-resolution with enhanced propagation and alignment,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 5972–5981. 10

work page 2022

[66] [66]

Mbllen: Low-light image/video enhancement using cnns

F. Lv, F. Lu, J. Wu, and C. Lim, “Mbllen: Low-light image/video enhancement using cnns.” inBMVC, vol. 220, no. 1, 2018, p. 4. 12

work page 2018

[67] [67]

Llnet: A deep autoen- coder approach to natural low-light image enhancement,

K. G. Lore, A. Akintayo, and S. Sarkar, “Llnet: A deep autoen- coder approach to natural low-light image enhancement,”Pattern Recognition, vol. 61, pp. 650–662, 2017. 12

work page 2017

[68] [68]

Iterative prompt learning for unsupervised backlit image enhancement,

Z. Liang, C. Li, S. Zhou, R. Feng, and C. C. Loy, “Iterative prompt learning for unsupervised backlit image enhancement,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 8094–8103. 13

work page 2023

[69] [69]

Segment anything,

A. Kirillov, E. Mintun, N. Ravi, H. Mao, C. Rolland, L. Gustafson, T. Xiao, S. Whitehead, A. C. Berg, W.-Y. Loet al., “Segment anything,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 4015–4026. 16

work page 2023

[70] [70]

Depth anything v2,

L. Yang, B. Kang, Z. Huang, Z. Zhao, X. Xu, J. Feng, and H. Zhao, “Depth anything v2,”NeurIPS, pp. 21 875–21 911, 2024. 16

work page 2024

[71] [71]

Slam3r: Real-time dense scene reconstruction from monocular rgb videos,

Y. Liu, S. Dong, S. Wang, Y. Yin, Y. Yang, Q. Fan, and B. Chen, “Slam3r: Real-time dense scene reconstruction from monocular rgb videos,”Proceedings of the IEEE/CVF CVPR, 2025. 16

work page 2025

[72] [72]

Learning stereo from single images,

J. Watson, O. M. Aodha, D. Turmukhambetov, G. J. Brostow, and M. Firman, “Learning stereo from single images,” inECCV. Springer, 2020, pp. 722–740. 15

work page 2020

[73] [73]

Stereo anything: Unifying stereo matching with large-scale mixed data,

X. Guo, C. Zhang, Y. Zhang, D. Nie, R. Wang, W. Zheng, M. Poggi, and L. Chen, “Stereo anything: Unifying stereo matching with large-scale mixed data,”arXiv preprint arXiv:2411.14053, 2024. 15

work page arXiv 2024

[74] [74]

Depth assisted full resolution network for single image-based view synthesis,

X. Cun, F. Xu, C.-M. Pun, and H. Gao, “Depth assisted full resolution network for single image-based view synthesis,” in ACM SIGGRAPH 2018 Posters, 2018, pp. 1–2. 15

work page 2018

[75] [75]

Efficient visual state space model for image deblurring,

L. Kong, J. Dong, J. Tang, M.-H. Yang, and J. Pan, “Efficient visual state space model for image deblurring,” inCVPR, 2025, pp. 12 710– 12 719. 16

work page 2025

[76] [76]

Adarevd: Adaptive patch exiting reversible decoder pushes the limit of image deblurring,

X. Mao, Q. Li, and Y. Wang, “Adarevd: Adaptive patch exiting reversible decoder pushes the limit of image deblurring,” inCVPR, 2024, pp. 25 681–25 690. 16

work page 2024

[77] [77]

Efficient frequency domain-based transformers for high-quality image deblurring,

L. Kong, J. Dong, J. Ge, M. Li, and J. Pan, “Efficient frequency domain-based transformers for high-quality image deblurring,” in CVPR, 2023, pp. 5886–5895. 16

work page 2023

[78] [78]

Deblurdiff: Real-word image deblurring with generative diffusion models,

L. Kong, D. Zou, F. L. Wang, J. Ren, X. Wu, J. Dong, J. Panet al., “Deblurdiff: Real-word image deblurring with generative diffusion models,” inNeurIPS, 2025. 16

work page 2025

[79] [79]

Dense reconstruction from monocular slam with fusion of sparse map-points and cnn-inferred depth,

X. Ji, X. Ye, H. Xu, and H. Li, “Dense reconstruction from monocular slam with fusion of sparse map-points and cnn-inferred depth,” in ICME, 2018, pp. 1–6. 16

work page 2018

[80] [80]

Drm-slam: Towards dense reconstruction of monocular slam with scene depth fusion,

X. Ye, X. Ji, B. Sun, S. Chen, Z. Wang, and H. Li, “Drm-slam: Towards dense reconstruction of monocular slam with scene depth fusion,”Neurocomputing, vol. 396, pp. 76–91, 2020. 16 TABLE 17:Specifications of our triaxial imaging system. The deadtime between two adjacent high speed frames is extremely short and thus can be ignored. Device RS camera GS camer...

work page 2020