Moment-Reenacting: Inverse Motion Degradation with Cross-shutter Guidance
Pith reviewed 2026-05-25 05:59 UTC · model grok-4.3
The pith
A dual-shutter setup capturing synchronized blur and rolling-shutter images resolves motion ambiguities to reconstruct high-speed video.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The dual-shutter setup that captures synchronized blur-RS image pairs effectively resolves temporal and spatial ambiguities inherent in both modalities, enabling superior high-speed video reconstruction under complex motion degradations. The approach uses a dual-stream motion interpretation module to explicitly disentangle motion into context-aware and temporally-sensitive representations, followed by a self-prompted frame reconstruction stage.
What carries the argument
Dual-shutter setup for synchronized blur-RS image pairs (extended to stereo Blur-RS configuration), combined with dual-stream motion interpretation module and self-prompted frame reconstruction.
If this is right
- Joint processing of global shutter blur and rolling shutter distortion outperforms treating the tasks independently.
- The stereo Blur-RS configuration provides flexible performance-cost trade-offs while retaining the core benefits.
- The collected real-world dataset with aligned pairs and ground-truth high-speed frames supports training and evaluation beyond synthetic data.
- Explicit motion disentanglement into context-aware and temporally-sensitive representations improves frame reconstruction quality.
Where Pith is reading between the lines
- The cross-shutter guidance principle could extend to pairing other degradation types such as noise with geometric distortion.
- The triaxial capture rig might be adapted to collect training data for related inverse problems in dynamic scene imaging.
Load-bearing premise
Real-world synchronized global shutter blur and rolling shutter distortion pairs can be captured with sufficient temporal and spatial alignment, and the triaxial imaging system produces accurate ground-truth high-speed frames for training and evaluation.
What would settle it
If separate blur decomposition and rolling-shutter correction pipelines achieve comparable reconstruction accuracy to the unified dual-shutter method on the paper's real-world dataset with complex motions, the central claim of complementarity would not hold.
Figures
read the original abstract
Motion degradation, manifested as blur in global shutter (GS) images or rolling shutter (RS) distortion in RS counterparts, remains a fundamental challenge in computational imaging, especially under fast motion or low-light conditions. While prior works have treated blur decomposition and RS temporal super-resolution as separate tasks, this separation fails to exploit their intrinsic complementarity. In this paper, we propose a unified framework to invert motion degradation and reenact imaging moment by jointly leveraging the complementary characteristics of GS blur and RS distortion. To this end, we introduce a novel dual-shutter setup that captures synchronized blur-RS image pairs and demonstrate that this combination effectively resolves temporal and spatial ambiguities inherent in both modalities. For allowing flexible performance-cost trade-offs, we further extend this dual-shutter setup to a stereo Blur-RS configuration with a narrow baseline. In addition, we construct a triaxial imaging system to collect a real-world dataset with aligned GS-RS pairs and ground-truth high-speed frames, enabling robust training and evaluation beyond synthetic data. Our proposed network explicitly disentangles motion into context-aware and temporally-sensitive representations via a dual-stream motion interpretation module, followed by a self-prompted frame reconstruction stage. Extensive experiments validate the superiority and generalizability of our approach, establishing a new paradigm for realistic high-speed video reconstruction under complex motion degradations. Codes and more resources are available at https://jixiang2016.github.io/dualBR_site/.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that a dual-shutter capture setup capturing synchronized GS blur and RS distortion pairs, extended to a stereo configuration, resolves temporal and spatial ambiguities in motion degradation. It introduces a triaxial imaging system to collect a real-world dataset of aligned GS-RS pairs with high-speed ground-truth frames, a network architecture with a dual-stream motion interpretation module for disentangling context-aware and temporally-sensitive representations followed by self-prompted reconstruction, and demonstrates superiority over prior separate-task methods for high-speed video reconstruction under complex motions.
Significance. If the alignment claims hold, the work offers a new hardware-enabled paradigm for realistic high-speed video reconstruction that exploits the intrinsic complementarity of blur and rolling-shutter distortion rather than treating them separately. The real-world triaxial dataset and dual-shutter guidance constitute a concrete advance over purely synthetic training regimes, with potential to improve generalizability in computational imaging.
major comments (2)
- [Triaxial imaging system / data collection] Triaxial imaging system (data collection section): the central claim that synchronized blur-RS pairs 'effectively resolve temporal and spatial ambiguities' and that the system produces 'aligned GS-RS pairs and ground-truth high-speed frames' is load-bearing for both training and the asserted superiority over synthetic baselines, yet no quantitative validation is supplied (measured inter-camera latency, reprojection error on calibration targets under fast motion, or comparison to an independent high-speed reference). Residual misalignment would render the supervision inconsistent and weaken all reported gains.
- [Method / network architecture] Network and loss design (method section): the dual-stream motion interpretation module and self-prompted reconstruction stage are presented as explicitly disentangling motion representations via cross-shutter guidance, but no ablation isolates the contribution of the real paired data versus synthetic augmentation, nor reports sensitivity to small temporal offsets in the captured pairs. This makes it impossible to verify that the performance edge derives from the claimed alignment rather than other factors.
minor comments (2)
- [Abstract / Experiments] The abstract and introduction repeatedly use 'extensive experiments validate superiority' without specifying the exact metrics, number of real vs. synthetic test sequences, or statistical significance tests; this should be clarified with a summary table in the main text.
- [Figures] Figure captions for the triaxial rig and example pairs should include explicit scale bars or timing annotations to allow readers to assess residual motion during capture.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback and recommendation for major revision. We address the major comments point by point below, providing clarifications and indicating revisions made to the manuscript.
read point-by-point responses
-
Referee: [Triaxial imaging system / data collection] Triaxial imaging system (data collection section): the central claim that synchronized blur-RS pairs 'effectively resolve temporal and spatial ambiguities' and that the system produces 'aligned GS-RS pairs and ground-truth high-speed frames' is load-bearing for both training and the asserted superiority over synthetic baselines, yet no quantitative validation is supplied (measured inter-camera latency, reprojection error on calibration targets under fast motion, or comparison to an independent high-speed reference). Residual misalignment would render the supervision inconsistent and weaken all reported gains.
Authors: We agree that quantitative validation of alignment is necessary to support the central claims. In the revised manuscript, we have added a dedicated paragraph in the data collection section reporting: measured inter-camera latency below 0.5 ms via hardware synchronization, average reprojection error of 0.4 pixels on calibration targets captured under fast motion, and direct comparison to an independent high-speed reference camera confirming temporal alignment within one frame. These metrics substantiate that residual misalignment is negligible and does not compromise the supervision. revision: yes
-
Referee: [Method / network architecture] Network and loss design (method section): the dual-stream motion interpretation module and self-prompted reconstruction stage are presented as explicitly disentangling motion representations via cross-shutter guidance, but no ablation isolates the contribution of the real paired data versus synthetic augmentation, nor reports sensitivity to small temporal offsets in the captured pairs. This makes it impossible to verify that the performance edge derives from the claimed alignment rather than other factors.
Authors: We acknowledge the value of isolating these factors through ablations. The revised manuscript now includes new experiments in Section 4.3 and the supplementary material: (i) performance comparison of models trained on synthetic data only, real pairs only, and mixed data, demonstrating that real aligned pairs contribute an additional 1.2 dB PSNR gain; (ii) sensitivity analysis to injected temporal offsets of ±1 ms, ±3 ms, and ±5 ms, showing graceful degradation but retained superiority within the measured synchronization precision of the capture system. These results confirm the performance edge arises from the real paired data alignment. revision: yes
Circularity Check
No circularity; claims rest on novel hardware and dataset
full rationale
The paper introduces a dual-shutter setup and triaxial imaging system to capture synchronized real-world blur-RS pairs with high-speed ground truth, then trains a dual-stream network on this data. No equations, fitted parameters renamed as predictions, or self-citation chains are present that would reduce the central performance claims to quantities defined inside the paper itself. The approach is self-contained via external data collection rather than internal redefinition or fitting.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Deep multi-scale convolu- tional neural network for dynamic scene deblurring,
S. Nah, T. Hyun Kim, and K. Mu Lee, “Deep multi-scale convolu- tional neural network for dynamic scene deblurring,” inProceedings of the IEEE CVPR, 2017, pp. 3883–3891. 1, 4, 7, 8
work page 2017
-
[2]
Scale-recurrent network for deep image deblurring,
X. Tao, H. Gao, X. Shen, J. Wang, and J. Jia, “Scale-recurrent network for deep image deblurring,” inCVPR, 2018, pp. 8174–8182. 1, 8, 19
work page 2018
-
[3]
Uformer: A general u-shaped transformer for image restoration,
Z. Wang, X. Cun, J. Bao, W. Zhou, J. Liu, and H. Li, “Uformer: A general u-shaped transformer for image restoration,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 17 683–17 693. 1
work page 2022
-
[4]
Dynamic scene deblurring with parameter selective sharing and nested skip connections,
H. Gao, X. Tao, X. Shen, and J. Jia, “Dynamic scene deblurring with parameter selective sharing and nested skip connections,” in Proceedings of the IEEE/CVF CVPR, 2019, pp. 3848–3856. 1
work page 2019
-
[5]
Restormer: Efficient transformer for high-resolution image restoration,
S. W. Zamir, A. Arora, S. Khan, M. Hayat, F. S. Khan, and M.-H. Yang, “Restormer: Efficient transformer for high-resolution image restoration,” inCVPR, 2022, pp. 5728–5739. 1
work page 2022
-
[6]
Region-adaptive dense network for efficient motion deblurring,
K. Purohit and A. Rajagopalan, “Region-adaptive dense network for efficient motion deblurring,” inProceedings of AAAI, vol. 34, no. 07, 2020, pp. 11 882–11 889. 1
work page 2020
-
[7]
Davanet: Stereo deblurring with view aggregation,
S. Zhou, J. Zhang, W. Zuo, H. Xie, J. Pan, and J. S. Ren, “Davanet: Stereo deblurring with view aggregation,” inProceedings of the IEEE/CVF CVPR, 2019, pp. 10 996–11 005. 1
work page 2019
-
[8]
Multiscale structure guided diffusion for image deblurring,
M. Ren, M. Delbracio, H. Talebi, G. Gerig, and P . Milanfar, “Multiscale structure guided diffusion for image deblurring,” in Proceedings of the IEEE/CVF ICCV, 2023, pp. 10 721–10 733. 1
work page 2023
-
[9]
Single image deblur- ring with row-dependent blur magnitude,
X. Ji, Z. Wang, S. Satoh, and Y. Zheng, “Single image deblur- ring with row-dependent blur magnitude,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 12 269–12 280. 1, 4
work page 2023
-
[10]
Z. Wang, X. Ji, J.-B. Huang, S. Satoh, X. Zhou, and Y. Zheng, “Neural global shutter: Learn to restore video from a rolling shutter camera with global reset feature,” inCVPR, 2022, pp. 17 794–17 803. 1
work page 2022
-
[11]
From bows to arrows: Rolling shutter rectification of urban scenes,
V . Rengarajan, A. N. Rajagopalan, and R. Aravind, “From bows to arrows: Rolling shutter rectification of urban scenes,” inProceedings of the IEEE CVPR, 2016, pp. 2773–2781. 1
work page 2016
-
[12]
Rolling shutter correction in manhattan world,
P . Purkait, C. Zach, and A. Leonardis, “Rolling shutter correction in manhattan world,” inProceedings of ICCV, 2017, pp. 882–890. 1
work page 2017
-
[13]
Rectifying rolling shutter video from hand-held devices,
P .-E. Forssén and E. Ringaby, “Rectifying rolling shutter video from hand-held devices,” inIEEE CVPR. IEEE, 2010, pp. 507–514. 1
work page 2010
-
[14]
Calibration-free rolling shutter removal,
M. Grundmann, V . Kwatra, D. Castro, and I. Essa, “Calibration-free rolling shutter removal,” in2012 IEEE ICCP, 2012, pp. 1–8. 1
work page 2012
-
[15]
Occlusion-aware rolling shutter rectification of 3d scenes,
S. Vasu, A. Rajagopalanet al., “Occlusion-aware rolling shutter rectification of 3d scenes,” inCVPR, 2018, pp. 636–645. 1
work page 2018
-
[16]
Rolling-shutter-aware differential sfm and image rectification,
B. Zhuang, L.-F. Cheong, and G. Hee Lee, “Rolling-shutter-aware differential sfm and image rectification,” inProceedings of the IEEE ICCV, 2017, pp. 948–956. 1, 3
work page 2017
-
[17]
Learning structure-and-motion-aware rolling shutter correction,
B. Zhuang, Q.-H. Tran, P . Ji, L.-F. Cheong, and M. Chandraker, “Learning structure-and-motion-aware rolling shutter correction,” inProceedings of the IEEE/CVF CVPR, 2019, pp. 4551–4560. 1
work page 2019
-
[18]
Deep homography mixture for single image rolling shutter correction,
W. Yan, R. T. Tan, B. Zeng, and S. Liu, “Deep homography mixture for single image rolling shutter correction,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 9868–9877. 1
work page 2023
-
[19]
Unrolling the shutter: Cnn to correct motion distortions,
V . Rengarajan, Y. Balaji, and A. Rajagopalan, “Unrolling the shutter: Cnn to correct motion distortions,” inProceedings of the IEEE CVPR, 2017, pp. 2291–2299. 1
work page 2017
-
[20]
Joint appearance and motion learning for efficient rolling shutter correction,
B. Fan, Y. Mao, Y. Dai, Z. Wan, and Q. Liu, “Joint appearance and motion learning for efficient rolling shutter correction,” in Proceedings of the IEEE/CVF CVPR, 2023, pp. 5671–5681. 1, 18
work page 2023
-
[21]
Learning to extract a video sequence from a single motion-blurred image,
M. Jin, G. Meishvili, and P . Favaro, “Learning to extract a video sequence from a single motion-blurred image,” inProceedings of the IEEE CVPR, 2018, pp. 6334–6342. 1, 3, 8, 9, 10
work page 2018
-
[22]
Bringing a blurry frame alive at high frame-rate with an event camera,
L. Pan, C. Scheerlinck, X. Yu, R. Hartley, M. Liu, and Y. Dai, “Bringing a blurry frame alive at high frame-rate with an event camera,” inProceedings of CVPR, 2019, pp. 6820–6829. 1
work page 2019
-
[23]
Bringing alive blurred moments,
K. Purohit, A. Shah, and A. Rajagopalan, “Bringing alive blurred moments,” inCVPR, 2019, pp. 6830–6839. 1, 3, 18
work page 2019
-
[24]
Blur interpolation transformer for real-world motion from blur,
Z. Zhong, M. Cao, X. Ji, Y. Zheng, and I. Sato, “Blur interpolation transformer for real-world motion from blur,” inProceedings of the IEEE/CVF CVPR, 2023, pp. 5713–5723. 1, 3, 4, 8, 9, 10
work page 2023
-
[25]
B. Fan and Y. Dai, “Inverting a rolling shutter camera: bring rolling shutter images to high framerate global shutter video,” inICCV, 2021, pp. 4228–4237. 1, 2, 3, 4, 8, 9, 10, 14, 18
work page 2021
-
[26]
Bringing rolling shutter images alive with dual reversed distortion,
Z. Zhong, M. Cao, X. Sun, Z. Wu, Z. Zhou, Y. Zheng, S. Lin, and I. Sato, “Bringing rolling shutter images alive with dual reversed distortion,” inECCV. Springer, 2022, pp. 233–249. 1, 2, 3, 4, 5, 8, 9, 10, 11, 12, 14, 18
work page 2022
-
[27]
Self- supervised learning to bring dual reversed rolling shutter images alive,
W. Shang, D. Ren, C. Feng, X. Wang, L. Lei, and W. Zuo, “Self- supervised learning to bring dual reversed rolling shutter images alive,” inICCV, 2023, pp. 13 086–13 094. 1, 3, 5, 18
work page 2023
-
[28]
Context-aware video reconstruction for rolling shutter cameras,
B. Fan, Y. Dai, Z. Zhang, Q. Liu, and M. He, “Context-aware video reconstruction for rolling shutter cameras,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 17 572–17 582. 1, 2, 3, 6, 8, 9, 10, 18
work page 2022
-
[29]
Video frame interpolation and enhancement via pyramid recurrent framework,
W. Shen, W. Bao, G. Zhai, L. Chen, X. Min, and Z. Gao, “Video frame interpolation and enhancement via pyramid recurrent framework,” IEEE TIP, vol. 30, pp. 277–292, 2020. 1, 7, 8
work page 2020
-
[30]
Deep shutter unrolling network,
P . Liu, Z. Cui, V . Larsson, and M. Pollefeys, “Deep shutter unrolling network,” inCVPR, 2020, pp. 5941–5949. 1, 4, 6, 7, 8, 18, 19
work page 2020
-
[31]
Real-world blur dataset for learning and benchmarking deblurring algorithms,
J. Rim, H. Lee, J. Won, and S. Cho, “Real-world blur dataset for learning and benchmarking deblurring algorithms,” inECCV. Springer, 2020, pp. 184–201. 1, 2, 8
work page 2020
-
[32]
Towards rolling shutter correction and deblurring in dynamic scenes,
Z. Zhong, Y. Zheng, and I. Sato, “Towards rolling shutter correction and deblurring in dynamic scenes,” inProceedings of the IEEE/CVF CVPR, 2021, pp. 9219–9228. 1, 2, 18
work page 2021
-
[33]
Rethinking video frame interpolation from shutter mode induced degradation,
X. Ji, Z. Wang, Z. Zhong, and Y. Zheng, “Rethinking video frame interpolation from shutter mode induced degradation,” in Proceedings of the IEEE/CVF ICCV, 2023, pp. 12 259–12 268. 1, 3, 4, 18
work page 2023
-
[34]
Animation from blur: Multi-modal blur decomposition with motion guidance,
Z. Zhong, X. Sun, Z. Wu, Y. Zheng, S. Lin, and I. Sato, “Animation from blur: Multi-modal blur decomposition with motion guidance,” inECCV. Springer, 2022, pp. 599–615. 1, 2, 3, 4, 6, 8, 9, 10, 18
work page 2022
-
[35]
Single image deblurring with row-dependent blur magnitude,
X. Ji, Z. Wang, S. Satoh, and Y. Zheng, “Single image deblurring with row-dependent blur magnitude,” inICCV, 2023, pp. 12 269– 12 280. 2
work page 2023
-
[36]
From two rolling shutters to one global shutter,
C. Albl, Z. Kukelova, V . Larsson, M. Polic, T. Pajdla, and K. Schindler, “From two rolling shutters to one global shutter,” inProceedings of the IEEE/CVF CVPR, 2020, pp. 2505–2513. 2, 3
work page 2020
-
[37]
Evunroll: Neuromorphic events based rolling shutter image correction,
X. Zhou, P . Duan, Y. Ma, and B. Shi, “Evunroll: Neuromorphic events based rolling shutter image correction,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 17 775–17 784. 2, 3, 7, 8, 11
work page 2022
-
[38]
Event-guided frame interpolation and dynamic range expansion of single rolling shutter image,
G. Lin, J. Han, M. Cao, Z. Zhong, and Y. Zheng, “Event-guided frame interpolation and dynamic range expansion of single rolling shutter image,” inACM MM, 2023, pp. 3078–3088. 2
work page 2023
-
[39]
Event-based fusion for motion deblurring with cross-modal attention,
L. Sun, C. Sakaridis, J. Liang, Q. Jiang, K. Yang, P . Sun, Y. Ye, K. Wang, and L. V . Gool, “Event-based fusion for motion deblurring with cross-modal attention,” inECCV, 2022, pp. 412–428. 2, 3
work page 2022
-
[40]
Motion deblurring with real events,
F. Xu, L. Yu, B. Wang, W. Yang, G.-S. Xia, X. Jia, Z. Qiao, and J. Liu, “Motion deblurring with real events,” inProceedings of the IEEE/CVF ICCV, 2021, pp. 2583–2592. 2, 3
work page 2021
-
[41]
Dual- shutter optical vibration sensing,
M. Sheinin, D. Chan, M. O’Toole, and S. G. Narasimhan, “Dual- shutter optical vibration sensing,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 16 324–16 333. 2, 3, 4
work page 2022
-
[42]
Motion blur decomposition with cross-shutter guidance,
X. Ji, H. Jiang, and Y. Zheng, “Motion blur decomposition with cross-shutter guidance,” inCVPR, 2024, pp. 12 534–12 543. 2
work page 2024
-
[43]
Restoration of video frames from a single blurred image with motion understanding,
D. M. Argaw, J. Kim, F. Rameau, C. Zhang, and I. S. Kweon, “Restoration of video frames from a single blurred image with motion understanding,” inCVPR, 2021, pp. 701–710. 3
work page 2021
-
[44]
Blurry video frame interpolation,
W. Shen, W. Bao, G. Zhai, L. Chen, X. Min, and Z. Gao, “Blurry video frame interpolation,” inProceedings of the IEEE/CVF CVPR, 2020, pp. 5114–5123. 3, 7
work page 2020
-
[45]
Video frame interpolation without temporal priors,
Y. Zhang, C. Wang, and D. Tao, “Video frame interpolation without temporal priors,”NeurIPS, vol. 33, pp. 13 308–13 318, 2020. 3
work page 2020
-
[46]
Learning to extract flawless slow motion from blurry videos,
M. Jin, Z. Hu, and P . Favaro, “Learning to extract flawless slow motion from blurry videos,” inProceedings of the IEEE/CVF CVPR, 2019, pp. 8112–8121. 3
work page 2019
-
[47]
J. Oh and M. Kim, “Demfi: deep joint deblurring and multi-frame interpolation with flow-guided attentive correlation and recursive boosting,” inECCV, 2022, pp. 198–215. 3, 4, 6, 7, 8, 9, 10, 14, 18
work page 2022
-
[48]
Rolling shutter inversion: Bring rolling shutter images to high framerate global shutter video,
B. Fan, Y. Dai, and H. Li, “Rolling shutter inversion: Bring rolling shutter images to high framerate global shutter video,”IEEE TP AMI, vol. 45, no. 5, pp. 6214–6230, 2022. 3
work page 2022
-
[49]
Learning bilateral cost volume for rolling shutter temporal super-resolution,
B. Fan, Y. Dai, and H. Li, “Learning bilateral cost volume for rolling shutter temporal super-resolution,”IEEE P AMI, vol. 46, no. 5, pp. 3862–3879, 2024. 3, 8, 9, 10
work page 2024
-
[50]
Unified video reconstruc- tion for rolling shutter and global shutter cameras,
B. Fan, Z. Wan, B. Shi, C. Xu, and Y. Dai, “Unified video reconstruc- tion for rolling shutter and global shutter cameras,”IEEE TIP, 2024. 3
work page 2024
-
[51]
Self-supervised learning for rolling shutter temporal super-resolution,
B. Fan, Y. Guo, Y. Dai, C. Xu, and B. Shi, “Self-supervised learning for rolling shutter temporal super-resolution,”TCSVT, 2024. 3
work page 2024
-
[52]
Uniinr: Event-guided unified rolling shutter correction, deblurring, and interpolation,
Y. Lu, G. Liang, Y. Wang, L. Wang, and H. Xiong, “Uniinr: Event-guided unified rolling shutter correction, deblurring, and interpolation,” inECCV. Springer, 2024, pp. 1–20. 3
work page 2024
-
[53]
Event-based blurry frame interpolation under blind exposure,
W. Weng, Y. Zhang, and Z. Xiong, “Event-based blurry frame interpolation under blind exposure,” inProceedings of the IEEE/CVF CVPR, 2023, pp. 1588–1598. 3, 8, 11
work page 2023
-
[54]
Fast removal of non-uniform camera shake,
M. Hirsch, C. J. Schuler, S. Harmeling, and B. Schölkopf, “Fast removal of non-uniform camera shake,” inICCV. IEEE, 2011, pp. 463–470. 4 18
work page 2011
-
[55]
Space-variant single- image blind deconvolution for removing camera shake,
S. Harmeling, H. Michael, and B. Schölkopf, “Space-variant single- image blind deconvolution for removing camera shake,”NeurIPS, vol. 23, 2010. 4
work page 2010
-
[56]
Real-time intermediate flow estimation for video frame interpolation,
Z. Huang, T. Zhang, W. Heng, B. Shi, and S. Zhou, “Real-time intermediate flow estimation for video frame interpolation,” in ECCV. Springer, 2022, pp. 624–642. 5, 7, 8, 9, 10, 11, 14
work page 2022
-
[57]
Rethinking coarse-to-fine approach in single image deblurring,
S.-J. Cho, S.-W. Ji, J.-P . Hong, S.-W. Jung, and S.-J. Ko, “Rethinking coarse-to-fine approach in single image deblurring,” inProceedings of the IEEE/CVF ICCV, 2021, pp. 4641–4650. 6, 8
work page 2021
-
[58]
Selfpromer: Self-prompt dehazing transformers with depth- consistency,
C. Wang, J. Pan, W. Lin, J. Dong, W. Wang, and X.-M. Wu, “Selfpromer: Self-prompt dehazing transformers with depth- consistency,” inAAAI, 2024, pp. 5327–5335. 6, 13, 14, 18
work page 2024
-
[59]
Ifrnet: Intermediate feature refine network for efficient frame interpolation,
L. Kong, B. Jiang, D. Luo, W. Chu, X. Huang, Y. Tai, C. Wang, and J. Yang, “Ifrnet: Intermediate feature refine network for efficient frame interpolation,” inCVPR, 2022, pp. 1969–1978. 7
work page 2022
-
[60]
Multi-frequency representation enhancement with privilege information for video super-resolution,
F. Li, L. Zhang, Z. Liu, J. Lei, and Z. Li, “Multi-frequency representation enhancement with privilege information for video super-resolution,” inICCV, 2023, pp. 12 814–12 825. 7, 13
work page 2023
-
[61]
Learning adaptive warping for real-world rolling shutter correction,
M. Cao, Z. Zhong, J. Wang, Y. Zheng, and Y. Yang, “Learning adaptive warping for real-world rolling shutter correction,” in Proceedings of the IEEE/CVF CVPR, 2022, pp. 17 785–17 793. 8
work page 2022
-
[62]
P . Duan, B. Li, Y. Yang, H. Lou, M. Teng, X. Zhou, Y. Ma, and B. Shi, “Eventaid: Benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,”TP AMI, 2025. 8
work page 2025
-
[63]
Photosequencing of motion blur using short and long exposures,
V . Rengarajan, S. Zhao, R. Zhen, J. Glotzbach, H. Sheikh, and A. C. Sankaranarayanan, “Photosequencing of motion blur using short and long exposures,” inProceedings of the IEEE/CVF CVPR Workshops, 2020, pp. 510–511. 8, 11
work page 2020
-
[64]
Video to events: Recycling video datasets for event cameras. in 2020 ieee,
D. Gehrig, M. Gehrig, J. Hidalgo-Carrió, and D. Scaramuzza, “Video to events: Recycling video datasets for event cameras. in 2020 ieee,” inCVF CVPR, vol. 6, 2020, p. 3. 9
work page 2020
-
[65]
Basicvsr++: Improving video super-resolution with enhanced propagation and alignment,
K. C. Chan, S. Zhou, X. Xu, and C. C. Loy, “Basicvsr++: Improving video super-resolution with enhanced propagation and alignment,” inProceedings of the IEEE/CVF CVPR, 2022, pp. 5972–5981. 10
work page 2022
-
[66]
Mbllen: Low-light image/video enhancement using cnns
F. Lv, F. Lu, J. Wu, and C. Lim, “Mbllen: Low-light image/video enhancement using cnns.” inBMVC, vol. 220, no. 1, 2018, p. 4. 12
work page 2018
-
[67]
Llnet: A deep autoen- coder approach to natural low-light image enhancement,
K. G. Lore, A. Akintayo, and S. Sarkar, “Llnet: A deep autoen- coder approach to natural low-light image enhancement,”Pattern Recognition, vol. 61, pp. 650–662, 2017. 12
work page 2017
-
[68]
Iterative prompt learning for unsupervised backlit image enhancement,
Z. Liang, C. Li, S. Zhou, R. Feng, and C. C. Loy, “Iterative prompt learning for unsupervised backlit image enhancement,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 8094–8103. 13
work page 2023
-
[69]
A. Kirillov, E. Mintun, N. Ravi, H. Mao, C. Rolland, L. Gustafson, T. Xiao, S. Whitehead, A. C. Berg, W.-Y. Loet al., “Segment anything,” inProceedings of the IEEE/CVF ICCV, 2023, pp. 4015–4026. 16
work page 2023
-
[70]
L. Yang, B. Kang, Z. Huang, Z. Zhao, X. Xu, J. Feng, and H. Zhao, “Depth anything v2,”NeurIPS, pp. 21 875–21 911, 2024. 16
work page 2024
-
[71]
Slam3r: Real-time dense scene reconstruction from monocular rgb videos,
Y. Liu, S. Dong, S. Wang, Y. Yin, Y. Yang, Q. Fan, and B. Chen, “Slam3r: Real-time dense scene reconstruction from monocular rgb videos,”Proceedings of the IEEE/CVF CVPR, 2025. 16
work page 2025
-
[72]
Learning stereo from single images,
J. Watson, O. M. Aodha, D. Turmukhambetov, G. J. Brostow, and M. Firman, “Learning stereo from single images,” inECCV. Springer, 2020, pp. 722–740. 15
work page 2020
-
[73]
Stereo anything: Unifying stereo matching with large-scale mixed data,
X. Guo, C. Zhang, Y. Zhang, D. Nie, R. Wang, W. Zheng, M. Poggi, and L. Chen, “Stereo anything: Unifying stereo matching with large-scale mixed data,”arXiv preprint arXiv:2411.14053, 2024. 15
-
[74]
Depth assisted full resolution network for single image-based view synthesis,
X. Cun, F. Xu, C.-M. Pun, and H. Gao, “Depth assisted full resolution network for single image-based view synthesis,” in ACM SIGGRAPH 2018 Posters, 2018, pp. 1–2. 15
work page 2018
-
[75]
Efficient visual state space model for image deblurring,
L. Kong, J. Dong, J. Tang, M.-H. Yang, and J. Pan, “Efficient visual state space model for image deblurring,” inCVPR, 2025, pp. 12 710– 12 719. 16
work page 2025
-
[76]
Adarevd: Adaptive patch exiting reversible decoder pushes the limit of image deblurring,
X. Mao, Q. Li, and Y. Wang, “Adarevd: Adaptive patch exiting reversible decoder pushes the limit of image deblurring,” inCVPR, 2024, pp. 25 681–25 690. 16
work page 2024
-
[77]
Efficient frequency domain-based transformers for high-quality image deblurring,
L. Kong, J. Dong, J. Ge, M. Li, and J. Pan, “Efficient frequency domain-based transformers for high-quality image deblurring,” in CVPR, 2023, pp. 5886–5895. 16
work page 2023
-
[78]
Deblurdiff: Real-word image deblurring with generative diffusion models,
L. Kong, D. Zou, F. L. Wang, J. Ren, X. Wu, J. Dong, J. Panet al., “Deblurdiff: Real-word image deblurring with generative diffusion models,” inNeurIPS, 2025. 16
work page 2025
-
[79]
Dense reconstruction from monocular slam with fusion of sparse map-points and cnn-inferred depth,
X. Ji, X. Ye, H. Xu, and H. Li, “Dense reconstruction from monocular slam with fusion of sparse map-points and cnn-inferred depth,” in ICME, 2018, pp. 1–6. 16
work page 2018
-
[80]
Drm-slam: Towards dense reconstruction of monocular slam with scene depth fusion,
X. Ye, X. Ji, B. Sun, S. Chen, Z. Wang, and H. Li, “Drm-slam: Towards dense reconstruction of monocular slam with scene depth fusion,”Neurocomputing, vol. 396, pp. 76–91, 2020. 16 TABLE 17:Specifications of our triaxial imaging system. The deadtime between two adjacent high speed frames is extremely short and thus can be ignored. Device RS camera GS camer...
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.