A recurrent E2F architecture with selective fusion and lightweight attention reports competitive reconstruction quality at lower model complexity on standard benchmarks.
UniE2F: A Unified Diffusion Framework for Event-to-Frame Reconstruction with Video Foundation Models
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Event cameras excel at high-speed, low-power, and high-dynamic-range scene perception. However, as they fundamentally record only relative intensity changes rather than absolute intensity, the resulting data streams suffer from a significant loss of spatial information and static texture details. In this paper, we address this limitation by leveraging the generative prior of a pre-trained video diffusion model to reconstruct high-fidelity video frames from sparse event data. Specifically, we first establish a baseline model by directly applying event data as a condition to synthesize videos. Then, based on the physical correlation between the event stream and video frames, we further introduce the event-based inter-frame residual guidance to enhance the accuracy of video frame reconstruction. Furthermore, we extend our method to video frame interpolation and prediction in a zero-shot manner by modulating the reverse diffusion sampling process, thereby creating a unified event-to-frame reconstruction framework. Experimental results on real-world and synthetic datasets demonstrate that our method significantly outperforms previous approaches both quantitatively and qualitatively. We also refer the reviewers to the video demo contained in the supplementary material for video results. The code will be publicly available at https://github.com/CS-GangXu/UniE2F.
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Computation-Aware Event-to-Frame Reconstruction via Selective Attention
A recurrent E2F architecture with selective fusion and lightweight attention reports competitive reconstruction quality at lower model complexity on standard benchmarks.