E2GS: Event Enhanced Gaussian Splatting
Pith reviewed 2026-05-24 00:26 UTC · model grok-4.3
The pith
Fusing event camera data into Gaussian Splatting corrects blur and enables high-quality novel view synthesis.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
E2GS incorporates event data into Gaussian Splatting to effectively utilize both blurry images and event data, significantly improving image deblurring and producing high-quality novel view synthesis with faster training and rendering speed of 140 FPS, as shown on synthetic and real-world datasets.
What carries the argument
E2GS, the integration of event data directly into the Gaussian Splatting optimization pipeline for blur correction.
If this is right
- Blurry images can be used for high-quality 3D reconstruction when paired with event data.
- Novel view synthesis achieves better visual quality than using images alone.
- Training and rendering speeds stay high at around 140 FPS.
- The method applies to both synthetic and real datasets with visually appealing results.
Where Pith is reading between the lines
- Extending the fusion to other event-based signals could further enhance reconstruction in low-light or high-speed scenarios.
- Real-time robotics applications might adopt this for on-the-fly 3D mapping from moving cameras.
- Investigating the exact mechanism of how events correct Gaussian parameters could lead to even simpler integration methods.
Load-bearing premise
Event data from the camera can be directly and effectively incorporated into the Gaussian Splatting process without needing extensive new models to bridge the different data types.
What would settle it
Experiments showing that E2GS produces equivalent or worse deblurring and view synthesis quality compared to standard Gaussian Splatting on the same blurry inputs would falsify the central claim.
read the original abstract
Event cameras, known for their high dynamic range, absence of motion blur, and low energy usage, have recently found a wide range of applications thanks to these attributes. In the past few years, the field of event-based 3D reconstruction saw remarkable progress, with the Neural Radiance Field (NeRF) based approach demonstrating photorealistic view synthesis results. However, the volume rendering paradigm of NeRF necessitates extensive training and rendering times. In this paper, we introduce Event Enhanced Gaussian Splatting (E2GS), a novel method that incorporates event data into Gaussian Splatting, which has recently made significant advances in the field of novel view synthesis. Our E2GS effectively utilizes both blurry images and event data, significantly improving image deblurring and producing high-quality novel view synthesis. Our comprehensive experiments on both synthetic and real-world datasets demonstrate our E2GS can generate visually appealing renderings while offering faster training and rendering speed (140 FPS). Our code is available at https://github.com/deguchihiroyuki/E2GS.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces Event Enhanced Gaussian Splatting (E2GS), which augments 3D Gaussian Splatting with event camera data to jointly perform image deblurring and novel view synthesis from blurry inputs. It claims effective fusion of event streams and blurry images yields visually appealing renderings at 140 FPS, with supporting experiments on synthetic and real-world datasets and public code release.
Significance. If the fusion approach proves robust, the work would offer a practical, high-speed alternative to slower NeRF-based event reconstruction pipelines by exploiting Gaussian Splatting's efficiency while leveraging event cameras' high dynamic range and motion-blur immunity. The code availability supports reproducibility.
major comments (2)
- [§3] §3 (Method): the event-image fusion step into the Gaussian optimization pipeline does not appear to include an explicit event rendering model, noise model, or calibration term to reconcile the integration mismatch between blurry-image intensity accumulation and asynchronous log-intensity event changes; without this, the auxiliary event loss risks producing inconsistent gradients, undermining the central claim of effective direct utilization.
- [§4] §4 (Experiments): quantitative tables comparing deblurring PSNR/SSIM and NVS metrics against baselines are needed to establish that reported gains are statistically meaningful and not dataset-specific; if the event contribution is marginal once domain mismatch is controlled, the general applicability of the pipeline is weakened.
minor comments (2)
- The abstract states 'significantly improving image deblurring' but the precise weighting between image and event losses should be stated explicitly in the method to allow reproduction.
- Figure captions could more clearly annotate the contribution of the event branch versus the baseline Gaussian Splatting run on blurry images alone.
Simulated Author's Rebuttal
We thank the referee for the detailed review and constructive comments on our paper. We address each major comment below and outline the revisions we plan to make.
read point-by-point responses
-
Referee: [§3] §3 (Method): the event-image fusion step into the Gaussian optimization pipeline does not appear to include an explicit event rendering model, noise model, or calibration term to reconcile the integration mismatch between blurry-image intensity accumulation and asynchronous log-intensity event changes; without this, the auxiliary event loss risks producing inconsistent gradients, undermining the central claim of effective direct utilization.
Authors: We appreciate this observation. Our current formulation uses an auxiliary event loss to guide the optimization, but we acknowledge the lack of an explicit rendering model and calibration term in the description. To address this, we will revise §3 to include a detailed derivation of the event loss, incorporating an event rendering approximation based on the log-intensity changes and a simple calibration to handle domain differences. This will clarify the gradient flow and support the claim of effective fusion. We believe this addition will resolve the concern without requiring changes to the implementation. revision: yes
-
Referee: [§4] §4 (Experiments): quantitative tables comparing deblurring PSNR/SSIM and NVS metrics against baselines are needed to establish that reported gains are statistically meaningful and not dataset-specific; if the event contribution is marginal once domain mismatch is controlled, the general applicability of the pipeline is weakened.
Authors: We agree that quantitative metrics are crucial for validating the improvements. Although the manuscript emphasizes visual quality and speed, we will add quantitative tables in the revised version, reporting PSNR and SSIM for both deblurring and novel view synthesis tasks. These will include comparisons with baselines on the synthetic and real-world datasets used in our experiments. This will allow readers to assess the statistical significance and general applicability. revision: yes
Circularity Check
No circularity detected; method description contains no derivation chain or self-referential fitting
full rationale
The abstract and available text describe E2GS as a fusion of event data into an existing Gaussian Splatting pipeline for deblurring and view synthesis, with claims of effective utilization and faster performance. No equations, optimization procedures, parameter fitting steps, or self-citations are presented that would allow any claimed prediction or result to reduce by construction to its own inputs. The central claim rests on empirical demonstration rather than a closed mathematical derivation, making the work self-contained against external benchmarks with no load-bearing circular steps.
Axiom & Free-Parameter Ledger
Forward citations
Cited by 1 Pith paper
-
AsyncEvGS: Asynchronous Event-Assisted Gaussian Splatting for Handheld Motion-Blurred Scenes
AsyncEvGS reconstructs high-fidelity 3D scenes from motion-blurred images by first deblurring via event data then using VGGT-based pose estimation and structure-driven losses inside Gaussian Splatting.
Reference graph
Works this paper leans on
-
[1]
INTRODUCTION In the task of 3D scene reconstruction and novel view syn- thesis, we witnessed tremendous progress over the past few years. Especially, after the NeRF (Neural Radiance Field) [1] marked a significant milestone, leading to the active develop- ment of various neural rendering techniques for 3D scene re- construction [2, 3]. Among these, 3D Gau...
-
[2]
E2GS: Event Enhanced Gaussian Splatting
RELA TED WORKS 2.1. 3D Scene Reconstruction 3D scene reconstruction is one of the fundamental functional- ity of computer vision. Recent advancements in 3D scene re- construction have gained more attention after the emergence of NeRF [1]. While several methods emerged to strengthen the NeRF-based approach [2, 3], there is one research direc- tion to accel...
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[3]
METHOD The overview of our method is illustrated in Fig. 2. The input of our method is a set of blurry images and event stream of a static scene. In our E2GS framework, we first perform pre- processing using the correspondence between event data and blurred images. Then, we use two types of loss functions to train the Gaussian Splatting considering the bl...
-
[4]
(11) Finally, we combine two loss functionLblur and Levent by using a weight parameter wevent to obtain the following loss L = Lblur + weventLevent, (12)
-
[5]
EXPERIMENTS 4.1. Experimental Setup We evaluated our E2GS on two different tasks: Image deblur- ring and novel view synthesis. For the image deblurring task, we evaluate the rendering results from the perspective of the blurry image set. for the novel view synthesis task, we evalu- ate the rendering results from the perspective not used in the blurry imag...
-
[6]
dataset. Synthetic data: Synthetic set contains six synthetic scenes (chair, ficus, hotdog, lego, materials, and mic), and it uses the Camera Shakify plugin in Blender to simulate camera shake. The event data are simulated by V2E[23]. Each scene has 100 views of blurry images estimated by 17 different cam- era poses from the Camera Shakify plugin, its cor...
-
[7]
Image Deblurletter lego camera toys plant Avg
and the average of the five scenes. Image Deblurletter lego camera toys plant Avg. GS 40.68 39.52 21.76 43.66 38.26 36.78 E2NeRF 44.33 34.09 28.89 43.41 32.23 36.59 E2GS (Ours) 37.62 35.2 19.93 38.87 30.87 32.50 Table 4: Quantitative evaluation of the novel view synthe- sis task. Showing the BRISQUE results of five scenes from E2NeRF [7] and the average o...
-
[8]
CONCLUSION In this paper, we propose Event Enhanced Gaussian Splat- ting (E2GS), the novel framework that effectively utilizes event data into Gaussian Splatting to reconstruct sharp scenes from blurry RGB frames. Comprehensive experiments using the synthetic dataset and the real-world dataset demonstrate that our E2GS achieves visually appealing renderin...
-
[9]
Nerf: Representing scenes as neural radiance fields for view synthesis,
Ben Mildenhall, Pratul P Srinivasan, Matthew Tancik, Jonathan T Barron, Ravi Ramamoorthi, and Ren Nge, “Nerf: Representing scenes as neural radiance fields for view synthesis,” Commun. ACM, vol. 65, no. 1, pp. 99– 106, 2021
work page 2021
-
[10]
Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields,
Jonathan T Barron, Ben Mildenhall, Matthew Tancik, Peter Hedman, Ricardo Martin-Brualla, and Pratul P Srinivasan, “Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields,” in ICCV, 2021, pp. 5855–5864
work page 2021
-
[11]
D-nerf: Neural radiance fields for dynamic scenes,
Albert Pumarola, Enric Corona, Gerard Pons-Moll, and Francesc Moreno-Noguer, “D-nerf: Neural radiance fields for dynamic scenes,” in CVPR, 2021, pp. 10318– 10327
work page 2021
-
[12]
3d gaussian splatting for real-time radiance field rendering,
Kerbl Bernhard, Kopanas Georgios, Leimk ¨uhler Thomas, and Drettakis George, “3d gaussian splatting for real-time radiance field rendering,” TOG, vol. 42, no. 4, July 2023
work page 2023
-
[13]
E-raft: Dense optical flow from event cameras,
Mathias Gehrig, Mario Millh ¨ausler, Daniel Gehrig, and Davide Scaramuzza, “E-raft: Dense optical flow from event cameras,” in 3DV, 2021, pp. 197–206
work page 2021
-
[14]
Learning event-driven video deblurring and inter- polation,
Songnan Lin, Jiawei Zhang, Jinshan Pan, Zhe Jiang, Dongqing Zou, Yongtian Wang, Jing Chen, and Jimmy Ren, “Learning event-driven video deblurring and inter- polation,” in ECCV, 2020, pp. 695–710
work page 2020
-
[15]
E2nerf: Event enhanced neural radiance fields from blurry im- ages,
Yunshan Qi, Lin Zhu, Yu Zhang, and Jia Li, “E2nerf: Event enhanced neural radiance fields from blurry im- ages,” in ICCV, 2023, p. 837–847
work page 2023
-
[16]
Eventnerf: Neural radiance fields from a single colour event camera,
Viktor Rudnev, Mohamed Elgharib, Christian Theobalt, and Vladislav Golyanik, “Eventnerf: Neural radiance fields from a single colour event camera,” in CVPR, 2023
work page 2023
-
[17]
E-nerf: Neural radiance fields from a moving event camera,
Simon Klenk, Lukas Koestler, Davide Scaramuzza, and Daniel Cremers, “E-nerf: Neural radiance fields from a moving event camera,” RAL, 2023
work page 2023
-
[18]
Instant neural graphics primitives with a multiresolution hash encoding,
Thomas M ¨uller, Alex Evans, Christoph Schied, and Alexander Keller, “Instant neural graphics primitives with a multiresolution hash encoding,”ToG, vol. 41, no. 4, pp. 1–15, 2022
work page 2022
-
[19]
Deblur-nerf: Neural radiance fields from blurry images,
Li Ma, Xiaoyu Li, Jing Liao, Qi Zhang, Xuan Wang, Jue Wang, and Pedro V Sander, “Deblur-nerf: Neural radiance fields from blurry images,” in CVPR, 2022, p. 12861–12870
work page 2022
-
[20]
Deblurring 3d gaussian splat- ting,
Byeonghyeon Lee, Howoong Lee, Xiangyu Sun, Usman Ali, and Eunbyung Park, “Deblurring 3d gaussian splat- ting,” arXiv preprint arXiv:2401.00834, 2024
-
[21]
A 128× 128 120 db 15 µs latency asynchronous temporal contrast vision sensor,
Patrick Lichtsteiner, Christoph Posch, and Tobi Del- bruck, “A 128× 128 120 db 15 µs latency asynchronous temporal contrast vision sensor,” IEEE J. Solid-State Circuits, vol. 43, no. 2, pp. 566–576, 2008
work page 2008
-
[22]
Real-time high speed motion prediction using fast aperture-robust event-driven visual flow,
Himanshu Akolkar, Sio-Hoi Ieng, , and Ryad Benos- man, “Real-time high speed motion prediction using fast aperture-robust event-driven visual flow,” TPAMI, vol. 44, no. 1, pp. 361–372, 2020
work page 2020
-
[23]
Event-based bispectral pho- tometry using temporally modulated illumination,
Tsuyoshi Takatani, Yuzuha Ito, Ayaka Ebisu, Yinqiang Zheng, and Takahito Aoto, “Event-based bispectral pho- tometry using temporally modulated illumination,” in CVPR, 2021, p. 15638–15647
work page 2021
-
[24]
Spiking transformers for event-based single object tracking,
Jiqing Zhang, Bo Dong, Haiwei Zhang, Jianchuan Ding, Felix Heide, Baocai Yin, and Xin Yang, “Spiking transformers for event-based single object tracking,” in CVPR, 2022, p. 8801–8810
work page 2022
-
[25]
Ev- nerf: Event based neural radiance field,
Inwoo Hwang, Junho Kim, and Young Min Kim, “Ev- nerf: Event based neural radiance field,” inCVPR, 2023, p. 837–847
work page 2023
-
[26]
Matthias Zwicker, Hanspeter Pfister, Jeroen Van Baar, and Markus Gross, “Ewa volume splatting,” in Pro- ceedings Visualization. IEEE, 2001, pp. 29–538
work page 2001
-
[27]
Bringing a blurry frame alive at high frame-rate with an event camera,
Liyuan Pan, Cedric Scheerlinck, Xin Yu, Richard Hart- ley, Miaomiao Liu, and Yuchao Dai, “Bringing a blurry frame alive at high frame-rate with an event camera,” in CVPR, 2019, pp. 6820–6829
work page 2019
-
[28]
Structure-from-motion revisited,
Johannes L Schonberger and Jan-Michael Frahm, “Structure-from-motion revisited,” in CVPR, 2016, pp. 4104–4113
work page 2016
-
[29]
The unreasonable ef- fectiveness of deep features as a perceptual metric,
Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang, “The unreasonable ef- fectiveness of deep features as a perceptual metric,” in CVPR, 2018, pp. 586–595
work page 2018
-
[30]
No-reference image quality assessment in the spatial domain,
Anish Mittal, Anush Krishna Moorthy, and Alan Conrad Bovik, “No-reference image quality assessment in the spatial domain,” TIP, vol. 21, no. 12, pp. 4695–4708, 2012
work page 2012
-
[31]
v2e: From video frames to realistic dvs events,
Yuhuang Hu, Shih-Chii Liu, and Tobi Delbruck, “v2e: From video frames to realistic dvs events,” in CVPR, 2021, pp. 1312–1321
work page 2021
-
[32]
A 240× 180 130 db 3 µs latency global shutter spatiotemporal vision sen- sor,
Christian Brandli, Raphael Berner, Minhao Yang, Shih- Chii Liu, and Tobi Delbruck, “A 240× 180 130 db 3 µs latency global shutter spatiotemporal vision sen- sor,” IEEE J. Solid-State Circuits , vol. 49, no. 10, pp. 2333–2341, 2014
work page 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.