Enhancing Event-based Object Detection with Monocular Normal Maps

Chuang Zhu; Hanqing Liu; Luoping Cui; Mingjie Liu

arxiv: 2508.02127 · v2 · pith:HY5TVU3Mnew · submitted 2025-08-04 · 💻 cs.CV

Enhancing Event-based Object Detection with Monocular Normal Maps

Mingjie Liu , Hanqing Liu , Luoping Cui , Chuang Zhu This is my paper

classification 💻 cs.CV

keywords detectioneventfusiongeometricmapsnormalpriorsappearance

0 comments

read the original abstract

Object detection in autonomous driving is frequently compromised by complex illumination. While event cameras offer a robust solution, they are susceptible to sudden contrast changes such as reflections which often trigger dense, misleading event signals. To overcome this, we leverage RGB-derived surface normal maps as explicit geometric constraints. Crucially, even when RGB degrades, they preserve low-frequency structural priors that effectively assist in event-based detection. Consequently, we present NRE-Net, a trimodal framework that integrates structural priors from surface Normal maps, appearance context from RGB images, and high-frequency dynamics from Events. The Adaptive Dual-stream Fusion Module (ADFM) first aligns geometric and appearance cues, followed by the Event-modality Aware Fusion Module (EAFM) which selectively integrates event dynamics. Extensive evaluations on DSEC-Det-sub and PKU-DAVIS-SOD demonstrate that incorporating geometric priors yields an additional 3.0% AP50 gain over dual-modal baselines, while our approach consistently outperforms fusion methods such as SFNet (+2.7%) and SODFormer (+7.1%).

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

RE-VLM: Event-Augmented Vision-Language Model for Scene Understanding
cs.CV 2026-05 unverdicted novelty 6.0

RE-VLM is the first dual-stream VLM combining RGB and event data with a graph-based pipeline to generate training captions and QA pairs, showing gains over RGB-only and event-only models on new datasets for challengin...
Sparse Hypergraph-Enhanced Frame-Event Object Detection with Fine-Grained MoE
cs.CV 2026-04 unverdicted novelty 6.0

Hyper-FEOD fuses RGB and event data via sparse hypergraph cross-modal fusion and region-specialized MoE experts to improve accuracy-efficiency in object detection.