pith. sign in

arxiv: 2504.15371 · v6 · pith:3K7DACZ2new · submitted 2025-04-21 · 💻 cs.CV · cs.NE

Event2Vec: Processing Neuromorphic Events Directly by Representations in Vector Space

classification 💻 cs.CV cs.NE
keywords event2veceventsasynchronousdirectlyneuromorphicsparsecamerasdata
0
0 comments X
read the original abstract

Neuromorphic event cameras possess superior temporal resolution, power efficiency, and dynamic range compared to traditional cameras. However, their asynchronous and sparse data format poses a significant challenge for conventional deep learning methods. Most existing methods either densify events into frames, sacrificing their sparse asynchronous nature, or use irregular models that are less compatible with GPU acceleration. Inspired by word-to-vector models, we propose event2vec, a novel representation that allows Transformers to process events directly. We demonstrate the effectiveness of event2vec on the DVS Gesture, ASL-DVS, and DVS-Lip benchmarks, showing that event2vec is remarkably parameter-efficient, features high throughput and low latency, and achieves high accuracy even with an extremely low number of events or low spatial resolutions. These results show that sparse asynchronous event data can be directly integrated into high-throughput Transformer architectures, offering an efficient paradigm for real-time neuromorphic vision. The code is provided at https://github.com/Intelligent-Computing-Lab-Panda/event2vec.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Neural Events: Discrete Asynchronous Autoencoders for Event-Based Vision

    cs.CV 2026-06 unverdicted novelty 7.0

    Neural events compress event camera streams into fewer informative tokens via discrete asynchronous autoencoders, achieving on-par or better performance on detection and classification with 2x lower event rate.