Eventclip: Adapting clip for event- based object recognition

Ziyi Wu, Xudong Liu, Igor Gilitschenski · 2023 · arXiv 2306.06354

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

EventPrune: Cascaded Event-Assisted Token Pruning for Efficient First-Person Dynamic Spatial Reasoning

cs.CV · 2026-05-19 · unverdicted · novelty 7.0

EventPrune prunes 80% of visual tokens in Video-LLMs using event camera motion cues, yielding 1.89x speedup, 52% fewer GFLOPs, and slightly higher accuracy than full-token baselines on first-person dynamic spatial reasoning.

EventFace: Event-Based Face Recognition via Structure-Driven Spatiotemporal Modeling

cs.CV · 2026-04-08 · unverdicted · novelty 6.0

EventFace achieves 94.19% Rank-1 accuracy and 5.35% EER on a new small event-based face dataset by transferring facial structure priors via LoRA and fusing them with temporal motion features.

Generative Event Pretraining with Foundation Model Alignment

cs.CV · 2026-03-24 · unverdicted · novelty 6.0

GEP transfers semantic knowledge from image foundation models to event data via alignment and generative pretraining on mixed sequences to create transferable event-based visual models.

RE-VLM: Event-Augmented Vision-Language Model for Scene Understanding

cs.CV · 2026-05-19 · unverdicted · novelty 5.0 · 2 refs

RE-VLM fuses RGB and event data in a dual-stream VLM with a graph-based pipeline for generating training captions and QA pairs, plus two new datasets, showing gains over RGB-only and event-only baselines especially in challenging conditions.

citing papers explorer

Showing 4 of 4 citing papers.

EventPrune: Cascaded Event-Assisted Token Pruning for Efficient First-Person Dynamic Spatial Reasoning cs.CV · 2026-05-19 · unverdicted · none · ref 39
EventPrune prunes 80% of visual tokens in Video-LLMs using event camera motion cues, yielding 1.89x speedup, 52% fewer GFLOPs, and slightly higher accuracy than full-token baselines on first-person dynamic spatial reasoning.
EventFace: Event-Based Face Recognition via Structure-Driven Spatiotemporal Modeling cs.CV · 2026-04-08 · unverdicted · none · ref 53
EventFace achieves 94.19% Rank-1 accuracy and 5.35% EER on a new small event-based face dataset by transferring facial structure priors via LoRA and fusing them with temporal motion features.
Generative Event Pretraining with Foundation Model Alignment cs.CV · 2026-03-24 · unverdicted · none · ref 55
GEP transfers semantic knowledge from image foundation models to event data via alignment and generative pretraining on mixed sequences to create transferable event-based visual models.
RE-VLM: Event-Augmented Vision-Language Model for Scene Understanding cs.CV · 2026-05-19 · unverdicted · none · ref 29 · 2 links
RE-VLM fuses RGB and event data in a dual-stream VLM with a graph-based pipeline for generating training captions and QA pairs, plus two new datasets, showing gains over RGB-only and event-only baselines especially in challenging conditions.

Eventclip: Adapting clip for event- based object recognition

fields

years

verdicts

representative citing papers

citing papers explorer