← back to paper
arxiv: 2605.19329 · 2 revisions
RE-VLM: Event-Augmented Vision-Language Model for Scene Understanding