Joint Multimedia Event Extraction from Video and Article

Chen, Brian, Lin, Xudong, Thomas, Christopher, Li, Manling, Yoshida, Shoya, Chum, Lovish · 2021 · DOI 10.18653/v1/2021.findings-emnlp.8

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Evaluation Pitfalls and Challenges in Multimedia Event Extraction

cs.CL · 2026-06-25 · unverdicted · novelty 7.0

A systematic analysis of evaluation practices in multimedia event extraction reveals that minor methodological choices cause large performance swings and overestimation of cross-modal grounding ability.

NEST: Narrative Event Structures in Time for Long Video Understanding

cs.CV · 2026-06-18 · unverdicted · novelty 7.0

NEST is a new benchmark dataset for narrative event structures in long videos, with baselines reporting ETD below 8%, EL under 6%, EAE below 11%, and ERE at 35-44% F1.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Evaluation Pitfalls and Challenges in Multimedia Event Extraction cs.CL · 2026-06-25 · unverdicted · none · ref 28
A systematic analysis of evaluation practices in multimedia event extraction reveals that minor methodological choices cause large performance swings and overestimation of cross-modal grounding ability.

Joint Multimedia Event Extraction from Video and Article

fields

years

verdicts

representative citing papers

citing papers explorer