VideoDetective improves long-video question answering by constructing a visual-temporal affinity graph and running a Hypothesis-Verification-Refinement loop to propagate query relevance across segments, yielding up to 7.5% accuracy gains on VideoMME-long.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding
VideoDetective improves long-video question answering by constructing a visual-temporal affinity graph and running a Hypothesis-Verification-Refinement loop to propagate query relevance across segments, yielding up to 7.5% accuracy gains on VideoMME-long.