pith. sign in

Proactivevideoqa: A comprehensive benchmark evaluating proactive interactions in video large language models

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

baseline 1 dataset 1

citation-polarity summary

fields

cs.CV 4 cs.AI 1

years

2026 5

representative citing papers

Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction

cs.CV · 2026-05-17 · conditional · novelty 7.0

Omni-DuplexEval creates a new benchmark and LLM-as-a-Judge framework for real-time duplex omni-modal interaction, revealing that current models score below 40% overall and struggle especially with proactive responses.

citing papers explorer

Showing 5 of 5 citing papers.