Prompt for ambiguity check You are a QA evaluation assistant tasked with filtering incorrect or low-quality question-answer pairs based on video and audio context

Compare against provided answer: - If orders match→Output ”[YES]” - If orders differ→Output ”[Corrected]” with proper order, explanation Output Format: [Validating]¡4-stage anal

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

JointAVBench: A Benchmark for Joint Audio-Visual Reasoning Evaluation

cs.MM · 2025-12-14 · conditional · novelty 7.0

JointAVBench is a benchmark for joint audio-visual reasoning that shows leading Omni-LLMs reach only 65.3% accuracy, with particular weakness in cross-scene tasks.

citing papers explorer

Showing 1 of 1 citing paper.

JointAVBench: A Benchmark for Joint Audio-Visual Reasoning Evaluation cs.MM · 2025-12-14 · conditional · none · ref 31
JointAVBench is a benchmark for joint audio-visual reasoning that shows leading Omni-LLMs reach only 65.3% accuracy, with particular weakness in cross-scene tasks.

Prompt for ambiguity check You are a QA evaluation assistant tasked with filtering incorrect or low-quality question-answer pairs based on video and audio context

fields

years

verdicts

representative citing papers

citing papers explorer