VLMs exhibit sharply higher counterfactual hallucination rates in Arabic and dialects despite high true-statement accuracy, revealed by the new M²CQA benchmark and CFHR metric.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Once Correct, Still Wrong: Counterfactual Hallucination in Multilingual Vision-Language Models
VLMs exhibit sharply higher counterfactual hallucination rates in Arabic and dialects despite high true-statement accuracy, revealed by the new M²CQA benchmark and CFHR metric.