Peng Qi, Zehong Yan, Wynne Hsu, and Mong Li Lee

Judge anything: Mllm as a judge across any modality · 2021 · arXiv 2503.17489

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

What's Left Unsaid? Detecting and Correcting Misleading Omissions in Multimodal News Previews

cs.CV · 2026-01-09 · unverdicted · novelty 7.0

OMGuard combines interpretation-aware fine-tuning and rationale-guided headline rewriting to detect and correct omission-based misleadingness in multimodal news previews, raising an 8B model's performance to match a 235B LVLM.

Evian: Towards Explainable Visual Instruction-tuning Data Auditing

cs.CV · 2026-04-22 · unverdicted · novelty 6.0

EVian decomposes vision-language model responses into three cognitive components and audits them along consistency, coherence, and accuracy axes, showing that a small curated subset outperforms much larger training sets.

citing papers explorer

Showing 2 of 2 citing papers.

What's Left Unsaid? Detecting and Correcting Misleading Omissions in Multimodal News Previews cs.CV · 2026-01-09 · unverdicted · none · ref 3
OMGuard combines interpretation-aware fine-tuning and rationale-guided headline rewriting to detect and correct omission-based misleadingness in multimodal news previews, raising an 8B model's performance to match a 235B LVLM.
Evian: Towards Explainable Visual Instruction-tuning Data Auditing cs.CV · 2026-04-22 · unverdicted · none · ref 5
EVian decomposes vision-language model responses into three cognitive components and audits them along consistency, coherence, and accuracy axes, showing that a small curated subset outperforms much larger training sets.

Peng Qi, Zehong Yan, Wynne Hsu, and Mong Li Lee

fields

years

verdicts

representative citing papers

citing papers explorer