VLF eedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment

Li, Lei, Xie, Zhihui, Li, Mukai, Chen, Shunian, Wang, Peiyi, Chen, Liang · 2024 · DOI 10.18653/v1/2024.emnlp-main.358

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

Jury Duty: Calibration and Orientation Failures in MLLM-as-a-Judge Under Cultural Ambiguity

cs.CV · 2026-06-12 · unverdicted · novelty 7.0

VOIR DIRE benchmark shows MLLM-as-a-Judge systems decompose into positivity-floor calibration failure and orientation failure on culturally contested items, with persona prompting recovering only the former.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Jury Duty: Calibration and Orientation Failures in MLLM-as-a-Judge Under Cultural Ambiguity cs.CV · 2026-06-12 · unverdicted · none · ref 30
VOIR DIRE benchmark shows MLLM-as-a-Judge systems decompose into positivity-floor calibration failure and orientation failure on culturally contested items, with persona prompting recovering only the former.

VLF eedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment

fields

years

verdicts

representative citing papers

citing papers explorer