QQJ is an evaluation framework that anchors LLM judges in expert rubrics and calibrates them on small high-quality annotation sets to improve alignment with human judgment on generative tasks.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
QQJ: Quantifying Qualitative Judgment for Scalable and Human-Aligned Evaluation of Generative AI
QQJ is an evaluation framework that anchors LLM judges in expert rubrics and calibrates them on small high-quality annotation sets to improve alignment with human judgment on generative tasks.