JudgmentBench supplies the first public paired rubric and preference annotations from legal experts on the same LLM outputs, showing comparative judgments outperform rubrics in recovering quality orderings.
Aligning Large Language Models through Synthetic Feedback
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
fields
cs.CL 3representative citing papers
Olmo 3 delivers fully open 7B and 32B language models with complete training artifacts, positioning the 32B variant as the strongest open thinking model released to date.
A comprehensive survey of knowledge distillation for LLMs structured around algorithms, skill enhancement, and vertical applications, highlighting data augmentation as a key enabler.
citing papers explorer
-
Olmo 3
Olmo 3 delivers fully open 7B and 32B language models with complete training artifacts, positioning the 32B variant as the strongest open thinking model released to date.