MÖVE presents a new German-language benchmark evaluating 39 LLMs on performance and governance criteria using ten public-administration datasets.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
years
2026 3verdicts
UNVERDICTED 3representative citing papers
ToolRec introduces dual-level calibration of click data and weighted KTO alignment to improve tool-invoking query recommendations in on-device assistants, reporting CTR gains in large-scale A/B tests.
IDO uses channel-wise reweighting, Gaussian modeling of factual uncertainty, and incongruity contrastive learning to achieve SOTA multimodal fake news detection.
citing papers explorer
-
ToolRec: Calibrated Preference Alignment for Query Recommendation in On-Device Assistants
ToolRec introduces dual-level calibration of click data and weighted KTO alignment to improve tool-invoking query recommendations in on-device assistants, reporting CTR gains in large-scale A/B tests.