Automatic evaluation tools for literary translations correlate poorly with expert human judgments on creativity and exhibit bias favoring machine-translated texts.
Coling 2008, 22nd International Conference on Computational Linguistics , address=
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CL 3years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
Small open-source LLMs achieve competitive system-level correlations with human judgments in machine translation quality estimation, outperforming traditional neural metrics and fine-tuned models via single-pass multi-output prompting.
A narrative review of AI language technologies in multilingual healthcare identifies performance gaps in safety and equity and proposes seven grand challenges centered on reliability, human oversight, and cross-disciplinary collaboration.
citing papers explorer
-
Creativity Bias: How Machine Evaluation Struggles with Creativity in Literary Translations
Automatic evaluation tools for literary translations correlate poorly with expert human judgments on creativity and exhibit bias favoring machine-translated texts.
-
CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs
Small open-source LLMs achieve competitive system-level correlations with human judgments in machine translation quality estimation, outperforming traditional neural metrics and fine-tuned models via single-pass multi-output prompting.
-
Artificial intelligence language technologies in multilingual healthcare: Grand challenges ahead
A narrative review of AI language technologies in multilingual healthcare identifies performance gaps in safety and equity and proposes seven grand challenges centered on reliability, human oversight, and cross-disciplinary collaboration.