RLSR trains source rewriters via RL with translation-quality improvement as the reward, outperforming prompt baselines at 4B scale while matching larger models.
Findings of the 2022 Conference on Machine Translation ( WMT 22)
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Ouvia is a user-centered evaluation framework for speech translation usability in real-world scenarios, showing limited usability rates and the superiority of QA-based metrics.
citing papers explorer
-
Rewrite to Translate, Translate to Reward: Reinforcement Learning for Source Rewriting in Machine Translation
RLSR trains source rewriters via RL with translation-quality improvement as the reward, outperforming prompt baselines at 4B scale while matching larger models.
-
Ouvia: A User-centered Framework for Measuring Usability of Speech Translation in Real-World Communication Scenarios
Ouvia is a user-centered evaluation framework for speech translation usability in real-world scenarios, showing limited usability rates and the superiority of QA-based metrics.