MERIT: Multilingual Expert-Reward Informed Tuning for Chinese-Centric Low-Resource Machine Translation

· 2026 · cs.CL · arXiv 2604.04839

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Neural machine translation (NMT) from Chinese to low-resource Southeast Asian languages remains severely constrained by the extreme scarcity of clean parallel corpora and the pervasive noise in existing mined data. This chronic shortage not only impedes effective model training but also sustains a large performance gap with high-resource directions, leaving millions of speakers of languages such as Lao, Burmese, and Tagalog with persistently low-quality translation systems despite recent advances in large multilingual models. We introduce \textbf{M}ultilingual \textbf{E}xpert-\textbf{R}eward \textbf{I}nformed \textbf{T}uning (\textbf{MERIT}), a unified translation framework that transforms the traditional English-centric ALT benchmark into a Chinese-centric evaluation suite for five Southeast Asian low-resource languages (LRLs). Our framework combines language-specific token prefixing (LTP) with supervised fine-tuning (SFT) and a novel group relative policy optimization (GRPO) guided by the semantic alignment reward (SAR). These results confirm that, in LRL{\textrightarrow}Chinese translation, targeted data curation and reward-guided optimization dramatically outperform mere model scaling.

representative citing papers

Semantic-Topological Graph Reasoning for Language-Guided Pulmonary Screening

cs.CV · 2026-04-07 · unverdicted · novelty 4.0

STGR framework integrates LLaMA-3-V and MedSAM via text-to-vision distillation and graph reasoning, achieving 81.5% DSC on LIDC-IDRI with under 1% parameter updates and high cross-fold stability.

citing papers explorer

Showing 1 of 1 citing paper.

Semantic-Topological Graph Reasoning for Language-Guided Pulmonary Screening cs.CV · 2026-04-07 · unverdicted · none · ref 13 · internal anchor
STGR framework integrates LLaMA-3-V and MedSAM via text-to-vision distillation and graph reasoning, achieving 81.5% DSC on LIDC-IDRI with under 1% parameter updates and high cross-fold stability.

MERIT: Multilingual Expert-Reward Informed Tuning for Chinese-Centric Low-Resource Machine Translation

fields

years

verdicts

representative citing papers

citing papers explorer