Post-Training Speech Enhancement Language Models with Perceptual Rewards

· 2026 · cs.LG · arXiv 2606.21458

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Speech enhancement language models achieve strong results when trained on discrete audio tokens, but their optimization relies on token-level cross-entropy rather than the perceptual metrics used for evaluation. We introduce a post-training stage for autoregressive speech enhancement language models using Group Sequence Policy Optimization (GSPO) with multi-metric perceptual rewards. Our method directly optimizes non-differentiable quality metrics (DNSMOS, WER, and UTMOS) as reward signals, without learned surrogates or offline preference pairs. Applied to two autoregressive base models, UniSE and GenSE, our approach achieves state-of-the-art results on the DNS2020 benchmark. A human evaluation ablation further shows that the composite multi-metric reward is preferred over any single-metric variant, confirming that multi-reward optimization avoids the reward hacking observed with single-metric training.

representative citing papers

Post-Training Speech Enhancement Language Models with Perceptual Rewards

cs.LG · 2026-06-19 · unverdicted · novelty 6.0

Post-training autoregressive speech enhancement LMs via GSPO with composite perceptual rewards from DNSMOS, WER, and UTMOS reaches SOTA on DNS2020 and outperforms single-metric variants in human evaluation.

citing papers explorer

Showing 1 of 1 citing paper.

Post-Training Speech Enhancement Language Models with Perceptual Rewards cs.LG · 2026-06-19 · unverdicted · none · ref 1 · internal anchor
Post-training autoregressive speech enhancement LMs via GSPO with composite perceptual rewards from DNSMOS, WER, and UTMOS reaches SOTA on DNS2020 and outperforms single-metric variants in human evaluation.

Post-Training Speech Enhancement Language Models with Perceptual Rewards

fields

years

verdicts

representative citing papers

citing papers explorer