Hierarchical Policy Optimization post-trains LLMs for simultaneous speech translation on imperfect data, yielding over +7 COMET and +1.25 MetricX improvements at 1.5-second latency on English-to-Chinese/German/Japanese tasks.
Finally, we enforce monotonicity on the alignment and group target words that correspond to the same speech chunk
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Hierarchical Policy Optimization for Simultaneous Translation of Unbounded Speech
Hierarchical Policy Optimization post-trains LLMs for simultaneous speech translation on imperfect data, yielding over +7 COMET and +1.25 MetricX improvements at 1.5-second latency on English-to-Chinese/German/Japanese tasks.