Preprint, arXiv:2512.17648

Simulstream: Open-source toolkit for evaluation, demonstration of streaming speech-to-text translation systems · arXiv 2512.17648

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

AlignAtt4LLM: Fast AlignAtt for Decoder-Only LLMs at IWSLT 2026 Simultaneous Speech Translation Task

cs.CL · 2026-06-02 · unverdicted · novelty 6.0

AlignAtt4LLM adapts AlignAtt to decoder-only LLMs via prompt layout, head selection, and attention replay, outperforming IWSLT 2026 baselines for En-De and En-It at ~2s and <4s latency.

Benchmarking Speech-to-Speech Translation Models

cs.CL · 2026-06-02 · unverdicted · novelty 6.0

COMPASS is a new reproducible benchmarking framework for S2ST that deploys 46 metrics on 1248 configurations, shows single-metric rankings mislead, reduces to 10 metrics per direction, and finds domain-specific metrics better match human judgments than standalone MOS predictors.

A Pocket Offline Model for Simultaneous Speech Translation as CUNI Submission to IWSLT 2026

cs.CL · 2026-06-02 · unverdicted · novelty 2.0

A 1B-parameter multilingual offline model is adapted with AlignAtt policy for simultaneous speech translation and submitted to IWSLT 2026 for three language pairs.

citing papers explorer

Showing 3 of 3 citing papers.

AlignAtt4LLM: Fast AlignAtt for Decoder-Only LLMs at IWSLT 2026 Simultaneous Speech Translation Task cs.CL · 2026-06-02 · unverdicted · none · ref 2
AlignAtt4LLM adapts AlignAtt to decoder-only LLMs via prompt layout, head selection, and attention replay, outperforming IWSLT 2026 baselines for En-De and En-It at ~2s and <4s latency.
Benchmarking Speech-to-Speech Translation Models cs.CL · 2026-06-02 · unverdicted · none · ref 45
COMPASS is a new reproducible benchmarking framework for S2ST that deploys 46 metrics on 1248 configurations, shows single-metric rankings mislead, reduces to 10 metrics per direction, and finds domain-specific metrics better match human judgments than standalone MOS predictors.
A Pocket Offline Model for Simultaneous Speech Translation as CUNI Submission to IWSLT 2026 cs.CL · 2026-06-02 · unverdicted · none · ref 5
A 1B-parameter multilingual offline model is adapted with AlignAtt policy for simultaneous speech translation and submitted to IWSLT 2026 for three language pairs.

Preprint, arXiv:2512.17648

fields

years

verdicts

representative citing papers

citing papers explorer