NPUsper reduces per-word latency, TTFT, and power for Whisper on mobile NPUs via online hallucination detection and K-step chunk graphs while preserving accuracy.
Alignatt: Using attention-based audio-translation alignments as a guide for simultaneous speech translation.arXiv preprint arXiv:2305.11408,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
A 1B-parameter multilingual offline model is adapted with AlignAtt policy for simultaneous speech translation and submitted to IWSLT 2026 for three language pairs.
citing papers explorer
-
NPUsper: Eliminating Redundant Computation for Real-Time Whisper on Mobile NPUs
NPUsper reduces per-word latency, TTFT, and power for Whisper on mobile NPUs via online hallucination detection and K-step chunk graphs while preserving accuracy.
-
A Pocket Offline Model for Simultaneous Speech Translation as CUNI Submission to IWSLT 2026
A 1B-parameter multilingual offline model is adapted with AlignAtt policy for simultaneous speech translation and submitted to IWSLT 2026 for three language pairs.