However, this accuracy comes at a cost: models with hundreds of millions of parameters are difficult to deploy on edge devices, embedded systems, or latency-sensitive applications

INTRODUCTION Automatic speech recognition (ASR) has advanced rapidly with large-scale Transformer models such as Whisper-small, which deliver state-of-the-art transcription accurac

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Quantizing Whisper-small: How design choices affect ASR performance

eess.AS · 2025-11-11 · unverdicted · novelty 4.0

Dynamic int8 quantization via Quanto on Whisper-small reduces size by 57% and improves WER on LibriSpeech test sets compared to the unquantized baseline.

citing papers explorer

Showing 1 of 1 citing paper.

Quantizing Whisper-small: How design choices affect ASR performance eess.AS · 2025-11-11 · unverdicted · none · ref 1
Dynamic int8 quantization via Quanto on Whisper-small reduces size by 57% and improves WER on LibriSpeech test sets compared to the unquantized baseline.

However, this accuracy comes at a cost: models with hundreds of millions of parameters are difficult to deploy on edge devices, embedded systems, or latency-sensitive applications

fields

years

verdicts

representative citing papers

citing papers explorer