K-means parameter clustering enables data-free training-free pruning of HuBERT and Whisper models with reported WER gains over magnitude pruning on LibriSpeech at 50% and 10% sparsity.
DistillW2V2: A small and streaming wav2vec 2.0 based ASR model,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.SD 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Proposes OPC and dual-mode LN to improve dual-mode SSL speech models, reducing WER gap at 160 ms latency on LibriSpeech from 3.65% to 3.40% (test-clean).
citing papers explorer
-
Towards Data-free and Training-free Compression for Speech Foundation Models Using Parameter Clustering
K-means parameter clustering enables data-free training-free pruning of HuBERT and Whisper models with reported WER gains over magnitude pruning on LibriSpeech at 50% and 10% sparsity.
-
Online Predictive Coding for Dual-Mode Self-Supervised Speech Model
Proposes OPC and dual-mode LN to improve dual-mode SSL speech models, reducing WER gap at 160 ms latency on LibriSpeech from 3.65% to 3.40% (test-clean).