DistillW2V2: A small and streaming wav2vec 2.0 based ASR model,

· 2023 · arXiv 2303.09278

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Towards Data-free and Training-free Compression for Speech Foundation Models Using Parameter Clustering

cs.SD · 2026-06-10 · unverdicted · novelty 4.0

K-means parameter clustering enables data-free training-free pruning of HuBERT and Whisper models with reported WER gains over magnitude pruning on LibriSpeech at 50% and 10% sparsity.

Online Predictive Coding for Dual-Mode Self-Supervised Speech Model

cs.SD · 2026-06-19 · unverdicted · novelty 3.0

Proposes OPC and dual-mode LN to improve dual-mode SSL speech models, reducing WER gap at 160 ms latency on LibriSpeech from 3.65% to 3.40% (test-clean).

citing papers explorer

Showing 2 of 2 citing papers.

Towards Data-free and Training-free Compression for Speech Foundation Models Using Parameter Clustering cs.SD · 2026-06-10 · unverdicted · none · ref 27
K-means parameter clustering enables data-free training-free pruning of HuBERT and Whisper models with reported WER gains over magnitude pruning on LibriSpeech at 50% and 10% sparsity.
Online Predictive Coding for Dual-Mode Self-Supervised Speech Model cs.SD · 2026-06-19 · unverdicted · none · ref 17
Proposes OPC and dual-mode LN to improve dual-mode SSL speech models, reducing WER gap at 160 ms latency on LibriSpeech from 3.65% to 3.40% (test-clean).

DistillW2V2: A small and streaming wav2vec 2.0 based ASR model,

fields

years

verdicts

representative citing papers

citing papers explorer