TF-MoE: Time-Frequency Mixture-of-Experts for Efficient Speech Separation

· 2026 · cs.SD · arXiv 2606.29575

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Recent advances in speech separation (SS) have led to compact front-end models with small parameter sizes, yet their high computational cost remains a major barrier for deployment on edge devices. To address this, we propose TF-MoE, a sparse Mixture-of-Experts (MoE) framework that enhances model capacity with almost no increase in inference cost. Our method introduces dynamic expert specialization in time and frequency dimensions through alternating time-wise and frequency-wise MoE modules, each dynamically selecting experts per frame or mel band. Built upon a mel-band-splitting Conformer backbone, TF-MoE achieves strong performance on SS tasks under low-compute settings. Experimental results demonstrate that TF-MoE consistently improves separation performance under computation cost constraints, outperforming BSRNN by +3.8 dB SDR on Libri2Mix with comparable 4.1 GMACs/s inference cost. This positions TF-MoE as a promising candidate for edge-device deployment.

representative citing papers

TF-MoE: Time-Frequency Mixture-of-Experts for Efficient Speech Separation

cs.SD · 2026-06-28 · unverdicted · novelty 5.0 · 2 refs

TF-MoE uses dynamic per-frame and per-mel-band expert selection in time and frequency dimensions to improve speech separation performance at comparable compute cost to prior models.

citing papers explorer

Showing 1 of 1 citing paper after filters.

TF-MoE: Time-Frequency Mixture-of-Experts for Efficient Speech Separation cs.SD · 2026-06-28 · unverdicted · none · ref 4 · 2 links · internal anchor
TF-MoE uses dynamic per-frame and per-mel-band expert selection in time and frequency dimensions to improve speech separation performance at comparable compute cost to prior models.

TF-MoE: Time-Frequency Mixture-of-Experts for Efficient Speech Separation

fields

years

verdicts

representative citing papers

citing papers explorer