A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting

· 2019 · cs.SD · arXiv 1906.08415

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Robustness against noise is critical for keyword spotting (KWS) in real-world environments. To improve the robustness, a speech enhancement front-end is involved. Instead of treating the speech enhancement as a separated preprocessing before the KWS system, in this study, a pre-trained speech enhancement front-end and a convolutional neural networks (CNNs) based KWS system are concatenated, where a feature transformation block is used to transform the output from the enhancement front-end into the KWS system's input. The whole model is trained jointly, thus the linguistic and other useful information from the KWS system can be back-propagated to the enhancement front-end to improve its performance. To fit the small-footprint device, a novel convolution recurrent network is proposed, which needs fewer parameters and computation and does not degrade performance. Furthermore, by changing the input features from the power spectrogram to Mel-spectrogram, less computation and better performance are obtained. our experimental results demonstrate that the proposed method significantly improves the KWS system with respect to noise robustness.

representative citing papers

A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting

cs.SD · 2019-06-20 · unverdicted · novelty 4.0

Joint training of speech enhancement and KWS with a novel CRN and Mel features improves noise robustness for small-footprint devices.

citing papers explorer

Showing 1 of 1 citing paper.

A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting cs.SD · 2019-06-20 · unverdicted · none · ref 2 · internal anchor
Joint training of speech enhancement and KWS with a novel CRN and Mel features improves noise robustness for small-footprint devices.

A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting

fields

years

verdicts

representative citing papers

citing papers explorer