pith. sign in

arxiv: 1710.06554 · v2 · pith:4BWR45LMnew · submitted 2017-10-18 · 💻 cs.CL

Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword Spotting

classification 💻 cs.CL
keywords keywordreimplementationspottingconvolutionalhonknetworksneuralpytorch
0
0 comments X
read the original abstract

We describe Honk, an open-source PyTorch reimplementation of convolutional neural networks for keyword spotting that are included as examples in TensorFlow. These models are useful for recognizing "command triggers" in speech-based interfaces (e.g., "Hey Siri"), which serve as explicit cues for audio recordings of utterances that are sent to the cloud for full speech recognition. Evaluation on Google's recently released Speech Commands Dataset shows that our reimplementation is comparable in accuracy and provides a starting point for future work on the keyword spotting task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Multimodal Uncertainty Reduction for Intention Recognition in Human-Robot Interaction

    cs.RO 2019-07 unverdicted novelty 4.0

    Multimodal intention classifiers fused with Independent Opinion Pool reduce uncertainty and improve accuracy over single-modality baselines in a 7-DoF robot arm collaboration task.

  2. A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting

    cs.SD 2019-06 unverdicted novelty 4.0

    Joint training of speech enhancement and KWS with a novel CRN and Mel features improves noise robustness for small-footprint devices.