High-frame-rate feature extraction at 200-400 fps improves end-to-end ASR word error rates on WSJ and CHiME-5, with relative reductions up to 24.1% when combined with speed perturbation.
When high-frame-rate features extraction of 200 and 400 frames/second are used, feature vectors are ex- tracted every 5 and 2.5 ms, respectively
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
eess.AS 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
End-to-End Speech Recognition with High-Frame-Rate Features Extraction
High-frame-rate feature extraction at 200-400 fps improves end-to-end ASR word error rates on WSJ and CHiME-5, with relative reductions up to 24.1% when combined with speed perturbation.