pith. sign in

arxiv: 1807.02465 · v1 · pith:KJAFF267new · submitted 2018-07-06 · 📡 eess.AS · cs.SD

Tone Recognition Using Lifters and CTC

classification 📡 eess.AS cs.SD
keywords methodspeechnetworksequencetonetonesaishell-1available
0
0 comments X
read the original abstract

In this paper, we present a new method for recognizing tones in continuous speech for tonal languages. The method works by converting the speech signal to a cepstrogram, extracting a sequence of cepstral features using a convolutional neural network, and predicting the underlying sequence of tones using a connectionist temporal classification (CTC) network. The performance of the proposed method is evaluated on a freely available Mandarin Chinese speech corpus, AISHELL-1, and is shown to outperform the existing techniques in the literature in terms of tone error rate (TER).

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.