pith. sign in

arxiv: cs/9811032 · v1 · pith:CNECDJJXnew · submitted 1998-11-24 · 💻 cs.NE · cs.HC

Text-To-Speech Conversion with Neural Networks: A Recurrent TDNN Approach

classification 💻 cs.NE cs.HC
keywords neuralnetworkrecurrentspeechsystemtext-to-speechapproacharchitecture
0
0 comments X
read the original abstract

This paper describes the design of a neural network that performs the phonetic-to-acoustic mapping in a speech synthesis system. The use of a time-domain neural network architecture limits discontinuities that occur at phone boundaries. Recurrent data input also helps smooth the output parameter tracks. Independent testing has demonstrated that the voice quality produced by this system compares favorably with speech from existing commercial text-to-speech systems.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.