pith. sign in

arxiv: 1803.08991 · v2 · pith:P7I6MNV6new · submitted 2018-03-23 · 💻 cs.CL

Leveraging translations for speech transcription in low-resource settings

classification 💻 cs.CL
keywords languagelow-resourcetranscriptiontranslationscollectmodelmulti-sourcesettings
0
0 comments X
read the original abstract

Recently proposed data collection frameworks for endangered language documentation aim not only to collect speech in the language of interest, but also to collect translations into a high-resource language that will render the collected resource interpretable. We focus on this scenario and explore whether we can improve transcription quality under these extremely low-resource settings with the assistance of text translations. We present a neural multi-source model and evaluate several variations of it on three low-resource datasets. We find that our multi-source model with shared attention outperforms the baselines, reducing transcription character error rate by up to 12.3%.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.