Low-Resource Contextual Topic Identification on Speech

Chunxi Liu; Craig Harman; Jan Trmal; Matthew Wiesner; Najim Dehak; Sanjeev Khudanpur; Shinji Watanabe

arxiv: 1807.06204 · v2 · pith:3ARCM7IXnew · submitted 2018-07-17 · 💻 cs.CL

Low-Resource Contextual Topic Identification on Speech

Chunxi Liu , Matthew Wiesner , Shinji Watanabe , Craig Harman , Jan Trmal , Najim Dehak , Sanjeev Khudanpur This is my paper

classification 💻 cs.CL

keywords topiccontextualsegmentsattention-basedaudiofirstidentificationindependently

0 comments

read the original abstract

In topic identification (topic ID) on real-world unstructured audio, an audio instance of variable topic shifts is first broken into sequential segments, and each segment is independently classified. We first present a general purpose method for topic ID on spoken segments in low-resource languages, using a cascade of universal acoustic modeling, translation lexicons to English, and English-language topic classification. Next, instead of classifying each segment independently, we demonstrate that exploring the contextual dependencies across sequential segments can provide large improvements. In particular, we propose an attention-based contextual model which is able to leverage the contexts in a selective manner. We test both our contextual and non-contextual models on four LORELEI languages, and on all but one our attention-based contextual model significantly outperforms the context-independent models.

This paper has not been read by Pith yet.

Low-Resource Contextual Topic Identification on Speech

discussion (0)