pith. sign in

arxiv: cs/0206014 · v1 · submitted 2002-06-09 · 💻 cs.CL

A Method for Open-Vocabulary Speech-Driven Text Retrieval

classification 💻 cs.CL
keywords recognitionretrievalspeechtermstranscriptionwordscollectionmethod
0
0 comments X
read the original abstract

While recent retrieval techniques do not limit the number of index terms, out-of-vocabulary (OOV) words are crucial in speech recognition. Aiming at retrieving information with spoken queries, we fill the gap between speech recognition and text retrieval in terms of the vocabulary size. Given a spoken query, we generate a transcription and detect OOV words through speech recognition. We then correspond detected OOV words to terms indexed in a target collection to complete the transcription, and search the collection for documents relevant to the completed transcription. We show the effectiveness of our method by way of experiments.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.