pith. sign in

arxiv: 1707.04538 · v1 · pith:THXJUA6Xnew · submitted 2017-07-14 · 💻 cs.CL

Cross-genre Document Retrieval: Matching between Conversational and Formal Writings

classification 💻 cs.CL
keywords documentretrievalconversationalcross-genreepisodeformalrerankingstructure
0
0 comments X
read the original abstract

This paper challenges a cross-genre document retrieval task, where the queries are in formal writing and the target documents are in conversational writing. In this task, a query, is a sentence extracted from either a summary or a plot of an episode in a TV show, and the target document consists of transcripts from the corresponding episode. To establish a strong baseline, we employ the current state-of-the-art search engine to perform document retrieval on the dataset collected for this work. We then introduce a structure reranking approach to improve the initial ranking by utilizing syntactic and semantic structures generated by NLP tools. Our evaluation shows an improvement of more than 4% when the structure reranking is applied, which is very promising.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.