A strong baseline for question relevancy ranking

Ana V. Gonz\'alez-Gardu\~no; Anders S{\o}gaard; Isabelle Augenstein

arxiv: 1808.08836 · v1 · pith:X3YTD2UYnew · submitted 2018-08-27 · 💻 cs.CL · cs.IR· cs.LG

A strong baseline for question relevancy ranking

Ana V. Gonz\'alez-Gardu\~no , Isabelle Augenstein , Anders S{\o}gaard This is my paper

classification 💻 cs.CL cs.IRcs.LG

keywords questionbaselinerankingrelevancytaskbestsharedstrong

0 comments

read the original abstract

The best systems at the SemEval-16 and SemEval-17 community question answering shared tasks -- a task that amounts to question relevancy ranking -- involve complex pipelines and manual feature engineering. Despite this, many of these still fail at beating the IR baseline, i.e., the rankings provided by Google's search engine. We present a strong baseline for question relevancy ranking by training a simple multi-task feed forward network on a bag of 14 distance measures for the input question pair. This baseline model, which is fast to train and uses only language-independent features, outperforms the best shared task systems on the task of retrieving relevant previously asked questions.

This paper has not been read by Pith yet.

A strong baseline for question relevancy ranking

discussion (0)