pith. sign in

arxiv: 1606.07188 · v1 · pith:QJDZVENBnew · submitted 2016-06-23 · 💻 cs.IR

Selective Term Proximity Scoring Via BP-ANN

classification 💻 cs.IR
keywords documentrankingbeneficialmodelproximityqueriesqueryterm
0
0 comments X p. Extension
pith:QJDZVENB Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{QJDZVENB}

Prints a linked pith:QJDZVENB badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

When two terms occur together in a document, the probability of a close relationship between them and the document itself is greater if they are in nearby positions. However, ranking functions including term proximity (TP) require larger indexes than traditional document-level indexing, which slows down query processing. Previous studies also show that this technique is not effective for all types of queries. Here we propose a document ranking model which decides for which queries it would be beneficial to use a proximity-based ranking, based on a collection of features of the query. We use a machine learning approach in determining whether utilizing TP will be beneficial. Experiments show that the proposed model returns improved rankings while also reducing the overhead incurred as a result of using TP statistics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.