pith. sign in

arxiv: 1102.3865 · v1 · pith:AFT47ZXUnew · submitted 2011-02-18 · 💻 cs.HC · cs.IR

Probability Based Clustering for Document and User Properties

classification 💻 cs.HC cs.IR
keywords documentretrievaluserfeaturesinformationmodelsystemsapplied
0
0 comments X
read the original abstract

Information Retrieval systems can be improved by exploiting context information such as user and document features. This article presents a model based on overlapping probabilistic or fuzzy clusters for such features. The model is applied within a fusion method which linearly combines several retrieval systems. The fusion is based on weights for the different retrieval systems which are learned by exploiting relevance feedback information. This calculation can be improved by maintaining a model for each document and user cluster. That way, the optimal retrieval system for each document or user type can be identified and applied. The extension presented in this article allows overlapping, probabilistic clusters of features to further refine the process.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.