Probability Based Clustering for Document and User Properties

Christa Womser-Hacker; Thomas Mandl

arxiv: 1102.3865 · v1 · pith:AFT47ZXUnew · submitted 2011-02-18 · 💻 cs.HC · cs.IR

Probability Based Clustering for Document and User Properties

Thomas Mandl , Christa Womser-Hacker This is my paper

classification 💻 cs.HC cs.IR

keywords documentretrievaluserfeaturesinformationmodelsystemsapplied

0 comments

read the original abstract

Information Retrieval systems can be improved by exploiting context information such as user and document features. This article presents a model based on overlapping probabilistic or fuzzy clusters for such features. The model is applied within a fusion method which linearly combines several retrieval systems. The fusion is based on weights for the different retrieval systems which are learned by exploiting relevance feedback information. This calculation can be improved by maintaining a model for each document and user cluster. That way, the optimal retrieval system for each document or user type can be identified and applied. The extension presented in this article allows overlapping, probabilistic clusters of features to further refine the process.

This paper has not been read by Pith yet.

Probability Based Clustering for Document and User Properties

discussion (0)