pith. sign in

arxiv: 1505.01306 · v1 · pith:NIMY7H5Cnew · submitted 2015-05-06 · 💻 cs.IR

Understanding Graph Structure of Wikipedia for Query Expansion

classification 💻 cs.IR
keywords knowledgeamountexpansionquerysourceswikipediabasescategories
0
0 comments X
read the original abstract

Knowledge bases are very good sources for knowledge extraction, the ability to create knowledge from structured and unstructured sources and use it to improve automatic processes as query expansion. However, extracting knowledge from unstructured sources is still an open challenge. In this respect, understanding the structure of knowledge bases can provide significant benefits for the effectiveness of such purpose. In particular, Wikipedia has become a very popular knowledge base in the last years because it is a general encyclopedia that has a large amount of information and thus, covers a large amount of different topics. In this piece of work, we analyze how articles and categories of Wikipedia relate to each other and how these relationships can support a query expansion technique. In particular, we show that the structures in the form of dense cycles with a minimum amount of categories tend to identify the most relevant information.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.