pith. machine review for the scientific record. sign in

arxiv: 1007.1361 · v2 · submitted 2010-07-08 · 💻 cs.DS · cs.IR

Recognition: unknown

Top-K Color Queries for Document Retrieval

Authors on Pith no claims yet
classification 💻 cs.DS cs.IR
keywords colorcolorsdataoptimalstructuretimearraydocument
0
0 comments X
read the original abstract

In this paper we describe a new efficient (in fact optimal) data structure for the {\em top-$K$ color problem}. Each element of an array $A$ is assigned a color $c$ with priority $p(c)$. For a query range $[a,b]$ and a value $K$, we have to report $K$ colors with the highest priorities among all colors that occur in $A[a..b]$, sorted in reverse order by their priorities. We show that such queries can be answered in $O(K)$ time using an $O(N\log \sigma)$ bits data structure, where $N$ is the number of elements in the array and $\sigma$ is the number of colors. Thus our data structure is asymptotically optimal with respect to the worst-case query time and space. As an immediate application of our results, we obtain optimal time solutions for several document retrieval problems. The method of the paper could be also of independent interest.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.