pith. sign in

arxiv: 1302.5302 · v1 · pith:TXH5BTQ4new · submitted 2013-02-21 · 💻 cs.IR · cs.DB

Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search

classification 💻 cs.IR cs.DB
keywords memoryallocationpostingsdynamicleadspolicyreal-timesearch
0
0 comments X
read the original abstract

We explore a real-time Twitter search application where tweets are arriving at a rate of several thousands per second. Real-time search demands that they be indexed and searchable immediately, which leads to a number of implementation challenges. In this paper, we focus on one aspect: dynamic postings allocation policies for index structures that are completely held in main memory. The core issue can be characterized as a "Goldilocks Problem". Because memory remains today a scare resource, an allocation policy that is too aggressive leads to inefficient utilization, while a policy that is too conservative is slow and leads to fragmented postings lists. We present a dynamic postings allocation policy that allocates memory in increasingly-larger "slices" from a small number of large, fixed pools of memory. Through analytical models and experiments, we explore different settings that balance time (query evaluation speed) and space (memory utilization).

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.