Giving Text Analytics a Boost

Christoph Hagleitner; Eva Sitaridi; Frederick R. Reiss; H. Peter Hofstee; Huaiyu Zhu; Kubilay Atasu; Laura Chiticariu; Raphael Polig

arxiv: 1806.01103 · v1 · pith:X7RPGGHZnew · submitted 2018-04-25 · 💻 cs.DC · cs.IR

Giving Text Analytics a Boost

Raphael Polig , Kubilay Atasu , Laura Chiticariu , Christoph Hagleitner , H. Peter Hofstee , Frederick R. Reiss , Eva Sitaridi , Huaiyu Zhu This is my paper

classification 💻 cs.DC cs.IR

keywords datasystemtacceleratoranalyticsbandwidthinformationinterfacesystem

0 comments

read the original abstract

The amount of textual data has reached a new scale and continues to grow at an unprecedented rate. IBM's SystemT software is a powerful text analytics system, which offers a query-based interface to reveal the valuable information that lies within these mounds of data. However, traditional server architectures are not capable of analyzing the so-called "Big Data" in an efficient way, despite the high memory bandwidth that is available. We show that by using a streaming hardware accelerator implemented in reconfigurable logic, the throughput rates of the SystemT's information extraction queries can be improved by an order of magnitude. We present how such a system can be deployed by extending SystemT's existing compilation flow and by using a multi-threaded communication interface that can efficiently use the bandwidth of the accelerator.

This paper has not been read by Pith yet.

Giving Text Analytics a Boost

discussion (0)