pith. machine review for the scientific record. sign in

arxiv: 1511.06493 · v1 · submitted 2015-11-20 · 💻 cs.DC

Recognition: unknown

Embarrassingly Parallel Time Series Analysis for Large Scale Weak Memory Systems

Authors on Pith no claims yet
classification 💻 cs.DC
keywords datamemoryanalysistimeweakcomputationspatternseries
0
0 comments X
read the original abstract

Second order stationary models in time series analysis are based on the analysis of essential statistics whose computations follow a common pattern. In particular, with a map-reduce nomenclature, most of these operations can be modeled as mapping a kernel that only depends on short windows of consecutive data and reducing the results produced by each computation. This computational pattern stems from the ergodicity of the model under consideration and is often referred to as weak or short memory when it comes to data indexed with respect to time. In the following we will show how studying weak memory systems can be done in a scalable manner thanks to a framework relying on specifically designed overlapping distributed data structures that enable fragmentation and replication of the data across many machines as well as parallelism in computations. This scheme has been implemented for Apache Spark but is certainly not system specific. Indeed we prove it is also adapted to leveraging high bandwidth fragmented memory blocks on GPUs.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.