Serverless Data Analytics with Flint

· 2018 · cs.DC · arXiv 1803.06354

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Serverless architectures organized around loosely-coupled function invocations represent an emerging design for many applications. Recent work mostly focuses on user-facing products and event-driven processing pipelines. In this paper, we explore a completely different part of the application space and examine the feasibility of analytical processing on big data using a serverless architecture. We present Flint, a prototype Spark execution engine that takes advantage of AWS Lambda to provide a pure pay-as-you-go cost model. With Flint, a developer uses PySpark exactly as before, but without needing an actual Spark cluster. We describe the design, implementation, and performance of Flint, along with the challenges associated with serverless analytics.

representative citing papers

ServerMix: Tradeoffs and Challenges of Serverless Data Analytics

cs.DC · 2019-07-26 · unverdicted · novelty 4.0

Serverless computing for data analytics involves trade-offs in disaggregation, isolation, and scheduling that push most workloads toward hybrid Servermix architectures.

citing papers explorer

Showing 1 of 1 citing paper.

ServerMix: Tradeoffs and Challenges of Serverless Data Analytics cs.DC · 2019-07-26 · unverdicted · none · ref 24 · internal anchor
Serverless computing for data analytics involves trade-offs in disaggregation, isolation, and scheduling that push most workloads toward hybrid Servermix architectures.

Serverless Data Analytics with Flint

fields

years

verdicts

representative citing papers

citing papers explorer