Progressive Analytics: A Computation Paradigm for Exploratory Data Analysis
read the original abstract
Exploring data requires a fast feedback loop from the analyst to the system, with a latency below about 10 seconds because of human cognitive limitations. When data becomes large or analysis becomes complex, sequential computations can no longer be completed in a few seconds and data exploration is severely hampered. This article describes a novel computation paradigm called Progressive Computation for Data Analysis or more concisely Progressive Analytics, that brings at the programming language level a low-latency guarantee by performing computations in a progressive fashion. Moving this progressive computation at the language level relieves the programmer of exploratory data analysis systems from implementing the whole analytics pipeline in a progressive way from scratch, streamlining the implementation of scalable exploratory data analysis systems. This article describes the new paradigm through a prototype implementation called ProgressiVis, and explains the requirements it implies through examples.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Hillview: A trillion-cell spreadsheet for big data
Hillview implements a distributed spreadsheet using vizketches to support interactive visualization of trillion-cell datasets on clusters of eight servers.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.