pith. sign in

arxiv: 1801.00371 · v2 · pith:DNKU25JNnew · submitted 2017-12-31 · 📊 stat.OT

Data Science vs. Statistics: Two Cultures?

classification 📊 stat.OT
keywords datastatisticsanalysissciencebroaderbusinesscommunicationlearning
0
0 comments X
read the original abstract

Data science is the business of learning from data, which is traditionally the business of statistics. Data science, however, is often understood as a broader, task-driven and computationally-oriented version of statistics. Both the term data science and the broader idea it conveys have origins in statistics and are a reaction to a narrower view of data analysis. Expanding upon the views of a number of statisticians, this paper encourages a big-tent view of data analysis. We examine how evolving approaches to modern data analysis relate to the existing discipline of statistics (e.g. exploratory analysis, machine learning, reproducibility, computation, communication and the role of theory). Finally, we discuss what these trends mean for the future of statistics by highlighting promising directions for communication, education and research.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.