pith. sign in

arxiv: 1405.6676 · v2 · pith:ZZIH2VE7new · submitted 2014-05-26 · 📊 stat.OT · cs.LG· math.ST· stat.TH

Statistique et Big Data Analytics; Volum\'etrie, L'Attaque des Clones

classification 📊 stat.OT cs.LGmath.STstat.TH
keywords availabledatalearningskillsstatisticianacquireacquiredadapted
0
0 comments X
read the original abstract

This article assumes acquired the skills and expertise of a statistician in unsupervised (NMF, k-means, SVD) and supervised learning (regression, CART, random forest). What skills and knowledge do a statistician must acquire to reach the "Volume" scale of big data? After a quick overview of the different strategies available and especially of those imposed by Hadoop, the algorithms of some available learning methods are outlined in order to understand how they are adapted to the strong stresses of the Map-Reduce functionalities

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.