arxiv: 1803.10311 · v2 · pith:334B7QVXnew · submitted 2018-03-27 · 💻 cs.LG · cs.DB· cs.HC· stat.ML

How Developers Iterate on Machine Learning Workflows -- A Survey of the Applied Machine Learning Literature

Doris Xin , Litian Ma , Shuchen Song , Aditya Parameswaran This is my paper

classification 💻 cs.LG cs.DBcs.HCstat.ML

keywords learningmachinedevelopmentworkflowappliedbenchmarkdomainshuman-in-the-loop

0 comments p. Extension

Add this Pith Number to your LaTeX paper

\usepackage{pith}
\pithnumber{334B7QVX}

Prints a linked pith:334B7QVX badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Machine learning workflow development is anecdotally regarded to be an iterative process of trial-and-error with humans-in-the-loop. However, we are not aware of quantitative evidence corroborating this popular belief. A quantitative characterization of iteration can serve as a benchmark for machine learning workflow development in practice, and can aid the development of human-in-the-loop machine learning systems. To this end, we conduct a small-scale survey of the applied machine learning literature from five distinct application domains. We collect and distill statistics on the role of iteration within machine learning workflow development, and report preliminary trends and insights from our investigation, as a starting point towards this benchmark. Based on our findings, we finally describe desiderata for effective and versatile human-in-the-loop machine learning systems that can cater to users in diverse domains.

This paper has not been read by Pith yet.

How Developers Iterate on Machine Learning Workflows -- A Survey of the Applied Machine Learning Literature

discussion (0)