pith. the verified trust layer for science. sign in

arxiv: 1803.10311 · v2 · pith:334B7QVXnew · submitted 2018-03-27 · 💻 cs.LG · cs.DB· cs.HC· stat.ML

How Developers Iterate on Machine Learning Workflows -- A Survey of the Applied Machine Learning Literature

classification 💻 cs.LG cs.DBcs.HCstat.ML
keywords learningmachinedevelopmentworkflowappliedbenchmarkdomainshuman-in-the-loop
0
0 comments X p. Extension
Add this Pith Number to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{334B7QVX}

Prints a linked pith:334B7QVX badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Machine learning workflow development is anecdotally regarded to be an iterative process of trial-and-error with humans-in-the-loop. However, we are not aware of quantitative evidence corroborating this popular belief. A quantitative characterization of iteration can serve as a benchmark for machine learning workflow development in practice, and can aid the development of human-in-the-loop machine learning systems. To this end, we conduct a small-scale survey of the applied machine learning literature from five distinct application domains. We collect and distill statistics on the role of iteration within machine learning workflow development, and report preliminary trends and insights from our investigation, as a starting point towards this benchmark. Based on our findings, we finally describe desiderata for effective and versatile human-in-the-loop machine learning systems that can cater to users in diverse domains.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.