Visus: An Interactive System for Automatic Machine Learning Model Building and Curation
Pith reviewed 2026-05-25 02:17 UTC · model grok-4.3
The pith
Visus is an interactive system that supports domain experts in building and curating AutoML-generated machine learning pipelines.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Visus is a system designed to support the model building process and curation of ML data processing pipelines generated by AutoML systems. The work describes the framework used to ground design choices, illustrates a usage scenario enabled by the system, and discusses feedback received in user testing sessions with domain experts.
What carries the argument
Visus, the interactive interface that guides users through inspection, modification, and refinement of AutoML pipelines.
Load-bearing premise
Domain experts lack machine-learning expertise and therefore need dedicated interactive interfaces to curate AutoML outputs effectively.
What would settle it
A controlled comparison in which domain experts using Visus produce no measurable improvement in pipeline quality or usability over experts working without the system.
Figures
read the original abstract
While the demand for machine learning (ML) applications is booming, there is a scarcity of data scientists capable of building such models. Automatic machine learning (AutoML) approaches have been proposed that help with this problem by synthesizing end-to-end ML data processing pipelines. However, these follow a best-effort approach and a user in the loop is necessary to curate and refine the derived pipelines. Since domain experts often have little or no expertise in machine learning, easy-to-use interactive interfaces that guide them throughout the model building process are necessary. In this paper, we present Visus, a system designed to support the model building process and curation of ML data processing pipelines generated by AutoML systems. We describe the framework used to ground our design choices and a usage scenario enabled by Visus. Finally, we discuss the feedback received in user testing sessions with domain experts.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents Visus, an interactive system to support domain experts (with limited ML expertise) in curating and refining end-to-end ML data processing pipelines generated by AutoML systems. It grounds the system design in a stated framework, describes a usage scenario, and reports qualitative feedback from user testing sessions with domain experts.
Significance. If the system and its design choices function as described, the work addresses a practical gap in making AutoML outputs usable by non-experts through interactive curation interfaces. The explicit design framework and reported user sessions provide concrete examples of interface features for pipeline inspection and refinement, which could inform future HCI-for-AutoML efforts. The contribution is primarily descriptive and system-oriented rather than a new algorithmic or theoretical result.
major comments (1)
- [user testing / evaluation] User testing section: The validation rests entirely on qualitative feedback from domain-expert sessions, with no reported participant count, task protocol, success metrics, or comparison to a baseline interface. This leaves the central claim that Visus 'supports the model building process and curation' supported only by narrative description rather than observable outcomes.
minor comments (2)
- [abstract] Abstract: The motivation sentence on domain experts having 'little or no expertise in machine learning' is repeated from the introduction without additional grounding; a brief citation to prior studies on AutoML user barriers would strengthen it.
- [usage scenario] The usage scenario is presented narratively; adding a short table or figure summarizing the sequence of user actions and system responses would improve clarity and reproducibility of the demonstrated workflow.
Simulated Author's Rebuttal
We thank the referee for the detailed review and constructive suggestion regarding the user testing section. We agree that additional specifics will strengthen the manuscript and will revise accordingly.
read point-by-point responses
-
Referee: [user testing / evaluation] User testing section: The validation rests entirely on qualitative feedback from domain-expert sessions, with no reported participant count, task protocol, success metrics, or comparison to a baseline interface. This leaves the central claim that Visus 'supports the model building process and curation' supported only by narrative description rather than observable outcomes.
Authors: We acknowledge that the current user testing description is primarily narrative. In the revised version we will explicitly report the number of domain-expert participants, outline the session protocol (including tasks performed and questions asked), and provide more concrete examples of the feedback received and how it informed design decisions. Because the study was designed as a qualitative validation of the proposed design framework rather than a controlled experiment, we did not collect quantitative success metrics or run a baseline comparison; we will make this scope explicit so readers understand the nature of the evidence. revision: partial
Circularity Check
No significant circularity: system description with independent user validation
full rationale
The paper presents a descriptive system (Visus) for curating AutoML pipelines, grounded in an explicitly stated design framework, illustrated via a usage scenario, and evaluated through reported user testing sessions with domain experts. No equations, fitted parameters, predictions, or derivations exist that could reduce to inputs by construction. No self-citation chains are invoked as load-bearing uniqueness theorems or ansatzes. The central claims are self-contained in the paper's own construction and external user feedback, qualifying for the default non-circularity outcome.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Domain experts require guided interactive interfaces because they lack ML expertise
Reference graph
Works this paper leans on
-
[1]
A User-based Visual Analytics Workflow for Exploratory Model Analysis
Dylan Cashman, Shah Rukh Humayoun, Florian Heimerl, Kendall Park, Subhajit Das, John Thompson, Bahador Saket, Abigail Mosca, John T. Stasko, Alex Endert, Michael Gleicher, and Remco Chang. 2018. Visual Analytics for Automated Model Discovery. CoRR abs/1809.10782 (2018). arXiv:1809.10782 http://arxiv.org/abs/ 1809.10782
work page internal anchor Pith review Pith/arXiv arXiv 2018
- [2]
-
[3]
Louis Columbus. [n. d.]. IBM Predicts Demand For Data Scientists Will Soar 28% By 2020. https://www.forbes.com/sites/louiscolumbus/2017/05/13/ ibm-predicts-demand-for-data-scientists-will-soar-28-by-2020/
work page 2020
-
[4]
Iddo Drori, Yamuna Krishnamurthy, Remi Rampin, Raoni Lourenço, Jorge Ono, Kyunghyun Cho, Claudio Silva, and Juliana Freire. 2018. AlphaD3M: Machine Learning Pipeline Synthesis. In Proceedings of Machine Learning Research, ICML 2018 AutoML Workshop
work page 2018
-
[5]
Matthias Feurer, Aaron Klein, Katharina Eggensperger, Jost Springenberg, Manuel Blum, and Frank Hutter. 2015. Efficient and robust automated machine learning. In Advances in neural information processing systems . 2962–2970
work page 2015
-
[6]
Yolanda Gil, James Honaker, Shikhar Gupta, Yibo Ma, Vito D’Orazio, Daniel Garijo, Shruti Gadewar, Qifan Yang, and Neda Jahanshad. 2019. Towards Human- guided Machine Learning. In Proceedings of the 24th International Conference on Intelligent User Interfaces (IUI ’19). ACM, New York, NY, USA, 614–624. https: //doi.org/10.1145/3301275.3302324
-
[7]
Yolanda Gil, Ke-Thia Yao, Varun Ratnakar, Daniel Garijo, Greg Ver Steeg, Rob Brekelmans, Mayank Kejriwal, Fanghao Luo, and I-De Huang. 2018. P4ML: A Phased Performance-Based Pipeline Planner for Automated Machine Learning. In Proceedings of Machine Learning Research, ICML 2018 AutoML Workshop
work page 2018
-
[8]
Moritz Hardt, Eric Price, , and Nati Srebro. 2016. Equality of Opportu- nity in Supervised Learning. In Advances in Neural Information Processing Systems 29 , D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Gar- nett (Eds.). Curran Associates, Inc., 3315–3323. http://papers.nips.cc/paper/ 6374-equality-of-opportunity-in-supervised-learning.pdf
work page 2016
- [9]
-
[10]
James Honaker and Vito D’Orazio. 2014. Statistical Modeling by Gesture: A graph- ical, Browser-based Statistical Interface for Data Repositories.. In HT (Doctoral Consortium/Late-breaking Results/Workshops)
work page 2014
-
[11]
Frank Hutter, Lars Kotthoff, and Joaquin Vanschoren. 2019. Automatic machine learning: methods, systems, challenges. Challenges in Machine Learning (2019)
work page 2019
-
[12]
J J Thomas, K A Cook, Institute Electrical, and Electronics Engineers. 2005. Illuminating the path: The research and development agenda for visual analytics
work page 2005
-
[13]
Fei-Fei Li and Jia Li. [n. d.]. Cloud AutoML: Making AI accessi- ble to every business. https://www.blog.google/products/google-cloud/ cloud-automl-making-ai-accessible-every-business/
-
[14]
Microsoft. [n. d.]. Microsoft Azure Machine Learning Studio. https://studio. azureml.net/
-
[15]
Y. Ming, H. Qu, and E. Bertini. 2019. RuleMatrix: Visualizing and Understanding Classifiers with Rules. IEEE Transactions on Visualization and Computer Graphics 25, 1 (Jan 2019), 342–352. https://doi.org/10.1109/TVCG.2018.2864812
-
[16]
nyc-vision-zero [n. d.]. NYC Vision Zero. https://www1.nyc.gov/site/visionzero/ index.page
-
[17]
Randal S. Olson, Nathan Bartley, Ryan J. Urbanowicz, and Jason H. Moore. 2016. Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science. In Proceedings of the Genetic and Evolutionary Computation Conference 2016 (GECCO ’16). ACM, New York, NY, USA, 485–492. https://doi.org/10.1145/ 2908812.2908918
-
[18]
Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, and Krishna P. Gummadi. 2017. Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification Without Disparate Mistreatment. In Proceedings of the 26th International Conference on World Wide Web (WWW ’17). International World Wide Web Conferences Steering Committee, Republic...
-
[19]
Indre Zliobaite. 2015. A survey on measuring indirect discrimination in machine learning. CoRR abs/1511.00148 (2015). arXiv:1511.00148 http://arxiv.org/abs/ 1511.00148
work page internal anchor Pith review Pith/arXiv arXiv 2015
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.