The Algorithmic Automation Problem: Prediction, Triage, and Human Effort

Maithra Raghu , Katy Blumer , Greg Corrado , Jon Kleinberg , Ziad Obermeyer , Sendhil Mullainathan

Authors on Pith no claims yet

classification 💻 cs.CV cs.AIcs.LG

keywords automationhumanalgorithmictaskperformancepredictionproblemdecision

read the original abstract

In a wide array of areas, algorithms are matching and surpassing the performance of human experts, leading to consideration of the roles of human judgment and algorithmic prediction in these domains. The discussion around these developments, however, has implicitly equated the specific task of prediction with the general task of automation. We argue here that automation is broader than just a comparison of human versus algorithmic performance on a task; it also involves the decision of which instances of the task to give to the algorithm in the first place. We develop a general framework that poses this latter decision as an optimization problem, and we show how basic heuristics for this optimization problem can lead to performance gains even on heavily-studied applications of AI in medicine. Our framework also serves to highlight how effective automation depends crucially on estimating both algorithmic and human error on an instance-by-instance basis, and our results show how improvements in these error estimation problems can yield significant gains for automation as well.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MPD$^2$-Router: Mask-aware Multi-expert Prior-regularized Dual-head Deferral Router in Glaucoma Screening and Diagnosis
cs.AI 2026-05 unverdicted novelty 7.0

MPD²-Router is a dual-head deferral router that uses mask-aware Gumbel-sigmoid gating, asymmetric cost-sensitive training, and rank-majorization regularization to lower clinical cost and raise MCC versus AI-only basel...
Flexible Routing via Uncertainty Decomposition
cs.LG 2026-05 unverdicted novelty 7.0

A router that decomposes uncertainty to flexibly route queries between cheap models and oracles while providing regret bounds and supporting abstention in classification tasks with multiple annotations.
Calibrating conditional risk
cs.LG 2026-04 unverdicted novelty 6.0

Conditional risk calibration reduces to standard regression and is distinct from probability calibration.
Hybrid Decision Making via Conformal VLM-generated Guidance
cs.AI 2026-04 unverdicted novelty 6.0

ConfGuide uses conformal risk control to generate targeted guidance sets in a learning-to-guide hybrid decision framework and demonstrates it on multi-label medical diagnosis.
L2D-Clinical: Learning to Defer for Adaptive Model Selection in Clinical Text Classification
cs.CL 2026-04 unverdicted novelty 6.0

L2D-Clinical improves F1 by 1.7 points on ADE detection and 9.3 points on MIMIC treatment classification by deferring 7-17% of cases from BERT to LLM.
Optimized Deferral for Imbalanced Settings
cs.LG 2026-04 unverdicted novelty 5.0

MILD reformulates two-stage learning to defer as cost-sensitive learning over the input-expert domain and derives new margin-based losses with guarantees, yielding better performance than baselines on image classifica...