Is Supervised Learning Really That Different from Unsupervised?

Oskar Allerbo; Thomas B. Sch\"on

arxiv: 2505.11006 · v6 · pith:MVZYBKXAnew · submitted 2025-05-16 · 📊 stat.ML · cs.LG

Is Supervised Learning Really That Different from Unsupervised?

Oskar Allerbo , Thomas B. Sch\"on This is my paper

classification 📊 stat.ML cs.LG

keywords learningmodelsupervisedunsupervisedwithoutaccessasymptoticdemonstrate

0 comments

read the original abstract

We demonstrate how supervised learning can be decomposed into a two-stage procedure, where (1) all model parameters are selected in an unsupervised manner, and (2) the outputs y are added to the model, without changing the parameter values. This is achieved by a new model selection criterion that - in contrast to cross-validation - can be used also without access to y. For linear ridge regression, we bound the asymptotic out-of-sample risk of our method in terms of the optimal asymptotic risk. We also demonstrate that versions of linear and kernel ridge regression, smoothing splines, k-nearest neighbors, random forests, and neural networks, trained without access to y, perform similarly to their standard y-based counterparts. Hence, our results suggest that the difference between supervised and unsupervised learning is less fundamental than it may appear.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Rigorous, Tractable Measure of Model Complexity
stat.ML 2026-05 unverdicted novelty 5.0

A gradient-similarity complexity measure that generalizes polynomial degree, kernel length scale, neighbor count, tree splits, and forest size while offering insights into double descent.