pith. sign in

arxiv: 1111.6410 · v2 · pith:OOG6ZDXUnew · submitted 2011-11-28 · 🧮 math.ST · stat.ML· stat.TH

Adaptive Semisupervised Inference

classification 🧮 math.ST stat.MLstat.TH
keywords densitylearnersemisupervisedassumptionsfunctionregressiondistancemetric
0
0 comments X p. Extension
pith:OOG6ZDXU Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{OOG6ZDXU}

Prints a linked pith:OOG6ZDXU badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Semisupervised methods inevitably invoke some assumption that links the marginal distribution of the features to the regression function of the label. Most commonly, the cluster or manifold assumptions are used which imply that the regression function is smooth over high-density clusters or manifolds supporting the data. A generalization of these assumptions is that the regression function is smooth with respect to some density sensitive distance. This motivates the use of a density based metric for semisupervised learning. We analyze this setting and make the following contributions - (a) we propose a semi-supervised learner that uses a density-sensitive kernel and show that it provides better performance than any supervised learner if the density support set has a small condition number and (b) we show that it is possible to adapt to the degree of semi-supervisedness using data-dependent choice of a parameter that controls sensitivity of the distance metric to the density. This ensures that the semisupervised learner never performs worse than a supervised learner even if the assumptions fail to hold.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.