pith. sign in

arxiv: 1204.1685 · v2 · pith:RU72BQTWnew · submitted 2012-04-07 · 🧮 math.ST · cs.LG· stat.ML· stat.TH

Density-sensitive semisupervised inference

classification 🧮 math.ST cs.LGstat.MLstat.TH
keywords methodssemisuperviseddataalphadistributionldotsad-hocadapt
0
0 comments X p. Extension
pith:RU72BQTW Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{RU72BQTW}

Prints a linked pith:RU72BQTW badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Semisupervised methods are techniques for using labeled data $(X_1,Y_1),\ldots,(X_n,Y_n)$ together with unlabeled data $X_{n+1},\ldots,X_N$ to make predictions. These methods invoke some assumptions that link the marginal distribution $P_X$ of X to the regression function f(x). For example, it is common to assume that f is very smooth over high density regions of $P_X$. Many of the methods are ad-hoc and have been shown to work in specific examples but are lacking a theoretical foundation. We provide a minimax framework for analyzing semisupervised methods. In particular, we study methods based on metrics that are sensitive to the distribution $P_X$. Our model includes a parameter $\alpha$ that controls the strength of the semisupervised assumption. We then use the data to adapt to $\alpha$.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.