Feature Selection For High-Dimensional Clustering

arxiv: 1406.2240 · v1 · pith:AKPQYZQEnew · submitted 2014-06-09 · 🧮 math.ST · stat.ML· stat.TH

Feature Selection For High-Dimensional Clustering

Larry Wasserman , Martin Azizyan , Aarti Singh This is my paper

classification 🧮 math.ST stat.MLstat.TH

keywords clusteringfeaturesboundserrorhigh-dimensionalmethodmodeaddition

0 comments p. Extension

pith:AKPQYZQE Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{AKPQYZQE}

Prints a linked pith:AKPQYZQE badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We present a nonparametric method for selecting informative features in high-dimensional clustering problems. We start with a screening step that uses a test for multimodality. Then we apply kernel density estimation and mode clustering to the selected features. The output of the method consists of a list of relevant features, and cluster assignments. We provide explicit bounds on the error rate of the resulting clustering. In addition, we provide the first error bounds on mode based clustering.

This paper has not been read by Pith yet.

Feature Selection For High-Dimensional Clustering

discussion (0)