Feature Selection For High-Dimensional Clustering
classification
🧮 math.ST
stat.MLstat.TH
keywords
clusteringfeaturesboundserrorhigh-dimensionalmethodmodeaddition
pith:AKPQYZQE Add to your LaTeX paper
What is a Pith Number?\usepackage{pith}
\pithnumber{AKPQYZQE}
Prints a linked pith:AKPQYZQE badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more
read the original abstract
We present a nonparametric method for selecting informative features in high-dimensional clustering problems. We start with a screening step that uses a test for multimodality. Then we apply kernel density estimation and mode clustering to the selected features. The output of the method consists of a list of relevant features, and cluster assignments. We provide explicit bounds on the error rate of the resulting clustering. In addition, we provide the first error bounds on mode based clustering.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.