Reliable Clustering of Bernoulli Mixture Models

Abolfazl Motahari; Amir Najafi; Hamid R. Rabiee

arxiv: 1710.02101 · v3 · pith:VGFXN4DAnew · submitted 2017-10-05 · 💻 cs.LG · cs.IT· math.IT· stat.ML

Reliable Clustering of Bernoulli Mixture Models

Amir Najafi , Abolfazl Motahari , Hamid R. Rabiee This is my paper

classification 💻 cs.LG cs.ITmath.ITstat.ML

keywords clusteringmixturebernoullibmmsclusterabilitycomplexitymodelsample

0 comments

read the original abstract

A Bernoulli Mixture Model (BMM) is a finite mixture of random binary vectors with independent dimensions. The problem of clustering BMM data arises in a variety of real-world applications, ranging from population genetics to activity analysis in social networks. In this paper, we analyze the clusterability of BMMs from a theoretical perspective, when the number of clusters is unknown. In particular, we stipulate a set of conditions on the sample complexity and dimension of the model in order to guarantee the Probably Approximately Correct (PAC)-clusterability of a dataset. To the best of our knowledge, these findings are the first non-asymptotic bounds on the sample complexity of learning or clustering BMMs.

This paper has not been read by Pith yet.

Reliable Clustering of Bernoulli Mixture Models

discussion (0)