Weighted total variation based convex clustering
read the original abstract
Data clustering is a fundamental problem with a wide range of applications. Standard methods, eg the $k$-means method, usually require solving a non-convex optimization problem. Recently, total variation based convex relaxation to the $k$-means model has emerged as an attractive alternative for data clustering. However, the existing results on its exact clustering property, ie, the condition imposed on data so that the method can provably give correct identification of all cluster memberships, is only applicable to very specific data and is also much more restrictive than that of some other methods. This paper aims at the revisit of total variation based convex clustering, by proposing a weighted sum-of-$\ell_1$-norm relating convex model. Its exact clustering property established in this paper, in both deterministic and probabilistic context, is applicable to general data and is much sharper than the existing results. These results provided good insights to advance the research on convex clustering. Moreover, the experiments also demonstrated that the proposed convex model has better empirical performance when be compared to standard clustering methods, and thus it can see its potential in practice.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
A Unified Framework for Structure-Aware Clustering and Heterogeneous Causal Graph Learning
DAG-DC-ADMM jointly clusters subjects and learns their cluster-specific causal DAGs via structural equation modeling, groupwise truncated Lasso fusion penalties, and an ADMM solver for the resulting nonconvex problem.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.