pith. sign in

arxiv: 1901.00214 · v1 · pith:PXW7HQFBnew · submitted 2019-01-01 · 💻 cs.LG · math.OC· stat.ML

Clustering with Distributed Data

classification 💻 cs.LG math.OCstat.ML
keywords meansclusteringminimaalgorithmdatadistributedassociatedconsider
0
0 comments X
read the original abstract

We consider $K$-means clustering in networked environments (e.g., internet of things (IoT) and sensor networks) where data is inherently distributed across nodes and processing power at each node may be limited. We consider a clustering algorithm referred to as networked $K$-means, or $NK$-means, which relies only on local neighborhood information exchange. Information exchange is limited to low-dimensional statistics and not raw data at the agents. The proposed approach develops a parametric family of multi-agent clustering objectives (parameterized by $\rho$) and associated distributed $NK$-means algorithms (also parameterized by $\rho$). The $NK$-means algorithm with parameter $\rho$ converges to a set of fixed points relative to the associated multi-agent objective (designated as `generalized minima'). By appropriate choice of $\rho$, the set of generalized minima may be brought arbitrarily close to the set of Lloyd's minima. Thus, the $NK$-means algorithm may be used to compute Lloyd's minima of the collective dataset up to arbitrary accuracy.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Distributed Global Optimization by Annealing

    math.OC 2019-07 unverdicted novelty 5.0

    A consensus + innovations algorithm with decaying additive Gaussian noise converges to the global minima of nonconvex functions under technical assumptions, with verification methods and a target-localization example.