Weighted quantization using MMD: From mean field to mean shift via gradient flows

Ayoub Belhadji; Daniel Sharp; Youssef Marzouk

arxiv: 2502.10600 · v4 · submitted 2025-02-14 · 📊 stat.ML · cs.LG· cs.NA· math.NA

Weighted quantization using MMD: From mean field to mean shift via gradient flows

Ayoub Belhadji , Daniel Sharp , Youssef Marzouk This is my paper

classification 📊 stat.ML cs.LGcs.NAmath.NA

keywords meangradientshiftalgorithmmsipparticlesquantizationclustering

0 comments

read the original abstract

Approximating a probability distribution using a set of particles is a fundamental problem in machine learning and statistics, with applications including clustering and quantization. Formally, we seek a weighted mixture of Dirac measures that best approximates the target distribution. While much existing work relies on the Wasserstein distance to quantify approximation errors, maximum mean discrepancy (MMD) has received comparatively less attention, especially when allowing for variable particle weights. We argue that a Wasserstein-Fisher-Rao gradient flow is well-suited for designing quantizations optimal under MMD. We show that a system of interacting particles satisfying a set of ODEs discretizes this flow. We further derive a new fixed-point algorithm called mean shift interacting particles (MSIP). We show that MSIP extends the classical mean shift algorithm, widely used for identifying modes in kernel density estimators. Moreover, we show that MSIP can be interpreted as preconditioned gradient descent and that it acts as a relaxation of Lloyd's algorithm for clustering. Our unification of gradient flows, mean shift, and MMD-optimal quantization yields algorithms that are more robust than state-of-the-art methods, as demonstrated via high-dimensional and multi-modal numerical experiments.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Stationary MMD Points
stat.ML 2025-05 unverdicted novelty 7.0

Stationary MMD points show super-convergence in integration error over MMD for RKHS integrands, and MMD gradient flows compute them with a new non-asymptotic finite-particle error bound.
A note on the unique properties of the Kullback--Leibler divergence for sampling via gradient flows
stat.ME 2025-07 unverdicted novelty 6.0

The Kullback-Leibler divergence is the only Bregman divergence whose gradient flow with respect to many popular metrics does not require the normalizing constant of the target distribution π.