pith. sign in

arxiv: 2007.04287 · v1 · pith:2AWN26BQnew · submitted 2020-07-08 · 📊 stat.ML · cs.LG

Learning from DPPs via Sampling: Beyond HKPV and symmetry

classification 📊 stat.ML cs.LG
keywords dppshkpvlinearsampleabilityapproachbeyonddistribution
0
0 comments X
read the original abstract

Determinantal point processes (DPPs) have become a significant tool for recommendation systems, feature selection, or summary extraction, harnessing the intrinsic ability of these probabilistic models to facilitate sample diversity. The ability to sample from DPPs is paramount to the empirical investigation of these models. Most exact samplers are variants of a spectral meta-algorithm due to Hough, Krishnapur, Peres and Vir\'ag (henceforth HKPV), which is in general time and resource intensive. For DPPs with symmetric kernels, scalable HKPV samplers have been proposed that either first downsample the ground set of items, or force the kernel to be low-rank, using e.g. Nystr\"om-type decompositions. In the present work, we contribute a radically different approach than HKPV. Exploiting the fact that many statistical and learning objectives can be effectively accomplished by only sampling certain key observables of a DPP (so-called linear statistics), we invoke an expression for the Laplace transform of such an observable as a single determinant, which holds in complete generality. Combining traditional low-rank approximation techniques with Laplace inversion algorithms from numerical analysis, we show how to directly approximate the distribution function of a linear statistic of a DPP. This distribution function can then be used in hypothesis testing or to actually sample the linear statistic, as per requirement. Our approach is scalable and applies to very general DPPs, beyond traditional symmetric kernels.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. State-of-art minibatches via novel DPP kernels: discretization, wavelets, and rough objectives

    stat.ML 2026-05 unverdicted novelty 7.0

    Wavelet DPP kernels deliver improved continuous variance reduction and a discretization procedure that preserves decay rates for discrete ML subsampling tasks including rough objectives.