pith. sign in

arxiv: 1609.04120 · v3 · pith:SUBOIOOYnew · submitted 2016-09-14 · 📊 stat.ML · cs.CR

Private Topic Modeling

classification 📊 stat.ML cs.CR
keywords inferenceprivacyvariationaldataalgorithmamountdistributionsiterations
0
0 comments X p. Extension
pith:SUBOIOOY Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{SUBOIOOY}

Prints a linked pith:SUBOIOOY badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We develop a privatised stochastic variational inference method for Latent Dirichlet Allocation (LDA). The iterative nature of stochastic variational inference presents challenges: multiple iterations are required to obtain accurate posterior distributions, yet each iteration increases the amount of noise that must be added to achieve a reasonable degree of privacy. We propose a practical algorithm that overcomes this challenge by combining: (1) an improved composition method for differential privacy, called the moments accountant, which provides a tight bound on the privacy cost of multiple variational inference iterations and thus significantly decreases the amount of additive noise; and (2) privacy amplification resulting from subsampling of large-scale data. Focusing on conjugate exponential family models, in our private variational inference, all the posterior distributions will be privatised by simply perturbing expected sufficient statistics. Using Wikipedia data, we illustrate the effectiveness of our algorithm for large-scale data.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.