pith. sign in

Scalable Bayesian Nonparametric Clustering and Classification

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

We develop a scalable multi-step Monte Carlo algorithm for inference under a large class of nonparametric Bayesian models for clustering and classification. Each step is "embarrassingly parallel" and can be implemented using the same Markov chain Monte Carlo sampler. The simplicity and generality of our approach makes inference for a wide range of Bayesian nonparametric mixture models applicable to large datasets. Specifically, we apply the approach to inference under a product partition model with regression on covariates. We show results for inference with two motivating data sets: a large set of electronic health records (EHR) and a bank telemarketing dataset. We find interesting clusters and favorable classification performance relative to other widely used competing classifiers.

fields

stat.ME 1

years

2019 1

verdicts

UNVERDICTED 1

representative citing papers

Dynamic time series clustering via volatility change-points

stat.ME · 2019-06-25 · unverdicted · novelty 4.0

A Bayesian method clusters time series by similarity in the timing of their most recent volatility change-points via a metric on posterior distributions, demonstrated on S&P 500 returns.

citing papers explorer

Showing 1 of 1 citing paper.

  • Dynamic time series clustering via volatility change-points stat.ME · 2019-06-25 · unverdicted · none · ref 19 · internal anchor

    A Bayesian method clusters time series by similarity in the timing of their most recent volatility change-points via a metric on posterior distributions, demonstrated on S&P 500 returns.