A Generative Product-of-Filters Model of Audio

Dawen Liang; Gautham J. Mysore; Matthew D. Hoffman

arxiv: 1312.5857 · v5 · pith:BPAY34BDnew · submitted 2013-12-20 · 📊 stat.ML · cs.LG

A Generative Product-of-Filters Model of Audio

Dawen Liang , Matthew D. Hoffman , Gautham J. Mysore This is my paper

classification 📊 stat.ML cs.LG

keywords modelaudioprocessinggenerativeinferenceproduct-of-filterssignaltask

0 comments

read the original abstract

We propose the product-of-filters (PoF) model, a generative model that decomposes audio spectra as sparse linear combinations of "filters" in the log-spectral domain. PoF makes similar assumptions to those used in the classic homomorphic filtering approach to signal processing, but replaces hand-designed decompositions built of basic signal processing operations with a learned decomposition based on statistical inference. This paper formulates the PoF model and derives a mean-field method for posterior inference and a variational EM algorithm to estimate the model's free parameters. We demonstrate PoF's potential for audio processing on a bandwidth expansion task, and show that PoF can serve as an effective unsupervised feature extractor for a speaker identification task.

This paper has not been read by Pith yet.

A Generative Product-of-Filters Model of Audio

discussion (0)