VAE-based regularization for deep speaker embedding

Dong Wang; Lantian Li; Yang Zhang

arxiv: 1904.03617 · v1 · pith:MSSRQSNAnew · submitted 2019-04-07 · 💻 cs.SD · cs.LG· eess.AS

VAE-based regularization for deep speaker embedding

Yang Zhang , Lantian Li , Dong Wang This is my paper

classification 💻 cs.SD cs.LGeess.AS

keywords speakerdeepembeddinggaussianlatentperformancepldaregularization

0 comments

read the original abstract

Deep speaker embedding has achieved state-of-the-art performance in speaker recognition. A potential problem of these embedded vectors (called `x-vectors') are not Gaussian, causing performance degradation with the famous PLDA back-end scoring. In this paper, we propose a regularization approach based on Variational Auto-Encoder (VAE). This model transforms x-vectors to a latent space where mapped latent codes are more Gaussian, hence more suitable for PLDA scoring.

This paper has not been read by Pith yet.

VAE-based regularization for deep speaker embedding

discussion (0)