Investigation of Frame Alignments for GMM-based Digit-prompted Speaker Verification

Jia Liu; Liang He; Michael T. Johnson; Weiqiang Zhang; Yi Liu

arxiv: 1710.10436 · v4 · pith:RJTMPYSTnew · submitted 2017-10-28 · 💻 cs.SD · eess.AS

Investigation of Frame Alignments for GMM-based Digit-prompted Speaker Verification

Yi Liu , Liang He , Weiqiang Zhang , Jia Liu , Michael T. Johnson This is my paper

classification 💻 cs.SD eess.AS

keywords alignmentsverificationdigit-promptedspeakeralignmentdifferentframegmm-based

0 comments

read the original abstract

Frame alignments can be computed by different methods in GMM-based speaker verification. By incorporating a phonetic Gaussian mixture model (PGMM), we are able to compare the performance using alignments extracted from the deep neural networks (DNN) and the conventional hidden Markov model (HMM) in digit-prompted speaker verification. Based on the different characteristics of these two alignments, we present a novel content verification method to improve the system security without much computational overhead. Our experiments on the RSR2015 Part-3 digit-prompted task show that, the DNN based alignment performs on par with the HMM alignment. The results also demonstrate the effectiveness of the proposed Kullback-Leibler (KL) divergence based scoring to reject speech with incorrect pass-phrases.

This paper has not been read by Pith yet.

Investigation of Frame Alignments for GMM-based Digit-prompted Speaker Verification

discussion (0)