pith. sign in

arxiv: 1410.6905 · v1 · pith:STV5427Xnew · submitted 2014-10-25 · 💻 cs.SD

On the use of Stress information in Speech for Speaker Recognition

classification 💻 cs.SD
keywords speakerrecognitionspeechstressadditionaldifferentinformationinherent
0
0 comments X
read the original abstract

The performance of a speaker recognition system decreases when the speaker is under stress or emotion. In this paper we explore and identify a mechanism that enables use of inherent stress-in-speech or speaking style information present in speech of a person as additional cues for speaker recognition. We quantify the the inherent stress present in the speech of a speaker mainly using 3 features, namely, pitch, amplitude and duration (together called PAD) We experimentally observe that the PAD vectors of similar phones in different words of a speaker are close to each other in the three dimensional (PAD) space confirming that the way a speaker stresses different syllables in their speech is unique to them, thus we propose the use of PAD based speaking style of a speaker as an additional feature for speaker recognition applications.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.