Post-processing with an encoder-decoder model yields 22% relative EER reduction on normal-vs-whispered trials and 1.88% EER on whispered-vs-whispered, outperforming ReDimNet-B2.
V., A.R., Ghosh, P.K.: Formant-gaps features for speaker verification using whispered speech
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.SD 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Enhancing Speaker Verification with Whispered Speech via Post-Processing
Post-processing with an encoder-decoder model yields 22% relative EER reduction on normal-vs-whispered trials and 1.88% EER on whispered-vs-whispered, outperforming ReDimNet-B2.