The proposed neural network is a CNN based encoder and an atten- tion based pooling layer followed by a set of dense layers

System Description Figure 2 shows the overall architecture used for this work

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Self Multi-Head Attention for Speaker Recognition

cs.SD · 2019-06-24 · unverdicted · novelty 6.0

Self multi-head attention applied after CNN encoding of spectrograms outperforms temporal and statistical pooling for speaker verification on VoxCeleb1 with 18% relative EER reduction.

citing papers explorer

Showing 1 of 1 citing paper.

Self Multi-Head Attention for Speaker Recognition cs.SD · 2019-06-24 · unverdicted · none · ref 4
Self multi-head attention applied after CNN encoding of spectrograms outperforms temporal and statistical pooling for speaker verification on VoxCeleb1 with 18% relative EER reduction.

The proposed neural network is a CNN based encoder and an atten- tion based pooling layer followed by a set of dense layers

fields

years

verdicts

representative citing papers

citing papers explorer