Introduces auxiliary interference speaker loss for target-speaker ASR achieving 6.6% relative WER reduction from 18.06% to 16.87% on mixed speech.
For training data, we randomly sampled an SIR value from uniform distribution between -10 dB and 10 dB for each mixture
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition
Introduces auxiliary interference speaker loss for target-speaker ASR achieving 6.6% relative WER reduction from 18.06% to 16.87% on mixed speech.