USTC-NELSLIP System Description for DIHARD-III Challenge

Chin-Hui Lee; Jia Pan; Jun Du; Lei Sun; Maokui He; Shutong Niu; Tian Gao; Xin Fang; Yuxuan Wang

arxiv: 2103.10661 · v1 · pith:Y4577UMRnew · submitted 2021-03-19 · 💻 cs.SD · cs.LG· eess.AS

USTC-NELSLIP System Description for DIHARD-III Challenge

Yuxuan Wang , Maokui He , Shutong Niu , Lei Sun , Tian Gao , Xin Fang , Jia Pan , Jun Du

show 1 more author

Chin-Hui Lee

This is my paper

classification 💻 cs.SD cs.LGeess.AS

keywords systemchallengedescriptiondiarizationprocessingspeechtrackachieved

0 comments

read the original abstract

This system description describes our submission system to the Third DIHARD Speech Diarization Challenge. Besides the traditional clustering based system, the innovation of our system lies in the combination of various front-end techniques to solve the diarization problem, including speech separation and target-speaker based voice activity detection (TS-VAD), combined with iterative data purification. We also adopted audio domain classification to design domain-dependent processing. Finally, we performed post processing to do system fusion and selection. Our best system achieved DERs of 11.30% in track 1 and 16.78% in track 2 on evaluation set, respectively.

This paper has not been read by Pith yet.

USTC-NELSLIP System Description for DIHARD-III Challenge

discussion (0)