Multi-Domain Processing via Hybrid Denoising Networks for Speech Enhancement

Adrian Kim; Jaejun Yoo; Jang-Hyun Kim; Jung-Woo Ha; Sanghyuk Chun

arxiv: 1812.08914 · v1 · pith:JF7SHXWDnew · submitted 2018-12-21 · 📡 eess.AS · cs.SD

Multi-Domain Processing via Hybrid Denoising Networks for Speech Enhancement

Jang-Hyun Kim , Jaejun Yoo , Sanghyuk Chun , Adrian Kim , Jung-Woo Ha This is my paper

classification 📡 eess.AS cs.SD

keywords hybridmodelapproachesdifferentenhancementmulti-domainnoiseperformance

0 comments

read the original abstract

We present a hybrid framework that leverages the trade-off between temporal and frequency precision in audio representations to improve the performance of speech enhancement task. We first show that conventional approaches using specific representations such as raw-audio and spectrograms are each effective at targeting different types of noise. By integrating both approaches, our model can learn multi-scale and multi-domain features, effectively removing noise existing on different regions on the time-frequency space in a complementary way. Experimental results show that the proposed hybrid model yields better performance and robustness than using each model individually.

This paper has not been read by Pith yet.

Multi-Domain Processing via Hybrid Denoising Networks for Speech Enhancement

discussion (0)