pith. sign in

arxiv: 1705.02514 · v2 · pith:6G6VTUY6new · submitted 2017-05-06 · 💻 cs.SD

End-to-end Source Separation with Adaptive Front-Ends

classification 💻 cs.SD
keywords separationsourcetransformsend-to-endfront-endnetworkadaptiveapplications
0
0 comments X
read the original abstract

Source separation and other audio applications have traditionally relied on the use of short-time Fourier transforms as a front-end frequency domain representation step. The unavailability of a neural network equivalent to forward and inverse transforms hinders the implementation of end-to-end learning systems for these applications. We present an auto-encoder neural network that can act as an equivalent to short-time front-end transforms. We demonstrate the ability of the network to learn optimal, real-valued basis functions directly from the raw waveform of a signal and further show how it can be used as an adaptive front-end for supervised source separation. In terms of separation performance, these transforms significantly outperform their Fourier counterparts. Finally, we also propose a novel source to distortion ratio based cost function for end-to-end source separation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.