pith. sign in

arxiv: 1902.01053 · v1 · pith:QAGNUW7Anew · submitted 2019-02-04 · 📡 eess.AS

Overlap-Add Windows with Maximum Energy Concentration for Speech and Audio Processing

classification 📡 eess.AS
keywords overlap-addprocessingwindowsconcentrationenergymaximumapproachaudio
0
0 comments X
read the original abstract

Processing of speech and audio signals with time-frequency representations require windowing methods which allow perfect reconstruction of the original signal and where processing artifacts have a predictable behavior. The most common approach for this purpose is overlap-add windowing, where signal segments are windowed before and after processing. Commonly used windows include the half-sine and a Kaiser-Bessel derived window. The latter is an approximation of the discrete prolate spherical sequence, and thus a maximum energy concentration window, adapted for overlap-add. We demonstrate that performance can be improved by including the overlap-add structure as a constraint in optimization of the maximum energy concentration criteria. The same approach can be used to find further special cases such as optimal low-overlap windows. Our experiments demonstrate that the proposed windows provide notable improvements in terms of reduction in side-lobe magnitude.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.