pith. sign in

arxiv: 1304.6763 · v2 · pith:FMXHIRAInew · submitted 2013-04-24 · 💻 cs.SD · cs.IT· math.IT

Deep Scattering Spectrum

classification 💻 cs.SD cs.ITmath.IT
keywords scatteringclassificationcoefficientsinvariantmodulationobtainedrepresentationspectrum
0
0 comments X
read the original abstract

A scattering transform defines a locally translation invariant representation which is stable to time-warping deformations. It extends MFCC representations by computing modulation spectrum coefficients of multiple orders, through cascades of wavelet convolutions and modulus operators. Second-order scattering coefficients characterize transient phenomena such as attacks and amplitude modulation. A frequency transposition invariant representation is obtained by applying a scattering transform along log-frequency. State-the-of-art classification results are obtained for musical genre and phone classification on GTZAN and TIMIT databases, respectively.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Integrating the Data Augmentation Scheme with Various Classifiers for Acoustic Scene Modeling

    eess.AS 2019-07 unverdicted novelty 2.0

    DCASE 2019 challenge submission reports >85% accuracy on acoustic scene classification via GAN augmentation fused with 1D and 2D CNN classifiers.