Publication detail

Gabor frames and deep scattering networks in audio processing

BAMMER, R. DÖRFLER, M. HARÁR, P.

Original Title

Gabor frames and deep scattering networks in audio processing

English Title

Gabor frames and deep scattering networks in audio processing

Type

journal article in Web of Science

Language

en

Original Abstract

This paper introduces Gabor scattering, a feature extractor based on Gabor frames and Mallat's scattering transform. By using a simple signal model for audio signals specific properties of Gabor scattering are studied. It is shown that for each layer, specific invariances to certain signal characteristics occur. Furthermore, deformation stability of the coefficient vector generated by the feature extractor is derived by using a decoupling technique which exploits the contractivity of general scattering networks. Deformations are introduced as changes in spectral shape and frequency modulation. The theoretical results are illustrated by numerical examples and experiments. Numerical evidence is given by evaluation on a synthetic and a "real" data set, that the invariances encoded by the Gabor scattering transform lead to higher performance in comparison with just using Gabor transform, especially when few training samples are available.

English abstract

This paper introduces Gabor scattering, a feature extractor based on Gabor frames and Mallat's scattering transform. By using a simple signal model for audio signals specific properties of Gabor scattering are studied. It is shown that for each layer, specific invariances to certain signal characteristics occur. Furthermore, deformation stability of the coefficient vector generated by the feature extractor is derived by using a decoupling technique which exploits the contractivity of general scattering networks. Deformations are introduced as changes in spectral shape and frequency modulation. The theoretical results are illustrated by numerical examples and experiments. Numerical evidence is given by evaluation on a synthetic and a "real" data set, that the invariances encoded by the Gabor scattering transform lead to higher performance in comparison with just using Gabor transform, especially when few training samples are available.

Keywords

machine learning; scattering transform; Gabor transform; deep learning; time-frequency analysis; CNN;

Released

26.09.2019

Publisher

MDPI

Location

Switzerland

Pages from

1

Pages to

25

Pages count

25

URL

Full text in the Digital Library

BibTex


@article{BUT159057,
  author="Roswitha {Bammer} and Monika {Dörfler} and Pavol {Harár}",
  title="Gabor frames and deep scattering networks in audio processing",
  annote="This paper introduces Gabor scattering, a feature extractor based on Gabor frames and Mallat's scattering transform. By using a simple signal model for audio signals specific properties of Gabor scattering are studied. It is shown that for each layer, specific invariances to certain signal characteristics occur. Furthermore, deformation stability of the coefficient vector generated by the feature extractor is derived by using a decoupling technique which exploits the contractivity of general scattering networks. Deformations are introduced as changes in spectral shape and frequency modulation. The theoretical results are illustrated by numerical examples and experiments. Numerical evidence is given by evaluation on a synthetic and a "real" data set, that the invariances encoded by the Gabor scattering transform lead to higher performance in comparison with just using Gabor transform, especially when few training samples are available.",
  address="MDPI",
  chapter="159057",
  doi="10.3390/axioms8040106",
  howpublished="online",
  institution="MDPI",
  number="4",
  volume="8",
  year="2019",
  month="september",
  pages="1--25",
  publisher="MDPI",
  type="journal article in Web of Science"
}