Environmental sounds classification based on visual features

  • Authors:
  • Sameh Souli;Zied Lachiri

  • Affiliations:
  • Signal, Image and pattern recognition research unit Dept. of Genie Electrique, ENIT, Le Belvédère, Tunisia;Signal, Image and pattern recognition research unit Dept. of Genie Electrique, ENIT, Le Belvédère, Tunisia

  • Venue:
  • CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a method aimed at classification of the environmental sounds in the visual domain by using the scale and translation invariance. We present a new approach that extracts visual features from sound spectrograms. We suggest to apply support vector machines (SVM's) in order to address sound classification. Indeed, in the proposed method we explore sound spectrograms as texture images, and extracts the time-frequency structures by using a translation-invariant wavelet transform and a patch transform alternated with local maximum and global maximum to pursuit scale and translation invariance. We illustrate the performance of this method on an audio database, which composed of 10 sounds classes. The obtained recognition rate is of the order 91.82 % with the multiclass decomposition method: One-Against-One.