Environmental sound classification for scene recognition using local discriminant bases and HMM

  • Authors:
  • Feng Su;Li Yang;Tong Lu;Gongyou Wang

  • Affiliations:
  • Nanjing University, Nanjing, China;Nanjing University, Nanjing, China;Nanjing University, Nanjing, China;Nanjing University, Nanjing, China

  • Venue:
  • MM '11 Proceedings of the 19th ACM international conference on Multimedia
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Analysis and classification of auditory scenes or contexts play important roles in content-based indexing and retrieval of multimedia databases and context-aware applications. In this paper, we propose an environmental sound and auditory scene recognition scheme that focuses on efficient feature representation and classfication of the unstructured composition of a scene (for example, restaurant, street, beach, etc.). We propose to use the local discriminant bases (LDB) technique to identify the discriminatory time-frequency subspace for environmental sounds and then use it for corresponding feature extraction. Based on LDB, we present two recognition models, with or without explicit sound event modeling, for auditory scenes, in which the hidden Markov model (HMM) is used to depict the characteristics and correlations among various events that constitute the scene. The experimental results demonstrate the effectiveness of the proposed approach for auditory scene classification.