SVM-based audio classification for content-based multimedia retrieval

  • Authors:
  • Yingying Zhu;Zhong Ming;Qiang Huang

  • Affiliations:
  • Faculty of Information Engineering, Shenzhen University, Shenzhen, P.R.China and Software Engineering Ltd. of Harbin Institute of Technology, Haerbin, China;Faculty of Information Engineering, Shenzhen University, Shenzhen, P.R.China;Faculty of Information Engineering, Shenzhen University, Shenzhen, P.R.China

  • Venue:
  • MCAM'07 Proceedings of the 2007 international conference on Multimedia content analysis and mining
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Audio classification is very important in multimedia retrieval such as audio indexing, analysis and content-based video retrieval. In this paper, we have proposed a clip-based support vector machine (SVM) approach to classify audio signals into six classes, which are pure speech, music, silence, environmental sound, speech with music and speech with environmental sound. The classification results are then used to partition a video into homogeneous audio segments, which is used to analyze and retrieve its higher-level content. The experimental results show that the proposed system not only improves classification accuracy, but also performs better than the other classification systems using the decision tree (DT), K Nearest Neighbor (K-NN) and Neural Network (NN).