Short user-generated videos classification using accompanied audio categories

  • Authors:
  • Jinlin Guo;Cathal Gurrin

  • Affiliations:
  • CLARITY and School of Computing, Dublin City University, Dublin, Ireland;CLARITY and School of Computing, Dublin City University, Dublin, Ireland

  • Venue:
  • Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper investigates the classification of short user-generated videos (UGVs) using the accompanied audio data since short UGVs accounts for a great proportion of the Internet UGVs and many short UGVs are accompanied by single-category soundtracks. We define seven types of UGVs corresponding to seven audio categories respectively. We also investigate three modeling approaches for audio feature representation, namely, single Gaussian (1G), Gaussian mixture (GMM) and Bag-of-Audio-Word (BoAW) models. Then using Support Vector Machine (SVM) with three different distance measurements corresponding to three feature representations, classifiers are trained to categorize the UGVs. The accompanying evaluation results show that these approaches are effective for categorizing the short UGVs based on their audio track. Experimental results show that a GMM representation with approximated Bhattacharyya distance (ABD) measurement produces the best performance, and BoAW representation with chi_square kernel also reports comparable results.