Non-reference audio quality assessment for online live music recordings

  • Authors:
  • Zhonghua Li;Ju-Chiang Wang;Jingli Cai;Zhiyan Duan;Hsin-Min Wang;Ye Wang

  • Affiliations:
  • School of Computing, National University of Singapore, Singapore, Singapore;Institute of Information Science, Academia Sinica, Taipei, Taiwan Roc;School of Computing, National University of Singapore, Singapore, Singapore;School of Computing, National University of Singapore, Singapore, Singapore;Institute of Information Science, Academia Sinica, Taipei, Taiwan Roc;School of Computing, National University of Singapore, Singapore, Singapore

  • Venue:
  • Proceedings of the 21st ACM international conference on Multimedia
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Immensely popular video sharing websites such as YouTube have become the most important sources of music information for Internet users and the most prominent platform for sharing live music. The audio quality of this huge amount of live music recordings, however, varies significantly due to factors such as environmental noise, location, and recording device. However, most video search engines do not take audio quality into consideration when retrieving and ranking results. Given the fact that most users prefer live music videos with better audio quality, we propose the first automatic, non-reference audio quality assessment framework for live music video search online. We first construct two annotated datasets of live music recordings. The first dataset contains 500 human-annotated pieces, and the second contains 2,400 synthetic pieces systematically generated by adding noise effects to clean recordings. Then, we formulate the assessment task as a ranking problem and try to solve it using a learning-based scheme. To validate the effectiveness of our framework, we perform both objective and subjective evaluations. Results show that our framework significantly improves the ranking performance of live music recording retrieval and can prove useful for various real-world music applications.