A novel framework for efficient automated singer identification in large music databases

Authors:
Jialie Shen;John Shepherd;Bin Cui;Kian-Lee Tan
Affiliations:
Singapore Management University, Singapore;The University of New South Wales, Sidney, Australia;Peking University, Beijing, China;National University of Singapore, Kent Ridge, Singapore
Venue:
ACM Transactions on Information Systems (TOIS)
Year:
2009

Citing 20
Cited 9

Learning boolean functions in an infinite attribute space

STOC '90 Proceedings of the twenty-second annual ACM symposium on Theory of computing
Fundamentals of speech recognition

Fundamentals of speech recognition
A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
Speech recognition: theory and C++ implementation

Speech recognition: theory and C++ implementation
The Internet is changing the music industry

Communications of the ACM
A singer identification technique for content-based classification of MP3 music objects

Proceedings of the eleventh international conference on Information and knowledge management
Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying

IEEE Transactions on Pattern Analysis and Machine Intelligence
Content management for electronic music distribution

Communications of the ACM - Digital rights management
Logistic Regression, AdaBoost and Bregman Distances

COLT '00 Proceedings of the Thirteenth Annual Conference on Computational Learning Theory
A continuous probabilistic framework for image matching

Computer Vision and Image Understanding
A comparative study on content-based music genre classification

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The MP3 open standard and the music industry's response to Internet piracy

Communications of the ACM - Blueprint for the future of high-performance networking
Music artist style identification by semi-supervised learning from both lyrics and content

Proceedings of the 12th annual ACM international conference on Multimedia
Music-to-knowledge (M2K): a prototyping and evaluation environment for music digital library research

Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures

Computer Music Journal
HSI: A Novel Framework for Efficient Automated Singer Identification in Large Music Database

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Introduction

Communications of the ACM - Music information retrieval
Automatic singer identification

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
A novel XML music information retrieval method using graph invariants

ACM Transactions on Information Systems (TOIS)
Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals

IEEE Transactions on Audio, Speech, and Language Processing

Effective music tagging through advanced statistical modeling

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Effect of noise-in-speech on MFCC parameters

SSIP '09/MIV'09 Proceedings of the 9th WSEAS international conference on signal, speech and image processing, and 9th WSEAS international conference on Multimedia, internet & video technologies
Contextual Video Recommendation by Multimodal Relevance and User Feedback

ACM Transactions on Information Systems (TOIS)
Beyond search: Event-driven summarization for web videos

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Tag-based social image search with visual-text joint hypergraph learning

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Nudging towards serendipity: a case with personal digital photos

BCS-HCI '11 Proceedings of the 25th BCS Conference on Human-Computer Interaction
Mining movies for song sequences with video based music genre identification system

Information Processing and Management: an International Journal
Advertising object in web videos

Neurocomputing
Mining movie archives for song sequences

Multimedia Tools and Applications

Quantified Score

Hi-index	0.01

Visualization

Abstract

Over the past decade, there has been explosive growth in the availability of multimedia data, particularly image, video, and music. Because of this, content-based music retrieval has attracted attention from the multimedia database and information retrieval communities. Content-based music retrieval requires us to be able to automatically identify particular characteristics of music data. One such characteristic, useful in a range of applications, is the identification of the singer in a musical piece. Unfortunately, existing approaches to this problem suffer from either low accuracy or poor scalability. In this article, we propose a novel scheme, called Hybrid Singer Identifier (HSI), for efficient automated singer recognition. HSI uses multiple low-level features extracted from both vocal and nonvocal music segments to enhance the identification process; it achieves this via a hybrid architecture that builds profiles of individual singer characteristics based on statistical mixture models. An extensive experimental study on a large music database demonstrates the superiority of our method over state-of-the-art approaches in terms of effectiveness, efficiency, scalability, and robustness.