Singing voice recognition based on matching of spectrogram pattern

Authors:
Peerapol Khunarsal;Chidchanok Lursinsap;Thanapant Raicharoen
Affiliations:
Advanced Virtual and Intelligent Computing Center, Department of Mathematics Chulalongkom University, Bangkok, Thailand;Advanced Virtual and Intelligent Computing Center, Department of Mathematics Chulalongkom University, Bangkok, Thailand;Defence Information and Space Technology Department The Office of Permanent Secretary of Defence Ministry of Defence, Bangkok, Thailand
Venue:
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Year:
2009

Citing 6
Cited 1

Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
The Effects of Background Music on Speech Recognition Accuracy

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Computationally measurable differences between speech and song

Computationally measurable differences between speech and song
Environmental Sound Recognition by Multilayered Neural Networks

CIT '04 Proceedings of the The Fourth International Conference on Computer and Information Technology
Song wave retrieval based on frame-wise phoneme recognition

AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression

IEEE Transactions on Audio, Speech, and Language Processing

Singer identification using time-frequency audio feature

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

Singing voice recognition is a difficult topic in Music information retrieval research area. The first approaches borrowed successful techniques widely used in Automatic speech Recognition (ASR) as speech and singing share similar acoustical feature since they are produced by the same apparatus. Moving from monophonic to polyphonic audio signal the problem become more complex as the background instrumental accompaniment is regarded as a noise source that has to be attenuated. This paper proposes a singing voice recognition algorithm that is able to automatically recognize the word in a singing signal with background music by using the concept of spectrogram pattern matching. The main idea is to apply both the spectrogram and the image processing methods to solve the problem of singing voice recognition. Each signal that accompanies music is analyzed and generated to its spectrogram that is used to train data for the classifier. Several classification functions are compared, such as Fisher classifier, Feed-Forward can effectively recognize the word in music with the accuracy rate more than 84%.