Neural Networks for Pattern Recognition
Neural Networks for Pattern Recognition
The Effects of Background Music on Speech Recognition Accuracy
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Computationally measurable differences between speech and song
Computationally measurable differences between speech and song
Environmental Sound Recognition by Multilayered Neural Networks
CIT '04 Proceedings of the The Fourth International Conference on Computer and Information Technology
Song wave retrieval based on frame-wise phoneme recognition
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
IEEE Transactions on Audio, Speech, and Language Processing
Singer identification using time-frequency audio feature
ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Hi-index | 0.00 |
Singing voice recognition is a difficult topic in Music information retrieval research area. The first approaches borrowed successful techniques widely used in Automatic speech Recognition (ASR) as speech and singing share similar acoustical feature since they are produced by the same apparatus. Moving from monophonic to polyphonic audio signal the problem become more complex as the background instrumental accompaniment is regarded as a noise source that has to be attenuated. This paper proposes a singing voice recognition algorithm that is able to automatically recognize the word in a singing signal with background music by using the concept of spectrogram pattern matching. The main idea is to apply both the spectrogram and the image processing methods to solve the problem of singing voice recognition. Each signal that accompanies music is analyzed and generated to its spectrogram that is used to train data for the classifier. Several classification functions are compared, such as Fisher classifier, Feed-Forward can effectively recognize the word in music with the accuracy rate more than 84%.