Drum loops retrieval from spoken queries

Authors:
Olivier Gillet;Gaël Richard
Affiliations:
GET-ENST (TELECOM Paris), Paris, France;GET-ENST (TELECOM Paris), Paris, France
Venue:
Journal of Intelligent Information Systems - Special issue: Intelligent multimedia applications
Year:
2005

Citing 13
Cited 0

Fundamentals of speech recognition

Fundamentals of speech recognition
The nature of statistical learning theory

The nature of statistical learning theory
Query by humming: musical information retrieval in an audio database

Proceedings of the third ACM international conference on Multimedia
Musical content-based retrieval: an overview of the Melodiscov approach and system

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
HMM-based musical query retrieval

Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Robust temporal and spectral modeling for query By melody

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
A hybrid graphical model for rhythmic parsing

Artificial Intelligence
Problems of music information retrieval in the real world

Information Processing and Management: an International Journal
Query by Rhythm: An Approach for Song Retrieval in Music Databases

RIDE '98 Proceedings of the Workshop on Research Issues in Database Engineering
The New Zealand Digital Library MELody in DEX

The New Zealand Digital Library MELody in DEX
Conventional and periodic N-grams in the transcription of drum sequences

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Sound onset detection by applying psychoacoustic knowledge

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
Automatic extraction of drum tracks from polyphonic music signals

WEDELMUSIC'02 Proceedings of the Second international conference on Web delivering of music

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent efforts in audio indexing and music information retrieval mostly focus on melody. If this is appropriate for polyphonic music signals, specific approaches are needed for systems dealing with percussive audio signals such as those produced by drums, tabla or djembé. In this article, we present a complete system allowing the management of a drum patterns (or drumloops) database. Queries in this database are formulated with spoken onomatopoeias-short meaningless words imitating the different sounds of the drumkit. The transcription task necessary to index the database is performed using Hidden Markov Models (HMM) and Support Vector Machines (SVM) and achieves a 86.4% correct recognition rate. The syllables of spoken queries are recognized and a relevant statistical model allows the comparison and alignment of the query with the rythmic sequences stored in the database, in order to provide a set of the most relevant drum loops.