Query by humming: musical information retrieval in an audio database
Proceedings of the third ACM international conference on Multimedia
An overview of audio information retrieval
Multimedia Systems - Special issue on audio and multimedia
Data structures and algorithms for nearest neighbor search in general metric spaces
SODA '93 Proceedings of the fourth annual ACM-SIAM Symposium on Discrete algorithms
Let's search for songs by humming!
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 2)
A practical query-by-humming system for a large music database
MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Discrete Time Processing of Speech Signals
Discrete Time Processing of Speech Signals
Fast tree-structured nearest neighbor encoding for vector quantization
IEEE Transactions on Image Processing
Super MBox: an efficient/effective content-based music retrieval system
MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Query by Tapping: A New Paradigm for Content-Based Music Retrieval from Acoustic Input
PCM '01 Proceedings of the Second IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Warping indexes with envelope transforms for query by humming
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Music scale modeling for melody matching
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Microcontroller implementation of melody recognition: a prototype
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Distributed and Reactive Query Planning in R-MAGIC: An Agent-Based Multimedia Retrieval System
IEEE Transactions on Knowledge and Data Engineering
Research and developments of a multi-modal MIR engine for commercial applications in East Asia
Journal of the American Society for Information Science and Technology - Music information retrieval
FTW: fast similarity search under the time warping distance
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Compacting music signatures for efficient music retrieval
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Identifying Similar Subsequences in Data Streams
DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
IEICE - Transactions on Information and Systems
Rhythm Speech Lyrics Input for MIDI-Based Singing Voice Synthesis
PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Similarity searching techniques in content-based audio retrieval via hashing
MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
A query-by-singing technique for retrieving polyphonic objects of popular music
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
A music retrieval system based on query-by-singing for karaoke jukebox
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
An initial study on progressive filtering based on dynamic programming for query-by-singing/humming
PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
An FFT-based fast melody comparison method for query-by-singing/humming systems
Pattern Recognition Letters
An FPGA based parallel architecture for music melody matching
Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
A query by humming system based on locality sensitive hashing indexes
Signal Processing
Pattern discovery in data streams under the time warping distance
The VLDB Journal — The International Journal on Very Large Data Bases
Hi-index | 0.00 |
This paper presents an implementation of a content-based music retrieval system that can take a user's acoustic input (8-second clip of singing or humming) via a microphone and then retrieve the intended song from a database containing over 3000 candidate songs. The system, known as Super MBox, demonstrates the feasibility of real-time music retrieval with a high success rate. Super MBox first takes the user's acoustic input from a microphone and converts it into a pitch vector. Then a hierarchical filtering method (HFM) is used to first filter out 80% unlikely candidates and then compare the query input with the remaining 20% candidates in a detailed manner. The output of Super MBox is a ranked song list according to the computed similarity scores. A brief mathematical analysis of the two-step HFM is given in the paper to explain how to derive the optimum parameters of the comparison engine. The proposed HFM and its analysis framework can be directly applied to other multimedia information retrieval systems. We have tested Super MBox extensively and found the top-20 success rate is over 85%, based on a dataset of about singing/humming 2000 clips from people with mediocre singing skills. Our studies demonstrate the feasibility of using Super MBox as a prototype for music search engines over the Internet and/or query engines in digital music libraries.