A comparative evaluation of search techniques for query-by-humming using the MUSART testbed

Authors:
Roger B. Dannenberg;William P. Birmingham;Bryan Pardo;Ning Hu;Colin Meek;George Tzanetakis
Affiliations:
Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213;Grove City College, Box 3123, Grove City, PA 16127;Department of Electrical Engineering and Computer Science, Northwestern University, Evanston, IL 60208;Google Inc. New York Office, New York, NY 10018;Microsoft Corporation, SQL Server, Building 35/2165, 1 Microsoft Way, Redmond, WA 98052;Computer Science Department, University of Victoria, P.O. Box 3055 STN CSC, Victoria BC V8W 3p6, Canada
Venue:
Journal of the American Society for Information Science and Technology
Year:
2007

Citing 14
Cited 10

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
Towards the digital music library: tune retrieval from acoustic input

Proceedings of the first ACM international conference on Digital libraries
Content-based retrieval for music collections

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Towards a digital library of popular music

Proceedings of the fourth ACM conference on Digital libraries
Melodic matching techniques for large music databases

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Evaluation of a simple and effective music information retrieval method

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
A comparison of melodic database retrieval techniques using sung queries

Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
The effectiveness study of various music information retrieval approaches

Proceedings of the eleventh international conference on Information and knowledge management
Searching Digital Music Libraries

ICADL '02 Proceedings of the 5th International Conference on Asian Digital Libraries: Digital Libraries: People, Knowledge, and Technology
Robust Polyphonic Music Retrieval with N-grams

Journal of Intelligent Information Systems
Name that tune: a pilot study in finding a melody from a sung query

Journal of the American Society for Information Science and Technology
A robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
A comprehensive trainable error model for sung music queries

Journal of Artificial Intelligence Research

Query by humming with the VocalSearch system

Communications of the ACM - Music information retrieval
Humming control interface for hand-held devices

Proceedings of the 9th international ACM SIGACCESS conference on Computers and accessibility
The vocalsearch music search engine

Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
Toward a General Framework for Polyphonic Comparison

Fundamenta Informaticae - Special Issue on Stringology
Accurate and efficient retrieval of multimedia time series data under uniform scaling and time warping

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
User specific training of a music search engine

MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction
Polyphonic alignment algorithms for symbolic music retrieval

CMMR/ICAD'09 Proceedings of the 6th international conference on Auditory Display
Melody, bass line, and harmony representations for music version identification

Proceedings of the 21st international conference companion on World Wide Web
Toward a General Framework for Polyphonic Comparison

Fundamenta Informaticae - Special Issue on Stringology
A survey of query-by-humming similarity methods

Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments

Quantified Score

Hi-index	0.00

Visualization

Abstract

Query-by-humming systems offer content-based searching for melodies and require no special musical training or knowledge. Many such systems have been built, but there has not been much useful evaluation and comparison in the literature due to the lack of shared databases and queries. The MUSART project testbed allows various search algorithms to be compared using a shared framework that automatically runs experiments and summarizes results. Using this testbed, the authors compared algorithms based on string alignment, melodic contour matching, a hidden Markov model, n-grams, and CubyHum. Retrieval performance is very sensitive to distance functions and the representation of pitch and rhythm, which raises questions about some previously published conclusions. Some algorithms are particularly sensitive to the quality of queries. Our queries, which are taken from human subjects in a realistic setting, are quite difficult, especially for n-gram models. Finally, simulations on query-by-humming performance as a function of database size indicate that retrieval performance falls only slowly as the database size increases. © 2007 Wiley Periodicals, Inc.