A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge

Authors:
B. Schuller;M. Zobl;G. Rigoll;M. Lang
Affiliations:
Inst. for Human-Computer Commun., Technische Univ. Munchen, Germany;Inst. for Human-Computer Commun., Technische Univ. Munchen, Germany;Inst. for Human-Computer Commun., Technische Univ. Munchen, Germany;Inst. for Human-Computer Commun., Technische Univ. Munchen, Germany
Venue:
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Year:
2003

Citing 2
Cited 1

Workshop on the creation of standardized test collections, tasks and metrics for music information retrieval (MIR) and music digital library (MDL) evaluation

Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Introduction to Bayesian Networks

Introduction to Bayesian Networks

Music thumbnailing incorporating harmony- and rhythm structure

AMR'08 Proceedings of the 6th international conference on Adaptive Multimedia Retrieval: identifying, Summarizing, and Recommending Image and Music

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently an increasing interest in music retrieval can be observed. Due to the growing amount of online and offline available music and a broadening user spectrum more efficient query methods are needed. We believe that only a parallel multimodal combination of different input modalities forms the most intuitive way to access desired media for any user. In this paper we introduce a query by humming, speaking, writing, and typing. The strengths of each modality are combined in a synergetic manner by a soft decision fusion. Songs can be referenced by their according melody, artist, title or other specific information. Further more the recognition of the actual user's emotion and external contextual knowledge helps to build an expectance of the intended song at a time. This constrains the hypothesis sphere of possible songs and leads to a more robust recognition or even a suggestive query. A combination of artificial neural networks, hidden Markov models and dynamic time warping integrated in a Bayesian belief network framework build the mathematical background of the chosen hybrid architecture. We address the implementation of a working system and results achieved by the introduced methods.