Research and developments of a multi-modal MIR engine for commercial applications in East Asia

  • Authors:
  • Jyh-Shing Roger Jang;Hong-Ru Lee;Jiang-Chuen Chen;Cheng-Yuan Lin

  • Affiliations:
  • Multimedia Information Retrieval Laboratory, Computer Science Department, National Tsing Hua University, Room 444, EECS Building, Hsinchu, Taiwan;Multimedia Information Retrieval Laboratory, Computer Science Department, National Tsing Hua University, Room 444, EECS Building, Hsinchu, Taiwan;Multimedia Information Retrieval Laboratory, Computer Science Department, National Tsing Hua University, Room 444, EECS Building, Hsinchu, Taiwan;Multimedia Information Retrieval Laboratory, Computer Science Department, National Tsing Hua University, Room 444, EECS Building, Hsinchu, Taiwan

  • Venue:
  • Journal of the American Society for Information Science and Technology - Music information retrieval
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article describes the research and development of an efficient Music Information Retrieval (MIR) engine that is embedded in a karaoke software package targeted for Asian people's need of music retrieval. The MIR engine has a multi-modal interface that allows queries by singing, humming, tapping, speaking, and writing. In particular, we discuss the design philosophy, technical barriers, and performance evaluation of such an engine, as well as its current and potential commercial applications. Feedbacks and feature requests from users, which greatly influence our future work, are also addressed.