Knowledge discovery-based identification of musical pitches and instruments in polyphonic sounds

  • Authors:
  • Rory A. Lewis;Xin Zhang;Zbigniew W. Ra

  • Affiliations:
  • Computer Science Department, University of North Carolina, 9201 University City Blvd. Charlotte, NC 28223, USA;Computer Science Department, University of North Carolina, 9201 University City Blvd. Charlotte, NC 28223, USA;Computer Science Department, University of North Carolina, 9201 University City Blvd. Charlotte, NC 28223, USA

  • Venue:
  • Engineering Applications of Artificial Intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Pitch and timber detection methods applicable to monophonic digital signals are common. Conversely, successful detection of multiple pitches and timbers in polyphonic time-invariant music signals remains a challenge. A review of these methods, sometimes called ''Blind Signal Separation'', is presented in this paper. We analyze how musically trained human listeners overcome resonance, noise, and overlapping signals to identify and isolate what instruments are playing and then what pitch each instrument is playing. The part of the instrument and pitch recognition system, presented in this paper, responsible for identifying the dominant instrument from a base signal uses temporal features proposed by Wieczorkowska [Slezak, D., Synak, P., Wieczorkowska, A., Wroblewski, J., 2002. Kdd-based approach to musical instrument sound recognition. Hacid, M.-S., Ras, Z.W., Zighed, D.A., Kodratoff, Y. (Eds.), Foundations of Intelligent Systems. Proceedings of 13th Symposium ISMIS 2002, Lyon, Franc 4519 Berlin, Heidelberg, pp. 28-36.] in addition to the standard 11 MPEG7 features. After retrieving a semantical match for that dominant instrument from the database, it creates a resulting foreign set of features to form a new synthetic basen signal which no longer bears the previously extracted dominant sound. The system may repeat this process until all recognizable dominant instruments are accounted for in the segment. The proposed methodology incorporates Knowledge Discovery, MPEG7 segmentation and Inverse Fourier Transforms.