Toward segmentation of popular music

  • Authors:
  • Yun-Sheng Wang

  • Affiliations:
  • George Mason University, Fairfax, VA, USA

  • Venue:
  • Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents my dissertation framework to extract local keys, chords, and segment popular music from audio signals; all unsupervised. Music signals are denoised using wavelet transform to obtain a smoother approximation for chroma extraction. We extract a bag of local keys from the chromagram using an infinite Gaussian mixture and use the key information to extract a time series of chords. Using chords, we transform the bag of keys into a timed sequence of local keys. The two time series, local keys and chords, are used to construct multi-dimensioned "harmonic rhythm" as segmentation cues. We propose to calculate the strangeness of the cues from the perspective of keys, speed, and dependence as a basis for change detection in the framework of a martingale-based algorithm to find segmentation boundaries. Given the structural information, the chord sequence can be further improved in a refinement loop consisting of keys, chords, and segmentations.