Discovering Musical Structure in Audio Recordings

Authors:
Roger B. Dannenberg;Ning Hu
Affiliations:
-;-
Venue:
ICMAI '02 Proceedings of the Second International Conference on Music and Artificial Intelligence
Year:
2002

Citing 2
Cited 3

A predominant-F/sub 0/ estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 05
A generative theory of shape

A generative theory of shape

Automatic music video summarization based on audio-visual-text analysis and alignment

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic Structure Detection for Popular Music

IEEE MultiMedia
Using duration models to reduce fragmentation in audio segmentation

Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Music is often described in terms of the structure of repeated phrases. For example, many songs have the form AABA, where each letter represents an instance of a phrase. This research aims to construct descriptions or explanations of music in this form, using only audio recordings as input. A system of programs is described that transcribes the melody of a recording, identifies similar segments, clusters these segments to form patterns, and then constructs an explanation of the music in terms of these patterns. Additional work using spectral information rather than melodic transcription is also described. Examples of successful machine "listening" and music analysis are presented.