ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 05
A generative theory of shape
Automatic music video summarization based on audio-visual-text analysis and alignment
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic Structure Detection for Popular Music
IEEE MultiMedia
Using duration models to reduce fragmentation in audio segmentation
Machine Learning
Hi-index | 0.00 |
Music is often described in terms of the structure of repeated phrases. For example, many songs have the form AABA, where each letter represents an instance of a phrase. This research aims to construct descriptions or explanations of music in this form, using only audio recordings as input. A system of programs is described that transcribes the melody of a recording, identifies similar segments, clusters these segments to form patterns, and then constructs an explanation of the music in terms of these patterns. Additional work using spectral information rather than melodic transcription is also described. Examples of successful machine "listening" and music analysis are presented.