Using duration models to reduce fragmentation in audio segmentation

Authors:
Samer Abdallah;Mark Sandler;Christophe Rhodes;Michael Casey
Affiliations:
Queen Mary, University of London, London E1 4NS;Queen Mary, University of London, London E1 4NS;Goldsmiths College, University of London, London SE14 6NW;Goldsmiths College, University of London, London SE14 6NW
Venue:
Machine Learning
Year:
2006

Citing 13
Cited 3

Towards a general theory of action and time

Artificial Intelligence
Temporal logics and their applications

Temporal logics and their applications
Reasoning about change: time and causation from the standpoint of artificial intelligence

Reasoning about change: time and causation from the standpoint of artificial intelligence
Visualizing music and audio using self-similarity

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Evaluation of a simple and effective music information retrieval method

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Pairwise Data Clustering by Deterministic Annealing

IEEE Transactions on Pattern Analysis and Machine Intelligence
Discovering Musical Structure in Audio Recordings

ICMAI '02 Proceedings of the Second International Conference on Music and Artificial Intelligence
Quantitative methods of evaluating image segmentation

ICIP '95 Proceedings of the 1995 International Conference on Image Processing (Vol. 3)-Volume 3 - Volume 3
Repeating pattern discovery and structure analysis from acoustic music data

Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
Content-based music structure analysis with applications to music semantics understanding

Proceedings of the 12th annual ACM international conference on Multimedia
Monte Carlo Statistical Methods (Springer Texts in Statistics)

Monte Carlo Statistical Methods (Springer Texts in Statistics)
Music summarization using key phrases

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
"The way it Sounds": timbre models for analysis and retrieval of music signals

IEEE Transactions on Multimedia

Music structure analysis using a probabilistic fitness measure and a greedy search algorithm

IEEE Transactions on Audio, Speech, and Language Processing
Modeling music as a dynamic texture

IEEE Transactions on Audio, Speech, and Language Processing
MUSIZ: a generic framework for music resizing with stretching and cropping

MM '11 Proceedings of the 19th ACM international conference on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

We investigate explicit segment duration models in addressing the problem of fragmentation in musical audio segmentation. The resulting probabilistic models are optimised using Markov Chain Monte Carlo methods; in particular, we introduce a modification to Wolff's algorithm to make it applicable to a segment classification model with an arbitrary duration prior. We apply this to a collection of pop songs, and show experimentally that the generated segmentations suffer much less from fragmentation than those produced by segmentation algorithms based on clustering, and are closer to an expert listener's annotations, as evaluated by two different performance measures.