Heuristic approach for generic audio data segmentation and annotation
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Visualizing music and audio using self-similarity
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Overlap-add methods for time-scaling of speech
Speech Communication
Music thumbnailing via structural analysis
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Singing voice detection in popular music
Proceedings of the 12th annual ACM international conference on Multimedia
Using duration models to reduce fragmentation in audio segmentation
Machine Learning
Seam carving for media retargeting
Communications of the ACM - Rural engineering development
Consumer video retargeting: context assisted spatial-temporal grid optimization
MM '09 Proceedings of the 17th ACM international conference on Multimedia
FSCAV: fast seam carving for size adaptation of videos
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Seam carving extension: a compression perspective
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Music analysis, retrieval and synthesis of audio signals MARSYAS
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Modeling music as a dynamic texture
IEEE Transactions on Audio, Speech, and Language Processing
Structure-aware music resizing using lyrics
Proceedings of the 19th international conference on World wide web
Time-Scale Modification of Audio Signals Using Enhanced WSOLA With Management of Transients
IEEE Transactions on Audio, Speech, and Language Processing
Audio thumbnailing of popular music using chroma-based representations
IEEE Transactions on Multimedia
Hi-index | 0.00 |
Content-aware music adaption, i.e. music resizing, in temporal constraints starts drawing attention from multimedia communities because of the need of real-world scenarios, e.g. animation production and radio advertisement production. The goal of music resizing is to change the length of a music track to a user preferred length using a series of basic operations, e.g. compression, prolonging, cropping and repeating. The only existing music resizing approach so far, called LyDAR, suffers from some limitations. For example, it cannot support prolonging a music track and cannot compress music pieces with very small stretch rates. In this paper, we propose MUSIZ, a generic framework for MUsic reSIZing. Observing the diversity of quality degradation for different segments, we propose the concept of stretch-resistance to measure the degree of quality degradation after a segment is stretched. MUSIZ stretches high stretch-resistance segments intensively and relieves low stretch-resistance segments to reduce the negative impact on the stretched music piece. For short length resizing requests, we develop a contiguity-preservative cropping algorithm to remove segments before stretching, while smoothing the abrupt change at the joint between two segments. Comprehensive experimental results show that MUSIZ is superior to the existing approaches.