Development of an instrument measuring user satisfaction of the human-computer interface
CHI '88 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Content-Based Video Indexing and Retrieval
IEEE MultiMedia
A guided tour to approximate string matching
ACM Computing Surveys (CSUR)
Automatic location of text in video frames
MULTIMEDIA '01 Proceedings of the 2001 ACM workshops on Multimedia: multimedia information retrieval
Discrete Time Processing of Speech Signals
Discrete Time Processing of Speech Signals
Creating music videos using automatic media analysis
Proceedings of the tenth ACM international conference on Multimedia
Statistical Learning of Multi-view Face Detection
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Discovering Musical Structure in Audio Recordings
ICMAI '02 Proceedings of the Second International Conference on Music and Artificial Intelligence
SVMTorch: support vector machines for large-scale regression problems
The Journal of Machine Learning Research
Automated extraction of music snippets
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Music thumbnailing via structural analysis
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Content-based music structure analysis with applications to music semantics understanding
Proceedings of the 12th annual ACM international conference on Multimedia
Music summarization using key phrases
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
Contextual factors affecting the utility of surrogates within exploratory search
Information Processing and Management: an International Journal
Clip based video summarization and ranking
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
An integrated music video browsing system for personalized television
Expert Systems with Applications: An International Journal
Static and dynamic video summaries
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Video summarization based on balanced AV-MMR
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Hi-index | 0.00 |
In this paper, we propose a novel approach for automatic music video summarization based on audio-visual-text analysis and alignment. The music video is separated into the music and video tracks. For the music track, the chorus is detected based on music structure analysis. For the video track, we first segment the shots and classify the shots into close-up face shots and non-face shots, then we extract the lyrics and detect the most repeated lyrics from the shots. The music video summary is generated based on the alignment of boundaries of the detected chorus, shot class and the most repeated lyrics from the music video. The experiments on chorus detection, shot classification, and lyrics detection using 20 English music videos are described. Subjective user studies have been conducted to evaluate the quality and effectiveness of summary. The comparisons with the summaries based on our previous method and the manual method indicate that the results of summarization using the proposed method are better at meeting users' expectations.