Live speaker identification in conversations
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Visual speaker localization aided by acoustic models
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Semantically enabled exploratory video search
Proceedings of the 3rd International Semantic Search Workshop
Joke-o-Mat HD: browsing sitcoms with human derived transcripts
Proceedings of the international conference on Multimedia
Narrative theme navigation for sitcoms supported by fan-generated scripts
Proceedings of the 3rd international workshop on Automated information extraction in media production
StoViz: story visualization of TV series
Proceedings of the 20th ACM international conference on Multimedia
Narrative theme navigation for sitcoms supported by fan-generated scripts
Multimedia Tools and Applications
Hi-index | 0.00 |
This paper summarizes our contribution to the Yahoo! task of the ACM Multimedia Grand Challenge. This challenge asks for the robust automatic segmentation of videos according to "narrative themes". Based on the automatic segmentation methods presented in [1] and partly [2], we describe a system to navigate Seinfeld episodes based on automatic segmentation of the audio track only. The system distinguishes laughter, music, and other noise as well as speech segments. Speech segments are identified against pre-trained speaker models of the actors. Given this segmentation and the artistic production rules that underlie the genre situation comedy and Seinfeld in particular, the system enables a user to browse an episode by scene, by punchline, and by dialog segments. The themes can be filtered by the main actors, e.g. the user can select to see only punchlines by Jerry and Kramer. Based on the length of the laughter, the top 5 punchlines are also identified and presented to the user.