Segmentation and annotation of audiovisual recordings based on automated speech recognition

Authors:
Stephan Repp;Jörg Waitelonis;Harald Sack;Christoph Meinel
Affiliations:
Hasso-Plattner-Institut für Softwaresystemtechnik GmbH, Potsdam, Germany;Friedrich-Schiller-Universität Jena, Jena, Germany;Friedrich-Schiller-Universität Jena, Jena, Germany;Hasso-Plattner-Institut für Softwaresystemtechnik GmbH, Potsdam, Germany
Venue:
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Year:
2007

Citing 10
Cited 4

Statistical Models for Text Segmentation

Machine Learning - Special issue on natural language learning
A fast bit-vector algorithm for approximate string matching based on dynamic programming

Journal of the ACM (JACM)
A critique and improvement of an evaluation metric for text segmentation

Computational Linguistics
Finding similar regions in many sequences

Journal of Computer and System Sciences - STOC 1999
Cross-media correlation: a case study of navigated hypermedia documents

Proceedings of the tenth ACM international conference on Multimedia
TextTiling: segmenting text into multi-paragraph subtopic passages

Computational Linguistics
Advances in domain independent linear text segmentation

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Discourse segmentation of multi-party conversation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Augmented segmentation and visualization for presentation videos

Proceedings of the 13th annual ACM international conference on Multimedia
Semantic Indexing for Recorded Educational Lecture Videos

PERCOMW '06 Proceedings of the 4th annual IEEE international conference on Pervasive Computing and Communications Workshops

Question answering from lecture videos based on an automatic semantic annotation

Proceedings of the 13th annual conference on Innovation and technology in computer science education
Dynamic Browsing of Audiovisual Lecture Recordings Based on Automated Speech Recognition

ITS '08 Proceedings of the 9th international conference on Intelligent Tutoring Systems
Question Answering from Lecture Videos Based on Automatically-Generated Learning Objects

ICWL '08 Proceedings of the 7th international conference on Advances in Web Based Learning
Use what you have: Yovisto video search engine takes a semantic turn

SAMT'10 Proceedings of the 5th international conference on Semantic and digital media technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

Searching multimedia data in particular audiovisual data is still a challenging task to fulfill. The number of digital video recordings has increased dramatically as recording technology has become more affordable and network infrastructure has become easy enough to provide download and streaming solutions. But, the accessibility and traceability of its content for further use is still rather limited. In our paper we are describing and evaluating a new approach to synchronizing auxiliary text-based material as, e. g. presentation slides with lecture video recordings. Our goal is to show that the tentative transliteration is sufficient for synchronization. Different approaches to synchronize textual material with deficient transliterations of lecture recordings are discussed and evaluated in this paper. Our evaluation data-set is based on different languages and various speakers' recordings.