Augmented segmentation and visualization for presentation videos
Proceedings of the 13th annual ACM international conference on Multimedia
Analysis and processing of lecture audio data: preliminary investigations
SpeechIR '04 Proceedings of the Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval at HLT-NAACL 2004
TalkMiner: a lecture webcast search engine
Proceedings of the international conference on Multimedia
Lecture Video Indexing and Analysis Using Video OCR Technology
SITIS '11 Proceedings of the 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems
Hi-index | 0.00 |
This paper presents an automated framework for lecture video indexing in the tele-teaching context. The major issues involved in our approach are content-based lecture video analysis and integration of proposed analysis engine into a lecture video portal. In video visual analysis, we apply automated video segmentation, video OCR (Optical Character Recognition) technologies for extracting lecture structural and textual metadata. Concerning ASR (Automated Speech Recognition) analysis, we have optimized the workflow for the creation of a German speech corpus from raw lecture audio data. This enables us to minimize the time and effort required for extending the speech corpus and thus improving the recognition rate. Both, OCR and ASR results have been applied for the further video indexing. In order to integrate the analysis engine into the lecture video portal, we have developed an architecture for the corresponding tasks such as, e.g., data transmission, analysis management, and result visualization etc. The accuracy of each individual analysis method has been evaluated by using publicly available test data sets.