VideoCLEF 2008: ASR classification with Wikipedia categories

Authors:
Jens Kürsten;Daniel Richter;Maximilian Eibl
Affiliations:
Chemnitz University of Technology, Faculty of Computer Science, Chair Computer Science and Media, Chemnitz, Germany;Chemnitz University of Technology, Faculty of Computer Science, Chair Computer Science and Media, Chemnitz, Germany;Chemnitz University of Technology, Faculty of Computer Science, Chair Computer Science and Media, Chemnitz, Germany
Venue:
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Year:
2008

Citing 2
Cited 3

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Overview of VideoCLEF 2008: automatic generation of topic-based feeds for dual language audio-visual content

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access

Video classification as IR task: experiments and observations

CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
Putting it all together: the Xtrieval framework at Grid@CLEF 2009

CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Wikipedia based news video topic modeling for information extraction

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

This article describes our participation at the VideoCLEF track. We designed and implemented a prototype for the classification of the Video ASR data. Our approach was to regard the task as text classification problem. We used terms from Wikipedia categories as training data for our text classifiers. For the text classification the Naive-Bayes and kNN classifier from the WEKA toolkit were used. We submitted experiments for classification task 1 and 2. For the translation of the feeds to English (translation task) Google's AJAX language API was used. Although our experiments achieved only low precision of 10 to 15 percent, we assume those results will be useful in a combined setting with the retrieval approach that was widely used. Interestingly, we could not improve the quality of the classification by using the provided metadata.