A semi-automatic text-based semantic video annotation system for Turkish facilitating multilingual retrieval

Authors:
Dilek KüçüK;Adnan YazıCı
Affiliations:
Power Electronics Department, TíBİTAK Energy Institute, 06531 Ankara, Turkey;Department of Computer Engineering, Middle East Technical University, 06531 Ankara, Turkey
Venue:
Expert Systems with Applications: An International Journal
Year:
2013

Citing 37
Cited 0

Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
A rule-based video database system architecture

Information Sciences—Informatics and Computer Science: An International Journal
Lexical cohesion computed by thesaural relations as an indicator of the structure of text

Computational Linguistics
TextTiling: segmenting text into multi-paragraph subtopic passages

Computational Linguistics
Discourse segmentation by human and automated means

Computational Linguistics
Multimedia indexing through multi-source and multi-language information extraction: the MUMIS project

Data & Knowledge Engineering - NLDB2002
A statistical information extraction system for Turkish

Natural Language Engineering
Advances in domain independent linear text segmentation

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Spatio-temporal querying in video databases

Information Sciences—Informatics and Computer Science: An International Journal
Message Understanding Conference-6: a brief history

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
SeLeCT: a lexical cohesion based news story segmentation system

AI Communications - STAIRS 2002
Story boundary detection in large broadcast news video archives: techniques, experience and trends

Proceedings of the 12th annual ACM international conference on Multimedia
Multimodal Video Indexing: A Review of the State-of-the-art

Multimedia Tools and Applications
KIM – a semantic platform for information extraction and retrieval

Natural Language Engineering
Web-assisted annotation, semantic indexing and search of television and radio news

WWW '05 Proceedings of the 14th international conference on World Wide Web
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Spoken and written news story segmentation using lexical chains

NAACLstudent '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Proceedings of the HLT-NAACL 2003 student research workshop - Volume 3
Large-Scale Concept Ontology for Multimedia

IEEE MultiMedia
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
SemanticVox: a multilingual video search engine

Proceedings of the 6th ACM international conference on Image and video retrieval
A review of text and image retrieval approaches for broadcast news video

Information Retrieval
User-assisted query translation for interactive cross-language information retrieval

Information Processing and Management: an International Journal
Information retrieval on Turkish texts

Journal of the American Society for Information Science and Technology
A novel block intensity comparison code for video classification and retrieval

Expert Systems with Applications: An International Journal
A generalised cross-modal clustering method applied to multimedia news semantic indexing and retrieval

Proceedings of the 18th international conference on World wide web
Lattice parsing to integrate speech recognition and rule-based machine translation

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Story segmentation of brodcast news in English, Mandarin and Arabic

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Named Entity Recognition Experiments on Turkish Texts

FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Story segmentation and topic classification of broadcast news via a topic-based segmental model and a genetic algorithm

IEEE Transactions on Audio, Speech, and Language Processing
News story segmentation in multiple modalities

Multimedia Tools and Applications
Effective content-based video retrieval using pattern-indexing and matching techniques

Expert Systems with Applications: An International Journal
Exploiting information extraction techniques for automatic semantic video indexing with an application to Turkish news videos

Knowledge-Based Systems
RitroveRAI: a web application for semantic indexing and hyperlinking of multimedia news

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Story segmentation in news videos using visual and text cues

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
TV news story segmentation based on semantic coherence and content similarity

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Multilingual video indexing and retrieval employing an information extraction tool for turkish news texts: a case study

FQAS'11 Proceedings of the 9th international conference on Flexible Query Answering Systems
Using Webcast Text for Semantic Event Detection in Broadcast Sports Video

IEEE Transactions on Multimedia

Quantified Score

Hi-index	12.05

Visualization

Abstract

It is commonly acknowledged that ever-increasing video archives should be conveniently indexed with the conveyed semantic information to facilitate later video retrieval. Domain-independent semantic video indexing is usually carried out through manual means which is too time-consuming and labor-intensive to be employed in practical settings. On the other hand, fully automated approaches are usually proposed for very specialized domains such as team sports videos. In this paper, we propose a generic text-based semi-automatic system for off-line semantic indexing and retrieval of news videos, since video texts such as speech transcripts stand as a plausible source of semantic information. The proposed system has a pipelined flow of execution where the sole manual intervention takes place during text extraction, yet it could execute in fully automated mode in case the associated video text is already available or a convenient text extractor is available to be incorporated into the system. At the core of the system is an information extraction component - a named entity recognizer - which extracts representative semantic information from the video texts. Based on the proposed generic system, a novel semantic annotation and retrieval system for Turkish is designed, implemented, and evaluated on two distinct news video data sets. By equipping it with the necessary components, the ultimate system is also turned into a multilingual video retrieval system and executed on a video data set in English, thereby facilitating multilingual semantic video retrieval.