Enabling video annotation using a semantic database extended with visual knowledge

Authors:
G. C. Stein;J. Rittscher;A. Hoogs
Affiliations:
One Res. Circle, GE Global Res., NY, USA;One Res. Circle, GE Global Res., NY, USA;One Res. Circle, GE Global Res., NY, USA
Venue:
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Year:
2003

Citing 4
Cited 0

Neural Network-Based Face Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Building Large Knowledge-Based Systems; Representation and Inference in the Cyc Project

Building Large Knowledge-Based Systems; Representation and Inference in the Cyc Project
Show&Tell: A Semi-Automated Image Annotation System

IEEE MultiMedia
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV

Quantified Score

Hi-index	0.00

Visualization

Abstract

A semantic database has been extended with visual information to enable video annotation. This paper describes a lexical database, WordNet. We show its limitations with respect to describing visual characteristics, and describe an extension to WordNet that contains specific visual information. Having such a semantic database makes video annotation possible for broadcast news: a domain that can cover any topic and involve a wide variety of events, objects and scenes. Combining basic visual analysis techniques and a semantic database containing visual descriptions avoids the problem developing large numbers of specific object and event detectors. Such a semantic database can be of great value for the analysis of multi-modal information. As far as we know, such a database has not been developed before.