Discriminative model fusion for semantic concept detection and annotation in video

Authors:
G. Iyengar;H. J. Nock
Affiliations:
IBM TJ Watson Research Center, NY;IBM TJ Watson Research Center, NY
Venue:
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Year:
2003

Citing 3
Cited 15

The nature of statistical learning theory

The nature of statistical learning theory
VisualSEEk: a fully automated content-based image query system

MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
User-trainable video annotation using multimodal cues

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval

Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
An empirical study of inter-concept similarities in multimedia ontologies

Proceedings of the 6th ACM international conference on Image and video retrieval
An integrated statistical model for multimedia evidence combination

Proceedings of the 15th international conference on Multimedia
Image and video indexing using networks of operators

Journal on Image and Video Processing
Image and video indexing using networks of operators

Journal on Image and Video Processing
Study on the combination of video concept detectors

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Multi-channel segmental hidden markov models for sports video mining

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Ontology-Based Inter-concept Relation Fusion for Concept Detection

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Unified video annotation via multigraph learning

IEEE Transactions on Circuits and Systems for Video Technology
Classifier fusion for SVM-based multimedia semantic indexing

ECIR'07 Proceedings of the 29th European conference on IR research
Using topic concepts for semantic video shots classification

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Double fusion for multimedia event detection

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Selection of negative samples and two-stage combination of multiple features for action detection in thousands of videos

Machine Vision and Applications
Retrieval of high-dimensional visual data: current state, trends and challenges ahead

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we describe a general information fusion algorithm that can be used to incorporate multimodal cues in building user-defined semantic concept models. We compare this technique with a Bayesian Network-based approach on a semantic concept detection task. Results indicate that this technique yields superior performance. We demonstrate this approach further by building classifiers of arbitrary concepts in a score space defined by a pre-deployed set of multimodal concepts. Results show annotation for user-defined concepts both in and outside the pre-deployed set is competitive with our best video-only models on the TREC Video 2002 corpus.