Exploring knowledge of sub-domain in a multi-resolution bootstrapping framework for concept detection in news video

Authors:
Gang Wang;Tat-Seng Chua;Ming Zhao
Affiliations:
National University of Singapore, Singapore, Singapore;National University of Singapore, Singapore, Singapore;GOOGLE, Mountain View, USA
Venue:
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Year:
2008

Citing 14
Cited 6

Using statistical testing in the evaluation of retrieval experiments

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Data clustering: a review

ACM Computing Surveys (CSUR)
The LIMSI Broadcast News transcription system

Speech Communication - Special issue on automatic transcription of broadcast news data
Robust automated topic identification

Robust automated topic identification
Learning query-class dependent weights in automatic video retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
Story boundary detection in large broadcast news video archives: techniques, experience and trends

Proceedings of the 12th annual ACM international conference on Multimedia
On the detection of semantic concepts at TRECVID

Proceedings of the 12th annual ACM international conference on Multimedia
A bootstrapping framework for annotating and retrieving WWW images

Proceedings of the 12th annual ACM international conference on Multimedia
ACM SIGMM retreat report on future directions in multimedia research

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Semi-Supervised Cross Feature Learning for Semantic Concept Detection in Videos

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Speech and Language Processing (2nd Edition)

Speech and Language Processing (2nd Edition)
Cross-domain video concept detection using adaptive svms

Proceedings of the 15th international conference on Multimedia
Proposing a new term weighting scheme for text categorization

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1

On the sampling of web images for learning visual concept classifiers

Proceedings of the ACM International Conference on Image and Video Retrieval
Automatic generation of semantic fields for annotating web images

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Robust Video Content Analysis via Transductive Learning

ACM Transactions on Intelligent Systems and Technology (TIST)
Knowledge adaptation for ad hoc multimedia event detection with few exemplars

Proceedings of the 20th ACM international conference on Multimedia
We are not equally negative: fine-grained labeling for multimedia event detection

Proceedings of the 21st ACM international conference on Multimedia
E-LAMP: integration of innovative ideas for multimedia event detection

Machine Vision and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a model based on a multi-resolution, multi-source and multi-modal (M3) bootstrapping framework that exploits knowledge of sub-domains for concept detection in news video. Because the characteristics and distributions of data in different sub-domains are different, we model and analyze the video in each sub-domain separately using a transductive framework. Along with this framework, we propose a "pseudo-Vapnik combined error bound" to tackle the problem of imbalanced distribution of training data in certain segments of sub-domains. For effective fusion of multi-modal features, we utilize multi-resolution inference and constraints to permit evidences from different modal features to support each other. Finally, we employ a bootstrapping technique to leverage unlabeled data to boost the overall system performance. We test our framework by detecting semantic concepts in the TRECVID 2004 dataset. Experimental results demonstrate that our approach is effective.