Multi-cue fusion for semantic video indexing

Authors:
Ming-Fang Weng;Yung-Yu Chuang
Affiliations:
National Taiwan University, Taipei, Taiwan Roc;National Taiwan University, Taipei, Taiwan Roc
Venue:
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Year:
2008

Citing 17
Cited 12

Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
Artificial Intelligence: A Modern Approach

Artificial Intelligence: A Modern Approach
Multimodal Video Indexing: A Review of the State-of-the-art

Multimedia Tools and Applications
Content-based multimedia information retrieval: State of the art and challenges

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Multimedia semantic indexing using model vectors

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Exploring temporal consistency for video analysis and retrieval

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Evaluation campaigns and TRECVid

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Estimating average precision with incomplete and imperfect judgments

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A reranking approach for context-based concept fusion in video indexing and retrieval

Proceedings of the 6th ACM international conference on Image and video retrieval
An empirical study of inter-concept similarities in multimedia ontologies

Proceedings of the 6th ACM international conference on Image and video retrieval
Towards optimal bag-of-features for object categorization and semantic video retrieval

Proceedings of the 6th ACM international conference on Image and video retrieval
Correlative multi-label video annotation

Proceedings of the 15th international conference on Multimedia
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
A probabilistic framework for semantic video indexing, filtering,and retrieval

IEEE Transactions on Multimedia
Association and Temporal Rule Mining for Post-Filtering of Semantic Concept Detection in Video

IEEE Transactions on Multimedia
Factor graph framework for semantic video indexing

IEEE Transactions on Circuits and Systems for Video Technology

Concept-Based Video Retrieval

Foundations and Trends in Information Retrieval
Exploring inter-concept relationship with context space for semantic video indexing

Proceedings of the ACM International Conference on Image and Video Retrieval
Semantic video indexing by fusing explicit and implicit context spaces

Proceedings of the international conference on Multimedia
Refining video annotation by exploiting inter-shot context

Proceedings of the international conference on Multimedia
Mining concept relationship in temporal context for effective video annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Collaborative video reindexing via matrix factorization

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
TempoM2: a multi feature index structure for temporal video search

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Constructing and Utilizing Video Ontology for Accurate and Fast Retrieval

International Journal of Multimedia Data Engineering & Management
Enhanced representation and multi-task learning for image annotation

Computer Vision and Image Understanding
Temporal-Spatial refinements for video concept fusion

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
ObjectPatchNet: Towards scalable and semantic image annotation and retrieval

Computer Vision and Image Understanding
A framework for video event classification by modeling temporal context of multimodal features using HMM

Journal of Visual Communication and Image Representation

Quantified Score

Hi-index	0.00

Visualization

Abstract

The huge amount of videos currently available poses a difficult problem in semantic video retrieval. The success of query-by-concept, recently proposed to handle this problem, depends greatly on the accuracy of concept-based video indexing. This paper describes a multi-cue fusion approach toward improving the accuracy of semantic video indexing. This approach is based on a unified framework that explores and integrates both contextual correlation among concepts and temporal dependency among shots. The framework is novel in two ways. First, a recursive algorithm is proposed to learn both inter-concept and inter-shot relationships from ground-truth annotations of tens of thousands of shots for hundreds of concepts. Second, labels for all concepts and all shots are solved simultaneously through optimizing a graphical model. Experiments on the widely used TRECVID 2006 data set show that our framework is effective for semantic concept detection in video, achieving around a 30% performance boost on two popular benchmarks, VIREO-374 and Columbia374, in inferred average precision.