Ensemble multi-instance multi-label learning approach for video annotation task

Authors:
Xin-Shun Xu;Xiangyang Xue;Zhi-Hua Zhou
Affiliations:
Nanjing University & Shandong University , Nanjing, China;Fudan University, Shanghai, China;Nanjing University, Nanjing, China
Venue:
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Year:
2011

Citing 16
Cited 3

Solving the multiple instance problem with axis-parallel rectangles

Artificial Intelligence
Multi-Instance Kernels

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Hidden Markov models for automatic annotation and content-based retrieval of images and video

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Multimedia semantic indexing using model vectors

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 1
Exploring temporal consistency for video analysis and retrieval

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
The Pyramid Match Kernel: Efficient Learning with Sets of Features

The Journal of Machine Learning Research
Correlative multilabel video annotation with temporal kernels

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
M3MIML: A Maximum Margin Method for Multi-instance Multi-label Learning

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Multi-instance learning by treating instances as non-I.I.D. samples

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Exploratory undersampling for class-imbalance learning

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Drosophila gene expression pattern annotation through multi-instance multi-label learning

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Evaluating Color Descriptors for Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
A New SVM Approach to Multi-instance Multi-label Learning

ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Multi-Layer Multi-Instance Learning for Video Concept Detection

IEEE Transactions on Multimedia
Factor graph framework for semantic video indexing

IEEE Transactions on Circuits and Systems for Video Technology

Semi-supervised multi-instance multi-label learning for video annotation task

Proceedings of the 20th ACM international conference on Multimedia
Instance Annotation for Multi-Instance Multi-Label Learning

ACM Transactions on Knowledge Discovery from Data (TKDD) - Special Issue on ACM SIGKDD 2012
Constrained instance clustering in multi-instance multi-label learning

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic video annotation is an important ingredient for video indexing, browsing, and retrieval. Traditional studies represent one video clip with a flat feature vector; however, video data usually has natural structure. Moreover, a video clip is generally relevant to multiple concepts. Indeed, the video annotation task is inherently a Multi-Instance Multi-Label (MIML) learning problem. In this paper, we propose the En-MIMLSVM approach for the video annotation task. It considers the class imbalance and long time training problems of most video annotation tasks. In addition, a temporally consistent weighted multi-instance kernel is developed to take into account both the temporal consistency in video data and the significance of instances of different levels in pyramid representation. The En-MIMLSVM is evaluated on TRECVID 2005 data set, and the results show that it outperforms several state-of-the-art methods.