Semantics reinforcement and fusion learning for multimedia streams

Authors:
Dhiraj Joshi;Milind Naphade;Apostol Natsev
Affiliations:
The Pennsylvania State University, University Park, PA;IBM Thomas J. Watson Research Center, Hawthorne, NY;IBM Thomas J. Watson Research Center, Hawthorne, NY
Venue:
Proceedings of the 6th ACM international conference on Image and video retrieval
Year:
2007

Citing 19
Cited 5

Combination of Multiple Classifiers Using Local Accuracy Estimates

IEEE Transactions on Pattern Analysis and Machine Intelligence
On Combining Classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Theoretical Study on Six Classifier Fusion Strategies

IEEE Transactions on Pattern Analysis and Machine Intelligence
A computationally efficient evolutionary algorithm for real-parameter optimization

Evolutionary Computation
Video Retrieval by Feature Learning in Key Frames

CIVR '02 Proceedings of the International Conference on Image and Video Retrieval
Boosting Image Orientation Detection with Indoor vs. Outdoor Classification

WACV '02 Proceedings of the Sixth IEEE Workshop on Applications of Computer Vision
Detection and Location of People in Video Images Using Adaptive Fusion of Color and Edge Information

ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 4
Boosting for Fast Face Recognition

RATFG-RTS '01 Proceedings of the IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems (RATFG-RTS'01)
Boosting Image Retrieval

International Journal of Computer Vision - Special Issue on Content-Based Image Retrieval
Ensemble selection from libraries of models

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Boosting contextual information in content-based image retrieval

Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
Optimal multimodal fusion for multimedia data analysis

Proceedings of the 12th annual ACM international conference on Multimedia
On the detection of semantic concepts at TRECVID

Proceedings of the 12th annual ACM international conference on Multimedia
Successful approaches in the TREC video retrieval evaluations

Proceedings of the 12th annual ACM international conference on Multimedia
The state of the art in image and video retrieval

CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval
A closer look at boosted image retrieval

CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval
Detection of documentary scene changes by audio-visual fusion

CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval
Factor graph framework for semantic video indexing

IEEE Transactions on Circuits and Systems for Video Technology

Inferring generic activities and events from image content and bags of geo-tags

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Event recognition: viewing the world with a third eye

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Web news categorization using a cross-media document graph

Proceedings of the ACM International Conference on Image and Video Retrieval
HMNews: a multimodal news data association framework

Proceedings of the 2010 ACM Symposium on Applied Computing
MultiFusion: A boosting approach for multimedia fusion

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Fusion of multimedia streams for enhanced performance is a critical problem for retrieval. However, fusion performance tends to easily overfit the hillclimb set used to learn fusion rules. In this paper, we perform fusion learning for multimedia streams using a greedy performance driven algorithm. In our fusion learning paradigm, fused output is a linear combination of multiple classifiers or ranked streams. The algorithm is inspired from Ensemble Learning [2] but takes that idea further for improving generalization capability. A key application of our fusion learning algorithm, described in this work, is semantics reinforcement using an ensemble of classifiers built using the same training dataset but groundtruth corresponding to different concepts. We expect that classifiers built for semantically close concepts should reinforce each other's performance and fusion learning is an excellent post-classification way to reinforce semantics and performance. Fusion learning experiments have been performed on TRECVID 2005 test set. Experiments using the well established retrieval effectiveness measure of mean average precision reveal that our proposed algorithm improves over the best classifier (oracle) by 3.8%. We also present and discuss some interesting and intuitive semantic reinforcement trends observed during fusion learning.