Content-Based Multimedia Retrieval Using Feature Correlation Clustering and Fusion

Authors:
Shu-Ching Chen;Hsin-Yu Ha;Fausto C. Fleites
Affiliations:
School of Computing and Information Sciences, Florida International University, Miami, FL, USA;School of Computing and Information Sciences, Florida International University, Miami, FL, USA;School of Computing and Information Sciences, Florida International University, Miami, FL, USA
Venue:
International Journal of Multimedia Data Engineering & Management
Year:
2013

Citing 45
Cited 0

Farthest neighbors, maximum spanning trees and related problems in higher dimensions

Computational Geometry: Theory and Applications
Learning query-class dependent weights in automatic video retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
Optimal multimodal fusion for multimedia data analysis

Proceedings of the 12th annual ACM international conference on Multimedia
Tracking Humans using Multi-modal Fusion

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Learning the semantics of multimedia queries and concepts from a small number of examples

Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Eigenfaces for recognition

Journal of Cognitive Neuroscience
Label Propagation through Linear Neighborhoods

IEEE Transactions on Knowledge and Data Engineering
Information Fusion in Multimedia Information Retrieval

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Correlation-Based Video Semantic Concept Detection Using Multiple Correspondence Analysis

ISM '08 Proceedings of the 2008 Tenth IEEE International Symposium on Multimedia
Multimedia Evidence Fusion for Video Concept Detection via OWA Operator

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Inferring semantic concepts from community-contributed images and noisy tags

MM '09 Proceedings of the 17th ACM international conference on Multimedia
NUS-WIDE: a real-world web image database from National University of Singapore

Proceedings of the ACM International Conference on Image and Video Retrieval
Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition

IEEE Transactions on Audio, Speech, and Language Processing - Special issue on multimodal processing in speech-based interactions
Correlation-based interestingness measure for video semantic concept detection

IRI'09 Proceedings of the 10th IEEE international conference on Information Reuse & Integration
Performance evaluation of score level fusion in multimodal biometric systems

Pattern Recognition
Classifier fusion for SVM-based multimedia semantic indexing

ECIR'07 Proceedings of the 29th European conference on IR research
Multiple feature fusion for social media applications

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Multimodal information fusion application to human emotion recognition from face and speech

Multimedia Tools and Applications
Cross-Media Retrieval Method Based on Temporal-spatial Clustering and Multimodal Fusion

ICICSE '09 Proceedings of the 2009 Fourth International Conference on Internet Computing for Science and Engineering
Efficient large-scale image annotation by probabilistic collaborative multi-label propagation

Proceedings of the international conference on Multimedia
A new approach to cross-modal multimedia retrieval

Proceedings of the international conference on Multimedia
Audio-visual spontaneous emotion recognition

ICMI'06/IJCAI'07 Proceedings of the ICMI 2006 and IJCAI 2007 international conference on Artifical intelligence for human computing
Audio-Visual Classification and Fusion of Spontaneous Affective Data in Likelihood Space

ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Information Fusion for Combining Visual and Textual Image Retrieval

ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Feature Selection Using Correlation and Reliability Based Scoring Metric for Video Semantic Detection

ICSC '10 Proceedings of the 2010 IEEE Fourth International Conference on Semantic Computing
Semantic combination of textual and visual information in multimedia retrieval

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Weighted Subspace Filtering and Ranking Algorithms for Video Concept Retrieval

IEEE MultiMedia
Acoustic super models for large scale video event detection

J-MRE '11 Proceedings of the 2011 joint ACM workshop on Modeling and representing events
Human tracking from a mobile agent: Optical flow and Kalman filter arbitration

Image Communication
A comparison of score, rank and probability-based fusion methods for video shot retrieval

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Double fusion for multimedia event detection

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Audio–Visual Affective Expression Recognition Through Multistream Fused HMM

IEEE Transactions on Multimedia
Design-based texture feature fusion using Gabor filters and co-occurrence probabilities

IEEE Transactions on Image Processing
Concept-Driven Multi-Modality Fusion for Video Search

IEEE Transactions on Circuits and Systems for Video Technology
Joint audio-visual bi-modal codewords for video event detection

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Multibiometric Cryptosystems Based on Feature-Level Fusion

IEEE Transactions on Information Forensics and Security - Part 2
Leveraging high-level and low-level features for multimedia event detection

Proceedings of the 20th ACM international conference on Multimedia
Cross-Media semantics mining based on sparse canonical correlation analysis and relevance feedback

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Weighted Association Rule Mining for Video Semantic Detection

International Journal of Multimedia Data Engineering & Management
Correlation-Based Ranking for Large-Scale Video Concept Retrieval

International Journal of Multimedia Data Engineering & Management
Sparse Representation Based Discriminative Canonical Correlation Analysis for Face Recognition

ICMLA '12 Proceedings of the 2012 11th International Conference on Machine Learning and Applications - Volume 01
Multimodal Information Fusion of Audio Emotion Recognition Based on Kernel Entropy Component Analysis

ISM '12 Proceedings of the 2012 IEEE International Symposium on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

Nowadays, only processing visual features is not enough for multimedia semantic retrieval due to the complexity of multimedia data, which usually involve a variety of modalities, e.g. graphics, text, speech, video, etc. It becomes crucial to fully utilize the correlation between each feature and the target concept, the feature correlation within modalities, and the feature correlation across modalities. In this paper, the authors propose a Feature Correlation Clustering-based Multi-Modality Fusion Framework FCC-MMF for multimedia semantic retrieval. Features from different modalities are combined into one feature set with the same representation via a normalization and discretization process. Within and across modalities, multiple correspondence analysis is utilized to obtain the correlation between feature-value pairs, which are then projected onto the two principal components. K-medoids algorithm, which is a widely used partitioned clustering algorithm, is selected to minimize the Euclidean distance within the resulted clusters and produce high intra-correlated feature-value pair clusters. Majority vote is applied to subsequently decide which cluster each feature belongs to. Once the feature clusters are formed, one classifier is built and trained for each cluster. The correlation and confidence of each classifier are considered while fusing the classification scores, and mean average precision is used to evaluate the final ranked classification scores. Finally, the proposed framework is applied on NUS-wide Lite data set to demonstrate the effectiveness in multimedia semantic retrieval.