Farthest neighbors, maximum spanning trees and related problems in higher dimensions
Computational Geometry: Theory and Applications
Learning query-class dependent weights in automatic video retrieval
Proceedings of the 12th annual ACM international conference on Multimedia
Optimal multimodal fusion for multimedia data analysis
Proceedings of the 12th annual ACM international conference on Multimedia
Tracking Humans using Multi-modal Fusion
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Learning the semantics of multimedia queries and concepts from a small number of examples
Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Journal of Cognitive Neuroscience
Label Propagation through Linear Neighborhoods
IEEE Transactions on Knowledge and Data Engineering
Information Fusion in Multimedia Information Retrieval
Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Correlation-Based Video Semantic Concept Detection Using Multiple Correspondence Analysis
ISM '08 Proceedings of the 2008 Tenth IEEE International Symposium on Multimedia
Multimedia Evidence Fusion for Video Concept Detection via OWA Operator
MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Inferring semantic concepts from community-contributed images and noisy tags
MM '09 Proceedings of the 17th ACM international conference on Multimedia
NUS-WIDE: a real-world web image database from National University of Singapore
Proceedings of the ACM International Conference on Image and Video Retrieval
IEEE Transactions on Audio, Speech, and Language Processing - Special issue on multimodal processing in speech-based interactions
Correlation-based interestingness measure for video semantic concept detection
IRI'09 Proceedings of the 10th IEEE international conference on Information Reuse & Integration
Performance evaluation of score level fusion in multimodal biometric systems
Pattern Recognition
Classifier fusion for SVM-based multimedia semantic indexing
ECIR'07 Proceedings of the 29th European conference on IR research
Multiple feature fusion for social media applications
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Multimodal information fusion application to human emotion recognition from face and speech
Multimedia Tools and Applications
Cross-Media Retrieval Method Based on Temporal-spatial Clustering and Multimodal Fusion
ICICSE '09 Proceedings of the 2009 Fourth International Conference on Internet Computing for Science and Engineering
Efficient large-scale image annotation by probabilistic collaborative multi-label propagation
Proceedings of the international conference on Multimedia
A new approach to cross-modal multimedia retrieval
Proceedings of the international conference on Multimedia
Audio-visual spontaneous emotion recognition
ICMI'06/IJCAI'07 Proceedings of the ICMI 2006 and IJCAI 2007 international conference on Artifical intelligence for human computing
Audio-Visual Classification and Fusion of Spontaneous Affective Data in Likelihood Space
ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Information Fusion for Combining Visual and Textual Image Retrieval
ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
ICSC '10 Proceedings of the 2010 IEEE Fourth International Conference on Semantic Computing
Semantic combination of textual and visual information in multimedia retrieval
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Acoustic super models for large scale video event detection
J-MRE '11 Proceedings of the 2011 joint ACM workshop on Modeling and representing events
Human tracking from a mobile agent: Optical flow and Kalman filter arbitration
Image Communication
A comparison of score, rank and probability-based fusion methods for video shot retrieval
CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Double fusion for multimedia event detection
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Audio–Visual Affective Expression Recognition Through Multistream Fused HMM
IEEE Transactions on Multimedia
Design-based texture feature fusion using Gabor filters and co-occurrence probabilities
IEEE Transactions on Image Processing
Concept-Driven Multi-Modality Fusion for Video Search
IEEE Transactions on Circuits and Systems for Video Technology
Joint audio-visual bi-modal codewords for video event detection
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Multibiometric Cryptosystems Based on Feature-Level Fusion
IEEE Transactions on Information Forensics and Security - Part 2
Leveraging high-level and low-level features for multimedia event detection
Proceedings of the 20th ACM international conference on Multimedia
Cross-Media semantics mining based on sparse canonical correlation analysis and relevance feedback
PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Weighted Association Rule Mining for Video Semantic Detection
International Journal of Multimedia Data Engineering & Management
Correlation-Based Ranking for Large-Scale Video Concept Retrieval
International Journal of Multimedia Data Engineering & Management
Sparse Representation Based Discriminative Canonical Correlation Analysis for Face Recognition
ICMLA '12 Proceedings of the 2012 11th International Conference on Machine Learning and Applications - Volume 01
ISM '12 Proceedings of the 2012 IEEE International Symposium on Multimedia
Hi-index | 0.00 |
Nowadays, only processing visual features is not enough for multimedia semantic retrieval due to the complexity of multimedia data, which usually involve a variety of modalities, e.g. graphics, text, speech, video, etc. It becomes crucial to fully utilize the correlation between each feature and the target concept, the feature correlation within modalities, and the feature correlation across modalities. In this paper, the authors propose a Feature Correlation Clustering-based Multi-Modality Fusion Framework FCC-MMF for multimedia semantic retrieval. Features from different modalities are combined into one feature set with the same representation via a normalization and discretization process. Within and across modalities, multiple correspondence analysis is utilized to obtain the correlation between feature-value pairs, which are then projected onto the two principal components. K-medoids algorithm, which is a widely used partitioned clustering algorithm, is selected to minimize the Euclidean distance within the resulted clusters and produce high intra-correlated feature-value pair clusters. Majority vote is applied to subsequently decide which cluster each feature belongs to. Once the feature clusters are formed, one classifier is built and trained for each cluster. The correlation and confidence of each classifier are considered while fusing the classification scores, and mean average precision is used to evaluate the final ranked classification scores. Finally, the proposed framework is applied on NUS-wide Lite data set to demonstrate the effectiveness in multimedia semantic retrieval.