Heterogeneous image feature integration via multi-modal spectral clustering

Authors:
Xiao Cai; Feiping Nie; Heng Huang;F. Kamangar
Affiliations:
Comput. Sci. & Eng. Dept., Univ. of Texas at Arlington, Arlington, TX, USA;Comput. Sci. & Eng. Dept., Univ. of Texas at Arlington, Arlington, TX, USA;Comput. Sci. & Eng. Dept., Univ. of Texas at Arlington, Arlington, TX, USA;Comput. Sci. & Eng. Dept., Univ. of Texas at Arlington, Arlington, TX, USA
Venue:
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Year:
2011

Citing 0
Cited 5

Social event detection using multimodal clustering and integrating supervisory signals

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Group-Wise consistent parcellation of gyri via adaptive multi-view spectral clustering of fiber shapes

MICCAI'12 Proceedings of the 15th international conference on Medical Image Computing and Computer-Assisted Intervention - Volume Part II
Towards metric fusion on multi-view data: a cross-view based graph random walk approach

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
SMI 2013: Unsupervised co-segmentation of 3D shapes via affinity aggregation spectral clustering

Computers and Graphics
Multi-view K-means clustering on big data

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

In recent years, more and more visual descriptors have been proposed to describe objects and scenes appearing in images. Different features describe different aspects of the visual characteristics. How to combine these heterogeneous features has become an increasing critical problem. In this paper, we propose a novel approach to unsupervised integrate such heterogeneous features by performing multi-modal spectral clustering on unlabeled images and unsegmented images. Considering each type of feature as one modal, our new multi-modal spectral clustering (MMSC) algorithm is to learn a commonly shared graph Laplacian matrix by unifying different modals (image features). A non-negative relaxation is also added in our method to improve the robustness and efficiency of image clustering. We applied our MMSC method to integrate five types of popularly used image features, including SIFT, HOG, GIST, LBP, CENTRIST and evaluated the performance by two benchmark data sets: Caltech-101 and MSRC-v1. Compared with existing unsupervised scene and object categorization methods, our approach always achieves superior performances measured by three standard clustering evaluation metrices.