Learning semantic distance from community-tagged media collection

Authors:
Guo-Jun Qi;Xian-Sheng Hua;Hong-Jiang Zhang
Affiliations:
University of Illinois at Urbana-Champaign, Urbana, IL, USA;Microsoft Research Asia, Beijing, China;Microsoft Advanced Technology Center, Beijing, China
Venue:
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Year:
2009

Citing 15
Cited 9

IR evaluation methods for retrieving highly relevant documents

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
Co-clustering documents and words using bipartite spectral graph partitioning

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Kernel Methods for Pattern Analysis

Kernel Methods for Pattern Analysis
Unsupervised learning on k-partite graphs

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
AnnoSearch: Image Auto-Annotation by Search

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
The Google Similarity Distance

IEEE Transactions on Knowledge and Data Engineering
Information-theoretic metric learning

Proceedings of the 24th international conference on Machine learning
Correlative multi-label video annotation

Proceedings of the 15th international conference on Multimedia
Graph theoretical framework for simultaneously integrating visual and textual features for efficient web image clustering

Proceedings of the 17th international conference on World Wide Web
Learning to reduce the semantic gap in web image retrieval and annotation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Resolving tag ambiguity

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Two-Dimensional Multilabel Active Learning with an Efficient Online Adaptation Model for Image Classification

IEEE Transactions on Pattern Analysis and Machine Intelligence
NUS-WIDE: a real-world web image database from National University of Singapore

Proceedings of the ACM International Conference on Image and Video Retrieval
Semantic Subspace Projection and Its Applications in Image Retrieval

IEEE Transactions on Circuits and Systems for Video Technology

One person labels one million images

Proceedings of the international conference on Multimedia
Learning contextual metrics for automatic image annotation

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Distance metric learning from uncertain side information for automated photo tagging

ACM Transactions on Intelligent Systems and Technology (TIST)
Towards semantic knowledge propagation from text corpus to web images

Proceedings of the 20th international conference on World wide web
Social multimedia: highlighting opportunities for search and mining of multimedia data in social media applications

Multimedia Tools and Applications
On Taxonomies for Multi-class Image Categorization

International Journal of Computer Vision
Learning the Relative Importance of Objects from Tagged Images for Retrieval and Cross-Modal Search

International Journal of Computer Vision
Low rank metric learning for social image retrieval

Proceedings of the 20th ACM international conference on Multimedia
Multifaceted conceptual image indexing on the world wide web

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a novel semantic-aware distance metric for images by mining multimedia data on the Internet, in particular, web images and their associated tags. As well known, a proper distance metric between images is a key ingredient in many realistic web image retrieval engines, as well many image understanding techniques. In this paper, we attempt to mine a novel distance metric from the web images by integrating their visual content as well as the associated user tags. Different from many existing distance metric learning algorithms which utilize the dissimilar or similar information between images pixels or features in signal level, the proposed scheme also takes the associated user-input tags into consideration. The visual content of images is also leveraged to respect an intuitive assumption that the visual similar images ought to have a smaller distance. A semi-definite programming is formulated to encode the above two aspects of criteria to learn the distance metric and we show such an optimization problem can be efficiently solved with a closed-form solution. We evaluate the proposed algorithm on two datasets. One is the benchmark Corel dataset and the other is a real-world dataset crawled from the image sharing website Flickr. By comparison with other existing distance learning algorithms, competitive results are obtained by the proposed algorithm in experiments.