Towards indexing representative images on the web

Authors:
Xin-Jing Wang;Zheng Xu;Lei Zhang;Ce Liu;Yong Rui
Affiliations:
Microsoft Research Asia, Beijing, China;University of Science and Technology of China, Hefei, China;Microsoft Research Asia, Beijing, China;Micrososft Research New England, Boston, MA, USA;Microsoft Research Asia, Beijing, China
Venue:
Proceedings of the 20th ACM international conference on Multimedia
Year:
2012

Citing 15
Cited 3

A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Access-ordered indexes

ACSC '04 Proceedings of the 27th Australasian conference on Computer science - Volume 26
An efficient parts-based near-duplicate and sub-image retrieval system

Proceedings of the 12th annual ACM international conference on Multimedia
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Inverted files for text search engines

ACM Computing Surveys (CSUR)
AnnoSearch: Image Auto-Annotation by Search

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
LabelMe: A Database and Web-Based Tool for Image Annotation

International Journal of Computer Vision
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Large-Scale Discovery of Spatially Related Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Corpus-based semantic class mining: distributional vs. pattern-based approaches

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Partition min-hash for partial duplicate image discovery

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)
Visual and semantic similarity in ImageNet

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition

Image search—from thousands to billions in 20 years

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special Sections on the 20th Anniversary of ACM International Conference on Multimedia, Best Papers of ACM Multimedia 2012
Picture tags and world knowledge: learning tag relations from visual semantic sources

Proceedings of the 21st ACM international conference on Multimedia
Image context discovery from socially curated contents

Proceedings of the 21st ACM international conference on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

Even after 20 years of research on real-world image retrieval, there is still a big gap between what search engines can provide and what users expect to see. To bridge this gap, we present an image knowledge base, ImageKB, a graph representation of structured entities, categories, and representative images, as a new basis for practical image indexing and search. ImageKB is automatically constructed via a both bottom-up and top-down, scalable approach that efficiently matches 2 billion web images onto an ontology with millions of nodes. Our approach consists of identifying duplicate image clusters from billions of images, obtaining a candidate set of entities and their images, discovering definitive texts to represent an image and identifying representative images for an entity. To date, ImageKB contains 235.3M representative images corresponding to 0.52M entities, much larger than the state-of-the-art alternative ImageNet that contains 14.2M images for 0.02M synsets. Compared to existing image databases, ImageKB reflects the distributions of both images on the web and users' interests, contains rich semantic descriptions for images and entities, and can be widely used for both text to image search and image to text understanding.