Image collector II: a system for gathering more than one thousand images from the Web for one keyword

Authors:
K. Yanai
Affiliations:
Dept. of Comput. Sci., Electro-Commun. Univ., Japan
Venue:
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Year:
2003

Citing 7
Cited 3

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Unifying textual and visual cues for content-based image retrieval on the World Wide Web

Computer Vision and Image Understanding - Special issue on content-based access for image and video libraries
Visually Searching the Web for Content

IEEE MultiMedia
Content-Based Image Retrieval Systems

Computer
Efficient Color Histogram Indexing for Quadratic Form Distance Functions

IEEE Transactions on Pattern Analysis and Machine Intelligence
An Experiment on Generic Image Classification Using Web Images

PCM '02 Proceedings of the Third IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
WebSeer: An Image Search Engine for the World Wide Web

WebSeer: An Image Search Engine for the World Wide Web

Generic image classification using visual knowledge on the web

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Region-based automatic web image selection

Proceedings of the international conference on Multimedia information retrieval
Automatic categorization for WWW images with applications for retrieval navigation

PCM'04 Proceedings of the 5th Pacific Rim conference on Advances in Multimedia Information Processing - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a system that enables us to gather more than one thousand images from the World Wide Web. The system is called Image Collector II. The image collector, which we proposed previously, can gather only several hundreds images. We made the two following improvements to extend the ability of our previous system in terms of the number of gathered images and their precision: (1) We extracted some words appearing with high frequency from all HTML files embedding output images in an initial image gathering, and using them as keywords, we made a second image gathering again. Through this, we obtained more than one thousand images for one keyword. (2) The more images we gathered, the more he precision of gathered images decreased. To raise the precision, we introduced word vectors of HTML files embedding images into the image selecting process in addition to image feature vectors.