An interactive approach for filtering out junk images from keyword-based google search results

  • Authors:
  • Yuli Gao;Jinye Peng;Hangzai Luo;Daniel A. Keim;Jianping Fan

  • Affiliations:
  • Hewlett-Packard Laboratories, Palo Alto, CA;School of Electronics and Information, Northwestern Polytechnical University, Xi'an, China;Software Engineering Institute, East China Normal University, Shanghai, China;Computer and Information Science Department, University of Konstanz, Konstanz, Germany;Department of Computer Science, University of North Carolina, Charlotte, NC

  • Venue:
  • IEEE Transactions on Circuits and Systems for Video Technology
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The keyword-based Google Images search engine is now becoming very popular for online image search. Unfortunately, only the text terms that are explicitly or implicitly linked with the images are used for image indexing but the associated text terms may not have exact correspondence with the underlying image semantics, thus the keyword-based Google Images search engine may return large amounts of junk images which are irrelevant to the given keyword-based queries. Based on this observation, we have developed an interactive approach to filter out the junk images from keyword-based Google Images search results and our approach consists of the following major components. a) A kernel-based image clustering technique is developed to partition the returned images into multiple clusters and outliers. b) Hyperbolic visualization is incorporated to display large amounts of returned images according to their nonlinear visual similarity contexts, so that users can assess the relevance between the returned images and their real query intentions interactively and select one or multiple images to express their query intentions and personal preferences precisely. c) An incremental kernel learning algorithm is developed to translate the users' query intentions and personal preferences for updating the mixture-of-kernels and generating better hypotheses to achieve more accurate clustering of the returned images and filter out the junk images more effectively. Experiments on diverse keyword-based queries from Google Images search engine have obtained very positive results. Our junk image filtering system is released for public evaluation at: http://www.cs.uncc.edu/~jfan/google_demo/.