K-way min-max cut for image clustering and junk images filtering from Google images

  • Authors:
  • Feng Xie;Yi Shen;Xiaofei He

  • Affiliations:
  • State Key Lab of CAD&CD / Zhejiang University, Hangzhou, China;UNC-Charlotte, Charlotte, NC, USA;State Key Lab of CAD&CD / Zhejiang University, Hangzhou, China

  • Venue:
  • Proceedings of the international conference on Multimedia
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

Currently most existing image search engines such as Google Images index web images majorly using text keywords extracted from the context, which may return large amount of junk information. We propose a novel clustering based filtering method to filter those junk images. Firstly we apply K-way min-max cut to cluster images returned by Google into multiple clusters based on the mixture of feature kernels, with kernel weights being determined automatically instead of hard fix. Secondly we select the best cluster in a robust way, and rank all the rest clusters according to their similarity with the best one. Finally those low-rank clusters can be filtered out as junk clusters. In experiments we obtain very comparative filtering performance to the current state-of-the-art, and improve Google Images search results significantly.