Fast search in large-scale image database using vector quantization

  • Authors:
  • Hangjun Ye;Guangyou Xu

  • Affiliations:
  • Department of Computer Science and Technology, Tsinghua University, Beijing, China;Department of Computer Science and Technology, Tsinghua University, Beijing, China

  • Venue:
  • CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Practical content-based image retrieval systems require efficient indexing schemes for fast searches. Researchers have proposed many methods using space and data partitioning for exact similarity searches. However, traditional indexing methods perform poorly and will degrade to simple sequential scans at high dimensionality - that is so-called "curse of dimensionality". Recently, several filtering approaches based on vector approximation (VA) were proposed and showed promising performance. In fact, existing VA-based methods assume independent distribution of dataset and utilize scalar quantizer to partition each dimension of data space. In real databases, however, images are from different categories and often clustered. In this paper, a novel indexing method using vector quantization is proposed. This approach introduces a vector quantizer to partition data space. It assumes a Gaussian mixture distribution and estimates this distribution through Expectation-Maximization (EM) method. Experiments on a large database of 275,465 images demonstrated a remarkable improvement of retrieval efficiency.