Quantization-based probabilistic feature modeling for kernel design in content-based image retrieval

  • Authors:
  • Hua Xie;Victor Andreu;Antonio Ortega

  • Affiliations:
  • California Institute of Technology, Pasadena, CA;University of Southern California, Los Angeles, CA;University of Southern California, Los Angeles, CA

  • Venue:
  • MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a quantization-based probabilistic feature modeling approach is proposed for relevance feedback in content-based image retrieval. We demonstrate its performance by using the resulting models within a support vector machine (SVM) based technique. Each feature component is quantized and mapped to probabilistic quantities representing the likelihood of the image being relevant (and irrelevant). These probabilistic quantities are then used to derive an information divergence-based kernel function for SVM classification which we introduced in earlier work. We show that the proposed method leads to the optimal maximum likelihood solution as the knowledge of the actual underlying probability model improves (i.e.,as the feature space is partitioned into arbitrarily small "regions "and accurate models are known for all regions). vWe investigate several practical quantization designs for feature modeling specifically in relevance feedback applications,where the scarcity of the data and high dimensionality prevent usage of vector quantization and parametric modeling approaches.Our proposed framework naturally takes into account the statistics of the data that is available during relevance feedback for the purpose of discriminating between relevant and irrelevant images.Experiments with the Corel dataset show that quantizers specifically designed for this application achieve gains over simple uniform quantizers (e.g.,5% to 10% in retrieval accuracy) when combined with our information divergence kernel. This kernel achieves gains (e.g.,17% in retrieval accuracy after first relevance feedback)as compared to the standard radial basis function (RBF) kernel used for SVM-based relevance feedback.