Spatial pooling for transformation invariant image representation

  • Authors:
  • Xia Li;Yan Song;Yijuan Lu;Qi Tian

  • Affiliations:
  • University of Texas at San Antonio, San Antonio, TX, USA;University of Science and Technology of China, Hefei, China;Texas State University, San Marcos, TX, USA;University of Texas at San Antonio, San Antonio, TX, USA

  • Venue:
  • MM '11 Proceedings of the 19th ACM international conference on Multimedia
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Spatial Pyramid Matching (SPM) [2] has been proposed to extend the Bag-of-Word (BoW) model for object classification. By re-serving the finer level information, it makes image matching more accurate. However, for not well-aligned images, where the object is rotated, flipped or translated, SPM may lose its discrimination power. To tackle this problem, we propose novel spatial pooling layouts to address various transformations, and generate a more general image representation. To evaluate the effectiveness of the proposed approach, we conduct extensive experiments on three transformation emphasized datasets for object classification task. Experimental results demonstrate its superiority over the state-of-the-arts. Besides, the proposed image representation is compact and consistent with the BoW model, which makes it applicable to image retrieval task as well.