Large-scale EMM identification based on geometry-constrained visual word correspondence voting

  • Authors:
  • Xin Yang;Qiong Liu;Chunyuan Liao;Kwang-Ting Cheng;Andreas Girgensohn

  • Affiliations:
  • University of California, Santa Barbara, CA;FX Palo Alto Laboratory, Bldg., Palo Alto, CA;FX Palo Alto Laboratory, Bldg., Palo Alto, CA;University of California, Santa Barbara, CA;FX Palo Alto Laboratory, Bldg., Palo Alto, CA

  • Venue:
  • Proceedings of the 1st ACM International Conference on Multimedia Retrieval
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a large-scale Embedded Media Marker (EMM) identification system which allows users to retrieve relevant dynamic media associated with a static paper document via camera-phones. The user supplies a query image by capturing an EMM-signified patch of a paper document through a camera phone. The system recognizes the query and in turn retrieves and plays the corresponding media on the phone. Accurate image matching is crucial for positive user experience in this application. To address the challenges posed by large datasets and variation in camera-phone-captured query images, we introduce a novel image matching scheme based on geometrically consistent correspondences. A hierarchical scheme, combined with two constraining methods, is designed to detect geometric constrained correspondences between images. A spatial neighborhood search approach is further proposed to address challenging cases of query images with a large translational shift. Experimental results on a 200k+ dataset show that our solution achieves high accuracy with low memory and time complexity and outperforms the baseline bag-of-words approach.