Detection of similar advertisements in media databases

  • Authors:
  • Karel Palecek

  • Affiliations:
  • Institute of Information Technology and Electronics, Technical University of Liberec, Czech Republic

  • Venue:
  • COST'10 Proceedings of the 2010 international conference on Analysis of Verbal and Nonverbal Communication and Enactment
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This contribution presents a system for detection of similar images of advertisements in moderate size datasets. These datasets are daily updated and mainly consists of advertisements from tv, newspapers, journals, etc. The task is to identify clusters of duplicate advertisements in given dataset. Images differ by translation, scale and the amount of compression. The presented approach is based on recently popular bag-of-features approach which has been successfully used in context of image retrieval and other related areas. Each image is represented as weighted histogram of local features. Similarities are computed based on the extracted features are projected onto separating hyperplane and clustered using agglomerative hierarchical clustering. Experiments show that this simple and efficient scheme yields good results and finds corresponding images even for advertisements which are substantially dissimilar in spatial arrangement and color composition with reasonable false positive rate.