Near-Duplicate detection for online-shops owners: an FCA-Based approach

  • Authors:
  • Dmitry I. Ignatov;Andrey V. Konstantiov;Yana Chubis

  • Affiliations:
  • National Research University Higher School of Economics, Moscow, Russia;National Research University Higher School of Economics, Moscow, Russia;National Research University Higher School of Economics, Moscow, Russia

  • Venue:
  • ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We proposed a prototype of near-duplicate detection system for web-shop owners. It's a typical situation for this online businesses to buy description of their goods from so-called copyrighters. Copyrighter can cheat from time to time and provide the owner with some almost identical descriptions for different items. In this paper we demonstrated how we can use FCA for fast clustering and revealing such duplicates in real online perfume shop's datasets.