On scalability of the similarity search in the world of peers

  • Authors:
  • Michal Batko;David Novak;Fabrizio Falchi;Pavel Zezula

  • Affiliations:
  • Masaryk University, Brno, Czech Republic;Masaryk University, Brno, Czech Republic;ISTI-CNR, Pisa, Italy;Masaryk University, Brno, Czech Republic

  • Venue:
  • InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Due to the increasing complexity of current digital data, similarity search has become a fundamental computational task in many applications. Unfortunately, its costs are still high and the linear scalability of single server implementations prevents from efficient searching in large data volumes. In this paper, we shortly describe four recent scalable distributed similarity search techniques and study their performance of executing queries on three different datasets. Though all the methods employ parallelism to speed up query execution, different advantages for different objectives have been identified by experiments. The reported results can be exploited for choosing the best implementations for specific applications. They can also be used for designing new and better indexing structures in the future.