A fast indexing algorithm optimization with user behavior pattern

  • Authors:
  • Zhu Wang;Tiejian Luo;Yanxiang Xu;Fuxing Cheng;Xin Zhang;Xiang Wang

  • Affiliations:
  • Graduate University of Chinese Academy of Sciences (GUCAS), Beijing, China;Graduate University of Chinese Academy of Sciences (GUCAS), Beijing, China;Graduate University of Chinese Academy of Sciences (GUCAS), Beijing, China;Graduate University of Chinese Academy of Sciences (GUCAS), Beijing, China;Graduate University of Chinese Academy of Sciences (GUCAS), Beijing, China;Graduate University of Chinese Academy of Sciences (GUCAS), Beijing, China

  • Venue:
  • ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Internet users' access pattern for objects has been observed to follow Zipf's law. The preference for network resource is showing strong influence on real-time lookup performance in large-scale distributed systems. In order to guarantee search response rate with limited memory space, we develop a new object indexing and locating algorithm called Bloom filter Arrays based on Zipf's-distributed user Preference (ZPBA). The algorithm uses a compact data structure to achieve high accuracy in item lookup. We give the theoretical analysis of ZPBA and then conduct experiments with one million item corpus and 100,000 queries to validate our design. Comparison shows that our solution can be 77% more space efficient than traditional bloom filter based index approaches for applications of concentrated user access preference. The algorithm demonstrates practical application potential in fault tolerant large-scale distributed indexing and item lookup.