Perfect Hashing Schemes for Mining Traversal Patterns

  • Authors:
  • Chin-Chen Chang;Chih-Yang Lin;Henry Chou

  • Affiliations:
  • Department of Information Engineering and Computer Science Feng Chia University Taichung, Taiwan, 40724, R.O.C. E-mail: ccc@cs.ccu.edu.tw;Department of Computer Science and Information Engineering National Chung Cheng University Chiayi, Taiwan, 621, R.O.C. gary, E-mail: chch90@cs.ccu.edu.tw;Department of Computer Science and Information Engineering National Chung Cheng University Chiayi, Taiwan, 621, R.O.C. gary, E-mail: chch90@cs.ccu.edu.tw

  • Venue:
  • Fundamenta Informaticae
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hashing schemes are a common technique to improve the performance in mining not only association rules but also sequential patterns or traversal patters. However, the collision problem in hash schemes may result in severe performance degradation. In this paper, we propose perfect hashing schemes for mining traversal patterns to avoid collisions in the hash table. The main idea is to transform each large itemsets into one large 2-itemset by employing a delicate encoding scheme. Then perfect hash schemes designed only for itemsets of length two, rather than varied lengths, are applied. The experimental results show that our method is more than twice as faster than FS algorithm. The results also show our method is scalable to database sizes. One variant of our perfect hash scheme, called partial hash, is proposed to cope with the enormous memory space required by typical perfect hash functions. We also give a comparison of the performances of different perfect hash variants and investigate their properties.