Massive data mining based on item sequence set grid space

  • Authors:
  • Lijuan Zhou;Zhang Zhang;Mingsheng Xu

  • Affiliations:
  • Information Engineering College, Capital Normal University, Beijing, China;Information Engineering College, Capital Normal University, Beijing, China;Information Engineering College, Capital Normal University, Beijing, China

  • Venue:
  • CAR'10 Proceedings of the 2nd international Asia conference on Informatics in control, automation and robotics - Volume 3
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

According to the stored mode of massive data in the relational database, this paper proposed a fast mining algorithm to find maximum frequent item sets based on item sequence set grid space. The traditional methods for mining association rules generate frequent item sets from small to large. These approaches are either time consuming or computationally expensive, and often generate a large number of redundant candidates or frequent item sets, which is fatal for controlling mining speed as data to mass-level. The goal of this paper is first to use a self-defined structure linked list to storage item sequence then to find the frequent item sets from large to small. Several applications of association rules mining using item sequence set grid space has a good performance but it demonstrated inefficiency in massive data mining. The problem involves time spent on sub item sets finding. Experimental results will be presented to show that the fast mining algorithm ISSDL-DM proposed in this paper use much less time than the similar existing algorithm ISS-DM for achieving the same outcomes.