Massive data mining based on item sequence set grid space

Authors:
Lijuan Zhou;Zhang Zhang;Mingsheng Xu
Affiliations:
Information Engineering College, Capital Normal University, Beijing, China;Information Engineering College, Capital Normal University, Beijing, China;Information Engineering College, Capital Normal University, Beijing, China
Venue:
CAR'10 Proceedings of the 2nd international Asia conference on Informatics in control, automation and robotics - Volume 3
Year:
2010

Citing 3
Cited 0

A Constrained Maximum Frequent Itemsets Incremental Mining Algorithm

NPC '07 Proceedings of the 2007 IFIP International Conference on Network and Parallel Computing Workshops
Top 10 algorithms in data mining

Knowledge and Information Systems
Deriving class association rules based on levelwise subspace clustering

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases

Quantified Score

Hi-index	0.00

Visualization

Abstract

According to the stored mode of massive data in the relational database, this paper proposed a fast mining algorithm to find maximum frequent item sets based on item sequence set grid space. The traditional methods for mining association rules generate frequent item sets from small to large. These approaches are either time consuming or computationally expensive, and often generate a large number of redundant candidates or frequent item sets, which is fatal for controlling mining speed as data to mass-level. The goal of this paper is first to use a self-defined structure linked list to storage item sequence then to find the frequent item sets from large to small. Several applications of association rules mining using item sequence set grid space has a good performance but it demonstrated inefficiency in massive data mining. The problem involves time spent on sub item sets finding. Experimental results will be presented to show that the fast mining algorithm ISSDL-DM proposed in this paper use much less time than the similar existing algorithm ISS-DM for achieving the same outcomes.