Maintaining discovered frequent itemsets: cases for changeable database and support

  • Authors:
  • Xiao Ping Du;Shi Wei Tang;Akifumi Makinouchi

  • Affiliations:
  • School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, P.R. China and College of Software, Beijing University of Aeronautics and Astronautics, Beijing 100083, P ...;School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, P.R. China;Graduate School of Information Science, Kyushu University, Fukuoka 812-8581, Japan

  • Venue:
  • Journal of Computer Science and Technology
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Mining frequent itemsets from large databases has played an essential role in many data mining tasks. It is also important to maintain the discovered frequent itemsets for these data mining tasks when the database is updated. All algorithms proposed so far for the maintenance of discovered frequent itemsets are only performed with a fixed minimum support, which is the same as that used to obtain the discovered frequent itemsets. That is, users cannot change the minimum support even if the new results are unsatisfactory to the users. In this paper two new complementary algorithms, FMP (First Maintaining Process) and RMP (Repeated Maintaining Process), are proposed to maintain discovered frequent itemsets in the case that new transaction data are added to a transaction database. Both algorithms allow users to change the minimum support for the maintenance processes. FMP is used for the first maintaining process, and when the result derived from the FMP is unsatisfactory, RMP will be performed repeatedly until satisfactory results are obtained. The proposed algorithms re-use the previous results to cut down the cost of maintenance. Extensive experiments have been conducted to assess the performance of the algorithms. The experimental results show that the proposed algorithms are very resultful compared with the previous mining and maintenance algorithms for maintenance of discovered frequent itemsets.