Constrained frequent pattern mining on univariate uncertain data

  • Authors:
  • Ying-Ho Liu;Chun-Sheng Wang

  • Affiliations:
  • Department of Information Management, National Dong Hwa University, No. 1, Sec. 2, Da Hsueh Road, Hualien 97401, Taiwan, ROC;Department of Information Management, Jinwen University of Science and Technology, No. 99, An-Chung Road, Hsin-Tien, Dist., New Taipei City 23154, Taiwan, ROC

  • Venue:
  • Journal of Systems and Software
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a new algorithm called CUP-Miner (Constrained Univariate Uncertain Data Pattern Miner) for mining frequent patterns from univariate uncertain data under user-specified constraints. The discovered frequent patterns are called constrained frequent U2 patterns (where ''U2'' represents ''univariate uncertain''). In univariate uncertain data, each attribute in a transaction is associated with a quantitative interval and a probability density function. The CUP-Miner algorithm is implemented in two phases: In the first phase, a U2P-tree (Univariate Uncertain Pattern tree) is constructed by compressing the target database transactions into a compact tree structure. Then, in the second phase, the constrained frequent U2 pattern is enumerated by traversing the U2P-tree with different strategies that correspond to different types of constraints. The algorithm speeds up the mining process by exploiting five constraint properties: succinctness, anti-monotonicity, monotonicity, convertible anti-monotonicity, and convertible monotonicity. Our experimental results demonstrate that CUP-Miner outperforms the modified CAP algorithm, the modified FIC algorithm, the modified U2P-Miner algorithm, and the modified Apriori algorithm.