A Theoretical Framework for Association Mining Based on the Boolean Retrieval Model

Authors:
Peter Bollmann-Sdorra;Aladdin Hafez;Vijay V. Raghavan
Affiliations:
-;-;-
Venue:
DaWaK '01 Proceedings of the Third International Conference on Data Warehousing and Knowledge Discovery
Year:
2001

Citing 12
Cited 1

Automatic text processing

Automatic text processing
Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Dynamic itemset counting and implication rules for market basket data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Inductive databases and condensed representations for data mining (extended abstract)

ILPS '97 Proceedings of the 1997 international symposium on Logic programming
Editorial

Data Mining and Knowledge Discovery
Parallel Mining of Association Rules

IEEE Transactions on Knowledge and Data Engineering
Methods and Problems in Data Mining

ICDT '97 Proceedings of the 6th International Conference on Database Theory
Integration of Data Mining with Database Technology

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Data Mining: Machine Learning, Statistics, and Databases

SSDBM '96 Proceedings of the Eighth International Conference on Scientific and Statistical Database Management
Data Mining and Knowledge Discovery in Databases: Implications for Scientific Databases

SSDBM '97 Proceedings of the Ninth International Conference on Scientific and Statistical Database Management
New Algorithms for Fast Discovery of Association Rules

New Algorithms for Fast Discovery of Association Rules

Generalizing the Notion of Confidence

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data mining has been defined as the non-trivial extraction of implicit, previously unknown and potentially useful information from data. Association mining is one of the important sub-fields in data mining, where rules that imply certain association relationships among a set of items in a transaction database are discovered. The efforts of most researchers focus on discovering rules in the form of implications between itemsets, which are subsets of items that have adequate supports. Having itemsets as both antecedent and precedent parts was motivated by the original application pertaining to market baskets and they represent only the simplest form of predicates. This simplicity is also due in part to the lack of a theoretical framework that includes more expressive predicates. The framework we develop derives from the observation that information retrieval and association mining are two complementary processes on the same data records or transactions. In information retrieval, given a query, we need to find the subset of records that matches the query. In contrast, in data mining, we need to find the queries (rules) having adequate number of records that support them. In this paper we introduce the theory of association mining that is based on a model of retrieval known as the Boolean Retrieval Model. The potential implications of the proposed theory are presented.