The POSTGRES next generation database management system
Communications of the ACM
Understanding the new SQL: a complete guide
Understanding the new SQL: a complete guide
Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Using the new DB2: IBM's object-relational database system
Using the new DB2: IBM's object-relational database system
A database perspective on knowledge discovery
Communications of the ACM
Dynamic itemset counting and implication rules for market basket data
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Fast discovery of association rules
Advances in knowledge discovery and data mining
Query flocks: a generalization of association-rule mining
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Integrating association rule mining with relational database systems: alternatives and implications
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Parallel Mining of Association Rules
IEEE Transactions on Knowledge and Data Engineering
Mining Sequential Patterns: Generalizations and Performance Improvements
EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
SLIQ: A Fast Scalable Classifier for Data Mining
EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
An Efficient Algorithm for Mining Association Rules in Large Databases
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Mining Generalized Association Rules
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Sampling Large Databases for Association Rules
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
A New SQL-like Operator for Mining Association Rules
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Scalable Mining for Classification Rules in Relational Databases
IDEAS '98 Proceedings of the 1998 International Symposium on Database Engineering & Applications
Spatial Subgroup Discovery Applied to the Analysis of Vegetation Data
PAKM '02 Proceedings of the 4th International Conference on Practical Aspects of Knowledge Management
Specifying Mining Algorithms with Iterative User-Defined Aggregates: A Case Study
PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Spatial Subgroup Mining Integrated in an Object-Relational Spatial Database
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
Integration of Data Mining with Database Technology
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Storage and Querying of E-Commerce Data
Proceedings of the 27th International Conference on Very Large Data Bases
Knowledge discovery in databases: the purpose, necessity, and challenges
Handbook of data mining and knowledge discovery
Handbook of data mining and knowledge discovery
Optimizing subset queries: a step towards SQL-based inductive databases for itemsets
Proceedings of the 2004 ACM symposium on Applied computing
Specifying Mining Algorithms with Iterative User-Defined Aggregates
IEEE Transactions on Knowledge and Data Engineering
Mining tree queries in a graph
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Embedded predictive modeling in a parallel relational database
Proceedings of the 2006 ACM symposium on Applied computing
Processing forecasting queries
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Online analytical mining association rules using Chi-square test
International Journal of Business Intelligence and Data Mining
A Logic-Based Approach to Mining Inductive Databases
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
Unary and n-ary inclusion dependency discovery in relational databases
Journal of Intelligent Information Systems
Association rule mining: models and algorithms
Association rule mining: models and algorithms
Cosmetics purchasing behavior - An analysis using association reasoning neural networks
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
Journal of Biomedical Informatics
Hi-index | 0.00 |
Data mining on large data warehouses is becoming increasingly important. In support of this trend, we consider a spectrum of architectural alternatives for coupling mining with database systems. These alternatives include: loose-coupling through a SQL cursor interface; encapsulation of a mining algorithm in a stored procedure; caching the data to a file system on-the-fly and mining; tight-coupling using primarily user-defined functions; and SQL implementations for processing in the DBMS. We comprehensively study the option of expressing the mining algorithm in the form of SQL queries using Association rule mining as a case in point. We consider four options in SQL-92 and six options in SQL enhanced with object-relational extensions (SQL-OR). Our evaluation of the different architectural alternatives shows that from a performance perspective, the Cache option is superior, although the performance of the SQL-OR option is within a factor of two. Both the Cache and the SQL-OR approaches incur a higher storage penalty than the loose-coupling approach which performance-wise is a factor of 3 to 4 worse than Cache. The SQL-92 implementations were too slow to qualify as a competitive option. We also compare these alternatives on the basis of qualitative factors like automatic parallelization, development ease, portability and inter-operability. As a byproduct of this study, we identify some primitives for native support in database systems for decision-support applications.