Extensible query processing in starburst
SIGMOD '89 Proceedings of the 1989 ACM SIGMOD international conference on Management of data
Query evaluation techniques for large databases
ACM Computing Surveys (CSUR)
Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
An Extension to SQL for Mining Association Rules
Data Mining and Knowledge Discovery
Set-Oriented Mining for Association Rules in Relational Databases
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Data Mining and Knowledge Discovery Handbook
Data Mining and Knowledge Discovery Handbook
YALE: rapid prototyping for complex data mining tasks
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Database Systems: The Complete Book
Database Systems: The Complete Book
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
PLANET: massively parallel learning of tree ensembles with MapReduce
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
Data mining deals with the extraction of hidden knowledge from large amounts of data. Nowadays, coarse-grained data mining modules are used. This traditional black box approach focuses on specific algorithm improvements and is not flexible enough to be used for more general optimization and beneficial component reuse. The work presented in this paper elaborates on decomposing data mining tasks as data mining execution process plans which are composed of finer-grained data mining operators. The cost of an operator can be analyzed and provides means for more holistic optimizations. This process-based data mining concept is evaluated via an OGSA-DAI based implementations for association rule mining which show the feasibility of our approach as well as the re-usability of some of the data mining operators.