The grid: blueprint for a new computing infrastructure
The grid: blueprint for a new computing infrastructure
Techniques for mapping tasks to machines in heterogeneous computing systems
Journal of Systems Architecture: the EUROMICRO Journal - Heterogeneous distributed and parallel architectures: hardware, software and design tools
Enhancing the Apriori Algorithm for Frequent Set Counting
DaWaK '01 Proceedings of the Third International Conference on Data Warehousing and Knowledge Discovery
Dynamic Matching and Scheduling of a Class of Independent Tasks onto Heterogeneous Computing Systems
HCW '99 Proceedings of the Eighth Heterogeneous Computing Workshop
A Directory Service for Configuring High-Performance Distributed Computations
HPDC '97 Proceedings of the 6th IEEE International Symposium on High Performance Distributed Computing
Evaluation of sampling for data mining of association rules
RIDE '97 Proceedings of the 7th International Workshop on Research Issues in Data Engineering (RIDE '97) High Performance Database Management for Large-Scale Applications
Promoting performance and separation of concerns for data mining applications on the grid
Future Generation Computer Systems - Special section: Data mining in grid computing environments
Parallel learning using decision trees: a novel approach
AMCOS'05 Proceedings of the 4th WSEAS International Conference on Applied Mathematics and Computer Science
Predicting Grid Performance Based on Novel Reduct Algorithm
KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
Predicting performance of grid based on rough set
WSEAS TRANSACTIONS on SYSTEMS
Cooperative caching for grid-enabled OLAP
International Journal of Grid and Utility Computing
Brain Injury Detection and Monitoring through fMRI Time Series Data Mining
Proceedings of the 2006 conference on Advances in Intelligent IT: Active Media Technology 2006
Performance-based data distribution for data mining applications on grid computing environments
The Journal of Supercomputing
Rough set based computation times estimation on knowledge grid
EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
Hi-index | 0.01 |
Increasingly the datasets used for data mining are becoming huge and physically distributed. Since the distributed knowledge discovery process is both data and computational intensive, the Grid is a natural platform for deploying a high performance data mining service. The focus of this paper is on the core services of such a Grid infrastructure. In particular we concentrate our attention on the design and implementation of specialized broker aware of data source locations and resource needs of data mining tasks. Allocation and scheduling decisions are taken on the basis of performance cost metrics and models that exploit knowledge about previous executions, and use sampling to acquire estimate about execution behavior.