Development of a grid service for scalable decision tree construction from grid databases

  • Authors:
  • Peter Brezany;Christian Kloner;A. Min Tjoa

  • Affiliations:
  • Institute of Scientific Computing, University of Vienna, Vienna, AUT;Institute of Scientific Computing, University of Vienna, Vienna, AUT;Institute for Software Technology and Multimedia Systems, Vienna University of Technology, Vienna, AUT

  • Venue:
  • PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Classification deals with discovery of a predictive learning function that classifies a data object into one of several predefined classes. We present a novel decision-tree-based classification service which can be used either autonomously or as a building block to construct distributed and scalable classifiers that operate on data repositories integrated into the Grid that typically involve large, complex, heterogeneous, and geographically distributed datasets. Although classification is considered as a well-studied problem – a lot of classification methods were proposed for sequential, parallel and distributed environments, so far, to our best knowledge, no effort was devoted to building classifiers based on federation of Grid resources. The Grid service described in this paper was fully implemented and integrated into the GridMiner framework (www.gridminer.org). Scalability and performance of the prototype implementation have been examined and the results are presented.