Digging Deep into the Data Mine with DataMiningGrid

  • Authors:
  • Vlado Stankovski;Martin Swain;Valentin Kravtsov;Thomas Niessen;Dennis Wegener;Matthias Röhm;Jernej Trnkoczy;Michael May;Jürgen Franke;Assaf Schuster;Werner Dubitzky

  • Affiliations:
  • University of Ljubljana;University of Ulster;Technion Israel Institute of Technology;Scopevisio AG;Fraunhofer Institute for Intelligent Analysis and Information Systems;DaimlerChrysler;University of Ljubljana;Fraunhofer Institute for Intelligent Analysis and Information Systems;DaimlerChrysler;Technion Israel Institute of Technology;University of Ulster

  • Venue:
  • IEEE Internet Computing
  • Year:
  • 2008

Quantified Score

Hi-index 0.02

Visualization

Abstract

The growing computerization in modern knowledge and technology sectors is generating huge volumes of electronically stored data. Data mining technology is often employed to make sense of these data. However, as modern data mining applications increase in complexity, so do their demands for resources. Grid computing is one of several emerging networked computing paradigms promising to meet the requirements of heterogeneous, large-scale and distributed data mining applications. Despite this promise, there are still too many issues to be resolved before grid technology is commonly applied to large-scale data mining tasks. To address some of these issues, we developed the DataMiningGrid system, which principally differs from similar systems by its ability to integrate a diverse set of programs and application scenarios within a single framework. The system's key features include high performance and scalability, sophisticated support for relevant standards, different user types, and flexible extensibility. The software is available as open source.