Data mining tasks and methods: parallel methods for scaling data mining algorithms to large data sets

  • Authors:
  • Robert Grossman;Yike Guo

  • Affiliations:
  • Chairman, Magnify/ and Director, Laboratory for Advanced Computing, University of Illinois at Chicago;Reader in Computing Science, Imperial College, University of London, United Kingdom

  • Venue:
  • Handbook of data mining and knowledge discovery
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this article, we describe some approaches and specific techniques for scaling data mining algorithms to large data sets through parallel processing. We then analyze in more detail three core algorithms that can be scaled to large data sets: building decision trees, discovering association rules, and creating clusters.