The Integrated Delivery of Large-Scale Data Mining: The ACSys Data Mining Project

  • Authors:
  • Graham J. Williams;Irfan Altas;Sergey Bakin;Peter Christen;Markus Hegland;Alonso Marquez;Peter Milne;Rajehndra Nagappan;Stephen Roberts

  • Affiliations:
  • -;-;-;-;-;-;-;-;-

  • Venue:
  • Revised Papers from Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data Mining draws on many technologies to deliver novel and actionable discoveries from very large collections of data. The Australian Government's Cooperative Research Centre for Advanced Computational Systems (ACSys) is a link between industry and research focusing on the deployment of high performance computers for data mining. We present an overview of the work of the ACSys Data Mining projects where the use of large-scale, high performance computers plays a key role. We highlight the use of large-scale computing within three complimentary areas: the development of parallel algorithms for data analysis, the deployment of virtual environments for data mining, and issues in data management for data mining. We also introduce the Data Miner's Arcade which provides simple abstractions to integrate these components providing high performance data access for a variety of data mining tools communicating through XML.