Scalable 2-Pass Data Mining Technique for Large Scale Spatio-temporal Datasets

  • Authors:
  • Tahar Kechadi;Michela Bertolotto

  • Affiliations:
  • University College Dublin, Belfield, Dublin 4, Ireland.;Sergio Di Martino, Filomena Ferrucci, Dip. di Matematica e Informatica, Università degli Studi di Salerno, Email: sdimartino,fferrucci@unisa.it, Italy

  • Venue:
  • KES '07 Knowledge-Based Intelligent Information and Engineering Systems and the XVII Italian Workshop on Neural Networks on Proceedings of the 11th International Conference
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a system for mining very large spatio-temporal datasets. The system comprises two main layers: the mining layer and the visualization layer. The mining layer implements a new approach based on a 2-pass strategy to efficiently support the data-mining process, address the spatial and temporal dimensions of the dataset, and visualize and interpret results. In the first pass, the data objects are grouped according to their close similarity. In the second pass these groups are clustered to produce new models or patterns. The main reason for this 2-pass strategy is that the datasets are too large for traditional mining and cannot support the interactivity required by the visualization layer.