TECNO-STREAMS: Tracking Evolving Clusters in Noisy Data Streams with a Scalable Immune System Learning Model

  • Authors:
  • Olfa Nasraoui;Cesar Cardona Uribe;Carlos Rojas Coronel;Fabio Gonzalez

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Artificial Immune System (AIS) models hold many promises inthe field of unsupervised learning. However, existing models arenot scalable, which makes them of limited use in data mining. Wepropose a new AIS based clustering approach (TECNO-STREAMS)that addresses the weaknesses of current AIS models. Comparedto existing AIS based techniques, our approach exhibits superiorlearning abilities, while at the same time, requiring low memoryand computational costs. Like the natural immune system, thestrongest advantage of immune based learning compared to otherapproaches is expected to be its ease of adaptation to the dynamicenvironment that characterizes several applications, particularlyin mining data streams. We illustrate the ability of the proposedapproach in detecting clusters in noisy data sets, and in miningevolving user profiles from Web clickstream data in a single pass.TECNO-STREAMS adheres to all the requirements of clusteringdata streams: compactness of representation, fast incremental processingof new data points, and clear and fast identification of outliers.