Learning from Data Streams: Synopsis and Change Detection

  • Authors:
  • Raquel Sebastião;João Gama;Teresa Mendonça

  • Affiliations:
  • LIAAD-INESC Porto L.A., University of Porto, Rua de Ceuta, 118-6, 4050-190 Porto, Portugal and Mathematics Department, Faculty of Science, University of Porto, Rua do Campo Alegre, 823, 4150-180 P ...;LIAAD-INESC Porto L.A., University of Porto, Rua de Ceuta, 118-6, 4050-190 Porto, Portugal and Faculty of Economics, University of Porto, Rua Dr. Roberto Frias, 4200-464 Porto, Portugal;Mathematics Department, Faculty of Science, University of Porto, Rua do Campo Alegre, 823, 4150-180 Porto, Portugal

  • Venue:
  • Proceedings of the 2008 conference on STAIRS 2008: Proceedings of the Fourth Starting AI Researchers' Symposium
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The aim of this PhD program is the study of algorithms for learning histograms, with the capacity of representing continuous high-speed flows of data and dealing with the current problem of change detection on data streams. In many modern applications, information is no longer gathered as finite stored data sets, but assuming the form of infinite data streams. As a large volume of information is produced at a high-speed rate it is no longer possible to use memory algorithms which require the full historic data stored in the main memory, so new ones are needed to process data online at the rate it is available. Moreover, the process generating data is not strictly stationary and evolves over time; so algorithms should, while extracting some sort of knowledge from this incessantly growing data, be able to adapt themselves to changes, maintaining a representation consistent with the most recent status of nature. In this work, we presented a feasible approach, using incremental histograms and monitoring data distributions, to detect concept drift in data stream context.