A comparative study of simple online learning strategies for streaming data

Authors:
M. Millán-Giraldo;J. S. Sánchez
Affiliations:
Universitat Jaume I, Dept. Llenguatges i Sistemes Informátics, Castelló la Plana, Spain;Universitat Jaume I, Dept. Llenguatges i Sistemes Informátics, Castelló la Plana, Spain
Venue:
WSEAS Transactions on Circuits and Systems
Year:
2008

Citing 8
Cited 0

Instance-Based Learning Algorithms

Machine Learning
Selecting Examples for Partial Memory Learning

Machine Learning
A streaming ensemble algorithm (SEA) for large-scale classification

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Pattern Recognition and Neural Networks

Pattern Recognition and Neural Networks
Data streams: algorithms and applications

Foundations and Trends® in Theoretical Computer Science
Face recognition with semi-supervised learning and multiple classifiers

CIMMACS'06 Proceedings of the 5th WSEAS International Conference on Computational Intelligence, Man-Machine Systems and Cybernetics
Push-pull incentive-based P2P live media streaming system

WSEAS TRANSACTIONS on COMMUNICATIONS
Semi-Supervised Learning

Semi-Supervised Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Since several years ago, the analysis of data streams has attracted considerably the attention in various research fields, such as databases systems and data mining. The continuous increase in volume of data and the high speed that they arrive to the systems challenge the computing systems to store, process and transmit. Furthermore, it has caused the development of new online learning strategies capable to predict the behavior of the streaming data. This paper compares three very simple learning methods applied to static data streams when we use the 1-Nearest Neighbor classifier, a linear discriminant, a quadratic classifier, a decision tree, and the Naïve Bayes classifier. The three strategies have been taken from the literature. One of them includes a time-weighted strategy to remove obsolete objects from the reference set. The experiments were carried out on twelve real data sets. The aim of this experimental study is to establish the most suitable online learning model according to the performance of each classifier.