Density Estimation Technique for Data Stream Classification

Authors:
Nittaya Kerdprasop;Kittisak Kerdprasop
Affiliations:
Suranaree University of Technology, Thailand;Suranaree University of Technology, Thailand
Venue:
DEXA '06 Proceedings of the 17th International Conference on Database and Expert Systems Applications
Year:
2006

Citing 0
Cited 2

New perspectives in autonomic design patterns for stream-classification-systems

Proceedings of the 2007 workshop on Automating service quality: Held at the International Conference on Automated Software Engineering (ASE)
Mining in Large Noisy Domains

Journal of Data and Information Quality (JDIQ)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Density estimation is an important pre-processing step in the problem of data stream classification in which the number of data is overwhelming and the exact data distribution is unknown. We simplify the problem by employing a statistical sampling technique to obtain an approximate solution. With the proposed method, an unbounded large data set can be sampled in a number of random configurations, and that data can be used to describe the data set as a whole. The efficiency of the method depends largely on the ability to draw samples effectively which in turn depends on how close we can estimate the target density. We use finite mixture models to represent the probability density functions of the data stream. Then, we apply the EM algorithm twice to learn the model parameters. The efficiency of our estimation technique has been shown in the experimental results.