GEP-Induced Expression Trees as Weak Classifiers
ICDM '08 Proceedings of the 8th industrial conference on Advances in Data Mining: Medical Applications, E-Commerce, Marketing, and Theoretical Aspects
Hi-index | 0.00 |
Knowledge Discovery and Data Mining(KDD) process includes preprocessing, transformation, data mining and knowledge extraction. The two important tasks of data mining are clustering and classification. In this paper, we propose a generic feature extraction for classification using Fuzzy C-Means(FCM) clustering. The raw data is preprocessed, normalized and then data points are clustered using fuzzy c-means technique. Feature vectors for all the classes are generated by extracting the most relevant features from the corresponding clusters and used for further classification. Artificial Neural Network and Support Vector Machines are used to perform the classification task. Experiments are conducted on four datasets and the accuracy obtained by performing specific feature extraction for a particular data set is compared with generic feature extraction scheme. The algorithm performs relatively well with respect to classification results when compared with the specific feature extraction technique.