Use of Circle-Segments as a Data Visualization Technique for Feature Selection in Pattern Classification

  • Authors:
  • Shir Li Wang;Chen Change Loy;Chee Peng Lim;Weng Kin Lai;Kay Sin Tan

  • Affiliations:
  • School of Electrical & Electronic Engineering, University of Science Malaysia, Penang, Malaysia;Centre for Advanced Informatics, MIMOS Berhad, Kuala Lumpur, Malaysia 57000;School of Electrical & Electronic Engineering, University of Science Malaysia, Penang, Malaysia;Centre for Advanced Informatics, MIMOS Berhad, Kuala Lumpur, Malaysia 57000;Department of Medicine, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia 50603

  • Venue:
  • Neural Information Processing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the issues associated with pattern classification using data-based machine learning systems is the "curse of dimensionality". In this paper, the circle-segments method is proposed as a feature selection method to identify important input features before the entire data set is provided for learning with machine learning systems. Specifically, four machine learning systems are deployed for classification, viz.Multilayer Perceptron (MLP), Support Vector Machine (SVM), Fuzzy ARTMAP (FAM), and k-Nearest Neighbour(kNN). The integration between the circle-segments method and the machine learning systems has been applied to two case studies comprising one benchmark and one real data sets. Overall, the results after feature selection using the circle-segments method demonstrate improvements in performance even with more than 50% of the input features eliminated from the original data sets.