Data dimensionality reduction with application to improving classification performance and explaining concepts of data sets

Authors:
Xiuju Fu;Lipo Wang
Affiliations:
Institute of High Performance Computing, Science Park 2, 117528, Singapore.;School of Electrical and Electronic Engineering, Nanyang Technological University, Block S1, Nanyang Avenue, 639798, Singapore
Venue:
International Journal of Business Intelligence and Data Mining
Year:
2005

Citing 12
Cited 10

Estimating attributes: analysis and extensions of RELIEF

ECML-94 Proceedings of the European conference on machine learning on Machine Learning
An algorithm to generate radial basis function (RBF)-like nets for classification problems

Neural Networks
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Approximation, Dimension Reduction, and Nonconvex Optimization Using Linear Superpositions of Gaussians

IEEE Transactions on Computers
Effective Data Mining Using Neural Networks

IEEE Transactions on Knowledge and Data Engineering
Dimensionality Reduction of Unsupervised Data

ICTAI '97 Proceedings of the 9th International Conference on Tools with Artificial Intelligence
Handwritten Kanji Recognition with the LDA Method

ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
Enhanced Fisher Linear Discriminant Models for Face Recognition

ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
Fast learning in networks of locally-tuned processing units

Neural Computation
Data dimensionality reduction with application to simplifying RBF network structure and improving classification performance

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
A neural-network learning theory and a polynomial time RBF algorithm

IEEE Transactions on Neural Networks
Extracting M-of-N rules from trained neural networks

IEEE Transactions on Neural Networks

An efficient weighted nearest neighbour classifier using vertical data representation

International Journal of Business Intelligence and Data Mining
A comparison of imputation methods in the presence of imprecise data when employing a neural network s-Sigmoid function

International Journal of Business Intelligence and Data Mining
Vote prediction by iterative domain knowledge and attribute elimination

International Journal of Business Intelligence and Data Mining
Decision trees for binary classification variables grow equally with the Gini impurity measure and Pearson's chi-square test

International Journal of Business Intelligence and Data Mining
Efficient online mining of large databases

International Journal of Business Information Systems
Preprocessing enhancements to improve data mining algorithms

International Journal of Business Intelligence and Data Mining
Finding "persistent rules": Combining association and classification results

Expert Systems with Applications: An International Journal
ReliefMSS: a variation on a feature ranking ReliefF algorithm

International Journal of Business Intelligence and Data Mining
Testing terrorism theory with data mining

International Journal of Data Analysis Techniques and Strategies
Fractal and neural networks based watermark identification

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data dimensionality reduction is usually carried out before patterns are input to classifiers. In order to obtain good results in data mining, selecting relevant data is desirable. In many cases, irrelevant or redundant attributes are included in data sets, which interfere with knowledge discovery from data sets. In this paper, we propose a rule-extraction method based on a novel separability-correlation measure (SCM) ranking the importance of attributes. According to the attribute ranking results, the attribute subsets that lead to the best classification results are selected and used as inputs to a classifier, such as an RBF neural network in our paper. The complexity of the classifier can thus be reduced and its classification performance improved. Our method uses the classification results with reduced attribute sets to extract rules. Computer simulations show that our method leads to smaller rule sets with higher accuracies compared with other methods.