Performance evaluation for classification methods: A comparative simulation study

Authors:
Yong Soo Kim
Affiliations:
Division of Industrial and Management Engineering, Sungkyul University, 400-10, Anyang 8-dong, Manan-gu, Anyang-City, Gyeonggi-do 430-742, Republic of Korea
Venue:
Expert Systems with Applications: An International Journal
Year:
2010

Citing 5
Cited 5

Managerial applications of neural networks: the case of bank failure predictions

Management Science
Predicting graduate student success: a comparison of neural networks and traditional techniques

Computers and Operations Research
A comparison between Fama and French's model and artificial neural networks in predicting the Chinese stock market

Computers and Operations Research
Comparison of the decision tree, artificial neural network, and linear regression methods based on the number and types of independent variables and sample size

Expert Systems with Applications: An International Journal
Comparison of neural networks and regression analysis: A new insight

Expert Systems with Applications: An International Journal

ART-type artificial neural networks applications for classification of operational states in wind turbines

ICAISC'10 Proceedings of the 10th international conference on Artifical intelligence and soft computing: Part II
Predictive-collaborative model as recovery and validation tool. Case of study: Psychiatric emergency department decision support

Expert Systems with Applications: An International Journal
Analysis of data complexity measures for classification

Expert Systems with Applications: An International Journal
Optimizing parameters of support vector machine using fast messy genetic algorithm for dispute classification

Expert Systems with Applications: An International Journal
Classification of major construction materials in construction environments using ensemble classifiers

Advanced Engineering Informatics

Quantified Score

Hi-index	12.06

Visualization

Abstract

In this article, the performance of classification methods was empirically compared while varying the number of classes of dependent variables, the number of independent variables, the types of independent variables, the number of classes of the independent variables, and the sample size. Our study employed 324 simulated examples, with artificial neural networks and decision trees as the data mining techniques, and logistic regression as the statistical method. In the performance study, we use the misclassification errors as the metric and come up with some additional findings: (i) for continuous independent variables, a statistical technique (i.e., logistic regression) was superior to data mining techniques (i.e., artificial neural network and decision tree) when dependent variable has binary values, while the artificial neural network was best when the number of classes of dependent variable was three or more; (ii) for continuous and categorical independent variables, logistic regression performs better than artificial neural network and decision tree in the case of small number of independent variables and small sample size, while artificial neural network was best in other cases; and (iii) the artificial neural network performance improved faster than that of other methods as the number of independent variables and the number of classes of dependent variables increases.