Evaluation of k-Nearest Neighbor classifier performance for direct marketing

Authors:
M. Govindarajan;RM. Chandrasekaran
Affiliations:
Department of Computer Science and Engineering, Annamalai University, Annamalai Nagar, 608 002 Tamil Nadu, India;Department of Computer Science and Engineering, Annamalai University, Annamalai Nagar, 608 002 Tamil Nadu, India
Venue:
Expert Systems with Applications: An International Journal
Year:
2010

Citing 17
Cited 2

Fast multiresolution image querying

SIGGRAPH '95 Proceedings of the 22nd annual conference on Computer graphics and interactive techniques
Face Recognition by Elastic Bunch Graph Matching

IEEE Transactions on Pattern Analysis and Machine Intelligence
Approximate statistical tests for comparing supervised classification learning algorithms

Neural Computation
Support vector machines applied to face recognition

Proceedings of the 1998 conference on Advances in neural information processing systems II
Data mining: concepts and techniques

Data mining: concepts and techniques
An Algorithm for Finding Best Matches in Logarithmic Expected Time

ACM Transactions on Mathematical Software (TOMS)
Data Mining: Introductory and Advanced Topics

Data Mining: Introductory and Advanced Topics
Depth-First K-Nearest Neighbor Finding Using the MaxNearestDist Estimator

ICIAP '03 Proceedings of the 12th International Conference on Image Analysis and Processing
A K-NN Associated Fuzzy Evidential Reasoning Classifier with Adaptive Neighbor Selection

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Pattern Recognition Using Average Patterns of Categorical k-Nearest Neighbors

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 4 - Volume 04
The SMART Retrieval System—Experiments in Automatic Document Processing

The SMART Retrieval System—Experiments in Automatic Document Processing
Combining Classification Improvements by Ensemble Processing

SERA '05 Proceedings of the Third ACIS Int'l Conference on Software Engineering Research, Management and Applications
Focusing on non-respondents: Response modeling with novelty detectors

Expert Systems with Applications: An International Journal
Eigenfaces for recognition

Journal of Cognitive Neuroscience
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
An effective refinement strategy for KNN text classifier

Expert Systems with Applications: An International Journal
Nearest neighbor pattern classification

IEEE Transactions on Information Theory

Length of stay prediction for clinical treatment process using temporal similarity

Expert Systems with Applications: An International Journal
Reprint of "Length of stay prediction for clinical treatment process using temporal similarity"

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	12.05

Visualization

Abstract

Text data mining is a process of exploratory data analysis. Classification maps data into predefined groups or classes. It is often referred to as supervised learning because the classes are determined before examining the data. This paper describes the proposed k-Nearest Neighbor classifier that performs comparative cross-validation for the existing k-Nearest Neighbor classifier. The feasibility and the benefits of the proposed approach are demonstrated by means of data mining problem: direct marketing. Direct marketing has become an important application field of data mining. Comparative cross-validation involves estimation of accuracy by either stratified k-fold cross-validation or equivalent repeated random subsampling. While the proposed method may have a high bias; its performance (accuracy estimation in our case) may be poor due to a high variance. Thus the accuracy with the proposed k-Nearest Neighbor classifier was less than that with the existing k-Nearest Neighbor classifier, and the smaller the improvement in runtime the larger the improvement in precision and recall. In our proposed method we have determined the classification accuracy and prediction accuracy where the prediction accuracy is comparatively high.