Data Reduction Method for Categorical Data Clustering
IBERAMIA '08 Proceedings of the 11th Ibero-American conference on AI: Advances in Artificial Intelligence
Analysis of sampling techniques for association rule mining
Proceedings of the 12th International Conference on Database Theory
A new sampling technique for association rule mining
Journal of Information Science
A Hybrid Higher Order Neural Classifier for handling classification problems
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Data sets contain very large amount of information, which is not an easy task for the users to scan the entire data set. The researcher's initial task is to formulate a realistic explanation for the use of sampling in his research. Sampling has been often suggested as an effective tool to reduce the size of the dataset operated at some cost to accuracy. It is the the process of selecting a representative part of a data set for the purpose of determining parameters or characteristics of the whole data set. Due to sampling we overcome the problems like; i) in research it is not possible to collect and test each and every element from the data base individually; and ii) study of sample rather than the entire dataset is also sometimes likely to produce more reliable results. This paper focuses on different types of sampling strategies applied on hybrid higher order neural network classifier (HHONC) rather than artificial neural network which is having several limitations. To overcome such limitations HHONC have been used. Here sampling technique has been applied on four real, integers and categorical dataset such as breast cancer, pima Indian diabetes, leukaemia and lung cancer data set prior to classification. The main objective of this paper is an empirical comparison of different sampling strategies for classification which gives more accuracy.