Cluster-based instance selection for machine classification

Authors:
Ireneusz Czarnowski
Affiliations:
Gdynia Maritime University, Department of Information Systems, Gdynia, Poland
Venue:
Knowledge and Information Systems
Year:
2012

Citing 0
Cited 7

Cluster integration for the cluster-based instance selection

ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume PartI
Distributed learning with data reduction

Transactions on computational collective intelligence IV
Instance selection in text classification using the silhouette coefficient measure

MICAI'11 Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
InstanceRank based on borders for instance selection

Pattern Recognition
Towards enhancing centroid classifier for text classification-A border-instance approach

Neurocomputing
Agent-Based approach to RBF network training with floating centroids

ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part II
Multi-level rough set reduction for decision rule mining

Applied Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Instance selection in the supervised machine learning, often referred to as the data reduction, aims at deciding which instances from the training set should be retained for further use during the learning process. Instance selection can result in increased capabilities and generalization properties of the learning model, shorter time of the learning process, or it can help in scaling up to large data sources. The paper proposes a cluster-based instance selection approach with the learning process executed by the team of agents and discusses its four variants. The basic assumption is that instance selection is carried out after the training data have been grouped into clusters. To validate the proposed approach and to investigate the influence of the clustering method used on the quality of the classification, the computational experiment has been carried out.