Sampling of virtual examples to improve classification accuracy for nominal attribute data

  • Authors:
  • Yujung Lee;Jaeho Kang;Byoungho Kang;Kwang Ryel Ryu

  • Affiliations:
  • Department of Computer Engineering, Pusan National University, Busan, Korea;Department of Computer Engineering, Pusan National University, Busan, Korea;Department of Computer Engineering, Pusan National University, Busan, Korea;Department of Computer Engineering, Pusan National University, Busan, Korea

  • Venue:
  • RSCTC'06 Proceedings of the 5th international conference on Rough Sets and Current Trends in Computing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a method of using virtual examples to improve the classification accuracy for data with nominal attributes. Most of the previous researches on virtual examples focused on data with numeric attributes, and they used domain-specific knowledge to generate useful virtual examples for a particularly targeted learning algorithm. Instead of using domain-specific knowledge, our method samples virtual examples from a naïve Bayesian network constructed from the given training set. A sampled example is considered useful if it contributes to the increment of the network’s conditional likelihood when added to the training set. A set of useful virtual examples can be collected by repeating this process of sampling followed by evaluation. Experiments have shown that the virtual examples collected this way can help various learning algorithms to derive classifiers of improved accuracy.