Not so greedy: Randomly Selected Naive Bayes

  • Authors:
  • Liangxiao Jiang;Zhihua Cai;Harry Zhang;Dianhong Wang

  • Affiliations:
  • Department of Computer Science, China University of Geosciences, Wuhan, Hubei 430074, China;Department of Computer Science, China University of Geosciences, Wuhan, Hubei 430074, China;Department of Computer Science, University of New Brunswick, Fredericton, New Brunswick, Canada E3B 5A3;Department of Electronic Engineering, China University of Geosciences, Wuhan, Hubei 430074, China

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 12.05

Visualization

Abstract

Many approaches are proposed to improve Naive Bayes, among which the attribute selection approach has demonstrated remarkable performance. Algorithms for attribute selection fall into two broad categories: filters and wrappers. Filters use the general data characteristics to evaluate the selected attribute subset before the learning algorithm is run, while wrappers use the learning algorithm itself as a black box to evaluate the selected attribute subset. In this paper, we work on the attribute selection approach of wrapper and propose an improved Naive Bayes algorithm by carrying a random search through the whole space of attributes. We simply called it Randomly Selected Naive Bayes (RSNB). In order to meet the need of classification, ranking, and class probability estimation, we discriminatively design three different versions: RSNB-ACC, RSNB-AUC, and RSNB-CLL. The experimental results based on a large number of UCI datasets validate their effectiveness in terms of classification accuracy (ACC), area under the ROC curve (AUC), and conditional log likelihood (CLL), respectively.