Feature selection for multi-label naive Bayes classification

  • Authors:
  • Min-Ling Zhang;José M. Peña;Victor Robles

  • Affiliations:
  • College of Computer and Information Engineering, Hohai University, Nanjing 210098, China and National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China;Department of Computer Architecture and Technology, Technical University of Madrid, Madrid, Spain;Department of Computer Architecture and Technology, Technical University of Madrid, Madrid, Spain

  • Venue:
  • Information Sciences: an International Journal
  • Year:
  • 2009

Quantified Score

Hi-index 0.07

Visualization

Abstract

In multi-label learning, the training set is made up of instances each associated with a set of labels, and the task is to predict the label sets of unseen instances. In this paper, this learning problem is addressed by using a method called Mlnb which adapts the traditional naive Bayes classifiers to deal with multi-label instances. Feature selection mechanisms are incorporated into Mlnb to improve its performance. Firstly, feature extraction techniques based on principal component analysis are applied to remove irrelevant and redundant features. After that, feature subset selection techniques based on genetic algorithms are used to choose the most appropriate subset of features for prediction. Experiments on synthetic and real-world data show that Mlnb achieves comparable performance to other well-established multi-label learning algorithms.