Feature selection for multi-label naive Bayes classification

Authors:
Min-Ling Zhang;José M. Peña;Victor Robles
Affiliations:
College of Computer and Information Engineering, Hohai University, Nanjing 210098, China and National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China;Department of Computer Architecture and Technology, Technical University of Madrid, Madrid, Spain;Department of Computer Architecture and Technology, Technical University of Madrid, Madrid, Spain
Venue:
Information Sciences: an International Journal
Year:
2009

Citing 32
Cited 31

Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
C4.5: programs for machine learning

C4.5: programs for machine learning
BoosTexter: A Boosting-based Systemfor Text Categorization

Machine Learning - Special issue on information retrieval
Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
A new family of online algorithms for category ranking

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
The Alternating Decision Tree Learning Algorithm

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
A maximal figure-of-merit learning approach to text categorization

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A MFoM learning approach to robust multiclass multi-label text categorization

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Hierarchical document categorization with support vector machines

Proceedings of the thirteenth ACM international conference on Information and knowledge management
MMAC: A New Multi-Class, Multi-Label Associative Classification Approach

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Multi-label informed latent semantic indexing

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Multi-labelled classification using maximum entropy method

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Collective multi-label classification

Proceedings of the 14th ACM international conference on Information and knowledge management
Learning hierarchical multi-category text classification models

ICML '05 Proceedings of the 22nd international conference on Machine learning
Multi-label Associative Classification of Medical Documents from MEDLINE

ICMLA '05 Proceedings of the Fourth International Conference on Machine Learning and Applications
Correlated Label Propagation with Application to Multi-label Learning

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization

IEEE Transactions on Knowledge and Data Engineering
Hierarchical multi-label prediction of gene function

Bioinformatics
ML-KNN: A lazy learning approach to multi-label learning

Pattern Recognition
Correlative multi-label video annotation

Proceedings of the 15th international conference on Multimedia
Subspace based feature selection for pattern recognition

Information Sciences: an International Journal
Decision trees for hierarchical multi-label classification

Machine Learning
Degrees of conditional (in)dependence: A framework for approximate Bayesian networks and examples related to the rough set-based feature selection

Information Sciences: an International Journal
Investigating the effect of dataset size, metrics sets, and feature selection techniques on software fault prediction problem

Information Sciences: an International Journal
Ml-rbf: RBF Neural Networks for Multi-Label Learning

Neural Processing Letters
A wrapper method for feature selection using Support Vector Machines

Information Sciences: an International Journal
A Unified Model for Multilabel Classification and Ranking

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Semi-supervised multi-label learning by constrained non-negative matrix factorization

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Case-based multilabel ranking

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Learning multi-label alternating decision trees from texts and data

MLDM'03 Proceedings of the 3rd international conference on Machine learning and data mining in pattern recognition

GP-COACH: Genetic Programming-based learning of COmpact and ACcurate fuzzy rule-based classification systems for High-dimensional problems

Information Sciences: an International Journal
On the relevance of linear discriminative features

Information Sciences: an International Journal
Simultaneous feature selection and classification using kernel-penalized support vector machines

Information Sciences: an International Journal
Applying electromagnetism-like mechanism for feature selection

Information Sciences: an International Journal
Designing a multi-label kernel machine with two-objective optimization

AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
mr2PSO: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification

Information Sciences: an International Journal
Graphical feature selection for multilabel classification tasks

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
hGA: Hybrid genetic algorithm in fuzzy rule-based classification systems for high-dimensional problems

Applied Soft Computing
An efficient multi-label support vector machine with a zero label

Expert Systems with Applications: An International Journal
Strengthening learning algorithms by feature discovery

Information Sciences: an International Journal
Multi-instance multi-label learning based on Gaussian process with application to visual mobile robot navigation

Information Sciences: an International Journal
Multi-label weighted k-nearest neighbor classifier with adaptive weight estimation

ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
Local analgesia adverse effects prediction using multi-label classification

Neurocomputing
An extended one-versus-rest support vector machine for multi-label classification

Neurocomputing
Towards more efficient multi-label classification using dependent and independent dual space reduction

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Pattern classification of dermoscopy images: A perceptually uniform model

Pattern Recognition
Fuzzy Passive-Aggressive classification: A robust and efficient algorithm for online classification problems

Information Sciences: an International Journal
Multi-label ensemble based on variable pairwise constraint projection

Information Sciences: an International Journal
Fast multi-label core vector machine

Pattern Recognition
Feature selection for multi-label classification using multivariate mutual information

Pattern Recognition Letters
Filter approach feature selection methods to support multi-label learning based on relieff and information gain

SBIA'12 Proceedings of the 21st Brazilian conference on Advances in Artificial Intelligence
A Comparison of Multi-label Feature Selection Methods using the Problem Transformation Approach

Electronic Notes in Theoretical Computer Science (ENTCS)
Exploiting label dependencies for improved sample complexity

Machine Learning
Multi-label learning with millions of labels: recommending advertiser bid phrases for web pages

Proceedings of the 22nd international conference on World Wide Web
Feature selection for classification of animal feed ingredients from near infrared microscopy spectra

Information Sciences: an International Journal
LCMKL: latent-community and multi-kernel learning based image annotation

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Enhancing learning algorithms to support data with short sequence features by automated feature discovery

Knowledge-Based Systems
Letters: Mutual information-based feature selection for multilabel classification

Neurocomputing
Reversible privacy preserving data mining: a combination of difference expansion and privacy preserving

The Journal of Supercomputing
A Framework to Generate Synthetic Multi-label Datasets

Electronic Notes in Theoretical Computer Science (ENTCS)
Multi-label learning under feature extraction budgets

Pattern Recognition Letters

Quantified Score

Hi-index	0.07

Visualization

Abstract

In multi-label learning, the training set is made up of instances each associated with a set of labels, and the task is to predict the label sets of unseen instances. In this paper, this learning problem is addressed by using a method called Mlnb which adapts the traditional naive Bayes classifiers to deal with multi-label instances. Feature selection mechanisms are incorporated into Mlnb to improve its performance. Firstly, feature extraction techniques based on principal component analysis are applied to remove irrelevant and redundant features. After that, feature subset selection techniques based on genetic algorithms are used to choose the most appropriate subset of features for prediction. Experiments on synthetic and real-world data show that Mlnb achieves comparable performance to other well-established multi-label learning algorithms.