Capturing correlations of multiple labels: A generative probabilistic model for multi-label learning

Authors:
Haiping Ma;Enhong Chen;Linli Xu;Hui Xiong
Affiliations:
Computer Science Department, University of Science and Technology of China (USTC), China;Computer Science Department, University of Science and Technology of China (USTC), China;Computer Science Department, University of Science and Technology of China (USTC), China;Management Science and Information Systems Department, Rutgers University, United States
Venue:
Neurocomputing
Year:
2012

Citing 25
Cited 2

BoosTexter: A Boosting-based Systemfor Text Categorization

Machine Learning - Special issue on information retrieval
An Evaluation of Statistical Approaches to Text Categorization

Information Retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Single-shot detection of multiple categories of text using parametric mixture models

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Latent dirichlet allocation

The Journal of Machine Learning Research
RCV1: A New Benchmark Collection for Text Categorization Research

The Journal of Machine Learning Research
Multi-labelled classification using maximum entropy method

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Short comings of latent models in supervised settings

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Collective multi-label classification

Proceedings of the 14th ACM international conference on Information and knowledge management
Pachinko allocation: DAG-structured mixture models of topic correlations

ICML '06 Proceedings of the 23rd international conference on Machine learning
Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization

IEEE Transactions on Knowledge and Data Engineering
Model-shared subspace boosting for multi-label classification

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Extracting shared subspace for multi-label classification

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Label ranking by learning pairwise preferences

Artificial Intelligence
Multilabel classification via calibrated label ranking

Machine Learning
Random k-Labelsets: An Ensemble Method for Multilabel Classification

ECML '07 Proceedings of the 18th European conference on Machine Learning
A Generative Probabilistic Model for Multi-label Classification

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Distributional Features for Text Categorization

IEEE Transactions on Knowledge and Data Engineering
Effective multi-label active learning for text classification

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Classifier Chains for Multi-label Classification

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Operations for learning with graphical models

Journal of Artificial Intelligence Research
Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Multi-label learning by exploiting label dependency

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Exploiting user interests for collaborative filtering: interests expansion via personalized ranking

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)

Semi-supervised multi-label image classification based on nearest neighbor editing

Neurocomputing
A PAM-based ontology concept and hierarchy learning method

Journal of Information Science

Quantified Score

Hi-index	0.01

Visualization

Abstract

Recent years have witnessed a considerable surge of interest in the multi-label learning problem. It has been shown that a key factor for a successful multi-label learning algorithm is to effectively exploit relations between labels. However, most of the previous work exploiting label relations focuses on pairwise relations. To handle the situations where there are intrinsic correlations among multiple labels, in this paper, we propose a generative model, Labeled Four-Level Pachinko Allocation Model (L-F-L-PAM), to capture correlations among multiple labels. In our approach of multi-label learning on text data, we apply the proposed model for inferring the training data and the standard Four-Level Pachinko Allocation Model for the test data. Furthermore, we propose a pruned Gibbs Sampling algorithm in the test stage to reduce the inference time. Finally, extensive experiments have been performed to validate the effectiveness and efficiency of our new approach. The results demonstrate significant improvements of our model over Labeled LDA (L-LDA) and superiority in terms of both effectiveness and computational efficiency over other high-performing multi-label learning methods.