Two stage architecture for multi-label learning

Authors:
Gjorgji Madjarov;Dejan Gjorgjevikj;Sašo Deroski
Affiliations:
Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Rugjer Boshkovikj 16, 1000 Skopje, Macedonia and Department of Knowledge Technologies, Joef Stefan Institute, Jamov ...;Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Rugjer Boshkovikj 16, 1000 Skopje, Macedonia;Department of Knowledge Technologies, Joef Stefan Institute, Jamova cesta 39, 1000 Ljubljana, Slovenia
Venue:
Pattern Recognition
Year:
2012

Citing 21
Cited 2

BoosTexter: A Boosting-based Systemfor Text Categorization

Machine Learning - Special issue on information retrieval
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Knowledge Discovery in Multi-label Phenotype Data

PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Round robin classification

The Journal of Machine Learning Research
A family of additive online algorithms for category ranking

The Journal of Machine Learning Research
Probability Estimates for Multi-class Classification by Pairwise Coupling

The Journal of Machine Learning Research
MMAC: A New Multi-Class, Multi-Label Associative Classification Approach

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization

IEEE Transactions on Knowledge and Data Engineering
ML-KNN: A lazy learning approach to multi-label learning

Pattern Recognition
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
Multilabel classification via calibrated label ranking

Machine Learning
Random k-Labelsets: An Ensemble Method for Multilabel Classification

ECML '07 Proceedings of the 18th European conference on Machine Learning
Efficient Pairwise Classification

ECML '07 Proceedings of the 18th European conference on Machine Learning
An Empirical Study of Lazy Multilabel Classification Algorithms

SETN '08 Proceedings of the 5th Hellenic conference on Artificial Intelligence: Theories, Models and Applications
Multi-label Classification Using Ensembles of Pruned Sets

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
A Unified Model for Multilabel Classification and Ranking

Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Classifier Chains for Multi-label Classification

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Efficient voting prediction for pairwise multilabel classification

Neurocomputing
Learning multi-label alternating decision trees from texts and data

MLDM'03 Proceedings of the 3rd international conference on Machine learning and data mining in pattern recognition
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)

Learning tree structure of label dependency for multi-label learning

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Fast multi-label core vector machine

Pattern Recognition

Quantified Score

Hi-index	0.01

Visualization

Abstract

A common approach to solving multi-label learning problems is to use problem transformation methods and dichotomizing classifiers as in the pair-wise decomposition strategy. One of the problems with this strategy is the need for querying a quadratic number of binary classifiers for making a prediction that can be quite time consuming, especially in learning problems with a large number of labels. To tackle this problem, we propose a Two Stage Architecture (TSA) for efficient multi-label learning. We analyze three implementations of this architecture the Two Stage Voting Method (TSVM), the Two Stage Classifier Chain Method (TSCCM) and the Two Stage Pruned Classifier Chain Method (TSPCCM). Eight different real-world datasets are used to evaluate the performance of the proposed methods. The performance of our approaches is compared with the performance of two algorithm adaptation methods (Multi-Label k-NN and Multi-Label C4.5) and five problem transformation methods (Binary Relevance, Classifier Chain, Calibrated Label Ranking with majority voting, the Quick Weighted method for pair-wise multi-label learning and the Label Powerset method). The results suggest that TSCCM and TSPCCM outperform the competing algorithms in terms of predictive accuracy, while TSVM has comparable predictive performance. In terms of testing speed, all three methods show better performance as compared to the pair-wise methods for multi-label learning.