Large Margin Classification Using the Perceptron Algorithm
Machine Learning - The Eleventh Annual Conference on computational Learning Theory
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Neural Networks for Pattern Recognition
Neural Networks for Pattern Recognition
The Journal of Machine Learning Research
A family of additive online algorithms for category ranking
The Journal of Machine Learning Research
RCV1: A New Benchmark Collection for Text Categorization Research
The Journal of Machine Learning Research
Online Passive-Aggressive Algorithms
The Journal of Machine Learning Research
Noise Tolerant Variants of the Perceptron Algorithm
The Journal of Machine Learning Research
Multilabel classification via calibrated label ranking
Machine Learning
Efficient Pairwise Classification
ECML '07 Proceedings of the 18th European conference on Machine Learning
A Unified Model for Multilabel Classification and Ranking
Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
A new perspective on an old perceptron algorithm
COLT'05 Proceedings of the 18th annual conference on Learning Theory
A comparison of methods for multiclass support vector machines
IEEE Transactions on Neural Networks
Classifier Chains for Multi-label Classification
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Learning from label preferences
DS'11 Proceedings of the 14th international conference on Discovery science
Rule stacking: an approach for compressing an ensemble of rule sets into a single classifier
DS'11 Proceedings of the 14th international conference on Discovery science
Efficient multilabel classification algorithms for large-scale problems in the legal domain
Semantic Processing of Legal Texts
Hi-index | 0.00 |
In this paper we applied multilabel classification algorithms to the EUR-Lex database of legal documents of the European Union. On this document collection, we studied three different multilabel classification problems, the largest being the categorization into the EUROVOC concept hierarchy with almost 4000 classes. We evaluated three algorithms: (i) the binary relevance approach which independently trains one classifier per label; (ii) the multiclass multilabel perceptron algorithm, which respects dependencies between the base classifiers; and (iii) the multilabel pairwise perceptron algorithm, which trains one classifier for each pair of labels. All algorithms use the simple but very efficient perceptron algorithm as the underlying classifier, which makes them very suitable for large-scale multilabel classification problems. The main challenge we had to face was that the almost 8,000,000 perceptrons that had to be trained in the pairwise setting could no longer be stored in memory. We solve this problem by resorting to the dual representation of the perceptron, which makes the pairwise approach feasible for problems of this size. The results on the EUR-Lex database confirm the good predictive performance of the pairwise approach and demonstrates the feasibility of this approach for large-scale tasks.