An Empirical Study of Lazy Multilabel Classification Algorithms

  • Authors:
  • E. Spyromitros;G. Tsoumakas;Ioannis Vlahavas

  • Affiliations:
  • Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece 54124;Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece 54124;Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece 54124

  • Venue:
  • SETN '08 Proceedings of the 5th Hellenic conference on Artificial Intelligence: Theories, Models and Applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Multilabel classification is a rapidly developing field of machine learning. Despite its short life, various methods for solving the task of multilabel classification have been proposed. In this paper we focus on a subset of these methods that adopt a lazy learning approach and are based on the traditional k-nearest neighbor (k NN) algorithm. Two are our main contributions. Firstly, we implement BRk NN, an adaptation of the k NN algorithm for multilabel classification that is conceptually equivalent to using the popular Binary Relevance problem transformation method in conjunction with the k NN algorithm, but much faster. We also identify two useful extensions of BRk NN that improve its overall predictive performance. Secondly, we compare this method against two other lazy multilabel classification methods, in order to determine the overall best performer. Experiments on different real-world multilabel datasets, using a variety of evaluation metrics, expose the advantages and limitations of each method with respect to specific dataset characteristics.