Comparing Methods for Multilabel Classification of Proteins Using Machine Learning Techniques

  • Authors:
  • Ricardo Cerri;Renato R. Silva;André C. Carvalho

  • Affiliations:
  • Instituto de Ciências Matemáticas e de Computação - ICMC/USP Avenida Trabalhador São-carlense --- 400 --- Centro, São Carlos - SP, Brasil Caixa Postal 668 --- CEP: 13 ...;Instituto de Ciências Matemáticas e de Computação - ICMC/USP Avenida Trabalhador São-carlense --- 400 --- Centro, São Carlos - SP, Brasil Caixa Postal 668 --- CEP: 13 ...;Instituto de Ciências Matemáticas e de Computação - ICMC/USP Avenida Trabalhador São-carlense --- 400 --- Centro, São Carlos - SP, Brasil Caixa Postal 668 --- CEP: 13 ...

  • Venue:
  • BSB '09 Proceedings of the 4th Brazilian Symposium on Bioinformatics: Advances in Bioinformatics and Computational Biology
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Multilabel classification is an important problem in bioinformatics and Machine Learning. In a conventional classification problem, examples belong to just one among many classes. When an example can simultaneously belong to more than one class, the classification problem is named multilabel classification problem. Protein function classification is a typical example of multilabel classification, since a protein may have more than one function. This paper describes the main characteristics of some multilabel classification methods and applies five methods to protein classification problems. For an experimental comparison of these methods, traditional machine learning techniques are used. The paper also compares different evaluation metrics used in multilabel problems.