An incremental node embedding technique for error correcting output codes

  • Authors:
  • Oriol Pujol;Sergio Escalera;Petia Radeva

  • Affiliations:
  • Dept. Matemítica Aplicada i Anílisi, UB, Gran Via 585, 08007 Barcelona, Spain and Centre de Visió per Computador and Computer Science Dept, Campus UAB, 08193 Bellaterra, Barcelona, ...;Centre de Visió per Computador and Computer Science Dept, Campus UAB, 08193 Bellaterra, Barcelona, Spain;Centre de Visió per Computador and Computer Science Dept, Campus UAB, 08193 Bellaterra, Barcelona, Spain

  • Venue:
  • Pattern Recognition
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

The error correcting output codes (ECOC) technique is a useful way to extend any binary classifier to the multiclass case. The design of an ECOC matrix usually considers an a priori fixed number of dichotomizers. We argue that the selection and number of dichotomizers must depend on the performance of the ensemble code in relation to the problem domain. In this paper, we present a novel approach that improves the performance of any initial output coding by extending it in a sub-optimal way. The proposed strategy creates the new dichotomizers by minimizing the confusion matrix among classes guided by a validation subset. A weighted methodology is proposed to take into account the different relevance of each dichotomizer. As a result, overfitting is avoided and small codes with good generalization performance are obtained. In the decoding step, we introduce a new strategy that follows the principle that positions coded with the symbol zero should have small influence in the results. We compare our strategy to other well-known ECOC strategies on the UCI database, and the results show it represents a significant improvement.