A mixed integer optimisation model for data classification

  • Authors:
  • Gang Xu;Lazaros G. Papageorgiou

  • Affiliations:
  • Centre for Process Systems Engineering, Department of Chemical Engineering, UCL (University College London), Torrington Place, London WC1E 7JE, UK;Centre for Process Systems Engineering, Department of Chemical Engineering, UCL (University College London), Torrington Place, London WC1E 7JE, UK

  • Venue:
  • Computers and Industrial Engineering
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this work, a mixed integer linear programming (MILP) model is proposed for the multi-class data classification problem using a hyper-box representation. The latter representation is particularly suitable for capturing disjoint data regions. The objective function used is the minimisation of the total number of misclassified data samples. In order to improve the training and testing accuracy of our approach, an iterative solution procedure is developed to assign potential multiple boxes to each single class. Finally, the applicability of the proposed approach is demonstrated through a number of illustrative examples. According to the computational results obtained, the proposed optimisation-based approach is competitive in terms of prediction accuracy when compared with various standard classifiers.