The benefits of using multi-objectivization for mining pittsburgh partial classification rules in imbalanced and discrete data

  • Authors:
  • Julie Jacques;Julien Taillard;David Delerue;Laetitia Jourdan;Clarisse Dhaenens

  • Affiliations:
  • Société Alicante, Seclin, France;Société Alicante, Seclin, France;Société Alicante, Seclin, France;LIFL, Université Lille 1, Villeneuve d'Ascq, France;LIFL, Université Lille 1, Villeneuve d'Ascq, France

  • Venue:
  • Proceedings of the 15th annual conference on Genetic and evolutionary computation
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

A large number of rule interestingness measures have been used as objectives in multi-objective classification rule mining algorithms. Aggregation or Pareto dominance are commonly used to deal with these multiple objectives. This paper compares these approaches on a partial classification problem over discrete and imbalanced data. After performing a Principal Component Analysis (PCA) to select candidate objectives and find conflictive ones, the two approaches are evaluated. The Pareto dominance-based approach is implemented as a dominance-based local search (DMLS) algorithm using confidence and sensitivity as objectives, while the other is implemented as a single-objective hill climbing using F-Measure as an objective, which combines confidence and sensitivity. Results shows that the dominance-based approach obtains statistically better results than the single-objective approach.