Pareto-optimal patterns in logical analysis of data

  • Authors:
  • Peter L. Hammer;Alexander Kogan;Bruno Simeone;Sándor Szedmák

  • Affiliations:
  • RUTCOR, Rutgers University, 640 Bartholomew Road, Piscataway, New Jersey 08854-8003, USA;Accounting and Information Systems, Rutgers Business School, Rutgers University, 180 University Ave., Newark, New Jersey 07102, USA;Dipartimento di Statistica, Probabilití e Statistiche Applicate,University of Rome I "La Sapienza",Piazzale Aldo Moro 5, 00185 - Roma, Italy;RUTCOR, Rutgers University, 640 Bartholomew Road, Piscataway, New Jersey 08854-8003, USA

  • Venue:
  • Discrete Applied Mathematics
  • Year:
  • 2004

Quantified Score

Hi-index 0.04

Visualization

Abstract

Patterns are the key building blocks in the logical analysis of data (LAD). It has been observed in empirical studies and practical applications that some patterns are more ''suitable'' than others for use in LAD. In this paper, we model various such suitability criteria as partial preorders defined on the set of patterns. We introduce three such preferences, and describe patterns which are Pareto-optimal with respect to any one of them, or to certain combinations of them. We develop polynomial time algorithms for recognizing Pareto-optimal patterns, as well as for transforming an arbitrary pattern to a better Pareto-optimal one with respect to any one of the considered criteria, or their combinations. We obtain analytical representations characterizing some of the sets of Pareto-optimal patterns, and investigate the computational complexity of generating all Pareto-optimal patterns. The empirical evaluation of the relative merits of various types of Pareto-optimality is carried out by comparing the classification accuracy of Pareto-optimal theories on several real life data sets. This evaluation indicates the advantages of ''strong patterns'', i.e. those patterns which are Pareto-optimal with respect to the ''evidential preference'' introduced in this paper.