EGEA: a new hybrid approach towards extracting reduced generic association rule set (application to AML blood cancer therapy)

  • Authors:
  • M. A. Esseghir;G. Gasmi;S. Ben Yahia;Y. Slimani

  • Affiliations:
  • Département des Sciences de l’Informatique, Faculté des Sciences de Tunis, Tunis, Tunisie;Département des Sciences de l’Informatique, Faculté des Sciences de Tunis, Tunis, Tunisie;Département des Sciences de l’Informatique, Faculté des Sciences de Tunis, Tunis, Tunisie;Département des Sciences de l’Informatique, Faculté des Sciences de Tunis, Tunis, Tunisie

  • Venue:
  • DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

To avoid obtaining an unmanageable highly sized association rule sets– compounded with their low precision– that often make the perusal of knowledge ineffective, the extraction and exploitation of compact and informative generic basis of association rules is a becoming a must. Moreover, they provide a powerful verification technique for hampering gene mis-annotating or badly clustering in the Unigene library. However, extracted generic basis is still oversized and their exploitation is impractical. Thus, providing critical nuggets of extra-valued knowledge is a compellingly addressable issue. To tackle such a drawback, we propose in this paper a novel approach, called EGEA (Evolutionary Gene Extraction Approach). Such approach aims to considerably reduce the quantity of knowledge, extracted from a gene expression dataset, presented to an expert. Thus, we use a genetic algorithm to select the more predictive set of genes related to patient situations. Once, the relevant attributes (genes) have been selected, they serve as an input for a second approach stage, i.e., extracting generic association rules from this reduced set of genes. The notably decrease of the generic association rule cardinality, extracted from the selected gene set, permits to improve the quality of knowledge exploitation. Carried out experiments on a benchmark dataset pointed out that among this set, there are genes which are previously unknown prognosis-associated genes. This may serve as molecular targets for new therapeutic strategies to repress the relapse of pediatric acute myeloid leukemia (AML).