Genome-wide efficient attribute selection for purely epistatic models via Shannon entropy

  • Authors:
  • Amirhossein Manzourolajdad;Mohammad Saraee;Aghafakhr Mirlohi;Abolfazl Javan

  • Affiliations:
  • Department of Electrical and Computer Engineering, Isfahan University of Technology (IUT), Isfahan, Iran.;Department of Electrical and Computer Engineering, Isfahan University of Technology (IUT), Isfahan, Iran.;Department of Agricultural Biotechnology, College of Agriculture, Isfahan University of Technology (IUT), Isfahan, Iran.;Department of Electrical and Computer Engineering, Isfahan University of Technology (IUT), Isfahan, Iran

  • Venue:
  • International Journal of Business Intelligence and Data Mining
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Epistasis plays an important role in the genetic architecture ofcommon human diseases. Most complex diseases are believed to havemultiple contributing loci that often have subtle patterns whichmake them fairly difficult to find in large data sets. Disordersthat follow purely epistatic models cannot be detected bycases/control studies based on individual analysis of susceptibleloci. The computational complexity of performing exhaustivesearches for detecting such models in genome-wide applications ispractically unfeasible. Furthermore, with ever-increasing number ofboth genotypes and individuals on one side, and little knowledge ofcomplex traits on the other, it is becoming fairly difficult andtime consuming to perform systematic genome-wide studies on suchtraits. We present and discuss a convenient framework for modellingepistasis using information theoretic concepts and algorithmsinspired by such an approach. These generalised algorithms, whichare especially in favour of purely epistatic models, are applied toboth simulated and real data. The real data represents thegenotype-phenotype values for Age-Related Macular Degeneration(AMD) disease. Many two-locus purely epistatic patterns were foundfor AMD. A new visualisation approach is also presented for thepurpose of better illustrating epistasy for cases where the numberof loci is more than two or three.