Data mining of gene expression changes in Alzheimer brain

  • Authors:
  • P. Roy Walker;Brandon Smith;Qing Yan Liu;A. Fazel Famili;Julio J. Valdés;Ziying Liu;Boleslaw Lach

  • Affiliations:
  • NeuroGenomics Group, Institute for Biological Sciences, National Research Council of Canada, 1200 Montreal Rd., Ottawa, Ont., Canada K1A 0R6;NeuroGenomics Group, Institute for Biological Sciences, National Research Council of Canada, 1200 Montreal Rd., Ottawa, Ont., Canada K1A 0R6;NeuroGenomics Group, Institute for Biological Sciences, National Research Council of Canada, 1200 Montreal Rd., Ottawa, Ont., Canada K1A 0R6;Integrated Reasoning Group, Institute for Information Technology, National Research Council of Canada, 1200 Montreal Rd., Ottawa, Ont., Canada K1A 0R6;Integrated Reasoning Group, Institute for Information Technology, National Research Council of Canada, 1200 Montreal Rd., Ottawa, Ont., Canada K1A 0R6;Integrated Reasoning Group, Institute for Information Technology, National Research Council of Canada, 1200 Montreal Rd., Ottawa, Ont., Canada K1A 0R6;Department of Laboratory Medicine, The Ottawa Hospital Civic Campus, 1053 Carling Avenue, Ottawa, Ont., Canada K1Y 4E9

  • Venue:
  • Artificial Intelligence in Medicine
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Genome-wide transcription profiling is a powerful technique for studying the enormous complexity of cellular states. Moreover, when applied to disease tissue it may reveal quantitative and qualitative alterations in gene expression that give information on the context or underlying basis for the disease and may provide a new diagnostic approach. However, the data obtained from high-density microarrays is highly complex and poses considerable challenges in data mining. The data requires care in both pre-processing and the application of data mining techniques. This paper addresses the problem of dealing with microarray data that come from two known classes (Alzheimer and normal). We have applied three separate techniques to discover genes associated with Alzheimer disease (AD). The 67 genes identified in this study included a total of 17 genes that are already known to be associated with Alzheimer's or other neurological diseases. This is higher than any of the previously published Alzheimer's studies. Twenty known genes, not previously associated with the disease, have been identified as well as 30 uncharacterized expressed sequence tags (ESTs). Given the success in identifying genes already associated with AD, we can have some confidence in the involvement of the latter genes and ESTs. From these studies we can attempt to define therapeutic strategies that would prevent the loss of specific components of neuronal function in susceptible patients or be in a position to stimulate the replacement of lost cellular function in damaged neurons. Although our study is based on a relatively small number of patients (four AD and five normal), we think our approach sets the stage for a major step in using gene expression data for disease modeling (i.e. classification and diagnosis). It can also contribute to the future of gene function identification, pathology, toxicogenomics, and pharmacogenomics.