Information Retrieval Meets Gene Analysis

  • Authors:
  • Hagit Shatkay;Stephen Edwards;Mark Boguski

  • Affiliations:
  • -;-;-

  • Venue:
  • IEEE Intelligent Systems
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

Current genomic research has generated an immense volume of data and a tremendous increase in the number of gene-related publications. This wealth of information presents a major data analysis challenge. The ultimate goal is to understand the complex biological interrelationships among all discovered genes and proteins. Meeting this goal will require both scanning the abundant literature about each gene and plenty of human expertise. As several research groups have recently noted, automated systems for extracting relevant information from the literature can complement existing techniques, speed up analysis, and greatly enhance our understanding of genetic processes. The authors present a method, based on probabilistic information retrieval, that uses the literature to establish functional relationships among genes on a genome-wide scale. Experiments applied to documents discussing yeast genes, and a comparison of the results to well-established gene functions, demonstrate the method's effectiveness.