A Point of Contact Between Computer Science and Molecular Biology

Authors:
Webb Miller;Scott Schwartz;Ross C. Hardison
Affiliations:
-;-;-
Venue:
IEEE Computational Science & Engineering
Year:
1994

Citing 2
Cited 0

The human genome project and informatics

Communications of the ACM
Computing in Molecular Biology: Mapping and Interpreting Biological Information

Computer

Quantified Score

Hi-index	0.00

Visualization

Abstract

Molecular biology is rapidly becoming a data-rich science with extensive computational needs. The sheer volume of data poses a serious challenge in storing and retrieving biological information, and the rate of growth is exponential. Linking the heterogeneous data libraries of molecular biology, organizing its diverse and interrelated data sets, and developing effective query options for its databases are all areas for cross-fertilization between molecular biology and computer science. However, even the apparently simple task of analyzing a single sequence of DNA requires complex collaboration. For several years, we have been developing a computer toolkit for analyzing DNA sequences. The biology of gene regulation in mammals has driven the design of the sequence comparison toolkit to emphasize space-efficient algorithms with a high degree of sensitivity and has profoundly affected choice of tools and the development of algorithms. We sketch the biology of this class of problem and show how it specifically drives the software development. The main components of this toolkit are outlined.