Biological information as set-based complexity

  • Authors:
  • David J. Galas;Matti Nykter;Gregory W. Carter;Nathan D. Price;Ilya Shmulevich

  • Affiliations:
  • Institute for Systems Biology, Seattle, WA;Institute for Systems Biology, Seattle, WA and Department of Signal Processing, Tampere University of Technology, Tampere, Finland;Institute for Systems Biology, Seattle, WA;Department of Chemical and Biomolecular Engineering and Institute for Genomic Biology, University of Illinois, Urbana-Champaign, IL and Institute for Systems Biology, Seattle, WA;Institute for Systems Biology, Seattle, WA

  • Venue:
  • IEEE Transactions on Information Theory - Special issue on information theory in molecular biology and neuroscience
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The significant and meaningful fraction of all the potential information residing in the molecules and structures of living systems is unknown. Sets of random molecular sequences or identically repeated sequences, for example, would be expected to contribute little or no useful information to a cell. This issue of quantitation of information is important since the ebb and flow of biologically significant information is essential to our quantitative understanding of biological function and evolution. Motivated specifically by these problems of biological information, a class of measures is proposed to quantify the contextual nature of the information in sets of objects, based on Kolmogorov's intrinsic complexity. Such measures discount both random and redundant information and are inherent in that they do not require a defined state space to quantify the information. The maximization of this new measure, which can be formulated in terms of the universal information distance, appears to have several useful and interesting properties, some of which we illustrate with examples.