DNA codes based on stem similarities between DNA sequences

  • Authors:
  • Arkadii D'yachkov;Anthony Macula;Vyacheslav Rykov;Vladimir Ufimtsev

  • Affiliations:
  • Moscow State University, Moscow, Russia;Air Force Res. Lab., IFTC, Rome Research Site, Rome, NY;University of Nebraska at Omaha, Omaha, NE;University of Nebraska at Omaha, Omaha, NE

  • Venue:
  • DNA13'07 Proceedings of the 13th international conference on DNA computing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

DNA codes consisting of DNA sequences are necessary for DNA computing. The minimum distance parameter of such codes is a measure of how dissimilar the codewords are, and thus is indirectly a measure of the likelihood of undetectedable or uncorrectable errors occurring during hybridization. To compute distance, an abstract metric, for example, longest common subsequence, must be used to model the actual bonding energies of DNA strands. In this paper we continue the development [1,2,3] of similarity functions for q-ary n-sequences The theoretical lower bound on the maximal possible size of codes, built on the space endowed with this metric, is obtained. that can be used (for q = 4) to model a thermodynamic similarity on DNA sequences. We introduce the concept of a stem similarity function and discuss DNA codes [2] based on the stem similarity. We suggest an optimal construction [2] and obtain random coding bounds on the maximum size and rate for such codes.