RSMA matching algorithm for searching biological sequences

  • Authors:
  • Ahmad Fadel Klaib;Hugh Osborne

  • Affiliations:
  • Informatics Department, University of Huddersfield, Queensgate, Huddersfield, UK;Informatics Department, University of Huddersfield, Queensgate, Huddersfield, UK

  • Venue:
  • IIT'09 Proceedings of the 6th international conference on Innovations in information technology
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Huge amounts of biological data are stored in linear files. Biological proteins are sequences of amino acids. The quantities of data in these fields tend to increase year on year. String matching algorithms playa key role in many computer science problems, and in the implementation of computer software. For this reason efficient string-matching algorithms should be used which use minimal computer storage and which minimize the searching response time. In this study, we propose a new algorithm called the Random String Matching Algorithm (RSMA). RSMA combines our enhanced preprocessing phase from the Berry Ravindran algorithm with our proposed new searching phase procedure. This variety ofsearching order allows our proposed algorithm to reduce the number of comparison characters and enhances the searching response time. Experimental results show that the RSMA algorithm offers a smaller number of comparisons and offers improved elapsed searching time when compared to other well-known algorithms.