A parallel LCS algorithm for biosequences alignment

  • Authors:
  • Wei Liu;Ling Chen;Lingjun Zou

  • Affiliations:
  • Nanjing University of Aeronautics and Astronautics, Nanjing, China;Yangzhou University, Yangzhou, China;Yangzhou University, Yangzhou, China

  • Venue:
  • Proceedings of the 2nd international conference on Scalable information systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Searching for the longest common substring (LCS) of biosequences is one of the most important tasks in Bioinformatics. A fast algorithm for LCS problem named FAST_LCS is presented. The algorithm first seeks the successors of the initial identical character pairs according to a successor table to obtain all the identical pairs and their levels. Then by tracing back from the identical character pair at the largest level, the result of LCS can be obtained. For two sequences X and Y with lengths n and m, the memory required for FAST_LCS is max{8*(n+1)+8*(m+1),L}, here L is the number of identical character pairs and time complexity of parallel implementation is O(|LCS(X, Y)|), here, |LCS(X, Y)| is the length of the LCS of X, Y. Experimental result on the gene sequences of tigr database shows that our algorithm can get exactly correct result and is faster and more efficient than other LCS algorithms.