Towards systolic hardware acceleration for local complexity analysis of massive genomic data

  • Authors:
  • Agathoklis Papadopoulos;Vasilis J. Promponas;Theocharis Theocharides

  • Affiliations:
  • KIOS Research Center, University of Cyprus, Nicosia, Cyprus;Bioinformatics Research Laboratory, University of Cyprus, Nicosia, Cyprus;KIOS Research Center, University of Cyprus, Nicosia, Cyprus

  • Venue:
  • Proceedings of the great lakes symposium on VLSI
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Modern biological research has greatly benefited from genomics. Such research however requires extensive computational power, traditionally employed on large-scale cluster machines as well as multi-core systems. Recent research in reconfigurable architectures however suggests that FPGA-based acceleration of genomic algorithms greatly improves the performance and energy efficiency when compared to multi-core systems and clusters. In this work, we present an initial attempt for massive systolic acceleration of the popular CAST algorithm employed by biologists for complexity analysis of genomic data. CAST is used for detecting (and subsequently masking) low-complexity regions (LCRs) in protein sequences. We designed and implemented a high-performance hardware-accelerated version of CAST for which we built an FPGA prototype, and benchmarked its performance against serial and multithreaded versions of the CAST algorithm in software. The proposed architecture achieves remarkable speedup compared to both serial and multithreaded CAST implementations ranging from approx. 100x-9500x, depending on the dataset features, such as low-complexity content and sequence length distribution. Such performance may enable complex analyses of voluminous sequence datasets, and has the potential to interoperate with other hardware architectures for protein sequence analysis.