Workload characteristics of DNA sequence analysis: from storage systems' perspective

  • Authors:
  • Kyeongyeol Lim;Geehan Park;Minsuk Choi;Youjip Won;Dongoh Kim;Hongyeon Kim

  • Affiliations:
  • Hanyang University, Seoul, Korea;Hanyang University, Seoul, Korea;Hanyang University, Seoul, Korea;Hanyang University, Seoul, Korea;Electronics and Telecommunications Research Institute, Daejeon, Korea;Electronics and Telecommunications Research Institute, Daejeon, Korea

  • Venue:
  • Proceedings of the 6th Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

The recent development of NGS (Next Generation Sequencing) methods has greatly increased the amount of genome data and created the need for high-performance computing and high-performance storage systems. The key issue in developing high-performance storage systems is building a storage system that is optimized for NGS analysis pipeline. In this paper, we implemented a tool to collect and analyze I/O workload in NGS analysis pipeline. Using this tool, we executed NGS analysis pipeline and analyzed the characteristics of I/Os collected in the experiment.