Enabling data and compute intensive workflows in bioinformatics

  • Authors:
  • Gaurang Mehta;Ewa Deelman;James A. Knowles;Ting Chen;Ying Wang;Jens Vöckler;Steven Buyske;Tara Matise

  • Affiliations:
  • USC Information Sciences Institute;USC Information Sciences Institute;Keck School of Medicine of USC;University of Southern California;University of Southern California, USA and Xiamen University, P.R. China;USC Information Sciences Institute;Rutgers University;Rutgers University

  • Venue:
  • Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing - Volume 2
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Accelerated growth in the field of bioinformatics has resulted in large data sets being produced and analyzed. With this rapid growth has come the need to analyze these data in a quick, easy, scalable, and reliable manner on a variety of computing infrastructures including desktops, clusters, grids and clouds. This paper presents the application of workflow technologies, and, specifically, Pegasus WMS, a robust scientific workflow management system, to a variety of bioinformatics projects from RNA sequencing, proteomics, and data quality control in population studies using GWAS data.