A moldable online scheduling algorithm and its application to parallel short sequence mapping

  • Authors:
  • Erik Saule;Doruk Bozdağ;Umit V. Catalyurek

  • Affiliations:
  • Department of Biomedical Informatics, The Ohio State University;Department of Biomedical Informatics, The Ohio State University;Department of Biomedical Informatics, The Ohio State University

  • Venue:
  • JSSPP'10 Proceedings of the 15th international conference on Job scheduling strategies for parallel processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A crucial step in DNA sequence analysis is mapping short sequences generated by next-generation instruments to a reference genome. In this paper, we focus on efficient online scheduling of multi-user parallel short sequence mapping queries on a multiprocessor system. With the availability of parallel execution models, the problem at hand becomes a moldable task scheduling problem where the number of processors needed to execute a task is determined by the scheduler. We propose an online scheduling algorithm to minimize the stretch of the tasks in the system. This metric provides improved fairness to small tasks compared to flow time metric and suits well to the nature of the problem. Experimental evaluation on two workload scenarios indicate that the algorithm results in significantly smaller stretch compared to a recent algorithm and it is more fair to small sized tasks.