Simultaneous scheduling of replication and computation for bioinformatic applications on the grid

  • Authors:
  • Frédéric Desprez;Antoine Vernois;Christophe Blanchet

  • Affiliations:
  • LIP, UMR CNRS-INRIA-UCBL 5668, ENS Lyon, France;LIP, UMR CNRS-INRIA-UCBL 5668, ENS Lyon, France;LBRS Laboratory / IBCP Institute, UMR 5086 CNRS, Univ. Claude Bernard Lyon 1, Lyon, France

  • Venue:
  • ISBMDA'05 Proceedings of the 6th International conference on Biological and Medical Data Analysis
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the first motivations of using grids comes from applications managing large data sets like for example in High Energy Physic or Life Sciences. To improve the global throughput of software environments, replicas are usually put at wisely selected sites. Moreover, computation requests have to be scheduled among the available resources. To get the best performance, scheduling and data replication have to be tightly coupled which is not always the case in existing approaches. This paper presents an algorithm that combines data management and scheduling at the same time using a steady-state approach. Our theoretical results are validated using simulation and logs from a large life science application (ACI GRID GriPPS). The PattInProt application searches sites and signatures of proteins into databanks of protein sequences.