Executing multiple pipelined data analysis operations in the grid

  • Authors:
  • Matthew Spencer;Renato Ferreira;Michael Beynon;Tahsin Kurc;Umit Catalyurek;Alan Sussman;Joel Saltz

  • Affiliations:
  • The Ohio State University, Columbus, OH;The Ohio State University, Columbus, OH;University of Maryland, College Park, MD;The Ohio State University, Columbus, OH;The Ohio State University, Columbus, OH;University of Maryland, College Park, MD;The Ohio State University, Columbus, OH

  • Venue:
  • Proceedings of the 2002 ACM/IEEE conference on Supercomputing
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Processing of data in many data analysis applications can be represented as an acyclic, coarse grain data flow, from data sources to the client. This paper is concerned with scheduling of multiple data analysis operations, each of which is represented as a pipelined chain of processing on data. We define the scheduling problem for effectively placing components onto Grid resources, and propose two scheduling algorithms. Experimental results are presented using a visualization application.