A software architecture for parallel list processing on grids

  • Authors:
  • Apolo H. Hernández;Graciela Román-Alonso;Miguel A. Castro-García;Manuel Aguilar-Cornejo;Santiago Domínguez-Domínguez;Jorge Buenabad-Chávez

  • Affiliations:
  • Departamento de Ing. Eléctrica, Universidad Autónoma Metropolitana, México, DF., México;Departamento de Ing. Eléctrica, Universidad Autónoma Metropolitana, México, DF., México;Departamento de Ing. Eléctrica, Universidad Autónoma Metropolitana, México, DF., México;Departamento de Ing. Eléctrica, Universidad Autónoma Metropolitana, México, DF., México;Departamento de Computación, Centro de Investigación y de Estudios Avanzados del IPN, México, DF., México;Departamento de Computación, Centro de Investigación y de Estudios Avanzados del IPN, México, DF., México

  • Venue:
  • PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Data List Management Library (DLML) processes data lists in parallel, balancing the workload transparently to programmers. Programmers only need to organise data into a list, use DLML functions to insert and get data items, and specify the sequential function(s) to process each data item according to the application logic. The first design of DLML was targeted for use at a single cluster. This paper presents DLML-Grid, a software architecture for DLML to run in Grid environments composed of multiple distributed clusters. The architecture is hierarchical and tends to localise communication within clusters, thus reducing communication overhead. Using OpenVPN, we implemented a prototype version of DLML-Grid to gather empirical results on its performance using two clusters and two applications whose workload is static and dynamically generated. DLML-Grid performs much better than DLML overall.