A PTS-PGATS based approach for data-intensive scheduling in data grids

  • Authors:
  • Kenli Li;Zhao Tong;Dan Liu;Teklay Tesfazghi;Xiangke Liao

  • Affiliations:
  • College of Information Science and Engineering, Hunan University, Changsha, China 410082;College of Information Science and Engineering, Hunan University, Changsha, China 410082;College of Information Science and Engineering, Hunan University, Changsha, China 410082;College of Information Science and Engineering, Hunan University, Changsha, China 410082;Computer School, National University of Defense Technology, Changsha, China 410072

  • Venue:
  • Frontiers of Computer Science in China
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Grid computing is the combination of computer resources in a loosely coupled, heterogeneous, and geographically dispersed environment. Grid data are the data used in grid computing, which consists of large-scale data-intensive applications, producing and consuming huge amounts of data, distributed across a large number of machines. Data grid computing composes sets of independent tasks each of which require massive distributed data sets that may each be replicated on different resources. To reduce the completion time of the application and improve the performance of the grid, appropriate computing resources should be selected to execute the tasks and appropriate storage resources selected to serve the files required by the tasks. So the problem can be broken into two sub-problems: selection of storage resources and assignment of tasks to computing resources. This paper proposes a scheduler, which is broken into three parts that can run in parallel and uses both parallel tabu search and a parallel genetic algorithm. Finally, the proposed algorithm is evaluated by comparing it with other related algorithms, which target minimizing makespan. Simulation results show that the proposed approach can be a good choice for scheduling large data grid applications.