A design of cooperation management system to improve reliability in resource sharing computing environment

  • Authors:
  • Ji Su Park;Kwang Sik Chung;Jin Gon Shon

  • Affiliations:
  • Dept. of Computer Science, Korea National Open University, Seoul, Korea;Dept. of Computer Science, Korea National Open University, Seoul, Korea;Dept. of Computer Science, Korea National Open University, Seoul, Korea

  • Venue:
  • GPC'07 Proceedings of the 2nd international conference on Advances in grid and pervasive computing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Resource sharing computing is a project that realizes high performance computing by utilizing the resources of peers that are connected to the Internet. Resource sharing computing provides a dynamic internet environment where peers can freely participate, but it raises questions on the reliability of operation processing. Existing resource sharing computing stores intermediate operation results in peers' local disks. Thus, when faults happen on peers' side, some peers need to wait for processing to reconnect with possibility of considerable delay. In case there is no reconnection, the intermediate operation results cannot be used. In addition, it is difficult to support cooperation due to incompatible modes of operation processing among heterogeneous systems. This thesis is to propose a cooperation management system, and define cooperation and cooperation groups necessary to improve the reliability in the resource sharing computing environment. Cooperation is a series of tasks that involve sorting tasks, processing tasks sequentially, and producing results. Cooperation group is a gathering of peers that can cooperate. Groups are created among different types of systems to enable cooperation among peers within the same group. Also, middle DB Server is proposed in a hierarchical structure to shorten delay and increase the reusability of intermediate operation results. As the intermediate operation results are stored in the middle DB Server in case there occurs a fault on peers' side, waiting time for reconnection is reduced through cooperation, and the reusability of intermediate operation result is improved. In this paper, we propose a structure that can store intermediate operation result in middle DB Server to improve reliability in resource sharing computing environment, and suggest a design for cooperation group service, discovery service, and task management service of cooperation management system.