Performance analysis and optimization of AMGA for the large-scale virtual screening

  • Authors:
  • Sunil Ahn;Namgyu Kim;Seehoon Lee;Dukyun Nam;Soonwook Hwang;Birger Koblitz;Vincent Breton;Sangyong Han

  • Affiliations:
  • Korea Institute of Science and Technology Information (KISTI), 52-11, Eoeun-Dong, Yuseong-Gu, Daejeon, Korea;Korea Institute of Science and Technology Information (KISTI), 52-11, Eoeun-Dong, Yuseong-Gu, Daejeon, Korea;Korea Institute of Science and Technology Information (KISTI), 52-11, Eoeun-Dong, Yuseong-Gu, Daejeon, Korea;Korea Institute of Science and Technology Information (KISTI), 52-11, Eoeun-Dong, Yuseong-Gu, Daejeon, Korea;Korea Institute of Science and Technology Information (KISTI), 52-11, Eoeun-Dong, Yuseong-Gu, Daejeon, Korea;European Organization for Nuclear Research (CERN), 1211 Geneva, Switzerland;Laboratoire de Physique Corpusculaire, IN2P3-UMR CNRS 6533, 24 avenue des Landais, 63177 AUBIERE, France;School of Computer Science and Engineering, Seoul National University, 56-1, Sinlim-Dong, Kwanak-Gu, Seoul, Korea

  • Venue:
  • Software—Practice & Experience
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses performance issues on ARDA Metadata Grid Application (AMGA) and presents new techniques to improve the throughput of AMGA for the WISDOM environment. The first issue is a performance degradation problem when AMGA is used as a metadata service for task retrieval in the WISDOM environment. To deal with the issue, a new AMGA operation designed to reduce the communication overhead required to retrieve a task from AMGA is proposed. According to a performance study conducted with the new operation, the throughput of task retrieval using the proposed operation can be as much as 70 times higher than the throughput when using the existing AMGA operations. The second issue is an AMGA throughput issue in large-scale grid-enabled applications such as WISDOM, where it is not uncommon that thousands of jobs running on grid nodes access the AGMA service simultaneously. To address this issue, integration of a load-balancing technique and a DB connection pool technique into the AMGA are proposed. Test results demonstrate that the performance can be improved linearly in proportion to the number of AMGA servers set up for load balancing; the performance improvement continues until the performance limit of the backend database system is reached. Copyright © 2009 John Wiley & Sons, Ltd.