Four-Ary Tree-Based Barrier Synchronization for 2D Meshes without Nonmember Involvement

  • Authors:
  • Sangman Moh;Chansu Yu;Ben Lee;Hee Young Youn;Dongsoo Han;Dongman Lee

  • Affiliations:
  • Information and Communications Univ., Taejon, Korea;Information and Communications Univ., Taejon, Korea;Oregon State Univ., Corvallis;Sungkyunkwan Univ., Suwon, Korea;Information and Communications Univ., Taejon, Korea;Information and Communications Univ., Taejon, Korea

  • Venue:
  • IEEE Transactions on Computers - Special issue on the parallel architecture and compilation techniques conference
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a Barrier Tree for Meshes (BTM) to minimize the barrier synchronization latency for two-dimensional (2D) meshes. The proposed BTM scheme has two distinguishing features. First, the synchronization tree is 4-ary. The synchronization latency of the BTM scheme is asymptotically $\Theta (\log_{4} n)$, while that of the fastest scheme reported in the literature is bounded between $\Omega (\log_{3} n)$ and $O (n^{1/2})$, where $n$ is the number of member nodes. Second, nonmember nodes are neither involved in the construction of a BTM nor actively participate in the synchronization operations, which avoids interference among different process groups during synchronization. This not only results in low setup overhead, but also reduces the synchronization latency. The low setup overhead is particularly effective for the dynamic process model provided in MPI-2. Extensive simulation study shows that, for up to $64 \times 64$ meshes, the BTM scheme results in about $40 \sim 70$ percent shorter synchronization latency and is more scalable than conventional schemes.