Two-Phase Barrier: A Synchronization Primitive for Improving the Processor Utilization

Authors:
Inbum Jung;Jongwoong Hyun;Joonwon Lee;Joongsoo Ma
Affiliations:
Division of Computer, Information & Telecommunication Engineering, Kangwon National University, 192-1, Hyoja 2-Dong Chunchon, Kangwon-Do 200-701, Korea. ibjung@kangwon.ac.kr;Division of Computer, Information & Telecommunication Engineering, Kangwon National University, 192-1, Hyoja 2-Dong Chunchon, Kangwon-Do 200-701, Korea;Division of Computer, Information & Telecommunication Engineering, Kangwon National University, 192-1, Hyoja 2-Dong Chunchon, Kangwon-Do 200-701, Korea;Division of Computer, Information & Telecommunication Engineering, Kangwon National University, 192-1, Hyoja 2-Dong Chunchon, Kangwon-Do 200-701, Korea
Venue:
International Journal of Parallel Programming
Year:
2001

Citing 13
Cited 0

Cache coherence protocols: evaluation using a multiprocessor simulation model

ACM Transactions on Computer Systems (TOCS)
Effects of synchronization barriers on multiprocessor performance

Parallel Computing
The fuzzy barrier: a mechanism for high speed synchronization of processors

ASPLOS III Proceedings of the third international conference on Architectural support for programming languages and operating systems
Scheduling parallel program tasks onto arbitrary target machines

Journal of Parallel and Distributed Computing - Special issue: software tools for parallel programming and visualization
Algorithms for scalable synchronization on shared-memory multiprocessors

ACM Transactions on Computer Systems (TOCS)
PYRROS: static task scheduling and code generation for message passing multiprocessors

ICS '92 Proceedings of the 6th international conference on Supercomputing
A dynamic processor allocation policy for multiprogrammed shared-memory multiprocessors

ACM Transactions on Computer Systems (TOCS)
Waiting algorithms for synchronization in large-scale multiprocessors

ACM Transactions on Computer Systems (TOCS)
Introduction to parallel computing: design and analysis of algorithms

Introduction to parallel computing: design and analysis of algorithms
Fast, contention-free combining tree barriers for shared-memory multiprocessors

International Journal of Parallel Programming
Scheduler-conscious synchronization

ACM Transactions on Computer Systems (TOCS)
MINT: A Front End for Efficient Simulation of Shared-Memory Multiprocessors

MASCOTS '94 Proceedings of the Second International Workshop on Modeling, Analysis, and Simulation On Computer and Telecommunication Systems
The Topological Barrier: A Synchronization Abstraction for Regularly-Structured Parallel Applications

The Topological Barrier: A Synchronization Abstraction for Regularly-Structured Parallel Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Barrier is widely used for synchronization in parallel programs. Since the process arrived earlier than others should wait at the barrier, the total processor utilization decreases. In this paper, to find the sources of the barrier waiting time, parallel programs are executed on the various grain sizes through execution-driven simulations. In simulation studies, we found that even if approximately equal amounts of work are distributed to each processor, all processes may not arrive at a barrier at the same time. The reasons are that the different numbers of cache misses and instructions within in partitioned grains result in the difference in arrival time of processors at the barrier. In this paper, the two-phased barrier is considered to reduce the blind waiting time in the traditional barrier scheme, which can be simply constructed by dividing one specific stage for the synchronization into two stages. On each stage, processes decide their stall or not, which is dependent on the current execution state of grains running on any given processors. Simulation results show that the reduced barrier waiting times attributed to the two-phased barrier contribute to the performance improvement of parallel programs.