Receiving message prediction method

Authors:
Yoshiyuki Iwamoto;Koichi Suga;Kanemitsu Ootsu;Takashi Yokota;Takanobu Baba
Affiliations:
Nasu-Seiho High School;Hitachi Business Solution Co., Ltd.;Department of Information Science, Faculty of Engineering, Utsunomiya University, 7-1-2 Yoto, Utsunomiya, Tochigi 321-8585, Japan;Department of Information Science, Faculty of Engineering, Utsunomiya University, 7-1-2 Yoto, Utsunomiya, Tochigi 321-8585, Japan;Department of Information Science, Faculty of Engineering, Utsunomiya University, 7-1-2 Yoto, Utsunomiya, Tochigi 321-8585, Japan
Venue:
Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
Year:
2003

Citing 12
Cited 2

Active messages: a mechanism for integrated communication and computation

ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Prefetching using Markov predictors

Proceedings of the 24th annual international symposium on Computer architecture
Data prefetching on the HP PA-8000

Proceedings of the 24th annual international symposium on Computer architecture
Selective, accurate, and timely self-invalidation using last-touch prediction

Proceedings of the 27th annual international symposium on Computer architecture
Parallel Computer Architecture: A Hardware/Software Approach

Parallel Computer Architecture: A Hardware/Software Approach
Improving Data Value Prediction Accuracy Using Path Correlation

HiPC '99 Proceedings of the 6th International Conference on High Performance Computing
Exploiting Speculative Thread-Level Parallelism on a SMT Processor

HPCN Europe '99 Proceedings of the 7th International Conference on High-Performance Computing and Networking
Transparent Real-Time Monitoring in MPI

Proceedings of the 11 IPPS/SPDP'99 Workshops Held in Conjunction with the 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing
Characterization of Communication Patterns in Message-Passing Parallel Scientific Application Programs

CANPC '98 Proceedings of the Second International Workshop on Network-Based Parallel Computing: Communication, Architecture, and Applications
Implementation of Hybrid Context Based Value Predictors Using Value Sequence Classification

Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Multi-stage Cascaded Prediction

Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Receiving message prediction method

Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing

Receiving message prediction method

Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
Scalable memory registration for high performance networks using helper threads

Proceedings of the 8th ACM International Conference on Computing Frontiers

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes and evaluates the Receiving Message Prediction Method for high performance message passing. In this method, a node in the idle state predicts the next message reception, and speculatively executes the message reception and user processes. This method is independent of underlying computer architecture and message passing libraries. We propose the algorithms for the message prediction, and evaluate them from the viewpoint of the success ratio and speed-ups. We use the NAS parallel benchmark programs as typical parallel applications running on two different types of parallel platforms, i.e., a workstation cluster and a shared memory multiprocessor. The experimental results show that the method can be applied to various platforms. The method can also be implemented just by changing the software inside their message passing libraries without any support from the underlying system software or hardware. This mean that we do not require any change of application software that uses the libraries. The application of the method to the message passing interface libraries achieves a speed-up of 6.8% for the NAS Parallel Benchmarks, and the static and dynamic selection of prediction methods based on profiling results improve the performance.