Performance evaluation of CP-PACS on CG benchmark

Authors:
Ken'ichi Itakura
Affiliations:
-
Venue:
HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
Year:
1997

Citing 2
Cited 0

A scalar architecture for pseudo vector processing based on slide-windowed registers

ICS '93 Proceedings of the 7th international conference on Supercomputing
High-performance parallel implementations of the NAS kernel benchmarks on the IBM SP2

IBM Systems Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this research, we evaluate NAS Parallel Benchmarks ver.1 Kernel CG on massively parallel processor CP-PACS, and analyze the result. CP-PACS' CPU has a special register which is auto-incremented by clock cycle, and we can instrument time spent for any function routine with very high accuracy. As a result of performance analysis, especially for data transfer time, our desk-top estimation fits to the instrumented result almost perfectly. From this analysis, we could show the bottleneck of the program when executing with a large number of PUs.