High-performance computing of 1/√xi and exp(±xi) for a vector of inputs xi on Alpha and IA-64 CPUs

  • Authors:
  • Md. Haidar Sharif;Achim Basermann;Christian Seidel;Axel Hunger

  • Affiliations:
  • ISE, Computer Engineering, University of Duisburg-Essen, 47057 Duisburg, Germany;NEC Europe Ltd., C&C Research Laboratories, 53757 Sankt Augustin, Germany;Max Planck Institute of Colloids and Interfaces, 14424 Potsdam, Germany;Technische Informatik, University of Duisburg-Essen, 47057 Duisburg, Germany

  • Venue:
  • Journal of Systems Architecture: the EUROMICRO Journal
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The modern microprocessors have become more sophisticated, the performance of software on modern architectures has grown more and more difficult to dissect and prognosticate. The execution of a program nowadays entails the complex interaction of code, compiler and processor micro-architecture. The built-in functions to compute 1/x or exp(+/-x) of math library and hardware are often incapable of achieving the challenging performance of high-performance numerical computing. To meet this demand, the current trend in constructing high-performance numerical computing for specific processors Alpha 21264 & 21364, and IA-64 has been optimized for 1/x"i and exp(+/-x"i) for a vector of inputs x"i which is significantly faster than optimized library routines. A detailed deliberation of how the processor micro-architecture as well as the manual optimization techniques improve the computing performance has been developed.