What every computer scientist should know about floating-point arithmetic
ACM Computing Surveys (CSUR)
High performance computing
Computer Architecture: A Quantitative Approach
Computer Architecture: A Quantitative Approach
Speeding up N-body Calculations on Machines without Hardware Square Root
Scientific Programming
Hi-index | 0.00 |
The modern microprocessors have become more sophisticated, the performance of software on modern architectures has grown more and more difficult to dissect and prognosticate. The execution of a program nowadays entails the complex interaction of code, compiler and processor micro-architecture. The built-in functions to compute 1/x or exp(+/-x) of math library and hardware are often incapable of achieving the challenging performance of high-performance numerical computing. To meet this demand, the current trend in constructing high-performance numerical computing for specific processors Alpha 21264 & 21364, and IA-64 has been optimized for 1/x"i and exp(+/-x"i) for a vector of inputs x"i which is significantly faster than optimized library routines. A detailed deliberation of how the processor micro-architecture as well as the manual optimization techniques improve the computing performance has been developed.