Floating-Point computation with just enough accuracy

Authors:
Hank Dietz;Bill Dieter;Randy Fisher;Kungyen Chang
Affiliations:
Department of Electrical & Computer Engineering, University of Kentucky;Department of Electrical & Computer Engineering, University of Kentucky;Department of Electrical & Computer Engineering, University of Kentucky;Department of Electrical & Computer Engineering, University of Kentucky
Venue:
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Year:
2006

Citing 4
Cited 0

Algorithm 719: Multiprecision translation and execution of FORTRAN programs

ACM Transactions on Mathematical Software (TOMS)
Software for Doubled-Precision Floating-Point Computations

ACM Transactions on Mathematical Software (TOMS)
Compiling for SIMD Within a Register

LCPC '98 Proceedings of the 11th International Workshop on Languages and Compilers for Parallel Computing
The Scc Compiler: SWARing at MMX 3DNow!

LCPC '99 Proceedings of the 12th International Workshop on Languages and Compilers for Parallel Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most mathematical formulae are defined in terms of operations on real numbers, but computers can only operate on numeric values with finite precision and range. Using floating-point values as real numbers does not clearly identify the precision with which each value must be represented. Too little precision yields inaccurate results; too much wastes computational resources. The popularity of multimedia applications has made fast hardware support for low-precision floating-point arithmetic common in Digital Signal Processors (DSPs), SIMD Within A Register (SWAR) instruction set extensions for general purpose processors, and in Graphics Processing Units (GPUs). In this paper, we describe a simple approach by which the speed of these low-precision operations can be speculatively employed to meet user-specified accuracy constraints. Where the native precision(s) yield insufficient accuracy, a simple technique is used to efficiently synthesize enhanced precision using pairs of native values.