Accurate arithmetic for vector processors
Journal of Parallel and Distributed Computing - Parallelism in Computer Arithmetic
Advanced Arithmetic for the Digital Computer: Design of Arithmetic Units
Advanced Arithmetic for the Digital Computer: Design of Arithmetic Units
FCCM '06 Proceedings of the 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
An Accelerator for Physics Simulations
Computing in Science and Engineering
Floating-Point Computations on Reconfigurable Computers
HPCMP-UGC '07 Proceedings of the 2007 DoD High Performance Computing Modernization Program Users Group Conference
An Implementation of the Conjugate Gradient Algorithm on FPGAs
FCCM '08 Proceedings of the 2008 16th International Symposium on Field-Programmable Custom Computing Machines
FPGA Based High Performance Double-Precision Matrix Multiplication
VLSID '09 Proceedings of the 2009 22nd International Conference on VLSI Design
ARCS '09 Proceedings of the 22nd International Conference on Architecture of Computing Systems
Complete Interval Arithmetic and Its Implementation on the Computer
Numerical Validation in Current Hardware Architectures
Hi-index | 0.00 |
Processor speed and available computing power constantly increases, enabling computation of more and more complex problems such as numerical simulations of physical processes. In this domain, however, the problem of accuracy arises due to rounding of intermediate results. One solution is to avoid intermediate rounding by using exact arithmetic. The use of FPGAs as application-specific accelerators can speed up such operations compared to their software implementation. In this paper, we present a system approach employing state-of-the art FPGA and interconnection technology for exact arithmetic with double-precision operands, delivering up to 400M exact MACs/s in total and providing a speedup of up to 88 times over competing software implementations in the case of matrix multiplication.