Multi-functional floating-point MAF designs with dot product support

Authors:
Mustafa Gök;Metin Mete Özbilen
Affiliations:
Department of Electrical and Electronics Engineering, Cukurova University, Balcalı, Adana 01330, Turkey;Department of Computer Engineering, Mersin University, Çiftlikköy, Mersin 33342, Turkey
Venue:
Microelectronics Journal
Year:
2008

Citing 7
Cited 1

Comparison of Single- and Dual-Pass Multiply-Add Fused Floating-Point Units

IEEE Transactions on Computers
Exploiting Parallelism in Geometry Processing with General Purpose Processors and Floating-Point SIMD Instructions

IEEE Transactions on Computers
167 MHz Radix-4 Floating Point Multiplier

ARITH '95 Proceedings of the 12th Symposium on Computer Arithmetic
Dual-mode floating-point multiplier architectures with parallel operations

Journal of Systems Architecture: the EUROMICRO Journal
A New Architecture For Multiple-Precision Floating-Point Multiply-Add Fused Unit Design

ARITH '07 Proceedings of the 18th IEEE Symposium on Computer Arithmetic
A Two's Complement Parallel Array Multiplication Algorithm

IEEE Transactions on Computers
POWER3: the next generation of PowerPC processors

IBM Journal of Research and Development

VLIW coprocessor for IEEE-754 quadruple-precision elementary functions

ACM Transactions on Architecture and Code Optimization (TACO)

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents multi-functional double-precision and quadruple-precision floating-point multiply-add fused (FPMAF) designs. The double-precision FPMAF design can execute adouble-precision floating-point multiply-add, or two single-precision floating-point multiplications, or a single-precision floating-point dot product. The quadruple-precision FPMAF can perform similar operations with quadruple, double and single precision operands. These architectures can perform a dot-product operation two times or more faster than a basic FPMAF design. The presented multi-functional designs are compared with basic double-precision and quadruple-precision FPMAF designs by ASIC syntheses. The syntheses results show that the proposed double-precision implementation has 8%more area than a standard double-precision FPMAF implementation, and the proposed quadruple-precision design has 12.5% more area than a standard quadruple-precision FPMAF. Both of the proposed designs have one more pipeline stage compared to the basic designs.