Computer graphics (2nd ed. in C): principles and practice
Computer graphics (2nd ed. in C): principles and practice
Computer arithmetic: algorithms and hardware designs
Computer arithmetic: algorithms and hardware designs
A hardware cost minimized fast Phong shader
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Hardware Designs for Exactly Rounded Elementary Functions
IEEE Transactions on Computers
Microsoft DirectX 9 Programmable Graphics Pipeline
Microsoft DirectX 9 Programmable Graphics Pipeline
High-Performance Architectures for Elementary Function Generation
ARITH '01 Proceedings of the 15th IEEE Symposium on Computer Arithmetic
High-Speed Function Approximation Using a Minimax Quadratic Interpolator
IEEE Transactions on Computers
IEEE Transactions on Computers
IEEE Micro
A High-Performance Area-Efficient Multifunction Interpolator
ARITH '05 Proceedings of the 17th IEEE Symposium on Computer Arithmetic
Table-based polynomials for fast hardware function evaluation
ASAP '05 Proceedings of the 2005 IEEE International Conference on Application-Specific Systems, Architecture Processors
Optimizing Hardware Function Evaluation
IEEE Transactions on Computers
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Elementary Functions: Algorithms and Implementation
Elementary Functions: Algorithms and Implementation
Computer
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Hi-index | 0.00 |
An high-speed special function unit (SFU) is presented in this paper. The system supports the single-precision IEEE-754 floating-point standard and implements faithfully rounded reciprocal, square root, reciprocal square root, logarithm, and exponential functions. The functions are approximated by using a novel constrained piecewise quadratic interpolation technique. In this way, the lookup table size is reduced by 40% with respect to previously proposed techniques, without any loss in accuracy. Error analysis and sizing methodology are presented in the paper. The SFU has been implemented in a 0.18-µm CMOS technology. The circuit is able to operate up to 420-MHz clock frequency, with a power dissipation of 160 mW at 420 MHz. The system can be employed in programmable graphics accelerators and in other applications where high-performance function evaluation is needed.