Efficient realizations of squaring circuit and reciprocal used in adaptive sample rate notch filters
Journal of VLSI Signal Processing Systems - Special issue on VLSI arithmetic and implementations
A Simplified Architecture for Modulo (2n + 1) Multiplication
IEEE Transactions on Computers
High-Speed Parallel-Prefix Modulo 2n - 1 Adders
IEEE Transactions on Computers - Special issue on computer arithmetic
Residue Number Systems: Algorithms and Architectures
Residue Number Systems: Algorithms and Architectures
Implementation of a Communications Channelizer using FPGAs and RNS Arithmetic
Journal of VLSI Signal Processing Systems
A Reduced-Area Scheme for Carry-Select Adders
IEEE Transactions on Computers
An RNS Montgomery Modular Multiplication Algorithm
IEEE Transactions on Computers
Diminished-One Modulo 2^n +1 Adder Design
IEEE Transactions on Computers
Modified Booth Modulo 2^n-1 Multipliers
IEEE Transactions on Computers
A Full RNS Implementation of RSA
IEEE Transactions on Computers
Efficient Diminished-1 Modulo 2^n+1 Multipliers
IEEE Transactions on Computers
Residue Number Systems: Theory and Implementation
Residue Number Systems: Theory and Implementation
Efficient architectures for modulo 2n-1 squarers
DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
How to Teach Residue Number System to Computer Scientists and Engineers
IEEE Transactions on Education
IEEE Transactions on Circuits and Systems for Video Technology
Hi-index | 0.00 |
Modulo 2^n+/-1 squarers are useful components for designing special purpose digital signal processors that internally use a residue number system and for implementing the modulo exponentiators and multiplicative inverses required in cryptographic algorithms. In this paper we propose, in a unified way, architectures for their design that are based on the radix-4 modified Booth encoding. For the modulo 2^n+1 case, both the normal and the diminished-one representations are considered. Experimental results show that the proposed squarers offer significant savings in the implementation area over previous proposals that can reach up to 38% for sufficiently large operand widths, while in many cases a small improvement in execution delay can also be achieved.