Implementation of single precision floating point square root on FPGAs

  • Authors:
  • Yamin Li;Wanming Chu

  • Affiliations:
  • -;-

  • Venue:
  • FCCM '97 Proceedings of the 5th IEEE Symposium on FPGA-Based Custom Computing Machines
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

The square root operation is hard to implement on FPGAs because of the complexity of the algorithms. In this paper, we present a non-restoring square root algorithm and two very simple single precision floating point square root implementations based on the algorithm on FPGAs. One is low-cost iterative implementation that uses a traditional adder/subtracter. The operation latency is 25 clock cycles and the issue rate is 24 clock cycles. The other is high-throughput pipelined implementation that uses multiple adder/subtracters. The operation latency is 15 clock cycles and the issue rate is one clock cycle. It means that the pipelined implementation is capable of accepting a square root instruction on every clock cycle.