Recursive algorithm with C++ program for floating-point arithmetic

  • Authors:
  • Sei-Jong Chung

  • Affiliations:
  • Northern Illinois University, DeKalb, Illinois

  • Venue:
  • ACM SIGCSE Bulletin
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

Floating point Arithmetic is a topic included in virtually all textbooks for Computer Systems (CS 3: ACM's Curriculum Recommendation) or for Computer Organization (CS 4: ACM's Curriculum Recommendation). This paper presents a mathematical optimization model for the topic. The problem of converting real (float) numbers into binary equivalents is first modeled as a Zero-One Integer Programming problem. Then, a Recursive Algorithm is formulated for Floating-Point Formats. Computer programs are written in both C and C++ for a 32-bit floating-point format, using the recursive algorithm. [The computer programs are available at the email