SlideShare a Scribd company logo
Natarajan Meghanathan et al. (Eds) : NeCoM, CSITEC - 2016
pp. 19–30, 2016. © CS & IT-CSCP 2016 DOI : 10.5121/csit.2016.61203
A FLOATING POINT DIVISION UNIT
BASED ON TAYLOR-SERIES EXPANSION
ALGORITHM AND ITERATIVE
LOGARITHMIC MULTIPLIER
Riyansh K. Karani, Akash K. Rana, Dhruv H. Reshamwala and
Kishore Saldanha
Electronics Department,
Dwarkadas J. Sanghvi College of Engineering, Mumbai
riyansh.karani.011235@gmail.com
akash9182akash@gmail.com
dhr.reshamwala@gmail.com
kishoresaldanha@gmail.com
ABSTRACT
Floating point division, even though being an infrequent operation in the traditional sense, is
indis-pensable when it comes to a range of non-traditional applications such as K-Means
Clustering and QR Decomposition just to name a few. In such applications, hardware support
for floating point division would boost the performance of the entire system. In this paper, we
present a novel architecture for a floating point division unit based on the Taylor-series
expansion algorithm. We show that the Iterative Logarithmic Multiplier is very well suited to be
used as a part of this architecture. We propose an implementation of the powering unit that can
calculate an odd power and an even power of a number simultaneously, meanwhile having little
hardware overhead when compared to the Iterative Logarithmic Multiplier.
KEYWORDS
Floating point division, Iterative Logarithmic Multiplier, Taylor-series
1. INTRODUCTION
Approximation methods such as Newton-Raphson and Taylor-series can be used to approximate
functions where direct computation of these functions is either computationally very expensive or
not possible. The Taylor-series expansion is an approximation method that generates a high order
polynomial approximation of a function at some value in its domain. The idea is simple; given
that a function and its first derivatives are continuous at some point in its domain, the function
can be approximated by a polynomial of degree at that point. The higher the order of this
polynomial, the better is the approximation. Following on this idea, the reciprocal of a number
can be approximated as a very simple Taylor-series polynomial, and thus the problem of division
of one number by another is essentially reduced to multiplication of one number and the Taylor-
20 Computer Science & Information Technology (CS & IT)
series polynomial of the other [6]. Calculating the terms of the Taylor-series polynomial that
approximates the reciprocal of a number involves calculating the powers of itself
where calculating each higher power generates a better approximation.
Naturally, there is a need of a multiplier unit to calculate these powers, and the performance of
the floating point division unit then depends almost solely on the performance of the multiplier
unit. There are many popular multiplier architectures in use today [3]. The Iterative Logarithmic
Multiplier is one such multiplier architecture [12]. What makes it an attractive choice is that it is
highly programmable. The accuracy of the product generated by this multiplier can be precisely
controlled, which comes in very handy when one does not need full precision multiplication, like
in the case of digital signal processing.
But perhaps what is even more important is that because of its inherent nature, its implementation
is very hardware efficient when it comes to computation of squares. This is because the
mathematical description of the Iterative Logarithmic Multiplier is quite simplified when
multiplying a number with itself, as when compared to multiplying two different numbers.
Because every even power of a number can be represented as a square of a smaller power
of the number every alternate power of is representable as a square of some other
number. Hence, the Iterative Logarithmic Multiplier is a very suitable candidate when choosing a
multiplier architecture for calculating the terms of the Taylor-series polynomial approximation of
a reciprocal.
In this paper, we present an architecture for a floating point division unit based on the Iterative
Logarithmic Multiplication algorithm, and the Taylor-series expansion algorithm. The proposed
architecture is designed to be hardware efficient, as is the requirement when designing
architectures for high speed computational units. We start by describing the Taylor-series
expansion algorithm in section II, and analyse the approximation errors generated when using this
approach. We then introduce the methodology for calculating the reciprocal of a number using
the Taylor-series approach, and derive a quantitative measure of the generated error. Since the
approach above requires an initial approximation of the reciprocal, in section III, we start by
describing and analysing linear approximation as a possible approach. We then build on this to
present the piecewise linear approximation based approach that we have employed in our
implementation of the floating point division unit. In section IV, we describe the Iterative
Logarithmic Multiplier as proposed by Babić , Avramović and Bulić [12], followed by the
discussion and implementation of the proposed squaring unit in section V. Finally, we present the
architecture for the proposed powering unit in section VI, and discuss its implementation and
features.
2. TAYLOR-SERIES EXPANSION ALGORITHM
Computer Science & Information Technology (CS & IT) 21
22 Computer Science & Information Technology (CS & IT)
3. INITIAL APPROXIMATION
As stated in the previous section, we need an initial approximation of the inverse of a number, in
order to calculate a more precise approximation using the Taylor-series expansion algorithm, and
according to equation (7), the number of iterations required to obtain an approximation with a
desired precision depends on the initial approximation. So, it is very important to select an
appropriate method for finding an initial approximation. There are different kinds of methods [5]
based on linear approximation, direct lookup tables [7] [8] [11], table lookup followed by
multiplication [4] and polynomial approximations [9]. In our implementation, we choose a
different approach. We employ a piecewise linear approximation for generating the initial
approximation, and we show that one can obtain any desired amount of precision using this
method, without much increase in complexity.
Figure 1: Plot showing the actual and linear Figure 2: Plot showing the values of
approximation values of x-1
, m for x in the range [1,2]
for a = 1 and b = 2
Computer Science & Information Technology (CS & IT) 23
24 Computer Science & Information Technology (CS & IT)
Computer Science & Information Technology (CS & IT) 25
Figure 3: Piecewise linear approximation of x-1
for x in the range [1,2], derived for n = 5
Table I: Piecewise Linear approximation segments
4. ITERATIVE LOGARITHMIC MULTIPLIER
Logarithmic Number System (LNS) based multipliers are a good choice when there is a
possibility of trading accuracy for speed (such as in Digital Signal Processing). The main
advantage of LNS based multipliers is the substitution of multiplication with addition, which is a
much simpler operation in terms of complexity. LNS multipliers can be divided into two
categories [12], one based on methods that use lookup-tables, and the others based on Mitchell’s
algorithm [10]. The major drawback with Mitchell’s algorithm is the error in the product due to
the piecewise linear approximation of the logarithmic curve. The Iterative Logarithmic
Multiplier, as the name suggests, proposes an iterative solution to computer this error term, and
hence generate a better approximation to the product.
26 Computer Science & Information Technology (CS & IT)
The binary representation of a number can be written as
Figure 4: Block diagram of an Figure 5: Architecture of the
Iterative Logarithmic Multiplier proposed squaring unit
Computer Science & Information Technology (CS & IT) 27
5. SQUARING UNIT
28 Computer Science & Information Technology (CS & IT)
6. POWERING UNIT
Figure 6: Flow diagram indicating the operation Figure 7: System implementation
of the powering unit for calculating
up to 12 powers of x
Computer Science & Information Technology (CS & IT) 29
7. CONCLUSION
In this paper, we propose and investigate a new architecture for a floating point division unit. We
show that the Taylor-series expansion algorithm can be used to generate approximations for the
reciprocal of a number up to an arbitrary precision, and analyse the errors for the same. We
propose a new piecewise linear approximation based method to generate the first approximation
required by the Taylor-series expansion algorithm, and present an extensive analysis. We then
present the architecture for a squaring unit derived from the Iterative Logarithmic Multiplier, and
argue that it requires less than 50% hardware, as compared to the Iterative Logarithmic
Multiplier. Finally, we present a cumulative implementation of the powering unit, and discuss
some of the enhancements made in order to further boost its performance. The complete system is
illustrated in Figure 7.
The performance of the system can be improved by pipelining the architecture for the Iterative
Logarithmic Multiplier [12] and the squaring unit, but at the cost of increase in hardware
utilization.
REFERENCES
[1] Tom M. Apostol. Calculus Vol. 1. 2nd ed. 2013, pp. 272–303.
[2] Birne Binegar. “Math 4513: Numerical Analysis”. 1998.
URL:https://math.okstate.edu/people/binegar/4513-F98/4513-l01.pdf.
[3] Deepak Bordiya and Lalit Bandil. “Comparative Analysis Of Multipliers (serial and parallel with
radix based on booth algorithm)”. In: International Journal of Engineering Research and Technology
Vol.2 - Issue 9 (Sept. 2013).
[4] Zhan Guo, Dongdong Chen, Bintian Zhou and Peter Nilsson. “Design and Implementation of
Reciprocal Unit”. In: IEEE (Jan. 2007)
[5] Ankit Khandelwal and Gaurav Agarwal. A Newton Raphson Divider Based on Improved Reciprocal
Approximation Algorithm. Dec. 2006.
[6] Taek-Jun Kwon. “Floating-Point unit design using Taylor-series Expansion algorithms”. PhD thesis.
University of Southern California, 2009.
[7] Albert A. Liddicoat and Michael J. Flynn. “High Performance Floating Point Divide”. In: Digital
Systems Design, 2001. Proceedings. Euromicro Symposium on. Sept. 2001.
[8] Naofumi Takagi, Masayuki Ito, and Shuzo Yajima. “Efficient Initial Approximation and Fast
Converging Methods for Division and Square Root”. In: ARITH ’95 Proceedings of the 12th
Symposium on Computer Arithmetic. 1995.
30 Computer Science & Information Technology (CS & IT)
[9] Jean-Michel Muller, Milos Ercegovac and Arnaud Tisserand. “Simple Seed Architectures for
Reciprocal and Square Root Reciprocal”. In: INRIA RR-5720 (2005), p. 25.
[10] J.N. Mitchell. “Computer multiplication and division using binary logarithms”. In: IRE Transactions
on Electronic Computers EC-11 (Aug. 1962), pp. 512–517.
[11] Oskar Mencer, Patrick Hung, Hossam Fahmy and Michael J. Flynn. “Fast Division algorithm with a
Small Lookup Table”. In: Circuits, Systems and Computers, 1997. Conference Record. 1997 11th
Asilomar Conference on. 1997.
[12] Patricio Bulić, Zdenka Babić, Aleksej Avramović.“An Iterative Logarithmic Multiplier”. In:
Elektrotehnis̆ ki vestnik 77(1) (2010), pp. 25–30.

More Related Content

PDF
CORRELATION OF EIGENVECTOR CENTRALITY TO OTHER CENTRALITY MEASURES: RANDOM, S...
PDF
An experimental evaluation of similarity-based and embedding-based link predi...
PDF
EVALUATING SYMMETRIC INFORMATION GAP BETWEEN DYNAMICAL SYSTEMS USING PARTICLE...
PDF
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
PDF
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
PDF
Using spectral radius ratio for node degree
PDF
Energy Efficient Power Failure Diagonisis For Wireless Network Using Random G...
PDF
CENTRALITY OF GRAPH ON DIFFERENT NETWORK TOPOLOGIES
CORRELATION OF EIGENVECTOR CENTRALITY TO OTHER CENTRALITY MEASURES: RANDOM, S...
An experimental evaluation of similarity-based and embedding-based link predi...
EVALUATING SYMMETRIC INFORMATION GAP BETWEEN DYNAMICAL SYSTEMS USING PARTICLE...
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
Using spectral radius ratio for node degree
Energy Efficient Power Failure Diagonisis For Wireless Network Using Random G...
CENTRALITY OF GRAPH ON DIFFERENT NETWORK TOPOLOGIES

What's hot (20)

PDF
APPROXIMATING NASH EQUILIBRIUM UNIQUENESS OF POWER CONTROL IN PRACTICAL WSNS
PDF
Density Based Clustering Approach for Solving the Software Component Restruct...
PDF
Dynamic approach to k means clustering algorithm-2
PDF
Centrality Prediction in Mobile Social Networks
PDF
Datapath
PDF
Spatial correlation based clustering algorithm for random and uniform topolog...
PDF
STUDY OF TASK SCHEDULING STRATEGY BASED ON TRUSTWORTHINESS
PDF
A reduced complexity and an efficient channel
DOCX
K means report
PDF
International Journal of Engineering Research and Development
PPTX
Implement principal component analysis (PCA) in python from scratch
PDF
Macromodel of High Speed Interconnect using Vector Fitting Algorithm
PDF
One dimensional vector based pattern
DOCX
8.clustering algorithm.k means.em algorithm
DOC
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
PPTX
Quantum-Min-Cut/Max-Flow-Based Vertex Importance Ranking
PDF
Vol 16 No 2 - July-December 2016
PDF
Clustering Algorithms for Data Stream
PPTX
Quantum persistent k cores for community detection
PDF
Welcome to International Journal of Engineering Research and Development (IJERD)
APPROXIMATING NASH EQUILIBRIUM UNIQUENESS OF POWER CONTROL IN PRACTICAL WSNS
Density Based Clustering Approach for Solving the Software Component Restruct...
Dynamic approach to k means clustering algorithm-2
Centrality Prediction in Mobile Social Networks
Datapath
Spatial correlation based clustering algorithm for random and uniform topolog...
STUDY OF TASK SCHEDULING STRATEGY BASED ON TRUSTWORTHINESS
A reduced complexity and an efficient channel
K means report
International Journal of Engineering Research and Development
Implement principal component analysis (PCA) in python from scratch
Macromodel of High Speed Interconnect using Vector Fitting Algorithm
One dimensional vector based pattern
8.clustering algorithm.k means.em algorithm
K-MEDOIDS CLUSTERING USING PARTITIONING AROUND MEDOIDS FOR PERFORMING FACE R...
Quantum-Min-Cut/Max-Flow-Based Vertex Importance Ranking
Vol 16 No 2 - July-December 2016
Clustering Algorithms for Data Stream
Quantum persistent k cores for community detection
Welcome to International Journal of Engineering Research and Development (IJERD)
Ad

Viewers also liked (17)

PDF
Robust Visual Tracking Based on Sparse PCA-L1
PDF
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE
PDF
Exploring The Dynamic Integration of Heterogeneous Services
PDF
FILE SYNCHRONIZATION SYSTEMS SURVEY
PDF
Basic Evaluation of Antennas Used in Microwave Imaging for Breast Cancer Dete...
PDF
CREATING DATA OUTPUTS FROM MULTI AGENT TRAFFIC MICRO SIMULATION TO ASSIMILATI...
PDF
Key Management Scheme for Secure Group Communication in WSN with Multiple Gr...
PPTX
Understanding the Impact of the Social Media on Digital Marketing and E-comme...
PDF
WIRELESS SENSORS INTEGRATION INTO INTERNET OF THINGS AND THE SECURITY PRIMITIVES
PDF
Google Apps
PDF
EVALUATION AND STUDY OF SOFTWARE DEGRADATION IN THE EVOLUTION OF SIX VERSIONS...
PDF
Explore the Effects of Emoticons on Twitter Sentiment Analysis
PPTX
Starbucks Case Study
PPTX
Facebook & Twitter Analytics
PPTX
Economic study on TATA Steel 2015
PPTX
What is Sleeping pralysis and how it effect us ?
PPTX
Market based instruments as a policy instrument for environmental problems
Robust Visual Tracking Based on Sparse PCA-L1
A LITERATURE REVIEW ON SEMANTIC WEB – UNDERSTANDING THE PIONEERS’ PERSPECTIVE
Exploring The Dynamic Integration of Heterogeneous Services
FILE SYNCHRONIZATION SYSTEMS SURVEY
Basic Evaluation of Antennas Used in Microwave Imaging for Breast Cancer Dete...
CREATING DATA OUTPUTS FROM MULTI AGENT TRAFFIC MICRO SIMULATION TO ASSIMILATI...
Key Management Scheme for Secure Group Communication in WSN with Multiple Gr...
Understanding the Impact of the Social Media on Digital Marketing and E-comme...
WIRELESS SENSORS INTEGRATION INTO INTERNET OF THINGS AND THE SECURITY PRIMITIVES
Google Apps
EVALUATION AND STUDY OF SOFTWARE DEGRADATION IN THE EVOLUTION OF SIX VERSIONS...
Explore the Effects of Emoticons on Twitter Sentiment Analysis
Starbucks Case Study
Facebook & Twitter Analytics
Economic study on TATA Steel 2015
What is Sleeping pralysis and how it effect us ?
Market based instruments as a policy instrument for environmental problems
Ad

Similar to A FLOATING POINT DIVISION UNIT BASED ON TAYLOR-SERIES EXPANSION ALGORITHM AND ITERATIVE LOGARITHMIC MULTIPLIER (20)

PDF
Fpga based efficient multiplier for image processing applications using recur...
PDF
IRJET-Design and Implementation of LNS based Approximate Multiplier using Mit...
PDF
Implementation and Simulation of Ieee 754 Single-Precision Floating Point Mul...
PDF
Iy3116761679
PDF
Implementation of Low-Complexity Redundant Multiplier Architecture for Finite...
PDF
Implementation of an Effective Self-Timed Multiplier for Single Precision Flo...
PDF
SINGLE PRECISION FLOATING POINT MULTIPLIER USING SHIFT AND ADD ALGORITHM
PDF
Ap32283286
PDF
Design and Implementation of Single Precision Pipelined Floating Point Co-Pro...
PDF
International Journal of Engineering Research and Development
PDF
A Fast Floating Point Double Precision Implementation on Fpga
PDF
IRJET- A Review on Single Precision Floating Point Arithmetic Unit of 32 Bit ...
PDF
Bs25412419
PDF
Lp2520162020
PDF
Lp2520162020
PDF
Bu33436438
PDF
Bu33436438
PDF
IRJET- Single Precision Floating Point Arithmetic using VHDL Coding
PDF
Comparative study of single precision floating point division using differen...
PDF
IRJET - Design and Implementation of Double Precision FPU for Optimised Speed
Fpga based efficient multiplier for image processing applications using recur...
IRJET-Design and Implementation of LNS based Approximate Multiplier using Mit...
Implementation and Simulation of Ieee 754 Single-Precision Floating Point Mul...
Iy3116761679
Implementation of Low-Complexity Redundant Multiplier Architecture for Finite...
Implementation of an Effective Self-Timed Multiplier for Single Precision Flo...
SINGLE PRECISION FLOATING POINT MULTIPLIER USING SHIFT AND ADD ALGORITHM
Ap32283286
Design and Implementation of Single Precision Pipelined Floating Point Co-Pro...
International Journal of Engineering Research and Development
A Fast Floating Point Double Precision Implementation on Fpga
IRJET- A Review on Single Precision Floating Point Arithmetic Unit of 32 Bit ...
Bs25412419
Lp2520162020
Lp2520162020
Bu33436438
Bu33436438
IRJET- Single Precision Floating Point Arithmetic using VHDL Coding
Comparative study of single precision floating point division using differen...
IRJET - Design and Implementation of Double Precision FPU for Optimised Speed

Recently uploaded (20)

PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
Digital Strategies for Manufacturing Companies
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
medical staffing services at VALiNTRY
PPTX
Essential Infomation Tech presentation.pptx
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
AI in Product Development-omnex systems
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PPTX
history of c programming in notes for students .pptx
Reimagine Home Health with the Power of Agentic AI​
Adobe Illustrator 28.6 Crack My Vision of Vector Design
Digital Strategies for Manufacturing Companies
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
medical staffing services at VALiNTRY
Essential Infomation Tech presentation.pptx
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
CHAPTER 2 - PM Management and IT Context
Design an Analysis of Algorithms I-SECS-1021-03
Which alternative to Crystal Reports is best for small or large businesses.pdf
VVF-Customer-Presentation2025-Ver1.9.pptx
AI in Product Development-omnex systems
How to Choose the Right IT Partner for Your Business in Malaysia
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
history of c programming in notes for students .pptx

A FLOATING POINT DIVISION UNIT BASED ON TAYLOR-SERIES EXPANSION ALGORITHM AND ITERATIVE LOGARITHMIC MULTIPLIER

  • 1. Natarajan Meghanathan et al. (Eds) : NeCoM, CSITEC - 2016 pp. 19–30, 2016. © CS & IT-CSCP 2016 DOI : 10.5121/csit.2016.61203 A FLOATING POINT DIVISION UNIT BASED ON TAYLOR-SERIES EXPANSION ALGORITHM AND ITERATIVE LOGARITHMIC MULTIPLIER Riyansh K. Karani, Akash K. Rana, Dhruv H. Reshamwala and Kishore Saldanha Electronics Department, Dwarkadas J. Sanghvi College of Engineering, Mumbai riyansh.karani.011235@gmail.com akash9182akash@gmail.com dhr.reshamwala@gmail.com kishoresaldanha@gmail.com ABSTRACT Floating point division, even though being an infrequent operation in the traditional sense, is indis-pensable when it comes to a range of non-traditional applications such as K-Means Clustering and QR Decomposition just to name a few. In such applications, hardware support for floating point division would boost the performance of the entire system. In this paper, we present a novel architecture for a floating point division unit based on the Taylor-series expansion algorithm. We show that the Iterative Logarithmic Multiplier is very well suited to be used as a part of this architecture. We propose an implementation of the powering unit that can calculate an odd power and an even power of a number simultaneously, meanwhile having little hardware overhead when compared to the Iterative Logarithmic Multiplier. KEYWORDS Floating point division, Iterative Logarithmic Multiplier, Taylor-series 1. INTRODUCTION Approximation methods such as Newton-Raphson and Taylor-series can be used to approximate functions where direct computation of these functions is either computationally very expensive or not possible. The Taylor-series expansion is an approximation method that generates a high order polynomial approximation of a function at some value in its domain. The idea is simple; given that a function and its first derivatives are continuous at some point in its domain, the function can be approximated by a polynomial of degree at that point. The higher the order of this polynomial, the better is the approximation. Following on this idea, the reciprocal of a number can be approximated as a very simple Taylor-series polynomial, and thus the problem of division of one number by another is essentially reduced to multiplication of one number and the Taylor-
  • 2. 20 Computer Science & Information Technology (CS & IT) series polynomial of the other [6]. Calculating the terms of the Taylor-series polynomial that approximates the reciprocal of a number involves calculating the powers of itself where calculating each higher power generates a better approximation. Naturally, there is a need of a multiplier unit to calculate these powers, and the performance of the floating point division unit then depends almost solely on the performance of the multiplier unit. There are many popular multiplier architectures in use today [3]. The Iterative Logarithmic Multiplier is one such multiplier architecture [12]. What makes it an attractive choice is that it is highly programmable. The accuracy of the product generated by this multiplier can be precisely controlled, which comes in very handy when one does not need full precision multiplication, like in the case of digital signal processing. But perhaps what is even more important is that because of its inherent nature, its implementation is very hardware efficient when it comes to computation of squares. This is because the mathematical description of the Iterative Logarithmic Multiplier is quite simplified when multiplying a number with itself, as when compared to multiplying two different numbers. Because every even power of a number can be represented as a square of a smaller power of the number every alternate power of is representable as a square of some other number. Hence, the Iterative Logarithmic Multiplier is a very suitable candidate when choosing a multiplier architecture for calculating the terms of the Taylor-series polynomial approximation of a reciprocal. In this paper, we present an architecture for a floating point division unit based on the Iterative Logarithmic Multiplication algorithm, and the Taylor-series expansion algorithm. The proposed architecture is designed to be hardware efficient, as is the requirement when designing architectures for high speed computational units. We start by describing the Taylor-series expansion algorithm in section II, and analyse the approximation errors generated when using this approach. We then introduce the methodology for calculating the reciprocal of a number using the Taylor-series approach, and derive a quantitative measure of the generated error. Since the approach above requires an initial approximation of the reciprocal, in section III, we start by describing and analysing linear approximation as a possible approach. We then build on this to present the piecewise linear approximation based approach that we have employed in our implementation of the floating point division unit. In section IV, we describe the Iterative Logarithmic Multiplier as proposed by Babić , Avramović and Bulić [12], followed by the discussion and implementation of the proposed squaring unit in section V. Finally, we present the architecture for the proposed powering unit in section VI, and discuss its implementation and features. 2. TAYLOR-SERIES EXPANSION ALGORITHM
  • 3. Computer Science & Information Technology (CS & IT) 21
  • 4. 22 Computer Science & Information Technology (CS & IT) 3. INITIAL APPROXIMATION As stated in the previous section, we need an initial approximation of the inverse of a number, in order to calculate a more precise approximation using the Taylor-series expansion algorithm, and according to equation (7), the number of iterations required to obtain an approximation with a desired precision depends on the initial approximation. So, it is very important to select an appropriate method for finding an initial approximation. There are different kinds of methods [5] based on linear approximation, direct lookup tables [7] [8] [11], table lookup followed by multiplication [4] and polynomial approximations [9]. In our implementation, we choose a different approach. We employ a piecewise linear approximation for generating the initial approximation, and we show that one can obtain any desired amount of precision using this method, without much increase in complexity. Figure 1: Plot showing the actual and linear Figure 2: Plot showing the values of approximation values of x-1 , m for x in the range [1,2] for a = 1 and b = 2
  • 5. Computer Science & Information Technology (CS & IT) 23
  • 6. 24 Computer Science & Information Technology (CS & IT)
  • 7. Computer Science & Information Technology (CS & IT) 25 Figure 3: Piecewise linear approximation of x-1 for x in the range [1,2], derived for n = 5 Table I: Piecewise Linear approximation segments 4. ITERATIVE LOGARITHMIC MULTIPLIER Logarithmic Number System (LNS) based multipliers are a good choice when there is a possibility of trading accuracy for speed (such as in Digital Signal Processing). The main advantage of LNS based multipliers is the substitution of multiplication with addition, which is a much simpler operation in terms of complexity. LNS multipliers can be divided into two categories [12], one based on methods that use lookup-tables, and the others based on Mitchell’s algorithm [10]. The major drawback with Mitchell’s algorithm is the error in the product due to the piecewise linear approximation of the logarithmic curve. The Iterative Logarithmic Multiplier, as the name suggests, proposes an iterative solution to computer this error term, and hence generate a better approximation to the product.
  • 8. 26 Computer Science & Information Technology (CS & IT) The binary representation of a number can be written as Figure 4: Block diagram of an Figure 5: Architecture of the Iterative Logarithmic Multiplier proposed squaring unit
  • 9. Computer Science & Information Technology (CS & IT) 27 5. SQUARING UNIT
  • 10. 28 Computer Science & Information Technology (CS & IT) 6. POWERING UNIT Figure 6: Flow diagram indicating the operation Figure 7: System implementation of the powering unit for calculating up to 12 powers of x
  • 11. Computer Science & Information Technology (CS & IT) 29 7. CONCLUSION In this paper, we propose and investigate a new architecture for a floating point division unit. We show that the Taylor-series expansion algorithm can be used to generate approximations for the reciprocal of a number up to an arbitrary precision, and analyse the errors for the same. We propose a new piecewise linear approximation based method to generate the first approximation required by the Taylor-series expansion algorithm, and present an extensive analysis. We then present the architecture for a squaring unit derived from the Iterative Logarithmic Multiplier, and argue that it requires less than 50% hardware, as compared to the Iterative Logarithmic Multiplier. Finally, we present a cumulative implementation of the powering unit, and discuss some of the enhancements made in order to further boost its performance. The complete system is illustrated in Figure 7. The performance of the system can be improved by pipelining the architecture for the Iterative Logarithmic Multiplier [12] and the squaring unit, but at the cost of increase in hardware utilization. REFERENCES [1] Tom M. Apostol. Calculus Vol. 1. 2nd ed. 2013, pp. 272–303. [2] Birne Binegar. “Math 4513: Numerical Analysis”. 1998. URL:https://math.okstate.edu/people/binegar/4513-F98/4513-l01.pdf. [3] Deepak Bordiya and Lalit Bandil. “Comparative Analysis Of Multipliers (serial and parallel with radix based on booth algorithm)”. In: International Journal of Engineering Research and Technology Vol.2 - Issue 9 (Sept. 2013). [4] Zhan Guo, Dongdong Chen, Bintian Zhou and Peter Nilsson. “Design and Implementation of Reciprocal Unit”. In: IEEE (Jan. 2007) [5] Ankit Khandelwal and Gaurav Agarwal. A Newton Raphson Divider Based on Improved Reciprocal Approximation Algorithm. Dec. 2006. [6] Taek-Jun Kwon. “Floating-Point unit design using Taylor-series Expansion algorithms”. PhD thesis. University of Southern California, 2009. [7] Albert A. Liddicoat and Michael J. Flynn. “High Performance Floating Point Divide”. In: Digital Systems Design, 2001. Proceedings. Euromicro Symposium on. Sept. 2001. [8] Naofumi Takagi, Masayuki Ito, and Shuzo Yajima. “Efficient Initial Approximation and Fast Converging Methods for Division and Square Root”. In: ARITH ’95 Proceedings of the 12th Symposium on Computer Arithmetic. 1995.
  • 12. 30 Computer Science & Information Technology (CS & IT) [9] Jean-Michel Muller, Milos Ercegovac and Arnaud Tisserand. “Simple Seed Architectures for Reciprocal and Square Root Reciprocal”. In: INRIA RR-5720 (2005), p. 25. [10] J.N. Mitchell. “Computer multiplication and division using binary logarithms”. In: IRE Transactions on Electronic Computers EC-11 (Aug. 1962), pp. 512–517. [11] Oskar Mencer, Patrick Hung, Hossam Fahmy and Michael J. Flynn. “Fast Division algorithm with a Small Lookup Table”. In: Circuits, Systems and Computers, 1997. Conference Record. 1997 11th Asilomar Conference on. 1997. [12] Patricio Bulić, Zdenka Babić, Aleksej Avramović.“An Iterative Logarithmic Multiplier”. In: Elektrotehnis̆ ki vestnik 77(1) (2010), pp. 25–30.