SlideShare a Scribd company logo
FPGA-Based Multi-Level Approximate Multipliers
for High-Performance Error-Resilient Applications
Objective:
The main objective of the paper is to approximate multipliers which are efficiently deployed on Field
Programmable Gate Arrays (FPGAs) by using newly proposed approximate logic compressors at different
levels of accuracy.
Introduction:
These days in many of the error-resilient applications such as multimedia, data mining, image processing,
machine learning, etc.,precise computations are not always necessary. The computation results with some
degradation of accuracy can be acceptable and meaningful enough for such applications. By taking
advantage of this property, we take into consideration the trade-offs between the accuracy and electrical
performances of a circuit. That is, we can sacrifice some loss of accuracy for beneficial gains of power
dissipation, occupied area,and delay. In this paper the approximate 8×8 , 16×16 , and 32×32 multipliers
with different accuracies will be efficiently implemented on FPGAs by using proposed approximate
compressors.
Exact and Approximate 4–2 Compressors:
A conventional exact 4–2 compressor consists of two full adders as shown in the figure below. It has a
total of 5 inputs and 3 outputs. The output sum has the same weight of 1 as the inputs while the
outputs cout and carry have a weight of 2. The reduction tree of an 8×8 multiplier is illustrated in the
second figure in which the dashed shapes represent half and full adders and the solid shapes are the exact
4–2 compressors.
4–2 compressors: (a) a conventional exact 4–2 compressor, (b) an approximate 4–2 compressor.
The partial product reduction tree of an 8×8 multipliers by using conventional 4–2 compressor (solid),
half-, and full-adders (dashed).
In an approximate 4–2 compressor there are two outputs which can approximately count all 1’s at its four
inputs itself. The block diagram of an approximate 4–2 compressor is illustrated above. These two outputs
of the approximate 4–2 compressor can have the same or different weights. Compared to the conventional
exact 4–2 compressor, this approximate compressor does not have the carry input in it. This shortens the
delay and reduces the occupied area and power consumption drastically.
Approximate 8×8 Multipliers:
The proposed 8×8 approximate multiplier implementations consist of severalsteps including partial
product generation, PPM reduction at the first stage by using approximate 4–2 compressors in class 2, the
PPM reduction at the following stages n (n>1 ) by using approximate 4–2 compressors class 1 and the
generation of the final result by using a ripple carry adder (RCA). It is to be noted that the partial product
generation and the PPM reduction at the first stage are combined together to save the hardware resource,
thereby assisting in power consumption. This is a considerable difference between the proposed FPGA-
based implementations and ASIC-based ones.
A dot diagram of the proposed 8×8 approximate multiplier has been illustrated. The different kinds of
approximate compressors to compress the PPM at different reduction stages are also applied. In terms of
hardware resources and accuracy,each kind of approximate compressors are used properly for the PPM
reduction at different stages. Here for example, the approximate 4–2 compressors in class 2 is efficiently
applied to compress the PPM at the first stage since it costs small hardware resources and less power
dissipation. The approximate 4–2 compressor class 1 is appropriate with reducing the PPM height at
stages n (n >1) because they show considerable higher accuracies and consume less dynamic power.
The dot diagram of the proposed approximate 8×8 multiplier. Approximate 4–2 compressors CP3
(CP4) are used to reduce the PPM at the first reduction stage while the compressors CP1 (CP2)
are utilized for the reduction stage 2.
Moreover, the proposed approximate multiplier has been clustered into two parts with different
accuracies. The most significant partial products are accumulated by using accurate compressors and
adders while the least significant PPs are accumulated by utilizing presented approximate compressors. In
this work, different configurations of approximate multipliers with various accuracies were implemented.
For example, in the above figure the approximate multiplier is implemented with 5 leftmost columns of
the partial products compressed by using accurate compressors and adders. An RCA is used to add two
last rows of the final stage to generate the final result since the dedicated RCA is known as one of the
fastest adders on FPGAs till date.
Approximate 8×8 Multiplier Configurations
The proposed multi-level approximate architecture in this paper improves the accuracy,shortens the
delay, and reduces the power dissipation and the occupied area of the approximate multiplier. Finally, all
the proposed approximate 8×8 multipliers with different accuracy configurations are summarized in in the
above table. The M8_CP13_k group includes approximate 8×8 multipliers using the approximate 4–2
compressor CP3 at the first reduction stage and the compressor CP1 in the second reduction stage. k is the
number of leftmost partial product columns compressed by accurate compressors,full-adders and half-
adders.
In this work, the approximate 4–2 compressor CP2 is not used to construct approximate 8×8 multipliers.
Since only a small amount of approximate 4–2 compressors are used for the reduction stage 2, the
reduction in dynamic power dissipation is small while the loss of accuracy could be considerable. The
effectiveness of the approximate 4–2 compressor CP2 on the dynamic power reduction is evaluated on
larger operand size multipliers (i.e., 16×16 , 32×32 multipliers).
Conclusion:
In this paper the proposed approximate 8×8 , 16×16 , and 32×32 multipliers with different accuracies
were efficiently implemented on FPGAs by using proposed approximate compressors. The proposed
approximate multipliers were evaluated on both circuit and application levels to demonstrate their
effectiveness and applicability. This was the first work to implement the approximate multipliers and
measure their dynamic power consumption on an FPGA board. Finally, the proposed multipliers in this
paper were proven suitable for high-performance and low-power error-resilient applications.

More Related Content

PDF
Compressor based approximate multiplier architectures for media processing ap...
PDF
Design and Analysis of 4-2 Compressor for Arithmetic Application
PDF
IRJET- Approximate Multiplier and 8 Bit Dadda Multiplier Implemented through ...
PDF
IRJET - Design of a Low Power Serial- Parallel Multiplier with Low Transition...
PDF
IRJET- Comparison of Multiplier Design with Various Full Adders
PPTX
Low power high_speed
PDF
Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...
PPTX
MAC UNIT USING DIFFERENT MULTIPLIERS
Compressor based approximate multiplier architectures for media processing ap...
Design and Analysis of 4-2 Compressor for Arithmetic Application
IRJET- Approximate Multiplier and 8 Bit Dadda Multiplier Implemented through ...
IRJET - Design of a Low Power Serial- Parallel Multiplier with Low Transition...
IRJET- Comparison of Multiplier Design with Various Full Adders
Low power high_speed
Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...
MAC UNIT USING DIFFERENT MULTIPLIERS

What's hot (20)

PDF
Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...
PDF
Fast Multiplier for FIR Filters
PDF
A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...
PDF
A Pipelined Fused Processing Unit for DSP Applications
PPTX
Array multiplier
PDF
Design and implementation of high speed baugh wooley and modified booth multi...
PDF
Hz3115131516
PDF
Gv3512031207
PDF
Low Power VLSI Design of Modified Booth Multiplier
PDF
VLSI Implementation of High Speed & Low Power Multiplier in FPGA
PPTX
Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...
PDF
Compare Efficiency of Different Multipliers Using Verilog Simulation & Modify...
PDF
N046018089
PDF
Design of 8-Bit Comparator Using 45nm CMOS Technology
PDF
Bivariatealgebraic integerencoded arai algorithm for
PDF
Layout Design Analysis of CMOS Comparator using 180nm Technology
PDF
J045075661
PDF
A comparative study of different multiplier designs
PPTX
design of high speed performance 64bit mac unit
Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...
Fast Multiplier for FIR Filters
A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast ...
A Pipelined Fused Processing Unit for DSP Applications
Array multiplier
Design and implementation of high speed baugh wooley and modified booth multi...
Hz3115131516
Gv3512031207
Low Power VLSI Design of Modified Booth Multiplier
VLSI Implementation of High Speed & Low Power Multiplier in FPGA
Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...
Compare Efficiency of Different Multipliers Using Verilog Simulation & Modify...
N046018089
Design of 8-Bit Comparator Using 45nm CMOS Technology
Bivariatealgebraic integerencoded arai algorithm for
Layout Design Analysis of CMOS Comparator using 180nm Technology
J045075661
A comparative study of different multiplier designs
design of high speed performance 64bit mac unit
Ad

Similar to FPGA-Based Multi-Level Approximate Multipliers for High-Performance Error-Resilient Applications (20)

PPTX
A Low-Power and High-Accuracy Approximate Multiplier With Reconfigurable Trun...
PDF
IRJET - Approximate Unsigned Multiplier with Varied Error Rate
PDF
Fa35880883
PDF
IRJET - Design and Analysis of Kogge Stone Adder using Approximate Compressor
PDF
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
PDF
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
PDF
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
PDF
IRJET- Image and Signal Filtering using Fir Filter Made using Approximate Hyb...
PDF
Recent articles published in VLSI design & Communication Systems
PDF
A METHODOLOGY FOR IMPROVEMENT OF ROBA MULTIPLIER FOR ELECTRONIC APPLICATIONS
PDF
A METHODOLOGY FOR IMPROVEMENT OF ROBA MULTIPLIER FOR ELECTRONIC APPLICATIONS
PDF
A METHODOLOGY FOR IMPROVEMENT OF ROBA MULTIPLIER FOR ELECTRONIC APPLICATIONS
PDF
A METHODOLOGY FOR IMPROVEMENT OF ROBA MULTIPLIER FOR ELECTRONIC APPLICATIONS
DOCX
Dual-Quality 4:2 Compressors for Utilizing in Dynamic Accuracy Configurable M...
DOCX
Design and analysis of approximate
PDF
Approximate single precision floating point adder for low power applications
PPTX
Truncated booth multiplier design of hdl
PDF
An efficient floating point adder for low-power devices
DOCX
Design of Power and Area Efficient Approximate Multipliers
PPTX
PROJECT PPT.Wallace tree multiplier using Adder
A Low-Power and High-Accuracy Approximate Multiplier With Reconfigurable Trun...
IRJET - Approximate Unsigned Multiplier with Varied Error Rate
Fa35880883
IRJET - Design and Analysis of Kogge Stone Adder using Approximate Compressor
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
IRJET- Image and Signal Filtering using Fir Filter Made using Approximate Hyb...
Recent articles published in VLSI design & Communication Systems
A METHODOLOGY FOR IMPROVEMENT OF ROBA MULTIPLIER FOR ELECTRONIC APPLICATIONS
A METHODOLOGY FOR IMPROVEMENT OF ROBA MULTIPLIER FOR ELECTRONIC APPLICATIONS
A METHODOLOGY FOR IMPROVEMENT OF ROBA MULTIPLIER FOR ELECTRONIC APPLICATIONS
A METHODOLOGY FOR IMPROVEMENT OF ROBA MULTIPLIER FOR ELECTRONIC APPLICATIONS
Dual-Quality 4:2 Compressors for Utilizing in Dynamic Accuracy Configurable M...
Design and analysis of approximate
Approximate single precision floating point adder for low power applications
Truncated booth multiplier design of hdl
An efficient floating point adder for low-power devices
Design of Power and Area Efficient Approximate Multipliers
PROJECT PPT.Wallace tree multiplier using Adder
Ad

Recently uploaded (20)

PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Construction Project Organization Group 2.pptx
PPTX
Safety Seminar civil to be ensured for safe working.
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PPT
introduction to datamining and warehousing
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
Sustainable Sites - Green Building Construction
PPTX
Geodesy 1.pptx...............................................
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
PPT on Performance Review to get promotions
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Construction Project Organization Group 2.pptx
Safety Seminar civil to be ensured for safe working.
CH1 Production IntroductoryConcepts.pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
introduction to datamining and warehousing
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Embodied AI: Ushering in the Next Era of Intelligent Systems
Model Code of Practice - Construction Work - 21102022 .pdf
Sustainable Sites - Green Building Construction
Geodesy 1.pptx...............................................
Operating System & Kernel Study Guide-1 - converted.pdf
Automation-in-Manufacturing-Chapter-Introduction.pdf
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPT on Performance Review to get promotions
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...

FPGA-Based Multi-Level Approximate Multipliers for High-Performance Error-Resilient Applications

  • 1. FPGA-Based Multi-Level Approximate Multipliers for High-Performance Error-Resilient Applications Objective: The main objective of the paper is to approximate multipliers which are efficiently deployed on Field Programmable Gate Arrays (FPGAs) by using newly proposed approximate logic compressors at different levels of accuracy. Introduction: These days in many of the error-resilient applications such as multimedia, data mining, image processing, machine learning, etc.,precise computations are not always necessary. The computation results with some degradation of accuracy can be acceptable and meaningful enough for such applications. By taking advantage of this property, we take into consideration the trade-offs between the accuracy and electrical performances of a circuit. That is, we can sacrifice some loss of accuracy for beneficial gains of power dissipation, occupied area,and delay. In this paper the approximate 8×8 , 16×16 , and 32×32 multipliers with different accuracies will be efficiently implemented on FPGAs by using proposed approximate compressors. Exact and Approximate 4–2 Compressors: A conventional exact 4–2 compressor consists of two full adders as shown in the figure below. It has a total of 5 inputs and 3 outputs. The output sum has the same weight of 1 as the inputs while the outputs cout and carry have a weight of 2. The reduction tree of an 8×8 multiplier is illustrated in the second figure in which the dashed shapes represent half and full adders and the solid shapes are the exact 4–2 compressors. 4–2 compressors: (a) a conventional exact 4–2 compressor, (b) an approximate 4–2 compressor.
  • 2. The partial product reduction tree of an 8×8 multipliers by using conventional 4–2 compressor (solid), half-, and full-adders (dashed). In an approximate 4–2 compressor there are two outputs which can approximately count all 1’s at its four inputs itself. The block diagram of an approximate 4–2 compressor is illustrated above. These two outputs of the approximate 4–2 compressor can have the same or different weights. Compared to the conventional exact 4–2 compressor, this approximate compressor does not have the carry input in it. This shortens the delay and reduces the occupied area and power consumption drastically. Approximate 8×8 Multipliers: The proposed 8×8 approximate multiplier implementations consist of severalsteps including partial product generation, PPM reduction at the first stage by using approximate 4–2 compressors in class 2, the PPM reduction at the following stages n (n>1 ) by using approximate 4–2 compressors class 1 and the generation of the final result by using a ripple carry adder (RCA). It is to be noted that the partial product generation and the PPM reduction at the first stage are combined together to save the hardware resource, thereby assisting in power consumption. This is a considerable difference between the proposed FPGA- based implementations and ASIC-based ones. A dot diagram of the proposed 8×8 approximate multiplier has been illustrated. The different kinds of approximate compressors to compress the PPM at different reduction stages are also applied. In terms of hardware resources and accuracy,each kind of approximate compressors are used properly for the PPM reduction at different stages. Here for example, the approximate 4–2 compressors in class 2 is efficiently applied to compress the PPM at the first stage since it costs small hardware resources and less power dissipation. The approximate 4–2 compressor class 1 is appropriate with reducing the PPM height at stages n (n >1) because they show considerable higher accuracies and consume less dynamic power.
  • 3. The dot diagram of the proposed approximate 8×8 multiplier. Approximate 4–2 compressors CP3 (CP4) are used to reduce the PPM at the first reduction stage while the compressors CP1 (CP2) are utilized for the reduction stage 2. Moreover, the proposed approximate multiplier has been clustered into two parts with different accuracies. The most significant partial products are accumulated by using accurate compressors and adders while the least significant PPs are accumulated by utilizing presented approximate compressors. In this work, different configurations of approximate multipliers with various accuracies were implemented. For example, in the above figure the approximate multiplier is implemented with 5 leftmost columns of the partial products compressed by using accurate compressors and adders. An RCA is used to add two last rows of the final stage to generate the final result since the dedicated RCA is known as one of the fastest adders on FPGAs till date. Approximate 8×8 Multiplier Configurations The proposed multi-level approximate architecture in this paper improves the accuracy,shortens the delay, and reduces the power dissipation and the occupied area of the approximate multiplier. Finally, all the proposed approximate 8×8 multipliers with different accuracy configurations are summarized in in the above table. The M8_CP13_k group includes approximate 8×8 multipliers using the approximate 4–2 compressor CP3 at the first reduction stage and the compressor CP1 in the second reduction stage. k is the number of leftmost partial product columns compressed by accurate compressors,full-adders and half- adders.
  • 4. In this work, the approximate 4–2 compressor CP2 is not used to construct approximate 8×8 multipliers. Since only a small amount of approximate 4–2 compressors are used for the reduction stage 2, the reduction in dynamic power dissipation is small while the loss of accuracy could be considerable. The effectiveness of the approximate 4–2 compressor CP2 on the dynamic power reduction is evaluated on larger operand size multipliers (i.e., 16×16 , 32×32 multipliers). Conclusion: In this paper the proposed approximate 8×8 , 16×16 , and 32×32 multipliers with different accuracies were efficiently implemented on FPGAs by using proposed approximate compressors. The proposed approximate multipliers were evaluated on both circuit and application levels to demonstrate their effectiveness and applicability. This was the first work to implement the approximate multipliers and measure their dynamic power consumption on an FPGA board. Finally, the proposed multipliers in this paper were proven suitable for high-performance and low-power error-resilient applications.