SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1051
FPGA IMPLEMENTATION OF HIGH SPEED AND LOW POWER
SPECULATIVE ADDER
V.Aparajita1, N. Krishna kumari2, S. Ramesh3
1, 2 Student, Dept. of Electronics and Communication Engineering, Bapatla Engineering College,
Andhra Pradesh, India
3 Asst. Prof, Dept. of Electronics and Communication Engineering, Bapatla Engineering College,
Andhra Pradesh, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract — High speed adders are highly desirable in
present day scenario where power also plays equal role. This
paper displays carry-lookahead adder (CLA) based
configuration of the contemporary inexact-speculative
adder (ISA) which is further fine-grain pipelined that is
addition of registers along its critical path and thereby,
upgrading the process of addition by decreasing the delay of
operation and enhancing the frequency of operation. The
registers we used are nothing but D-Flip-flops which are
clock gated in order to reduce the power consumption.
Functional verification and hardware implementation for
various configurations of the suggested ISA is to be carried
out on field-programmable gate array(FPGA) platform.
The synthesis and post layout simulation of the proposed ISA
is carried out in FPGA using vivado hls for power analysis.
Implementation of pipelining has reduced the delay up to
6ns compared to non-pipelined architecture and it has also
reduced power up to 4w.
Key Words: Inexact speculative adder, Carry
lookahead adder, pipelining, Field programmable gate
array, very-large scale-integration.
1. INTRODUCTION
Speed is one of the important factor along with the
utilizing of less power for the adders in the present day
scenario rather than the exact result. For this we prefer
highly optimized adders which require less delay and low
power and this paper presents an adder of exactly this
type.
With acceptable degradation in accuracy and performance
it is possible to design high speed and low power adder
using speculation technique[4]. Accuracy is the major
compromise to be done to improve power and speed by
speculation. Thereby these adders are referred to as
Inexact speculative adders. Various adders are reported in
the references from [5]-[9] but accuracy is considered as
the major constraint in these adders and concentrated
more on improving accuracy of the results. However there
is a chance to improve the speed of the adders by retaining
a minimum error in the result. So our contributions in this
work as follows:
(1) Design of carry lookahead adder based inexact
speculative adder.
(2) Thereafter this adder is fine grain pipelined to reduce
the critical path delay and also enhancing the speed of
operation. FPGA implementation of 8, 16 and 32 bit
versions of proposed and suggested architectures are
carried out, obtained the post, place and route results and
compared. clock signal fed to various stages of the
pipelined ISA-architecture has been gated to reduce the
power consumption. Synthesis and post-layout simulation
of the clock gated ISA has been performed in FPGA using
vivado hls.
(a)
(b) (c)
Fig 1: Basic block diagram of n-bit conventional inexact-
speculation adder (ISA). (b) Gate-level circuit
representation of speculator block. (c) Digital architecture
of compensator block.
2. INEXACT SPECULATIVE ADDER
In the proposed architecture, we have segregated the n-
bit input into 4-bit blocks (i.e., the value of x = 4 in Fig. 1)
and each of these blocks is fed as operands to the x-bit
adder. Unlike the conventional ISA architecture, the adder
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1052
unit has been replaced with 4-bit CLA to further enhance
the speed of operation. Explanation of different blocks of
the adder are as follows:
a) Adder and Speculator blocks: Consider two n-bit
operands are A={A0, A1, A2,----An-1} and B={B0, B1, B2----Bn-1}.
whereas the sum, carry-in and carry-out are expressed as
S={S0, S1 ,S2----Sn-1}, Cin and Cout respectively. Speculator block
is based on CLA logic to speculate the output carry for
each 4-bit adder block. Speculation is carried out for last
two msb bits of each block. Subsequently, the input carry
for each speculator block is 0 or 1 which introduces
positive or negative errors respectively. The output carry,
which is denoted as Cs0 from each speculator block is fed
as an input carry for the adder block succeeding it. So each
4-bit adder block need not wait for the input carry from
the preceding 4- bit adder block. Instead, all such adder
blocks perform simultaneous additions on receiving input
carries from the concerned speculator blocks. Speculator
block computes carry based on the equation shown below:
Pi=Ai Bi
Gi=Ai . Bi
Ci+1=Gi + Ai . Bi
This block is situated along the critical path of ISA
architecture; however, it doesn’t produce much delay as it
computes the carry for only two bits. On the other hand,
adder block performs addition of 4-bit input blocks using
CLA logic based on
Si=Ai Bi
Here, the local sum obtained from each adder block is not
the exact output because the addition has been performed
using speculated carry inputs. Correction or Balancing of
such sum value is carried out by the compensator block.
b)Compensator Block: Fig.1(c) shows the digital
architecture for compensator block used in the ISA adder.
This block compares carry from each 4-bit adder block
with the corresponding speculated carry using a XOR gate.
Thereafter, the output from XOR gate generates an error
flag (fe) triggers the activation of one of the two
compensation techniques: error correction or reduction. If
the XOR-gate output is ‘0’ then the local sum is directly
passed to the final output. Similarly, if the XOR gate gives
‘1’ then this indicates that an error has occurred which can
be either positive or negative. A positive error indicates a
speculation of ‘0’ instead of ‘1’ and, hence, induces too low
sum and negative error indicates speculation of ‘1’ instead
of ‘0’ which induces too high sum. The components of
compensation block involved in the overall critical path of
ISA are the XOR gate, de-multiplexer and multiplexer.
3. FINE-GRAIN PIPELINED ARCHITECTURE
In the conventional ISA architecture, let us assume that the
combinational delay of 4-bit adder, speculator and
compensator blocks to be ∂4𝑏−𝑎𝑑𝑑𝑒𝑟, ∂𝑠𝑝𝑒𝑐 and ∂𝑐𝑜𝑚𝑝
respectively. In this architecture, carry in is speculated for
each 4-bit adder block and based on this adder block
calculates the local sum. Thereafter, the error speculation
is detected by comparing speculated carry in and prior
carry out from 4-bit adder. Subsequently, compensator
block performs the correction and balancing operation.
Thus, the critical path of the conventional ISA architecture
includes delays of adder of the 𝑖𝑡ℎ instant and the
speculator plus compensator delays of (𝑖+1)𝑡ℎ instant and
the equation is given as
( ) ( ) ( )
The detailed version of the critical path is given by
( ) ( )
( )
Where are the combinational
delays of logical AND, OR and XOR respectively. The
speculator, compensator, 4-bit adder and overall design is
feed forward VLSI architecture. If we carefully analyze and
pipeline these blocks then we may reduce the critical path
delay and gain fast result.
Fig 2: Pipelined VLSI architecture of the proposed ISA for
n=16bits and x=4bits with five pipeline stages.
Pipelining process here is explained using n=16bit ISA
architecture. Even the value of n increases the critical path
is unaffected because the value of x is always a 4-bit and
hence the adder, speculator and compensator blocks
remain unchanged. In the above fig 2 the proposed
architecture is replaced by the pipelined speculator
(PSPEC), pipelined compensator (PCOMP) and the
pipelined CLA (PCLA). Sub blocks PSEPEC, PCOMP, PCLA
contain the pipelined stages. Overall architecture of the
ISA adder has been designed with five pipelined stages
and there are six levels of registers included in this design
as shown in fig 2.In this case the number of pipeline stages
remains constant and on increasing the width of operands
then the bit widths of the operands, retaining the same
critical path delay.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1053
(a)
(b) (c)
Fig 3: Gate level circuit of (a) 4-bit pipelined carry
lookahead adder (b) pipelined compensator(PCOMP) (c)
pipelined speculator(PSPEC)
Fig 3 shows the gate level designs of the sub blocks and
their respective pipelined stages. From the figures we can
conclude that the critical path of the suggested pipelined
architecture lies in PCLA and it includes only one XOR and
three AND gate input delays. Therefore the equation for
delay is given by
This includes clock-q delay and the setup time required to
launch and capture flip-flops respectively. Thus maximum
clock frequency can be obtained by the inverse of the
delay.
4. EXPERMENTAL RESULT AND COMPARISION
This section presents the functional verification and board
level implementation of non-pipelined and pipelined ISA.
Subsequently the post-simulation results are compared.
a) FPGA implementation
In this work, the proposed and suggested ISA adder-
architecture has been coded in hardware descriptive
language (HDL) and then simulated as well as synthesized
in ISE 14.7 design suite. We have synthesized this
architecture for three different configurations : n = 8-bit, n
= 16-bit and n = 32-bit. After the successful syntax check
and synthesis of the design, the generated net-lists are
placed and routed (P&R) on Spartan-3E version of Xilinx
FPGA board. Then after the timing information and the
number of devices utilized are also calculated for both the
adders.
The maximum operating frequency is 127.7MHz.This
value is 69.29%, 57.4%, 56.69% better than the clock
frequencies achieved by 8-bit,16-bit,32-bit non-pipelined
adders. The comparison of exact area occupied in terms of
LUT’s and the power required of these adders are possible
by synthesizing as well as laying out FPGA for each of
these adders.
b) Power and area analysis
In the digital circuit design, pipelining is the process of
shortening the delay in critical path at the cost of area
which is predominated by the registers used to create
pipeline stages in the design. Therefore, the suggested ISA
architecture that is deep pipelined definitely requires
extra registers in comparison with the proposed non-
pipelined ones. On the other side, we have divided the
suggested ISA architecture into different stages by
pipelining it. Now, this makes our architecture suitable for
clock-gating. In the new design, we have gated the clock
signal that is fed into every stage. On doing this, the ideal
stages of our architecture can be deferred from the clock
switching which significantly reduces the power
consumption. Such gating is valid only during the
beginning and ending sessions of the addition process. On
the starting of addition, later pipeline stages (towards the
output side) of the design are ideal and these stages can be
clock gated. Unlike towards the end of addition process,
earlier stages (near the input side) of the design seem to
be ideal and are clock-gated. For example: pipeline stages
five , four, three and two are ideal while the process is
being carried out in the first stage when the addition
begins. Similarly, first stage will be ideal while rest keeps
processing data when addition is towards the completion.
However, while the adder is in-between the process of
adding continuous stream of data then there is no point of
gating the clock because all the stages are busy performing
the operations.
In order to obtain number of LUT’s required and amount
of power consumed by both the adders, this work includes
synthesis and post layout simulation results of three
configurations of both pipelined and non-pipelined adders
for the purpose of comparison.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1054
Table 1: Comparison of post P&R results obtained from FPGA implementations of 8, 16, 32-bit pipelined and non-
pipelined ISA designs.
The 32-bit pipelined ISA consumes a total power of
17.634w and it can be observed that this architecture
consumes 4.235w lesser power than the non-pipelined ISA
and this is possible only due to the implementation of
clock-gating technique and also due to pipelining the
critical path delay of 32-bit non-pipelined adder is
13.762ns where as for pipelined one it is 7.888 ns and
hence due to pipelining the delay is reduced by 6ns. The
above table shows the comparison of total area consumed
in terms of LUT’s, power consumed, critical path delay and
the maximum clock frequency of 8-bit,16-bit,32-bit non-
pipelined as well as pipelined adders. Thereby the adder
presented has area degradation in comparison with the
non-pipelined one.
5. CONCLUSION
In this paper we presented the high-speed and low- power
version of ISA design. This architecture is fine-grain
pipelined and clock-gated to reduce delay as well as to
reduce power consumption respectively. Experimental
results showed that the modified architecture operate at a
maximum frequency of 127.72MHz in FPGA. Subsequently
a 32-bit pipelined architecture consumes power of
17.634w.Thereby, such design would definitely play
significant role in the design of contemporary as well as
future electronic devices for IoE and many other
applications. However, the area issue can be resolved to
some extent by using lower technology nodes in the
design process.
6. REFERENCES
[1] Behzad Razavi, “Cognitive Radio Design Challenges
and Techniques,” IEEE Journals of Solid-State
Circuits (JSSC), vol. 45, no. 8, pp. 1542- 1553,
2010.
[2] Gyanendra Prasad Joshi, Seung Yeob Nam and
Sung Won Kim, “Cog- nitive Radio Wireless Sensor
Networks: Applications, Challenges and Research
Trends,” Sensors, vol. 13, no. 9, pp. 11196-11228,
2013.
[3] D. Blaauw et al., “IoT Design Space Challenges:
Circuits and Systems,” IEEE Symposium on VLSI
Technology (VLSI-Technology): Digest of Technical
Papers, pp. 1-2, 2014.
[4] T. Liu and S. L. Lu, “Performance Improvement
with Circuit-level Speculation,” 33rd Annual IEEE
ACM International Symposium on Mi-
croarchitecture (MICRO-33), pp. 348-355, 2000.
[5] N. Zhu, W.-L. Goh, and K.-S. Yeo, “An Enhanced Low-
power High-speed Adder For Error-tolerant
Application,” 12th International Symposium on
Integrated Circuits (ISIC), pp. 69-72, 2009.
[6] M. Weber, M. Putic, H. Zhang, J. Lach, and J. Huang,
“Balancing Adder for Error Tolerant Applications,”
IEEE International Symposium on Circuits and
Systems (ISCAS), pp. 3038-3041, 2013.
[7] N. Zhu, W.-L. Goh, G. Wang, and K.-S. Yeo,
“Enhanced Low-power High-speed Adder for
Error-tolerant Application,” IEEE International SoC
Design Conference (ISOCC), pp. 323-327, 2010.
[8] Y. Kim, Y. Zhang, and P. Li, “An Energy Efficient
Approximate Adder with Carry Skip for Error
Resilient Neuromorphic VLSI Systems,” IEEE/ACM
International Conference on Computer-Aided
Design (ICCAD), pp. 130- 137, 2013.
[9] Vincent Camus, Jeremy Schlachter and Christian
Enz, “Energy-Efficient Inexact Speculative Adder
with High Performance and Accuracy Control,” IEEE
International Symposium on Circuits and Systems
(ISCAS), pp. 45- 48, 2015.
ISA configurations 8-bit
NPLA
8-bit
PLA
16-bit
NPLA
16-bit
PLA
32-bit
NPLA
32-bit
PLA
FPGA family Spartan-3E Spartan-3E Spartan-3E Spartan-3E Spartan-3E Spartan-3E
FPGA device Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L
4-ip LUT’s 13 31 22 72 46 149
Crit.path delay(ns) 11.292 7.888 13.672 7.888 13.762 7.888
Max.clk freq (MHz) 88.592 126.77 73.141 126.77 72.66 126.77
Power
consumed(w)
5.533 4.95 10.596 8.94 21.865 17.634

More Related Content

PDF
IRJET- Asic Implementation of Efficient Error Detection for Floating Poin...
PDF
IRJET - A Speculative Approximate Adder for Error Recovery Unit
PDF
IRJET- Accuracy Configurable Adder
PDF
IRJET- A Review of Approximate Adders for Energy-Efficient Digital Signal Pro...
PDF
Implementation of Carry Skip Adder using PTL
PDF
Designing of Adders and Vedic Multiplier using Gate Diffusion Input
PDF
1.area efficient carry select adder
PDF
implementation and design of 32-bit adder
IRJET- Asic Implementation of Efficient Error Detection for Floating Poin...
IRJET - A Speculative Approximate Adder for Error Recovery Unit
IRJET- Accuracy Configurable Adder
IRJET- A Review of Approximate Adders for Energy-Efficient Digital Signal Pro...
Implementation of Carry Skip Adder using PTL
Designing of Adders and Vedic Multiplier using Gate Diffusion Input
1.area efficient carry select adder
implementation and design of 32-bit adder

What's hot (20)

PDF
HARDWARE EFFICIENT SCALING FREE VECTORING AND ROTATIONAL CORDIC FOR DSP APPLI...
PPT
Design of Low-Power High-Speed Truncation-Error-Tolerant Adder
PPTX
Design & implementation of high speed carry select adder
PDF
IRJET- An Efficient Wallace Tree Multiplier using Modified Adder
DOCX
Area–delay–power efficient carry select adder
PDF
Iaetsd 128-bit area
PDF
Implementation of Low Power and Area Efficient Carry Select Adder
PDF
J43015355
DOCX
DESIGN AND PERFORMANCE ANALYSIS OF BINARY ADDERS_edited
PDF
Design of high speed adders for efficient digital design blocks
PDF
Paper7 2
PDF
Design and Verification of Area Efficient Carry Select Adder
PDF
High Speed Carryselect Adder
PDF
Design and Implementation of Low-Power and Area-Efficient 64 bit CSLA using VHDL
PPT
32-bit unsigned multiplier by using CSLA & CLAA
PDF
Evaluation of High Speed and Low Memory Parallel Prefix Adders
PDF
A Novel Efficient VLSI Architecture for IEEE 754 Floating point multiplier us...
PDF
a technical review of efficient and high speed adders for vedic multipliers
PDF
Eq36876880
PPT
Low power & area efficient carry select adder
HARDWARE EFFICIENT SCALING FREE VECTORING AND ROTATIONAL CORDIC FOR DSP APPLI...
Design of Low-Power High-Speed Truncation-Error-Tolerant Adder
Design & implementation of high speed carry select adder
IRJET- An Efficient Wallace Tree Multiplier using Modified Adder
Area–delay–power efficient carry select adder
Iaetsd 128-bit area
Implementation of Low Power and Area Efficient Carry Select Adder
J43015355
DESIGN AND PERFORMANCE ANALYSIS OF BINARY ADDERS_edited
Design of high speed adders for efficient digital design blocks
Paper7 2
Design and Verification of Area Efficient Carry Select Adder
High Speed Carryselect Adder
Design and Implementation of Low-Power and Area-Efficient 64 bit CSLA using VHDL
32-bit unsigned multiplier by using CSLA & CLAA
Evaluation of High Speed and Low Memory Parallel Prefix Adders
A Novel Efficient VLSI Architecture for IEEE 754 Floating point multiplier us...
a technical review of efficient and high speed adders for vedic multipliers
Eq36876880
Low power & area efficient carry select adder
Ad

Similar to IRJET- FPGA Implementation of High Speed and Low Power Speculative Adder (20)

PDF
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
PDF
IRJET- FPGA Implementation of Low Power Configurable Adder for Approximate Co...
PDF
Paper id 37201517
PDF
DESIGN AND IMPLEMENTATION OF AN IMPROVED CARRY INCREMENT ADDER
PDF
Performance Analysis of Dual Gate MOSFET in Adder
PDF
Analysis of different bit carry look ahead adder using verilog code 2
PDF
Binary Adder Architectures for Cell-Based VLSI and their Synthesis
PDF
DESIGN AND IMPLEMENTATION OF AN IMPROVED CARRY INCREMENT ADDER
PDF
International Journal of Computational Engineering Research(IJCER)
PDF
128-Bit Area Efficient Reconfigurable Carry Select Adder
PDF
Design of 64 bit error tolerant adder
PDF
Design of 64 bit error tolerant adder
PDF
Design of 64 bit error tolerant adder
PDF
An Area Efficient Adder Design for VLSI Circuits
DOCX
Project report on design & implementation of high speed carry select adder
PDF
Optimized Carry Speculative Adder
PDF
COMPARISON ANALYSIS OF 16-BIT ADDERS
PDF
Designing and Characterization of koggestone, Sparse Kogge stone, Spanning tr...
PDF
High Speed and Area Efficient Booth Multiplier Using SQRT CSLA with Zero Find...
PDF
Gq3511781181
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET- FPGA Implementation of Low Power Configurable Adder for Approximate Co...
Paper id 37201517
DESIGN AND IMPLEMENTATION OF AN IMPROVED CARRY INCREMENT ADDER
Performance Analysis of Dual Gate MOSFET in Adder
Analysis of different bit carry look ahead adder using verilog code 2
Binary Adder Architectures for Cell-Based VLSI and their Synthesis
DESIGN AND IMPLEMENTATION OF AN IMPROVED CARRY INCREMENT ADDER
International Journal of Computational Engineering Research(IJCER)
128-Bit Area Efficient Reconfigurable Carry Select Adder
Design of 64 bit error tolerant adder
Design of 64 bit error tolerant adder
Design of 64 bit error tolerant adder
An Area Efficient Adder Design for VLSI Circuits
Project report on design & implementation of high speed carry select adder
Optimized Carry Speculative Adder
COMPARISON ANALYSIS OF 16-BIT ADDERS
Designing and Characterization of koggestone, Sparse Kogge stone, Spanning tr...
High Speed and Area Efficient Booth Multiplier Using SQRT CSLA with Zero Find...
Gq3511781181
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
PDF
Kiona – A Smart Society Automation Project
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
PDF
Breast Cancer Detection using Computer Vision
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
Kiona – A Smart Society Automation Project
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
BRAIN TUMOUR DETECTION AND CLASSIFICATION
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
Breast Cancer Detection using Computer Vision
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...

Recently uploaded (20)

PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Digital Logic Computer Design lecture notes
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
DOCX
573137875-Attendance-Management-System-original
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
Lecture Notes Electrical Wiring System Components
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
Sustainable Sites - Green Building Construction
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
Geodesy 1.pptx...............................................
PPTX
additive manufacturing of ss316l using mig welding
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
UNIT 4 Total Quality Management .pptx
PDF
composite construction of structures.pdf
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
Internet of Things (IOT) - A guide to understanding
PPT
Mechanical Engineering MATERIALS Selection
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Digital Logic Computer Design lecture notes
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
R24 SURVEYING LAB MANUAL for civil enggi
CYBER-CRIMES AND SECURITY A guide to understanding
573137875-Attendance-Management-System-original
Automation-in-Manufacturing-Chapter-Introduction.pdf
Lecture Notes Electrical Wiring System Components
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
Sustainable Sites - Green Building Construction
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Geodesy 1.pptx...............................................
additive manufacturing of ss316l using mig welding
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
UNIT 4 Total Quality Management .pptx
composite construction of structures.pdf
bas. eng. economics group 4 presentation 1.pptx
Internet of Things (IOT) - A guide to understanding
Mechanical Engineering MATERIALS Selection

IRJET- FPGA Implementation of High Speed and Low Power Speculative Adder

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1051 FPGA IMPLEMENTATION OF HIGH SPEED AND LOW POWER SPECULATIVE ADDER V.Aparajita1, N. Krishna kumari2, S. Ramesh3 1, 2 Student, Dept. of Electronics and Communication Engineering, Bapatla Engineering College, Andhra Pradesh, India 3 Asst. Prof, Dept. of Electronics and Communication Engineering, Bapatla Engineering College, Andhra Pradesh, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract — High speed adders are highly desirable in present day scenario where power also plays equal role. This paper displays carry-lookahead adder (CLA) based configuration of the contemporary inexact-speculative adder (ISA) which is further fine-grain pipelined that is addition of registers along its critical path and thereby, upgrading the process of addition by decreasing the delay of operation and enhancing the frequency of operation. The registers we used are nothing but D-Flip-flops which are clock gated in order to reduce the power consumption. Functional verification and hardware implementation for various configurations of the suggested ISA is to be carried out on field-programmable gate array(FPGA) platform. The synthesis and post layout simulation of the proposed ISA is carried out in FPGA using vivado hls for power analysis. Implementation of pipelining has reduced the delay up to 6ns compared to non-pipelined architecture and it has also reduced power up to 4w. Key Words: Inexact speculative adder, Carry lookahead adder, pipelining, Field programmable gate array, very-large scale-integration. 1. INTRODUCTION Speed is one of the important factor along with the utilizing of less power for the adders in the present day scenario rather than the exact result. For this we prefer highly optimized adders which require less delay and low power and this paper presents an adder of exactly this type. With acceptable degradation in accuracy and performance it is possible to design high speed and low power adder using speculation technique[4]. Accuracy is the major compromise to be done to improve power and speed by speculation. Thereby these adders are referred to as Inexact speculative adders. Various adders are reported in the references from [5]-[9] but accuracy is considered as the major constraint in these adders and concentrated more on improving accuracy of the results. However there is a chance to improve the speed of the adders by retaining a minimum error in the result. So our contributions in this work as follows: (1) Design of carry lookahead adder based inexact speculative adder. (2) Thereafter this adder is fine grain pipelined to reduce the critical path delay and also enhancing the speed of operation. FPGA implementation of 8, 16 and 32 bit versions of proposed and suggested architectures are carried out, obtained the post, place and route results and compared. clock signal fed to various stages of the pipelined ISA-architecture has been gated to reduce the power consumption. Synthesis and post-layout simulation of the clock gated ISA has been performed in FPGA using vivado hls. (a) (b) (c) Fig 1: Basic block diagram of n-bit conventional inexact- speculation adder (ISA). (b) Gate-level circuit representation of speculator block. (c) Digital architecture of compensator block. 2. INEXACT SPECULATIVE ADDER In the proposed architecture, we have segregated the n- bit input into 4-bit blocks (i.e., the value of x = 4 in Fig. 1) and each of these blocks is fed as operands to the x-bit adder. Unlike the conventional ISA architecture, the adder
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1052 unit has been replaced with 4-bit CLA to further enhance the speed of operation. Explanation of different blocks of the adder are as follows: a) Adder and Speculator blocks: Consider two n-bit operands are A={A0, A1, A2,----An-1} and B={B0, B1, B2----Bn-1}. whereas the sum, carry-in and carry-out are expressed as S={S0, S1 ,S2----Sn-1}, Cin and Cout respectively. Speculator block is based on CLA logic to speculate the output carry for each 4-bit adder block. Speculation is carried out for last two msb bits of each block. Subsequently, the input carry for each speculator block is 0 or 1 which introduces positive or negative errors respectively. The output carry, which is denoted as Cs0 from each speculator block is fed as an input carry for the adder block succeeding it. So each 4-bit adder block need not wait for the input carry from the preceding 4- bit adder block. Instead, all such adder blocks perform simultaneous additions on receiving input carries from the concerned speculator blocks. Speculator block computes carry based on the equation shown below: Pi=Ai Bi Gi=Ai . Bi Ci+1=Gi + Ai . Bi This block is situated along the critical path of ISA architecture; however, it doesn’t produce much delay as it computes the carry for only two bits. On the other hand, adder block performs addition of 4-bit input blocks using CLA logic based on Si=Ai Bi Here, the local sum obtained from each adder block is not the exact output because the addition has been performed using speculated carry inputs. Correction or Balancing of such sum value is carried out by the compensator block. b)Compensator Block: Fig.1(c) shows the digital architecture for compensator block used in the ISA adder. This block compares carry from each 4-bit adder block with the corresponding speculated carry using a XOR gate. Thereafter, the output from XOR gate generates an error flag (fe) triggers the activation of one of the two compensation techniques: error correction or reduction. If the XOR-gate output is ‘0’ then the local sum is directly passed to the final output. Similarly, if the XOR gate gives ‘1’ then this indicates that an error has occurred which can be either positive or negative. A positive error indicates a speculation of ‘0’ instead of ‘1’ and, hence, induces too low sum and negative error indicates speculation of ‘1’ instead of ‘0’ which induces too high sum. The components of compensation block involved in the overall critical path of ISA are the XOR gate, de-multiplexer and multiplexer. 3. FINE-GRAIN PIPELINED ARCHITECTURE In the conventional ISA architecture, let us assume that the combinational delay of 4-bit adder, speculator and compensator blocks to be ∂4𝑏−𝑎𝑑𝑑𝑒𝑟, ∂𝑠𝑝𝑒𝑐 and ∂𝑐𝑜𝑚𝑝 respectively. In this architecture, carry in is speculated for each 4-bit adder block and based on this adder block calculates the local sum. Thereafter, the error speculation is detected by comparing speculated carry in and prior carry out from 4-bit adder. Subsequently, compensator block performs the correction and balancing operation. Thus, the critical path of the conventional ISA architecture includes delays of adder of the 𝑖𝑡ℎ instant and the speculator plus compensator delays of (𝑖+1)𝑡ℎ instant and the equation is given as ( ) ( ) ( ) The detailed version of the critical path is given by ( ) ( ) ( ) Where are the combinational delays of logical AND, OR and XOR respectively. The speculator, compensator, 4-bit adder and overall design is feed forward VLSI architecture. If we carefully analyze and pipeline these blocks then we may reduce the critical path delay and gain fast result. Fig 2: Pipelined VLSI architecture of the proposed ISA for n=16bits and x=4bits with five pipeline stages. Pipelining process here is explained using n=16bit ISA architecture. Even the value of n increases the critical path is unaffected because the value of x is always a 4-bit and hence the adder, speculator and compensator blocks remain unchanged. In the above fig 2 the proposed architecture is replaced by the pipelined speculator (PSPEC), pipelined compensator (PCOMP) and the pipelined CLA (PCLA). Sub blocks PSEPEC, PCOMP, PCLA contain the pipelined stages. Overall architecture of the ISA adder has been designed with five pipelined stages and there are six levels of registers included in this design as shown in fig 2.In this case the number of pipeline stages remains constant and on increasing the width of operands then the bit widths of the operands, retaining the same critical path delay.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1053 (a) (b) (c) Fig 3: Gate level circuit of (a) 4-bit pipelined carry lookahead adder (b) pipelined compensator(PCOMP) (c) pipelined speculator(PSPEC) Fig 3 shows the gate level designs of the sub blocks and their respective pipelined stages. From the figures we can conclude that the critical path of the suggested pipelined architecture lies in PCLA and it includes only one XOR and three AND gate input delays. Therefore the equation for delay is given by This includes clock-q delay and the setup time required to launch and capture flip-flops respectively. Thus maximum clock frequency can be obtained by the inverse of the delay. 4. EXPERMENTAL RESULT AND COMPARISION This section presents the functional verification and board level implementation of non-pipelined and pipelined ISA. Subsequently the post-simulation results are compared. a) FPGA implementation In this work, the proposed and suggested ISA adder- architecture has been coded in hardware descriptive language (HDL) and then simulated as well as synthesized in ISE 14.7 design suite. We have synthesized this architecture for three different configurations : n = 8-bit, n = 16-bit and n = 32-bit. After the successful syntax check and synthesis of the design, the generated net-lists are placed and routed (P&R) on Spartan-3E version of Xilinx FPGA board. Then after the timing information and the number of devices utilized are also calculated for both the adders. The maximum operating frequency is 127.7MHz.This value is 69.29%, 57.4%, 56.69% better than the clock frequencies achieved by 8-bit,16-bit,32-bit non-pipelined adders. The comparison of exact area occupied in terms of LUT’s and the power required of these adders are possible by synthesizing as well as laying out FPGA for each of these adders. b) Power and area analysis In the digital circuit design, pipelining is the process of shortening the delay in critical path at the cost of area which is predominated by the registers used to create pipeline stages in the design. Therefore, the suggested ISA architecture that is deep pipelined definitely requires extra registers in comparison with the proposed non- pipelined ones. On the other side, we have divided the suggested ISA architecture into different stages by pipelining it. Now, this makes our architecture suitable for clock-gating. In the new design, we have gated the clock signal that is fed into every stage. On doing this, the ideal stages of our architecture can be deferred from the clock switching which significantly reduces the power consumption. Such gating is valid only during the beginning and ending sessions of the addition process. On the starting of addition, later pipeline stages (towards the output side) of the design are ideal and these stages can be clock gated. Unlike towards the end of addition process, earlier stages (near the input side) of the design seem to be ideal and are clock-gated. For example: pipeline stages five , four, three and two are ideal while the process is being carried out in the first stage when the addition begins. Similarly, first stage will be ideal while rest keeps processing data when addition is towards the completion. However, while the adder is in-between the process of adding continuous stream of data then there is no point of gating the clock because all the stages are busy performing the operations. In order to obtain number of LUT’s required and amount of power consumed by both the adders, this work includes synthesis and post layout simulation results of three configurations of both pipelined and non-pipelined adders for the purpose of comparison.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1054 Table 1: Comparison of post P&R results obtained from FPGA implementations of 8, 16, 32-bit pipelined and non- pipelined ISA designs. The 32-bit pipelined ISA consumes a total power of 17.634w and it can be observed that this architecture consumes 4.235w lesser power than the non-pipelined ISA and this is possible only due to the implementation of clock-gating technique and also due to pipelining the critical path delay of 32-bit non-pipelined adder is 13.762ns where as for pipelined one it is 7.888 ns and hence due to pipelining the delay is reduced by 6ns. The above table shows the comparison of total area consumed in terms of LUT’s, power consumed, critical path delay and the maximum clock frequency of 8-bit,16-bit,32-bit non- pipelined as well as pipelined adders. Thereby the adder presented has area degradation in comparison with the non-pipelined one. 5. CONCLUSION In this paper we presented the high-speed and low- power version of ISA design. This architecture is fine-grain pipelined and clock-gated to reduce delay as well as to reduce power consumption respectively. Experimental results showed that the modified architecture operate at a maximum frequency of 127.72MHz in FPGA. Subsequently a 32-bit pipelined architecture consumes power of 17.634w.Thereby, such design would definitely play significant role in the design of contemporary as well as future electronic devices for IoE and many other applications. However, the area issue can be resolved to some extent by using lower technology nodes in the design process. 6. REFERENCES [1] Behzad Razavi, “Cognitive Radio Design Challenges and Techniques,” IEEE Journals of Solid-State Circuits (JSSC), vol. 45, no. 8, pp. 1542- 1553, 2010. [2] Gyanendra Prasad Joshi, Seung Yeob Nam and Sung Won Kim, “Cog- nitive Radio Wireless Sensor Networks: Applications, Challenges and Research Trends,” Sensors, vol. 13, no. 9, pp. 11196-11228, 2013. [3] D. Blaauw et al., “IoT Design Space Challenges: Circuits and Systems,” IEEE Symposium on VLSI Technology (VLSI-Technology): Digest of Technical Papers, pp. 1-2, 2014. [4] T. Liu and S. L. Lu, “Performance Improvement with Circuit-level Speculation,” 33rd Annual IEEE ACM International Symposium on Mi- croarchitecture (MICRO-33), pp. 348-355, 2000. [5] N. Zhu, W.-L. Goh, and K.-S. Yeo, “An Enhanced Low- power High-speed Adder For Error-tolerant Application,” 12th International Symposium on Integrated Circuits (ISIC), pp. 69-72, 2009. [6] M. Weber, M. Putic, H. Zhang, J. Lach, and J. Huang, “Balancing Adder for Error Tolerant Applications,” IEEE International Symposium on Circuits and Systems (ISCAS), pp. 3038-3041, 2013. [7] N. Zhu, W.-L. Goh, G. Wang, and K.-S. Yeo, “Enhanced Low-power High-speed Adder for Error-tolerant Application,” IEEE International SoC Design Conference (ISOCC), pp. 323-327, 2010. [8] Y. Kim, Y. Zhang, and P. Li, “An Energy Efficient Approximate Adder with Carry Skip for Error Resilient Neuromorphic VLSI Systems,” IEEE/ACM International Conference on Computer-Aided Design (ICCAD), pp. 130- 137, 2013. [9] Vincent Camus, Jeremy Schlachter and Christian Enz, “Energy-Efficient Inexact Speculative Adder with High Performance and Accuracy Control,” IEEE International Symposium on Circuits and Systems (ISCAS), pp. 45- 48, 2015. ISA configurations 8-bit NPLA 8-bit PLA 16-bit NPLA 16-bit PLA 32-bit NPLA 32-bit PLA FPGA family Spartan-3E Spartan-3E Spartan-3E Spartan-3E Spartan-3E Spartan-3E FPGA device Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L Xc-7a100tcsg324-1L 4-ip LUT’s 13 31 22 72 46 149 Crit.path delay(ns) 11.292 7.888 13.672 7.888 13.762 7.888 Max.clk freq (MHz) 88.592 126.77 73.141 126.77 72.66 126.77 Power consumed(w) 5.533 4.95 10.596 8.94 21.865 17.634