SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 691
SRAM BASED IN-MEMORY MATRIX VECTOR MULTIPLIER
K.G.Venkata krishna 1, P. Hema naga sai surya kumar 2, S. Meghana 3, A. Reddy prasad reddy 4,
G. Muni jayanth 5
1 Assistant Professor, Department of Electronics and Communication Engineering, Krishna University College of
Engineering and Technology Krishna University, Machilipatnam Andhra Pradesh, India. .
2 U.G Student, Department of Electronics and Communication Engineering, Krishna University, Machilipatnam,
Andhra Pradesh, India.
3 U.G Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam
Andhra Pradesh, India.
4 U.G Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam
Andhra Pradesh, India.
5 U.G Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam
Andhra Pradesh, India.
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - The weights stored in the SRAM are turned into
proportional voltages using a D/A converter, which ishowthe
SRAM-based matrix-vector multiplier for in-memory
computation functions. These voltages are subsequently
multiplied by a switched-capacitorstageusinganm-bit digital
input activation. Finally, charge sharing is used to gather the
output voltages associated with the various multiplication
outcomes along one column.
The needed circuit size, calculation time, and power
consumption grow linearly withthespecifiedarchitecture. For
the energy usage in switches and capacitors, analytical
formulae are provided. Additionally, the effect of
manufacturing mismatch on the precision of analogue
computing is looked at.
Key Words: Analog Computation,HardwareAccelerator,
In-Memory Computation, SRAM, DRAM
1.INTRODUCTION
During computations, a lot of data is sent back and forth
between the physically distinct memory and processorunits
of standard Von-Neumann computing systems. It is
necessary to reevaluate both the well-established charge-
based memory technologies, such as SRAM, DRAM, and
Flash, as well as the emerging resistance-based nonvolatile
memory technologies in order to get around the limitations
of the traditional Von-Neumann-based architectures, which
enforce an assertive separation of the processing unit and
the memory subsystem.
It is becoming more and more obvious that switching to
computing architectures with co-located logic and memory
is necessary for application domains like artificial
intelligence (AI). IMC, a unique non-Von Neumann
computing paradigm, uses the physical characteristics and
dynamical state of charging resistance-based memory
devices to conduct certain computations directly in the
memory. An IMC-based system may be used to accomplish a
variety of computing tasks, including logical operations,
arithmetic operations, and even certain machine learning
activities.
1.1 Motivation
The need for low-power integrated circuits has greatly
increased over the past several years as a result of the
increasing expansion of battery-operated devices including
wireless communication units, portable entertainment
devices, and implementable bio-medical chips. SRAM will
eventually account for more than 60% of SoCs, predicts the
International Technology Roadmap for Semiconductors
(ITRS). The problem of consuming power and space is
significantly solved when the technology scales by greatly
increasing the transistor density in the SRAM units.
1.2 Objective
The in-memory matrix-vector multiplier built on SRAM
has as its primary goal a reduction in the amount of time
required to complete computations. Performance may be
improved and power consumption can be decreased by
utilising SRAM technology.
2. LITERATURE SURVEY
2.1 Static Random Access Memory.
In SoCs, embedded SRAMs may take up the bulk of the chip
space. Modern scaled-downtechnologies'increasingprocess
spreads and non-catastrophic defect-relatedvulnerability to
external factors mightjeopardiseSRAMcells'stability,which
is measured by their low Static Noise Margin (SNM). In a cell
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 692
whose SNM is sufficiently tiny that it might mistakenly flip
under the worst operating conditions, a Stability Fault (SF)
can occur. The study was done on a thorough SRAM SNM
sensitivity analysis and it pinpointedthemaincausesofpoor
SNM. A weak Cell Fault Model was presented based on the
findings, which may be used in fault simulations to simulate
an SRAM cell with a weakened SNM. The SNM of the freshly
revised load-less 4T SRAM cell was also given an analytical
expression. Several sorts of flaws in the cell's pull-up route
may go undetected while reading a 6T SRAM cell with bit
lines recharged to VDD. These flaws may result in the SFs.
Two fully working SRAMtest chips—anasynchronousSRAM
(CMOS 0.18 m technology) and a synchronous SRAM (CMOS
0.13 m technology)—are created for the verificationofthese
methods. This approach offers better fault coverage and
flexibility than the DRT, shorter test times, and no high-
temperature needs. Regular SRAM March Tests have been
demonstrated to have a very low detection sensitivity for
SRAM cells with possible SFs. The pass/fail threshold's
programmabilityenablestrackingofprocesschangesand/or
changes to the quality standards without the need for post-
silicon design updates.
Fig-1: The block diagram for SRAM.
2.2 SRAM Block Structure
An example of the fundamental SRAM block structure is
shown in the above graphic.A wordlinefromWL0-WLN-1is
chosen by a row decoder that is gated by the timing block
after decoding the X row address bits. An additional Z-
decoder activates the accessed page in the case of an SRAM
array with N rows and M bits set up in a page-like fashion.
Word- or bit-oriented memories are also possible. Each
address in a bit-oriented memory may access a single bit.
2.3 SRAM Cell
The essential elements of anySRAMusedtostorebinarydata
are memory cells. Two cross-coupled inverters that create a
latch and access transistors make up a standard SRAM cell.
Access transistors provide for read-only and write-only cell
access as well as cell isolationin the unaccessedstate.Aslong
as the cell is powered, an SRAM cell must have non-
destructive read access, writecapability, and unlimitedstore
(or data retention) duration. Memory cells are organized
hierarchically into cores, which may then be further
subdivided into blocks and arrays based on the system
performance and power needs. A resistive load four-
transistor (4T) SRAM cell, a six-transistor (6T) CMOS SRAM
cell, and a load-less 4T SRAM cell are three of the most
contemporary SRAM cells. A smaller cell increases the
amount of bits per unit area and lowers the cost per bit.
Because the related capacitances are less with smaller cells,
speed and power consumption can be indirectly improved.
Fig-2: SRAM Cell.
2.4 Designed by 8T-SRAM as ADOT
The schematic for a typical 8T bit-cell may be found here. A
decoupled read port is made up of two extra transistors in
addition to the well-known 6T-SRAM bit-cell. The write
word-line (WWL) must be enabled and the write BLs
(WBLs/WBLBs) must be driventogroundorVDD,depending
on the bit that has to be saved. The read BL (RBL) must be
recharged to VDD and the read WL (RWL) must be activated
in order to read a value from the cell. Keep in mind that the
source-line (SL) is grounded
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 693
Fig-3: Computing Dot-Product with 4-Bitweight using an
8T-SRAM Memory Array.
3. PROPOSED SYSTEM
Demand for fast speed, low power, and low noise systems is
quite strong. Static Random-Access memory (SRAM) can be
utilized for several purposes. The dominant matrix-vector
operations, accordingtotheideaofin-memorycomputingfor
neural network applications, are carried out in the memory
itself. The precision of analogue MAC operationsisaproblem
for in-memory computing. By running MAC operations on a
regular SRAM, the accuracy barrier is overcome. The initial
step in the strategy is to achieve linearly scalable computing
accuracy in terms of time, power, and area.
Fig-4: Block diagram of the proposed system.
3.1 Transmission Gate
A transmission gate (TG) is an analogue gate, similar to a
relay, that may be controlled by a control signal with nearly
any voltage potential to conduct or block current in either
direction. It is a CMOS-based switch in which PMOS
transmits a strong 1 but a poor 0, and NMOS transmits a
powerful 0 but a weak 1. Both NMOS and PMOS function at
the same time. Two field-effect transistors (FET) make up a
transmission gate, however unlike conventional discrete
field-effect transistors, the substrate terminal (bulk) is not
internally linked to the source terminal in Fig. 3.2. The drain
and source terminals of the two transistors, an n-channel
MOSFET and a p-channel MOSFET, are linked together to
form a parallel connection. A NOT gate (inverter) links their
gate terminals together to produce the control terminal.
Fig-5: Transmission gate.
3.2 Switched Capacitor
A switched capacitor (SC) is an electrical circuit that carries
charges into and out of capacitors in responsetotheopening
and closing of electronic switches. The switches are often
controlled by non-overlapping clock signals so that not all
switches close at once. Switched-capacitor filters are those
that use these components as opposed to exact resistorsand
rely solely on the ratios between capacitances and the
switching frequency. As a result, they are far more
appropriate for use in integrated circuits than precisely
defined resistors and capacitors, which are more expensive
to build.
Fig-6: Circuit with switched capacitors.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 694
SC circuits are generally constructed using the
complementary CMOS (CMOS) process and implemented
utilising metal oxide semiconductor (MOS) technology,
including MOS-capacitors and MOS field-effect transistor
(MOSFET) switches. Pulse code modulation (PCM) codec-
filters, analogue to digital converter (ADC) chips, mixed
signal integrated circuits, and PCM digital telephony are a
few examples of common uses for MOS SC circuits.
Fig-7: Capacitor resistor with switch.
Fig-8: Circuit diagram for a switched leave capacitor.
3.3 Analog Multiplication.
The voltage that is proportionate to the weight must thenbe
analogly multiplied with the input as the following step.
Similar to the weight Wn, it is expected that the input Xn is
represented in SMR as a nx-bit fixed-point number. So, by
performing an XOR operation between the respective signs
of the weight (bn sign) and input (i n sign), it is possible to
determine the sign of the multiplicationresultSnresultright
away.
As a consequence, the Sn result may be used to determine
the Vpre recharge voltage. Figure displays an example of a
transistor-level implementation of an SRAM-based 3-bit
signed IMCU and the related circuit implementation. The
recharge voltage selection step and the corresponding
circuitry can be skipped in the event of an unsigned
multiplication. A multibit fixed-point multiplication of an
input Xin by a weight Wn can also be restated as a sum of nx
binary products if the distributive law is applied.
As a result, while processing each bit of the input
individually, the multiplicationmaybedoneconsecutivelyin
nx multiply and add stages. The best way to accomplish this
in hardware is to change the control signals on the switches
of the MSB capacitor Cnw, which is charged to Vw, n at the
ncyc, w-th cycle.
Despite having many similarities to operational amplifiers,
analogue multiplier circuits are far more prone to noise and
offset voltage-related issues since these mistakes can
compound. Phase-related issues may be very complicated
when working with high-frequency transmissions. Wide-
range general-purpose analogue multipliers are far more
difficult to manufacture than operational amplifiers, and
they are frequently made utilizing specialized technologies
and laser trimming, just as high-performance amplifierslike
instrumentation amplifiers. Since they are quite expensive,
they are often only employed in circuits where they are
absolutely necessary.
Fig-9: IMCU implementation at the transistor level using
SRAM cells.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 695
4. SOFTWARE REQUIREMENTS
A set of tools for designing integrated circuits is called
Tanner EDA. With these tools,youmayenterschematics, run
SPICE simulations, create physical designs (such chip
layouts), verify design rules (DRC), and do layout versus
schematic (LVS) comparisons.
Fig:-10: The tool's name.
4.1 Design Tools
Three different tools are
 S-edit
 T-SPICE L- edit
 A schematic capture tools
 the SPICE simulation engine integrated with S-
edit - the physical design tool
5. RESULTS
Fig-11: Schematic view
The schematic perspective is depicted in the above diagram.
They have revealed An Sram's internal relationships as well
as its structural details.
Fig-12: Output waveforms.
The SRAM's waveforms are seen in the above graphic. When
we divide a wave into its many parameters.
Fig-13: Power and Time Results
The effect of weight and input quantizationonthemaximum
power, average power, and multiply operation time. Due to
the fact that every extra weight bit requires a new set of
capacitors, switches, and clock cycles, delay and power
consumption both rise linearly. As a result, their product,
which is energy consumption, exhibitsa squaredependency.
Nevertheless, for the input bits nx, the scaling vs energy
stays perfectly linear since pipelining simply necessitates
three more cycles of operation for the circuit, with no
additional hardware needed inside the IMCU.
3. CONCLUSIONS
The idea of in-memory computing for neural network
applications has been motivated by the time and energy
costs involved with data transportation. This method makes
use of specific physical characteristics of memory
technologies to conduct the dominant matrix-vector
operations in-place, or in the memory itself. Our method is
the first to achieve computing accuracythatgrowslinearlyin
time, power, and area, despite the fact that there are several
SRAM-based matrix-vector multiplication engines in the
literature.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072
© 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 696
The precision of the analogue MAC operations is the
fundamental obstacletoin-memorycomputing.Thearea and
power needed can be decreased by switching to 14 nm
technology from 45 nm technology.
The SRAM-based multibit in-memory matrix vector
multiplier (IMMVM), which has the potential todramatically
increase the speed and power efficiency of a variety of
computing workloads, is a promising technology. Matrix-
vector multiplication (MVM) may be carried out by the
IMMVM architecture directly in thememoryarray,obviating
the requirement to transfer data back and forth between
memory and the processing unit. This can increase overall
system effectiveness and lessen the data flow bottleneck.
REFERENCES
[1] 1.B. Keeth and R. J. Baker, DRAM Circuit Design: A
Tutorial, 1st ed. Hoboken, 9NJ, USA: Wiley, 2000.
[2] P. F. Ferguson, X. Haurie, and G. C. Temes, “A highly
linear low-power 10-bit DAC for GSM,” in Proc. IEEE
Custom Integer. Circuits Conf., May 2000, pp. 261–264.
[3] M. Le Gallo et al., “Mixed-precision in-memory
computing,” NatureElectron., vol. 1, no. 4, pp. 246–253,
Apr. 2018.
[4] W. C. Jeong et al., “True 7 nm platform technology
featuring smallest FinFET and smallest SRAM cell by
EUV, special constructs and 3rdgeneration single
diffusion break,” in Proc. IEEE Symp. VLSI Technol., Jun.
2018.
BIOGRAPHIES
K.G.VENKATA KRISHNA, Assistant
Professor, Krishna University
College of Engineering and
Technology Krishna University,
Machilipatnam.
PARASA HEMA NAGA SAI SURYA
KUMAR, Student of Department of
Electronics and Communication
Engineering, Krishna University,
Machilipatnam.
SARIKOKKU MEGHANA, Student
of Department of Electronics and
Communication Engineering,
Krishna University,
Machilipatnam.
ATURI REDDY PRASAD REDDY,
Student of Department of
Electronics and Communication
Engineering, Krishna University,
Machilipatnam.
GODUGUCHINTHA MUNI
JAYANTH, Student of Department
of Electronics and Communication
Engineering, Krishna University,
Machilipatnam.

More Related Content

PDF
Design and Simulation Low power SRAM Circuits
PDF
CELL STABILITY ANALYSIS OF CONVENTIONAL 6T DYNAMIC 8T SRAM CELL IN 45NM TECHN...
PDF
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
PDF
IRJET- Comparative Analysis of High Speed SRAM Cell for 90nm CMOS Technology
PDF
IRJET- Deisgn of Low Power 16x16 Sram with Adiabatic Logic
PDF
Implementation of an Efficient SRAM for Ultra-Low Voltage Application Based o...
PDF
Implementation of an Efficient SRAM for Ultra-Low Voltage Application Based o...
PDF
Fj3110731078
Design and Simulation Low power SRAM Circuits
CELL STABILITY ANALYSIS OF CONVENTIONAL 6T DYNAMIC 8T SRAM CELL IN 45NM TECHN...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IRJET- Comparative Analysis of High Speed SRAM Cell for 90nm CMOS Technology
IRJET- Deisgn of Low Power 16x16 Sram with Adiabatic Logic
Implementation of an Efficient SRAM for Ultra-Low Voltage Application Based o...
Implementation of an Efficient SRAM for Ultra-Low Voltage Application Based o...
Fj3110731078

Similar to SRAM BASED IN-MEMORY MATRIX VECTOR MULTIPLIER (20)

PDF
Design of Low Power High Density SRAM Bit Cell
PDF
Study and Analysis of Low Power SRAM Memory Array at nano-scaled Technology
PDF
Using CMOS Sub-Micron Technology VLSI Implementation of Low Power, High Spee...
PDF
Using CMOS Sub-Micron Technology VLSI Implementation of Low Power, High Speed...
PDF
Lecture10 sram2
PDF
IRJET- Low Voltage High Speed 8T SRAM Cell for Ultra-Low Power Applications
PDF
MID_TERM_PPT[1].pdf128228282828828282828282
DOCX
sram bit cell ssram bit cellssram bit cellssram bit cell
PPTX
Semi conductor Memories in VLSI design and testing
PDF
Kj2417641769
PDF
Static-Noise-Margin Analysis of Modified 6T SRAM Cell during Read Operation
PDF
Lecture14
PPTX
SRAM- Ultra low voltage operation
PDF
Design and Performance Evaluation of a 64-bit SRAM Memory Array Utilizing Mod...
PDF
IRJET- Design of Energy Efficient 8T SRAM Cell at 90nm Technology
PDF
Mu2421122117
PDF
PDF
Energy optimization of 6T SRAM cell using low-voltage and high-performance in...
PDF
A Comparitive Analysis of Improved 6t Sram Cell With Different Sram Cell
PDF
Design & Implementation of Subthreshold Memory Cell design based on the prima...
Design of Low Power High Density SRAM Bit Cell
Study and Analysis of Low Power SRAM Memory Array at nano-scaled Technology
Using CMOS Sub-Micron Technology VLSI Implementation of Low Power, High Spee...
Using CMOS Sub-Micron Technology VLSI Implementation of Low Power, High Speed...
Lecture10 sram2
IRJET- Low Voltage High Speed 8T SRAM Cell for Ultra-Low Power Applications
MID_TERM_PPT[1].pdf128228282828828282828282
sram bit cell ssram bit cellssram bit cellssram bit cell
Semi conductor Memories in VLSI design and testing
Kj2417641769
Static-Noise-Margin Analysis of Modified 6T SRAM Cell during Read Operation
Lecture14
SRAM- Ultra low voltage operation
Design and Performance Evaluation of a 64-bit SRAM Memory Array Utilizing Mod...
IRJET- Design of Energy Efficient 8T SRAM Cell at 90nm Technology
Mu2421122117
Energy optimization of 6T SRAM cell using low-voltage and high-performance in...
A Comparitive Analysis of Improved 6t Sram Cell With Different Sram Cell
Design & Implementation of Subthreshold Memory Cell design based on the prima...
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
PDF
Kiona – A Smart Society Automation Project
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
PDF
Breast Cancer Detection using Computer Vision
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
Kiona – A Smart Society Automation Project
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
BRAIN TUMOUR DETECTION AND CLASSIFICATION
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
Breast Cancer Detection using Computer Vision
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Ad

Recently uploaded (20)

PPTX
Safety Seminar civil to be ensured for safe working.
PDF
PPT on Performance Review to get promotions
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
UNIT 4 Total Quality Management .pptx
PDF
Digital Logic Computer Design lecture notes
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PDF
Well-logging-methods_new................
DOCX
573137875-Attendance-Management-System-original
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPT
Project quality management in manufacturing
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
additive manufacturing of ss316l using mig welding
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
bas. eng. economics group 4 presentation 1.pptx
Safety Seminar civil to be ensured for safe working.
PPT on Performance Review to get promotions
CYBER-CRIMES AND SECURITY A guide to understanding
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Lecture Notes Electrical Wiring System Components
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
UNIT 4 Total Quality Management .pptx
Digital Logic Computer Design lecture notes
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Well-logging-methods_new................
573137875-Attendance-Management-System-original
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Project quality management in manufacturing
UNIT-1 - COAL BASED THERMAL POWER PLANTS
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Internet of Things (IOT) - A guide to understanding
additive manufacturing of ss316l using mig welding
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
bas. eng. economics group 4 presentation 1.pptx

SRAM BASED IN-MEMORY MATRIX VECTOR MULTIPLIER

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 691 SRAM BASED IN-MEMORY MATRIX VECTOR MULTIPLIER K.G.Venkata krishna 1, P. Hema naga sai surya kumar 2, S. Meghana 3, A. Reddy prasad reddy 4, G. Muni jayanth 5 1 Assistant Professor, Department of Electronics and Communication Engineering, Krishna University College of Engineering and Technology Krishna University, Machilipatnam Andhra Pradesh, India. . 2 U.G Student, Department of Electronics and Communication Engineering, Krishna University, Machilipatnam, Andhra Pradesh, India. 3 U.G Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam Andhra Pradesh, India. 4 U.G Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam Andhra Pradesh, India. 5 U.G Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam Andhra Pradesh, India. ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - The weights stored in the SRAM are turned into proportional voltages using a D/A converter, which ishowthe SRAM-based matrix-vector multiplier for in-memory computation functions. These voltages are subsequently multiplied by a switched-capacitorstageusinganm-bit digital input activation. Finally, charge sharing is used to gather the output voltages associated with the various multiplication outcomes along one column. The needed circuit size, calculation time, and power consumption grow linearly withthespecifiedarchitecture. For the energy usage in switches and capacitors, analytical formulae are provided. Additionally, the effect of manufacturing mismatch on the precision of analogue computing is looked at. Key Words: Analog Computation,HardwareAccelerator, In-Memory Computation, SRAM, DRAM 1.INTRODUCTION During computations, a lot of data is sent back and forth between the physically distinct memory and processorunits of standard Von-Neumann computing systems. It is necessary to reevaluate both the well-established charge- based memory technologies, such as SRAM, DRAM, and Flash, as well as the emerging resistance-based nonvolatile memory technologies in order to get around the limitations of the traditional Von-Neumann-based architectures, which enforce an assertive separation of the processing unit and the memory subsystem. It is becoming more and more obvious that switching to computing architectures with co-located logic and memory is necessary for application domains like artificial intelligence (AI). IMC, a unique non-Von Neumann computing paradigm, uses the physical characteristics and dynamical state of charging resistance-based memory devices to conduct certain computations directly in the memory. An IMC-based system may be used to accomplish a variety of computing tasks, including logical operations, arithmetic operations, and even certain machine learning activities. 1.1 Motivation The need for low-power integrated circuits has greatly increased over the past several years as a result of the increasing expansion of battery-operated devices including wireless communication units, portable entertainment devices, and implementable bio-medical chips. SRAM will eventually account for more than 60% of SoCs, predicts the International Technology Roadmap for Semiconductors (ITRS). The problem of consuming power and space is significantly solved when the technology scales by greatly increasing the transistor density in the SRAM units. 1.2 Objective The in-memory matrix-vector multiplier built on SRAM has as its primary goal a reduction in the amount of time required to complete computations. Performance may be improved and power consumption can be decreased by utilising SRAM technology. 2. LITERATURE SURVEY 2.1 Static Random Access Memory. In SoCs, embedded SRAMs may take up the bulk of the chip space. Modern scaled-downtechnologies'increasingprocess spreads and non-catastrophic defect-relatedvulnerability to external factors mightjeopardiseSRAMcells'stability,which is measured by their low Static Noise Margin (SNM). In a cell
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 692 whose SNM is sufficiently tiny that it might mistakenly flip under the worst operating conditions, a Stability Fault (SF) can occur. The study was done on a thorough SRAM SNM sensitivity analysis and it pinpointedthemaincausesofpoor SNM. A weak Cell Fault Model was presented based on the findings, which may be used in fault simulations to simulate an SRAM cell with a weakened SNM. The SNM of the freshly revised load-less 4T SRAM cell was also given an analytical expression. Several sorts of flaws in the cell's pull-up route may go undetected while reading a 6T SRAM cell with bit lines recharged to VDD. These flaws may result in the SFs. Two fully working SRAMtest chips—anasynchronousSRAM (CMOS 0.18 m technology) and a synchronous SRAM (CMOS 0.13 m technology)—are created for the verificationofthese methods. This approach offers better fault coverage and flexibility than the DRT, shorter test times, and no high- temperature needs. Regular SRAM March Tests have been demonstrated to have a very low detection sensitivity for SRAM cells with possible SFs. The pass/fail threshold's programmabilityenablestrackingofprocesschangesand/or changes to the quality standards without the need for post- silicon design updates. Fig-1: The block diagram for SRAM. 2.2 SRAM Block Structure An example of the fundamental SRAM block structure is shown in the above graphic.A wordlinefromWL0-WLN-1is chosen by a row decoder that is gated by the timing block after decoding the X row address bits. An additional Z- decoder activates the accessed page in the case of an SRAM array with N rows and M bits set up in a page-like fashion. Word- or bit-oriented memories are also possible. Each address in a bit-oriented memory may access a single bit. 2.3 SRAM Cell The essential elements of anySRAMusedtostorebinarydata are memory cells. Two cross-coupled inverters that create a latch and access transistors make up a standard SRAM cell. Access transistors provide for read-only and write-only cell access as well as cell isolationin the unaccessedstate.Aslong as the cell is powered, an SRAM cell must have non- destructive read access, writecapability, and unlimitedstore (or data retention) duration. Memory cells are organized hierarchically into cores, which may then be further subdivided into blocks and arrays based on the system performance and power needs. A resistive load four- transistor (4T) SRAM cell, a six-transistor (6T) CMOS SRAM cell, and a load-less 4T SRAM cell are three of the most contemporary SRAM cells. A smaller cell increases the amount of bits per unit area and lowers the cost per bit. Because the related capacitances are less with smaller cells, speed and power consumption can be indirectly improved. Fig-2: SRAM Cell. 2.4 Designed by 8T-SRAM as ADOT The schematic for a typical 8T bit-cell may be found here. A decoupled read port is made up of two extra transistors in addition to the well-known 6T-SRAM bit-cell. The write word-line (WWL) must be enabled and the write BLs (WBLs/WBLBs) must be driventogroundorVDD,depending on the bit that has to be saved. The read BL (RBL) must be recharged to VDD and the read WL (RWL) must be activated in order to read a value from the cell. Keep in mind that the source-line (SL) is grounded
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 693 Fig-3: Computing Dot-Product with 4-Bitweight using an 8T-SRAM Memory Array. 3. PROPOSED SYSTEM Demand for fast speed, low power, and low noise systems is quite strong. Static Random-Access memory (SRAM) can be utilized for several purposes. The dominant matrix-vector operations, accordingtotheideaofin-memorycomputingfor neural network applications, are carried out in the memory itself. The precision of analogue MAC operationsisaproblem for in-memory computing. By running MAC operations on a regular SRAM, the accuracy barrier is overcome. The initial step in the strategy is to achieve linearly scalable computing accuracy in terms of time, power, and area. Fig-4: Block diagram of the proposed system. 3.1 Transmission Gate A transmission gate (TG) is an analogue gate, similar to a relay, that may be controlled by a control signal with nearly any voltage potential to conduct or block current in either direction. It is a CMOS-based switch in which PMOS transmits a strong 1 but a poor 0, and NMOS transmits a powerful 0 but a weak 1. Both NMOS and PMOS function at the same time. Two field-effect transistors (FET) make up a transmission gate, however unlike conventional discrete field-effect transistors, the substrate terminal (bulk) is not internally linked to the source terminal in Fig. 3.2. The drain and source terminals of the two transistors, an n-channel MOSFET and a p-channel MOSFET, are linked together to form a parallel connection. A NOT gate (inverter) links their gate terminals together to produce the control terminal. Fig-5: Transmission gate. 3.2 Switched Capacitor A switched capacitor (SC) is an electrical circuit that carries charges into and out of capacitors in responsetotheopening and closing of electronic switches. The switches are often controlled by non-overlapping clock signals so that not all switches close at once. Switched-capacitor filters are those that use these components as opposed to exact resistorsand rely solely on the ratios between capacitances and the switching frequency. As a result, they are far more appropriate for use in integrated circuits than precisely defined resistors and capacitors, which are more expensive to build. Fig-6: Circuit with switched capacitors.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 694 SC circuits are generally constructed using the complementary CMOS (CMOS) process and implemented utilising metal oxide semiconductor (MOS) technology, including MOS-capacitors and MOS field-effect transistor (MOSFET) switches. Pulse code modulation (PCM) codec- filters, analogue to digital converter (ADC) chips, mixed signal integrated circuits, and PCM digital telephony are a few examples of common uses for MOS SC circuits. Fig-7: Capacitor resistor with switch. Fig-8: Circuit diagram for a switched leave capacitor. 3.3 Analog Multiplication. The voltage that is proportionate to the weight must thenbe analogly multiplied with the input as the following step. Similar to the weight Wn, it is expected that the input Xn is represented in SMR as a nx-bit fixed-point number. So, by performing an XOR operation between the respective signs of the weight (bn sign) and input (i n sign), it is possible to determine the sign of the multiplicationresultSnresultright away. As a consequence, the Sn result may be used to determine the Vpre recharge voltage. Figure displays an example of a transistor-level implementation of an SRAM-based 3-bit signed IMCU and the related circuit implementation. The recharge voltage selection step and the corresponding circuitry can be skipped in the event of an unsigned multiplication. A multibit fixed-point multiplication of an input Xin by a weight Wn can also be restated as a sum of nx binary products if the distributive law is applied. As a result, while processing each bit of the input individually, the multiplicationmaybedoneconsecutivelyin nx multiply and add stages. The best way to accomplish this in hardware is to change the control signals on the switches of the MSB capacitor Cnw, which is charged to Vw, n at the ncyc, w-th cycle. Despite having many similarities to operational amplifiers, analogue multiplier circuits are far more prone to noise and offset voltage-related issues since these mistakes can compound. Phase-related issues may be very complicated when working with high-frequency transmissions. Wide- range general-purpose analogue multipliers are far more difficult to manufacture than operational amplifiers, and they are frequently made utilizing specialized technologies and laser trimming, just as high-performance amplifierslike instrumentation amplifiers. Since they are quite expensive, they are often only employed in circuits where they are absolutely necessary. Fig-9: IMCU implementation at the transistor level using SRAM cells.
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 695 4. SOFTWARE REQUIREMENTS A set of tools for designing integrated circuits is called Tanner EDA. With these tools,youmayenterschematics, run SPICE simulations, create physical designs (such chip layouts), verify design rules (DRC), and do layout versus schematic (LVS) comparisons. Fig:-10: The tool's name. 4.1 Design Tools Three different tools are  S-edit  T-SPICE L- edit  A schematic capture tools  the SPICE simulation engine integrated with S- edit - the physical design tool 5. RESULTS Fig-11: Schematic view The schematic perspective is depicted in the above diagram. They have revealed An Sram's internal relationships as well as its structural details. Fig-12: Output waveforms. The SRAM's waveforms are seen in the above graphic. When we divide a wave into its many parameters. Fig-13: Power and Time Results The effect of weight and input quantizationonthemaximum power, average power, and multiply operation time. Due to the fact that every extra weight bit requires a new set of capacitors, switches, and clock cycles, delay and power consumption both rise linearly. As a result, their product, which is energy consumption, exhibitsa squaredependency. Nevertheless, for the input bits nx, the scaling vs energy stays perfectly linear since pipelining simply necessitates three more cycles of operation for the circuit, with no additional hardware needed inside the IMCU. 3. CONCLUSIONS The idea of in-memory computing for neural network applications has been motivated by the time and energy costs involved with data transportation. This method makes use of specific physical characteristics of memory technologies to conduct the dominant matrix-vector operations in-place, or in the memory itself. Our method is the first to achieve computing accuracythatgrowslinearlyin time, power, and area, despite the fact that there are several SRAM-based matrix-vector multiplication engines in the literature.
  • 6. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 10 Issue: 05 | May 2023 www.irjet.net p-ISSN: 2395-0072 © 2023, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 696 The precision of the analogue MAC operations is the fundamental obstacletoin-memorycomputing.Thearea and power needed can be decreased by switching to 14 nm technology from 45 nm technology. The SRAM-based multibit in-memory matrix vector multiplier (IMMVM), which has the potential todramatically increase the speed and power efficiency of a variety of computing workloads, is a promising technology. Matrix- vector multiplication (MVM) may be carried out by the IMMVM architecture directly in thememoryarray,obviating the requirement to transfer data back and forth between memory and the processing unit. This can increase overall system effectiveness and lessen the data flow bottleneck. REFERENCES [1] 1.B. Keeth and R. J. Baker, DRAM Circuit Design: A Tutorial, 1st ed. Hoboken, 9NJ, USA: Wiley, 2000. [2] P. F. Ferguson, X. Haurie, and G. C. Temes, “A highly linear low-power 10-bit DAC for GSM,” in Proc. IEEE Custom Integer. Circuits Conf., May 2000, pp. 261–264. [3] M. Le Gallo et al., “Mixed-precision in-memory computing,” NatureElectron., vol. 1, no. 4, pp. 246–253, Apr. 2018. [4] W. C. Jeong et al., “True 7 nm platform technology featuring smallest FinFET and smallest SRAM cell by EUV, special constructs and 3rdgeneration single diffusion break,” in Proc. IEEE Symp. VLSI Technol., Jun. 2018. BIOGRAPHIES K.G.VENKATA KRISHNA, Assistant Professor, Krishna University College of Engineering and Technology Krishna University, Machilipatnam. PARASA HEMA NAGA SAI SURYA KUMAR, Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam. SARIKOKKU MEGHANA, Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam. ATURI REDDY PRASAD REDDY, Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam. GODUGUCHINTHA MUNI JAYANTH, Student of Department of Electronics and Communication Engineering, Krishna University, Machilipatnam.