SlideShare a Scribd company logo
Understanding LSTM
and
its Diagrams
Shi Yan
-
Slides written by Park JeeHyun
22 DEC 2017
Understanding LSTM and its diagrams
• Theoretically the naively connected neural network,
so called recurrent neural network, can work.
But in practice, it suffers from two problems:
• vanishing gradient and
• exploding gradient, which make it unusable.
• Then later, LSTM (long short term memory) was invented
to solve this issue by explicitly introducing a memory
unit, called the cell into the network.
Understanding LSTM and its diagrams
Understanding LSTM and its diagrams
• Memory pipe
• Forget valve
• New memory valve
• Generate new output
• Output valve
Memory pipe
• An element-wise multiplication
• if you multiply the old memory C_t-1 with a vector that is close to 0, that m
eans you want to forget most of the old memory.
• You let the old memory goes through, if your forget valve equals 1.
• A piece-wise summation
• New memory and the old memory will merge by this operation.
• How much new memory should be added to the old memory is
controlled by another valve, below.
Memory pipe : forget valve
• An element-wise multiplication
• if you multiply the old memory C_t-1 with a vector that is close to 0, that m
eans you want to forget most of the old memory.
• You let the old memory goes through, if your forget valve equals 1.
Memory pipe : new memory valve
• A piece-wise summation
• New memory and the old memory will merge by this operation.
• How much new memory should be added to the old memory is
controlled by another valve, below.
Generate new output : output valve
• This step has an output valve that is controlled by the new
memory, the previous output h_t-1, the input X_t and a bias
vector.
• This valve controls how much new memory should output to
the next LSTM unit.
Annex : LSTM diagram
Annex : LSTM diagram
References
• “Understanding LSTM and its diagrams” by Shi Yan
• https://medium.com/mlreview/understanding-lstm-and-its-
diagrams-37e2f46f1714
• “Understanding LSTM Networks” from colah's blog
• http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Understanding lstm and its diagrams

More Related Content

PPTX
PPTX
Natural language Processing.pptx
PPTX
LLaMA 2.pptx
PPTX
YOLO v1
PDF
Rnn and lstm
PDF
Large Language Models Are Reasoning Teachers
PPT
Real-Time Scheduling Algorithms
Natural language Processing.pptx
LLaMA 2.pptx
YOLO v1
Rnn and lstm
Large Language Models Are Reasoning Teachers
Real-Time Scheduling Algorithms

What's hot (20)

PPTX
RNN-LSTM.pptx
PDF
Realtime systems chapter 1
PPTX
Pwm(Pulse Width Modulation) presentation
PPT
Design of cmos based ring oscillator
PDF
LSTM Based Sentiment Analysis
PDF
PR-217: EfficientDet: Scalable and Efficient Object Detection
PPTX
Introduction to FPGAs
PPT
chap 18 multicore computers
PDF
Design and Simulation Low power SRAM Circuits
PPTX
Recurrent Neural Network
PDF
Unit 1 intro-embedded
PDF
Algorithms and Data Structure 2018
PPT
Recent Advances in Natural Language Processing
PDF
Embedded Systems Power Management
PPTX
Large-scale Video Classification with Convolutional Neural Network
PPT
CFD - OpenFOAM
PPTX
Deep learning: Overfitting , underfitting, and regularization
PDF
1.2V Analog I/O library for TSMC 65nm technology
PPTX
Parallel Processors (SIMD)
RNN-LSTM.pptx
Realtime systems chapter 1
Pwm(Pulse Width Modulation) presentation
Design of cmos based ring oscillator
LSTM Based Sentiment Analysis
PR-217: EfficientDet: Scalable and Efficient Object Detection
Introduction to FPGAs
chap 18 multicore computers
Design and Simulation Low power SRAM Circuits
Recurrent Neural Network
Unit 1 intro-embedded
Algorithms and Data Structure 2018
Recent Advances in Natural Language Processing
Embedded Systems Power Management
Large-scale Video Classification with Convolutional Neural Network
CFD - OpenFOAM
Deep learning: Overfitting , underfitting, and regularization
1.2V Analog I/O library for TSMC 65nm technology
Parallel Processors (SIMD)
Ad

More from JEE HYUN PARK (9)

PPTX
keti companion classifier
PPTX
[Paper review] BERT
PPTX
Kcc201728apr2017 170828235330
PPTX
neural based_context_representation_learning_for_dialog_act_classification
PPTX
a deep reinforced model for abstractive summarization
PPTX
Understanding GloVe
PPTX
Historical Finance Data
PPTX
KCC2017 28APR2017
PPTX
Short-Term Load Forecasting of Australian National Electricity Market by Hier...
keti companion classifier
[Paper review] BERT
Kcc201728apr2017 170828235330
neural based_context_representation_learning_for_dialog_act_classification
a deep reinforced model for abstractive summarization
Understanding GloVe
Historical Finance Data
KCC2017 28APR2017
Short-Term Load Forecasting of Australian National Electricity Market by Hier...
Ad

Recently uploaded (20)

PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PDF
composite construction of structures.pdf
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PPTX
Geodesy 1.pptx...............................................
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPT
Project quality management in manufacturing
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
additive manufacturing of ss316l using mig welding
PDF
PPT on Performance Review to get promotions
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
DOCX
573137875-Attendance-Management-System-original
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
UNIT 4 Total Quality Management .pptx
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
Welding lecture in detail for understanding
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
composite construction of structures.pdf
Operating System & Kernel Study Guide-1 - converted.pdf
Geodesy 1.pptx...............................................
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Project quality management in manufacturing
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
additive manufacturing of ss316l using mig welding
PPT on Performance Review to get promotions
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
573137875-Attendance-Management-System-original
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
UNIT 4 Total Quality Management .pptx
R24 SURVEYING LAB MANUAL for civil enggi
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
CH1 Production IntroductoryConcepts.pptx
Welding lecture in detail for understanding
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx

Understanding lstm and its diagrams

  • 1. Understanding LSTM and its Diagrams Shi Yan - Slides written by Park JeeHyun 22 DEC 2017
  • 2. Understanding LSTM and its diagrams • Theoretically the naively connected neural network, so called recurrent neural network, can work. But in practice, it suffers from two problems: • vanishing gradient and • exploding gradient, which make it unusable. • Then later, LSTM (long short term memory) was invented to solve this issue by explicitly introducing a memory unit, called the cell into the network.
  • 3. Understanding LSTM and its diagrams
  • 4. Understanding LSTM and its diagrams • Memory pipe • Forget valve • New memory valve • Generate new output • Output valve
  • 5. Memory pipe • An element-wise multiplication • if you multiply the old memory C_t-1 with a vector that is close to 0, that m eans you want to forget most of the old memory. • You let the old memory goes through, if your forget valve equals 1. • A piece-wise summation • New memory and the old memory will merge by this operation. • How much new memory should be added to the old memory is controlled by another valve, below.
  • 6. Memory pipe : forget valve • An element-wise multiplication • if you multiply the old memory C_t-1 with a vector that is close to 0, that m eans you want to forget most of the old memory. • You let the old memory goes through, if your forget valve equals 1.
  • 7. Memory pipe : new memory valve • A piece-wise summation • New memory and the old memory will merge by this operation. • How much new memory should be added to the old memory is controlled by another valve, below.
  • 8. Generate new output : output valve • This step has an output valve that is controlled by the new memory, the previous output h_t-1, the input X_t and a bias vector. • This valve controls how much new memory should output to the next LSTM unit.
  • 9. Annex : LSTM diagram
  • 10. Annex : LSTM diagram
  • 11. References • “Understanding LSTM and its diagrams” by Shi Yan • https://medium.com/mlreview/understanding-lstm-and-its- diagrams-37e2f46f1714 • “Understanding LSTM Networks” from colah's blog • http://colah.github.io/posts/2015-08-Understanding-LSTMs/