SlideShare a Scribd company logo
8
Most read
12
Most read
13
Most read
Auto-Encoding Variational Bayes
Diederik P. Kingma, Max Welling
Machine Learning Group Universiteit van Amsterdam
ICLR 2014 conference submission, Cited by 4655
May 23, 2019
SMC AI Research Center
Kyuri Kim
Introduction
1.1 Supervised learning vs. Unsupervised learning
Supervised learning Ex. Classification
Regression
Object Detection
Semantic Segmentation
Image captioning
:
Data: (x, y)
x is data, y is label
Learn a function to map x → y
Unsupervised learning
Ex. Clustering
Dimensionality Reduction
Density estimation
Feature Learning
:
Data: (x)
Just data, no label
Learn something underlying
hidden structure of the data
1-d Density estimation 2-d Density estimation
Ex.
Image Detection
Ex.
2
Introduction
1.2 Auto Encoder (Feature Learning)
1. Unsupervised Learning
2. ML density estimation
3. Manifold Learning
4. Generative model learning
In Auto Encoder Training:
In Trained Auto Encoder :
Encoding Decoding
Latent Variable
𝝌 𝒙
L (𝝌, 𝒚)
Reconstruction Error
Decoder는 최소한의 학습 데이터는 생성해 낼 수 있고,
Encoder는 최소한의 학습 데이터는 latent vector로 표현할 수 있다.
Minimize
𝒁
3
𝑥)
Introduction
1.3 Generative Model
Given training data, generate new samples from same distribution
Ex. Variational Auto encoders (VAE), Generative Adversarial Network(GAN)
Generated samples ~ 𝑝 𝑚𝑜𝑑𝑒𝑙(𝑥)Training data ~ 𝑝 𝑑𝑎𝑡𝑎(𝑥)
Want to learn 𝒑 𝒎𝒐𝒅𝒆𝒍 𝒙 similar to ~ 𝒑 𝒅𝒂𝒕𝒂(𝒙)
Probability density function
4
Introduction
1.4 Generative Model Network
Generative model taxonomy
Ian Goodfellow, "NIPS 2016 Tutorial: Generative Adversarial Networks"
5
Experiments
2.1 Variational Auto-Encoder
Target Data
𝝌
Latent Variable
𝑧
P(𝑥|𝑧)P(𝑧)
Sample from true prior Sample from true conditional
Decoder Network
How to Train the Model?
6
Maximum likelihood
estimation
Experiments
2.2 Variation Inference
Variation InferenceTarget Data
Generator
𝒈 𝜽(.)
𝝌
Latent Variable
𝑝 (𝑧|𝑥) ≈ 𝑞 𝜙(𝑧|𝑥)~𝑧
sampling 𝑧 from 𝑝(𝑧|𝑥)
Z를 정규분포에서 Sampling하는 것 보다 x와 유의미하게 Sample이 나올 수 있는 확률 분포 𝑝(𝑧|𝑥)로부터
Sampling. 그러나 𝑝(𝑧|𝑥)가 무엇인지 알지 못하므로, 우리가 알고 있는 확률분포 중 하나를 택하여 𝑞 𝜙(𝑧|𝑥)
그것의 parameter 값을 조정, 𝑝(𝑧|𝑥)과 유사하게 만든다.
7
𝑧
Experiments
2.3 Variational Auto-Encoder
(1) Definition of VAE
Variational Inference를 Auto Encoder의 구조를 통해 구현한 Generate Model.
(2) Structure of VAE When z is deterministic value
On the respects of probability
Where z is in a distribution
X가 주어졌을 때 z의 확률 Variational Inference
𝑧
𝒈 𝜽(.)𝒒∅(.)
𝝌𝝌
𝑝(𝑧|𝑥) ≈ 𝑝(𝑧)
8
Experiments
2.4 ELBO(Evidence Lower Bound)
ELBO(Evidence Lower Bound)
Variational
Inference
Jensen’s Inequality
9
Experiments
2.4 ELBO(Evidence Lower Bound)
ELBO(Evidence Lower Bound) KL term
두 확률분포 간의 거리 ≥ 0 10
Experiments
2.4 ELBO(Evidence Lower Bound)
KL term
①
③
②
11
Experiments
2.4 ELBO(Evidence Lower Bound)
𝒈 𝜽(𝒙|𝒛)𝒒∅(𝒛|𝒙)
Encoder
Inference Network
Decoder
Generation Network
sampling
Reconstruction error Regularization
12
Experiments
2.5 Reparameterization trick
mean
Reconstruction
error
sampling
Backpropagation Impossible → Reparameterization trick
13
Experiments
2.5 Reparameterization trick
정규분포
14
Result & Conclusion
http://dpkingma.com/sgvb_mnist_demo/demo.html
Z1
Degree of smile
Z2
Head pose
Z1
Z2
15
Appendix
- Auto Encoder “어떤 방식으로 pre train하였는가?”
16

More Related Content

PDF
Auto encoding-variational-bayes
PPTX
[Paper Reading] Attention is All You Need
PDF
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
PDF
Self-Attention with Linear Complexity
PDF
Introduction to Autoencoders
PPTX
Word embedding
PDF
Deep Generative Models
PDF
Attention is All You Need (Transformer)
Auto encoding-variational-bayes
[Paper Reading] Attention is All You Need
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Self-Attention with Linear Complexity
Introduction to Autoencoders
Word embedding
Deep Generative Models
Attention is All You Need (Transformer)

What's hot (20)

PDF
[기초개념] Graph Convolutional Network (GCN)
PDF
Introduction to Diffusion Models
PPTX
PDF
Natural Language Processing
PDF
Introduction to Deep Learning
PPTX
Natural language processing and transformer models
PPTX
PDF
An introduction to the Transformers architecture and BERT
PPTX
Introduction to Transformer Model
PDF
Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)
PDF
Transformer Introduction (Seminar Material)
PPTX
Dynamic programming Basics
PPTX
Attention Is All You Need
PPTX
Deep belief network.pptx
PPTX
PDF
Autoencoders
PPTX
Introduction For seq2seq(sequence to sequence) and RNN
PDF
TensorFlow and Keras: An Overview
PDF
Module 8: Natural language processing Pt 1
PDF
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
[기초개념] Graph Convolutional Network (GCN)
Introduction to Diffusion Models
Natural Language Processing
Introduction to Deep Learning
Natural language processing and transformer models
An introduction to the Transformers architecture and BERT
Introduction to Transformer Model
Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)
Transformer Introduction (Seminar Material)
Dynamic programming Basics
Attention Is All You Need
Deep belief network.pptx
Autoencoders
Introduction For seq2seq(sequence to sequence) and RNN
TensorFlow and Keras: An Overview
Module 8: Natural language processing Pt 1
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
Ad

Similar to Auto-encoding variational bayes (20)

PDF
Introduction to Variational Auto Encoder
PDF
Visualizing the Model Selection Process
PDF
A simple framework for contrastive learning of visual representations
PPT
Software tookits for machine learning and graphical models
PDF
Camp IT: Making the World More Efficient Using AI & Machine Learning
PDF
Deep image generating models
PPTX
FUNCTION APPROXIMATION
PPTX
Module4_GAN.pptxgdgdijehejejjejejejhehjdd
PPTX
GAN Deep Learning Approaches to Image Processing Applications (1).pptx
PDF
imageclassification-160206090009.pdf
PPTX
Image classification with Deep Neural Networks
PPTX
Neural networks
PDF
[PR12] PR-036 Learning to Remember Rare Events
PDF
Machine Learning Live
PDF
Lecture4 - Machine Learning
PPTX
Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14
PPTX
Variational Auto Encoder and the Math Behind
PPTX
Deep learning from a novice perspective
PDF
AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...
PDF
Layering Based Network Intrusion Detection System to Enhance Network Attacks ...
Introduction to Variational Auto Encoder
Visualizing the Model Selection Process
A simple framework for contrastive learning of visual representations
Software tookits for machine learning and graphical models
Camp IT: Making the World More Efficient Using AI & Machine Learning
Deep image generating models
FUNCTION APPROXIMATION
Module4_GAN.pptxgdgdijehejejjejejejhehjdd
GAN Deep Learning Approaches to Image Processing Applications (1).pptx
imageclassification-160206090009.pdf
Image classification with Deep Neural Networks
Neural networks
[PR12] PR-036 Learning to Remember Rare Events
Machine Learning Live
Lecture4 - Machine Learning
Piotr Mirowski - Review Autoencoders (Deep Learning) - CIUUK14
Variational Auto Encoder and the Math Behind
Deep learning from a novice perspective
AN ANN APPROACH FOR NETWORK INTRUSION DETECTION USING ENTROPY BASED FEATURE S...
Layering Based Network Intrusion Detection System to Enhance Network Attacks ...
Ad

More from Kyuri Kim (7)

PDF
GPT-Series.pdf
PDF
BERT- Pre-training of Deep Bidirectional Transformers for Language Understand...
PDF
Generative adversarial networks
PDF
Future semantic segmentation with convolutional LSTM
PDF
Exploring uncertainty measures in deep networks for sclerosis
PDF
Convolutional neural network based metal artifact reduction in x ray computed...
PDF
Automated bone metastasis detection
GPT-Series.pdf
BERT- Pre-training of Deep Bidirectional Transformers for Language Understand...
Generative adversarial networks
Future semantic segmentation with convolutional LSTM
Exploring uncertainty measures in deep networks for sclerosis
Convolutional neural network based metal artifact reduction in x ray computed...
Automated bone metastasis detection

Recently uploaded (20)

PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
Geodesy 1.pptx...............................................
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
additive manufacturing of ss316l using mig welding
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
DOCX
573137875-Attendance-Management-System-original
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
web development for engineering and engineering
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
Digital Logic Computer Design lecture notes
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
Construction Project Organization Group 2.pptx
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
Sustainable Sites - Green Building Construction
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
bas. eng. economics group 4 presentation 1.pptx
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Geodesy 1.pptx...............................................
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
additive manufacturing of ss316l using mig welding
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
573137875-Attendance-Management-System-original
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
web development for engineering and engineering
Operating System & Kernel Study Guide-1 - converted.pdf
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Digital Logic Computer Design lecture notes
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Construction Project Organization Group 2.pptx
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Sustainable Sites - Green Building Construction
OOP with Java - Java Introduction (Basics)
bas. eng. economics group 4 presentation 1.pptx
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...

Auto-encoding variational bayes

  • 1. Auto-Encoding Variational Bayes Diederik P. Kingma, Max Welling Machine Learning Group Universiteit van Amsterdam ICLR 2014 conference submission, Cited by 4655 May 23, 2019 SMC AI Research Center Kyuri Kim
  • 2. Introduction 1.1 Supervised learning vs. Unsupervised learning Supervised learning Ex. Classification Regression Object Detection Semantic Segmentation Image captioning : Data: (x, y) x is data, y is label Learn a function to map x → y Unsupervised learning Ex. Clustering Dimensionality Reduction Density estimation Feature Learning : Data: (x) Just data, no label Learn something underlying hidden structure of the data 1-d Density estimation 2-d Density estimation Ex. Image Detection Ex. 2
  • 3. Introduction 1.2 Auto Encoder (Feature Learning) 1. Unsupervised Learning 2. ML density estimation 3. Manifold Learning 4. Generative model learning In Auto Encoder Training: In Trained Auto Encoder : Encoding Decoding Latent Variable 𝝌 𝒙 L (𝝌, 𝒚) Reconstruction Error Decoder는 최소한의 학습 데이터는 생성해 낼 수 있고, Encoder는 최소한의 학습 데이터는 latent vector로 표현할 수 있다. Minimize 𝒁 3 𝑥)
  • 4. Introduction 1.3 Generative Model Given training data, generate new samples from same distribution Ex. Variational Auto encoders (VAE), Generative Adversarial Network(GAN) Generated samples ~ 𝑝 𝑚𝑜𝑑𝑒𝑙(𝑥)Training data ~ 𝑝 𝑑𝑎𝑡𝑎(𝑥) Want to learn 𝒑 𝒎𝒐𝒅𝒆𝒍 𝒙 similar to ~ 𝒑 𝒅𝒂𝒕𝒂(𝒙) Probability density function 4
  • 5. Introduction 1.4 Generative Model Network Generative model taxonomy Ian Goodfellow, "NIPS 2016 Tutorial: Generative Adversarial Networks" 5
  • 6. Experiments 2.1 Variational Auto-Encoder Target Data 𝝌 Latent Variable 𝑧 P(𝑥|𝑧)P(𝑧) Sample from true prior Sample from true conditional Decoder Network How to Train the Model? 6 Maximum likelihood estimation
  • 7. Experiments 2.2 Variation Inference Variation InferenceTarget Data Generator 𝒈 𝜽(.) 𝝌 Latent Variable 𝑝 (𝑧|𝑥) ≈ 𝑞 𝜙(𝑧|𝑥)~𝑧 sampling 𝑧 from 𝑝(𝑧|𝑥) Z를 정규분포에서 Sampling하는 것 보다 x와 유의미하게 Sample이 나올 수 있는 확률 분포 𝑝(𝑧|𝑥)로부터 Sampling. 그러나 𝑝(𝑧|𝑥)가 무엇인지 알지 못하므로, 우리가 알고 있는 확률분포 중 하나를 택하여 𝑞 𝜙(𝑧|𝑥) 그것의 parameter 값을 조정, 𝑝(𝑧|𝑥)과 유사하게 만든다. 7 𝑧
  • 8. Experiments 2.3 Variational Auto-Encoder (1) Definition of VAE Variational Inference를 Auto Encoder의 구조를 통해 구현한 Generate Model. (2) Structure of VAE When z is deterministic value On the respects of probability Where z is in a distribution X가 주어졌을 때 z의 확률 Variational Inference 𝑧 𝒈 𝜽(.)𝒒∅(.) 𝝌𝝌 𝑝(𝑧|𝑥) ≈ 𝑝(𝑧) 8
  • 9. Experiments 2.4 ELBO(Evidence Lower Bound) ELBO(Evidence Lower Bound) Variational Inference Jensen’s Inequality 9
  • 10. Experiments 2.4 ELBO(Evidence Lower Bound) ELBO(Evidence Lower Bound) KL term 두 확률분포 간의 거리 ≥ 0 10
  • 11. Experiments 2.4 ELBO(Evidence Lower Bound) KL term ① ③ ② 11
  • 12. Experiments 2.4 ELBO(Evidence Lower Bound) 𝒈 𝜽(𝒙|𝒛)𝒒∅(𝒛|𝒙) Encoder Inference Network Decoder Generation Network sampling Reconstruction error Regularization 12
  • 16. Appendix - Auto Encoder “어떤 방식으로 pre train하였는가?” 16