SlideShare a Scribd company logo
Algorithmic	Intelligence	Lab	Seminar:
On	Unifying	Deep	Generative	Models
2017.06.21.
Sangwoo	Mo
1/21
Overview
• On	Unifying	Deep	Generative	Models (arXiv,	2	Jun	2017)
• Author:		Zhiting	Hu,	Zichao	Yang,	Ruslan	Salakhutdinov,	Eric	P.	Xing
• Contribution
1. Establish	formal	connection between	GAN	and	VAE
2. Enables	to	exchange	ideas	across	models	in	principled	way
(apply	ideas	in	VAE	to	GAN,	and	ideas	in	GAN	to	VAE)
2/21
Table	of	Contents
• Bridging	the	Gap
• ADA (Adversarial	Domain	Adaptation)
• GAN (Generative	Adversarial	Network)
• VAE (Variational	Autoencoder)
• WS (Wake	Sleep	Algorithm)
• Applications
• IWGAN (Importance	Weighted	GAN)
• AAVAE (Adversary	Activated	VAE)
• Experiments
• Conclusion
3/21
Table	of	Contents
• Bridging	the	Gap
• ADA (Adversarial	Domain	Adaptation)
• GAN (Generative	Adversarial	Network)
• VAE (Variational	Autoencoder)
• WS (Wake	Sleep	Algorithm)
• Applications
• IWGAN (Importance	Weighted	GAN)
• AAVAE (Adversary	Activated	VAE)
• Experiments
• Conclusion
4/21
ADA	(Adversarial	Domain	Adaptation)
• Goal: Transfer	knowledge	from	source	domain	to	target	domain
• =>		Learn	feature	extractor whose	output	cannot	be	distinguished
by	a	discriminator	of	source	and	target	domains
• Notations
• 𝒛:		data	(either	from	source	or	target	domain)
• 𝒚 ∈ {0,1}:		domain	indicator	(source:	1,	target:	0)
• 𝒙 = 𝐺,(𝑧):		feature	(𝐺,:		feature	extractor)
5/21
ADA	(Adversarial	Domain	Adaptation)
• More	Notations
• 𝒑(𝒚):		prior	distribution	(assume	uniform)
• 𝒑(𝒛|𝒚):		data	distribution	(for	given	domain)
• 𝒑 𝜽(𝒙|𝒚):		implicit feature	distribution
• 𝒒 𝝓(𝒚|𝒙): discriminator	of	domains
• 𝒒 𝝓
𝒓
𝒚 𝒙 = 𝑞7(1 − 𝑦|𝑥)
• 𝑫 𝝓 𝒙 = 𝑞7(𝑦 = 1|𝑥)
6/21
ADA	(Adversarial	Domain	Adaptation)
• The	discriminator	should	guess	the	true	domain,
and	the	feature	extractor	should	fool	the	discriminator
• Thus,	the	objective	function	is
(omitted	supervised	learning	part	of	feature	extractor)
7/21
GAN	(Generative	Adversarial	Network)
• GAN	can	be	seen	as	a	special	case	of	ADA
• Let	real	data	to	be	source,	and	generated	data	to	be	target
• Note	that	𝑝=(𝑥) is	parametrized	by	𝜃,	while	𝑝?@A@(𝑥) is	fixed
(code	space	and	generator	is	degenerated	for	𝑦 = 1)
• Here,	ADA	objective	is	identical	to	GAN	(unsaturated version)
8/21
GAN	(Generative	Adversarial	Network)
• GAN	objective	=	𝐾𝐿(𝑝, 𝑥 𝑦 ||𝑞D 𝑥 𝑦 ) − 𝐽𝑆𝐷(𝑝=||𝑝?@A@)
• Let	𝑦 as	visible	and	𝑥 as	latent
• Then	it	is	variational	inference where	𝑞D 𝑥 𝑦 is	posterior
• Since	𝑞D 𝑥 𝑦 ∝ 𝑝,I
𝑥 =
J
K
(𝑝= 𝑥 + 𝑝?@A@ 𝑥 ),	𝑝= goes	to	𝑝?@A@
• Remark	that	it	is	reverse	KL, thus	occurs	mode	collapse problem
9/21
GAN	(Generative	Adversarial	Network)
• InfoGAN:	additionally	recover	the	latent	code	𝑧
• Simply	introduce	extra	conditional	𝑝M(𝑧|𝑥, 𝑦)
• Then	the	objective	is
where 𝑞D 𝑥 𝑧, 𝑦 ∝ 𝑞MI
𝑧 𝑥, 𝑦 	𝑞7I
D
𝑦 𝑥 	𝑝,I
(𝑥)
10/21
VAE	(Variational	Autoencoder)
• Assume	VAE	has	optimal (degenerated)	discriminator	𝑞∗(𝑦|𝑥)
• VAE	detects	every	false	data,	and	only	learns	from real	data
• The	original	objective	is
•
and	identical	to
11/21
Compare	GAN	and	VAE
• In	summary
• InfoGAN	objective: 𝐾𝐿(𝑝, 𝑥 𝑧, 𝑦 ||𝑞D 𝑥 𝑧, 𝑦 )
• VAE	objective: 𝐾𝐿(𝑞D 𝑧, 𝑦 𝑥 ||𝑝, 𝑧, 𝑦 𝑥 )
• Remark	that	(1)	position of	𝑝 and	𝑞 are	reversed,	and
(2)	hidden/visible variables	𝑥 and	𝑦, 𝑧 are	inverted
• VAE	minimizes	KL	->	smoothed	output
• GAN	minimizes	reverse	KL	->	mode	collapse	
=>	VAE/GAN	joint	model
12/21
WS	(Wake	Sleep	Algorithm)
• Classic	wake-sleep	algorithm	is
• VAE	=	wake	step
• Let	ℎ be	𝑧 and	𝜆 be	𝜂.	VAE	objective	is	𝑝,,	but	also	optimize	𝑞M
• GAN	=	sleep	step
• Let	ℎ be	𝑦 and	𝜆 be	𝜙.	GAN	objective	is	𝑞7,	but	also	optimize	𝑝,
13/21
Table	of	Contents
• Bridging	the	Gap
• ADA (Adversarial	Domain	Adaptation)
• GAN (Generative	Adversarial	Network)
• VAE (Variational	Autoencoder)
• WS (Wake	Sleep	Algorithm)
• Applications
• IWGAN (Importance	Weighted	GAN)
• AAVAE (Adversary	Activated	VAE)
• Experiments
• Conclusion
14/21
IWGAN	(Importance	Weighted	GAN)
• Importance	Weighted	GAN
• In	practice,	just	assign	weights	for	each	samples	in	mini-batch
15/21
AAVAE	(Adversary	Activated	VAE)
• Adversary	Activated	VAE
• Motivation:	Utilize	fake	samples
• =>	Use	discriminator	network	𝑞7(𝑦|𝑥) instead	of	optimal	𝑞∗(𝑦|𝑥)
• Objective
16/21
AAVAE	(Adversary	Activated	VAE)
• However,	(18)	is	intractable	since	𝑝, 𝑥 𝑧, 𝑦 = 1 = 𝑝?@A@(𝑥) is
an	implicit	distribution	(cannot	estimate	likelihood)
• In	practice,	AAVAE	use	binary	classifier	same	as	GAN
• In	my	opinion,	it	is	just	a	GAN	variant	using	different	𝐺 objective
17/21
Table	of	Contents
• Bridging	the	Gap
• ADA (Adversarial	Domain	Adaptation)
• GAN (Generative	Adversarial	Network)
• VAE (Variational	Autoencoder)
• WS (Wake	Sleep	Algorithm)
• Applications
• IWGAN (Importance	Weighted	GAN)
• AAVAE (Adversary	Activated	VAE)
• Experiments
• Conclusion
18/21
Experiment	Results
19/21
Table	of	Contents
• Bridging	the	Gap
• ADA (Adversarial	Domain	Adaptation)
• GAN (Generative	Adversarial	Network)
• VAE (Variational	Autoencoder)
• WS (Wake	Sleep	Algorithm)
• Applications
• IWGAN (Importance	Weighted	GAN)
• AAVAE (Adversary	Activated	VAE)
• Experiments
• Conclusion
20/21
Conclusion	&	Discussions
• Conclusion
• Traditional	models	usually	distinguish	visible/latent	variables
• However,	it	may	not	necessary	to	make	clear	boundary	between	
visible/latent	and	generator/discriminator
• GAN	and	VAE	can	be	thought	as	(in	some	sense)	dual
• Research	Directions
• Generalize	framework	to	connect	to	other	learning	paradigms
e.g.	Reinforcement	Learning,	Energy-based	model,	etc.
21/21
Appendix
22/21
Appendix:	ADA	objective	=	GAN	objective
23/21
Appendix:	Reverse	KL	divergence
24/21
Appendix:	Proof	of	Lemma	1.
25/21
Appendix:	Proof	of	Lemma	2.
26/21

More Related Content

PPTX
Vitaly Bondar: Are GANs dead or alive? (UA)
PDF
Sparse inverse covariance estimation using skggm
PDF
Open-source tools for generating and analyzing large materials data sets
PDF
Novel Class Detection Using RBF SVM Kernel from Feature Evolving Data Streams
PDF
Low rank models for recommender systems with limited preference information
PDF
A beginner's guide to Style Transfer and recent trends
PPTX
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
PDF
Explicit Density Models
Vitaly Bondar: Are GANs dead or alive? (UA)
Sparse inverse covariance estimation using skggm
Open-source tools for generating and analyzing large materials data sets
Novel Class Detection Using RBF SVM Kernel from Feature Evolving Data Streams
Low rank models for recommender systems with limited preference information
A beginner's guide to Style Transfer and recent trends
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Explicit Density Models

Similar to On Unifying Deep Generative Models (20)

PDF
Generative Adversarial Networks 2
PDF
Learning In Nonstationary Environments: Perspectives And Applications. Part2:...
PPT
Sequential Action Patterns in Collaborative Ontology Engineering Projects: A ...
PPTX
Image generation using Aritificial intellegence and Generative Adversarial Ne...
PDF
00 Automatic Mental Health Classification in Online Settings and Language Emb...
PDF
IEEE Datamining 2016 Title and Abstract
PDF
6666666666666666666666666666666666666.pdf
PPTX
GDC2019 - SEED - Towards Deep Generative Models in Game Development
PDF
An Extensive Review on Generative Adversarial Networks GAN’s
PPTX
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
PPTX
GENERATIVE GRAPH CONVOLUTIONAL NETWORK FOR GROWING GRAPHS.pptx
PDF
Sound Empirical Evidence in Software Testing
PDF
Comparative Analysis of GANs and VAEs in Generating High-Quality Images: A Ca...
PDF
BOOSTING ADVERSARIAL ATTACKS WITH MOMENTUM - Tianyu Pang and Chao Du, THU - D...
PDF
Icml2018 naver review
PPTX
Talk@rmit 09112017
PDF
IEEE Fuzzy system Title and Abstract 2016
PDF
AUGMENTING MISSING SENSOR DATA FOR ROBUST HUMAN ACTIVITY RECOGNITION
PDF
AUGMENTING MISSING SENSOR DATA FOR ROBUST HUMAN ACTIVITY RECOGNITION
PDF
From Bugs to Decision Support - Selected Research Highlights
Generative Adversarial Networks 2
Learning In Nonstationary Environments: Perspectives And Applications. Part2:...
Sequential Action Patterns in Collaborative Ontology Engineering Projects: A ...
Image generation using Aritificial intellegence and Generative Adversarial Ne...
00 Automatic Mental Health Classification in Online Settings and Language Emb...
IEEE Datamining 2016 Title and Abstract
6666666666666666666666666666666666666.pdf
GDC2019 - SEED - Towards Deep Generative Models in Game Development
An Extensive Review on Generative Adversarial Networks GAN’s
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
GENERATIVE GRAPH CONVOLUTIONAL NETWORK FOR GROWING GRAPHS.pptx
Sound Empirical Evidence in Software Testing
Comparative Analysis of GANs and VAEs in Generating High-Quality Images: A Ca...
BOOSTING ADVERSARIAL ATTACKS WITH MOMENTUM - Tianyu Pang and Chao Du, THU - D...
Icml2018 naver review
Talk@rmit 09112017
IEEE Fuzzy system Title and Abstract 2016
AUGMENTING MISSING SENSOR DATA FOR ROBUST HUMAN ACTIVITY RECOGNITION
AUGMENTING MISSING SENSOR DATA FOR ROBUST HUMAN ACTIVITY RECOGNITION
From Bugs to Decision Support - Selected Research Highlights
Ad

More from Sangwoo Mo (20)

PDF
Brief History of Visual Representation Learning
PDF
Learning Visual Representations from Uncurated Data
PDF
Hyperbolic Deep Reinforcement Learning
PDF
A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...
PDF
Self-supervised Learning Lecture Note
PDF
Deep Learning Theory Seminar (Chap 3, part 2)
PDF
Deep Learning Theory Seminar (Chap 1-2, part 1)
PDF
Introduction to Diffusion Models
PDF
Object-Region Video Transformers
PDF
Deep Implicit Layers: Learning Structured Problems with Neural Networks
PDF
Learning Theory 101 ...and Towards Learning the Flat Minima
PDF
Sharpness-aware minimization (SAM)
PDF
Score-Based Generative Modeling through Stochastic Differential Equations
PDF
Self-Attention with Linear Complexity
PDF
Meta-Learning with Implicit Gradients
PDF
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
PDF
Generative Models for General Audiences
PDF
Bayesian Model-Agnostic Meta-Learning
PDF
Deep Learning for Natural Language Processing
PDF
Domain Transfer and Adaptation Survey
Brief History of Visual Representation Learning
Learning Visual Representations from Uncurated Data
Hyperbolic Deep Reinforcement Learning
A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...
Self-supervised Learning Lecture Note
Deep Learning Theory Seminar (Chap 3, part 2)
Deep Learning Theory Seminar (Chap 1-2, part 1)
Introduction to Diffusion Models
Object-Region Video Transformers
Deep Implicit Layers: Learning Structured Problems with Neural Networks
Learning Theory 101 ...and Towards Learning the Flat Minima
Sharpness-aware minimization (SAM)
Score-Based Generative Modeling through Stochastic Differential Equations
Self-Attention with Linear Complexity
Meta-Learning with Implicit Gradients
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
Generative Models for General Audiences
Bayesian Model-Agnostic Meta-Learning
Deep Learning for Natural Language Processing
Domain Transfer and Adaptation Survey
Ad

Recently uploaded (20)

PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
project resource management chapter-09.pdf
PPTX
A Presentation on Touch Screen Technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
August Patch Tuesday
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
DP Operators-handbook-extract for the Mautical Institute
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
MIND Revenue Release Quarter 2 2025 Press Release
project resource management chapter-09.pdf
A Presentation on Touch Screen Technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Univ-Connecticut-ChatGPT-Presentaion.pdf
August Patch Tuesday
Assigned Numbers - 2025 - Bluetooth® Document
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Encapsulation_ Review paper, used for researhc scholars
A comparative analysis of optical character recognition models for extracting...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
DP Operators-handbook-extract for the Mautical Institute
Digital-Transformation-Roadmap-for-Companies.pptx
Group 1 Presentation -Planning and Decision Making .pptx
A comparative study of natural language inference in Swahili using monolingua...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
gpt5_lecture_notes_comprehensive_20250812015547.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf

On Unifying Deep Generative Models