SlideShare a Scribd company logo
Generation:
Deep Generative Models
What is Generative Model?
• Generative model learns the distribution of data without label
• Create new data & Modify existing data
• Image/video/language/speech generation
• Data augmentation & semi-supervised learning
• Data privacy (e.g., public release of medical dataset)
What is Generative Model?
• Generative model learns the distribution of data without label
• Unsupervised representation learning
• Learning “good” representation with unlabeled data
• Design an auxiliary task (hence often called self-supervised learning)
• Generative model is a popular approach for unsupervised learning
Major Breakthroughs in Deep Generative Models
1980 1985 1990 2000 2005 2010 2015
1985 2006
Boltzmann machine (1985)
• By G. Hinton et al.
• Undirected graphical model
• Computationally expensive
Helmholtz machine
(1986)
• Directed graphical
model
Contrastivedivergence(1989)
• G. Hinton et.al
• Easy method for training RBM
1986
Deep Boltzmann machine (2009)
• Undirected deep generative
model consists of stacks of RBM
• Layerwise training followed by
joint learning
Restricted Boltzmann
machine (1986)
• Bipartite version of BM
1995
Variational Autoencoder (2013)
• By Durk Kingma et al.
• Easy NN like back-propagation learning
in deep generative model
Greedilylayer-wisepre-training(2006)
• Deep Belief Networks
• Major breakthrough in learning
deep generative model Generative Adversarial Network
(2014)
• Large scale image generative model
G. Hinton, S. Ruslan D. Kingma, M. Welling I. GoodfellowG. Hinton, T. Sejnowski P. Smolensky G. Hinton, R. Neal
• Hierarchical feature learning• Restricted Boltzmann Machine • Contrastive Divergence • Variatianal Autoencoder
2002 2009 2013 2014 2015
Ladder Network (2015)
• Performance breakthrough in
Semi-supervised learning
Approaches for Generative Models
1. Flow-based (autoregressive) model
• Pros: exactly compute the probability of the data (many applications)
• Cons: slow inference (autoregressive) or low quality (non-autoregressive)
Autoregressive (e.g., PixelCNN)
Non-autoregressive (e.g., Normalizing Flow)
Approaches for Generative Models
2. Variational autoencoder (VAE)
• Pros: stable training & theoretical properties (lower bound of likelihood)
• Cons: known to produce blurry outputs1
1. Recent methods combine VAE and other methods, e.g., IAF-VAE (+ flow) or WAE (+ GAN) to improve the performance
Blurry!
Approaches for Generative Models
3. Generative adversarial network (GAN)
• Pros: good performance (most SOTA models are based on GAN)
• Cons: hard to train (alternating two networks leads instability)
Application: Image Generation
• BigGAN
Application: Image Generation
• StyleGAN (https://www.youtube.com/watch?v=kSLJriaOumA)
Application: Image-to-Image Translation
• pix2pix (paired)
Application: Image-to-Image Translation
• CycleGAN (unpaired)
Application: Image-to-Image Translation
• StarGAN (multi-domain)
Application: Image-to-Image Translation
• MUNIT (diverse output)
Application: Image-to-Image Translation
• InstaGAN (shape modification) (from our lab)
Application: Emoji Generation
• DTN (create personal avatar)
Application: Semantic Manipulation
• pix2pixHD (https://www.youtube.com/watch?v=3AIpPlzM_qs)
Application: Pose Guided Generation
• PG2 (change pose of person)
Application: Cloth Extraction
• PixelDTGAN (extract cloth from image)
Application: Text-to-Image Synthesis
• Reed et al.
Application: Text-to-Image Synthesis
• Hong et al. (control location with bounding box)
Application: Video-to-Video Translation
• vid2vid (paired) (https://www.youtube.com/watch?v=HCqXJth9t_k)
Application: Video-to-Video Translation
• everybody dance now (with pose) (https://www.youtube.com/watch?v=PCBTZh41Ris)
Application: Video-to-Video Translation
• Recycle-GAN (unpaired) (https://www.youtube.com/watch?v=F51RCdDIuUw)
Application: Data Augmentation
• DAGAN (augment data to improve neural network performance)
Application: Anomaly Detection
• AnoGAN (find anomaly from given data)

More Related Content

PPTX
Fine tuning large LMs
PDF
Stable Diffusion path
PDF
Generative Models and ChatGPT
PDF
Latent diffusions vs DALL-E v2
PDF
Landscape of AI/ML in 2023
PPTX
Diffusion models beat gans on image synthesis
PDF
Cs231n 2017 lecture13 Generative Model
PDF
Tutorial on Deep Generative Models
Fine tuning large LMs
Stable Diffusion path
Generative Models and ChatGPT
Latent diffusions vs DALL-E v2
Landscape of AI/ML in 2023
Diffusion models beat gans on image synthesis
Cs231n 2017 lecture13 Generative Model
Tutorial on Deep Generative Models

What's hot (20)

PDF
PR-409: Denoising Diffusion Probabilistic Models
PDF
LanGCHAIN Framework
PDF
GPT-2: Language Models are Unsupervised Multitask Learners
PDF
Computer Vision
PDF
Large Language Models Bootcamp
PPTX
Generative models
PDF
Introduction to Diffusion Models
PPTX
Ensemble methods
PDF
Machine Learning Pipelines
PPTX
Few shot learning/ one shot learning/ machine learning
PPTX
Explainable Machine Learning (Explainable ML)
PDF
And then there were ... Large Language Models
PDF
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
PDF
ChatGPT Evaluation for NLP
PDF
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
PPTX
1.Introduction to deep learning
PDF
Explainable AI
PPTX
PDF
Exploring Generating AI with Diffusion Models
PDF
Generative AI: Past, Present, and Future – A Practitioner's Perspective
PR-409: Denoising Diffusion Probabilistic Models
LanGCHAIN Framework
GPT-2: Language Models are Unsupervised Multitask Learners
Computer Vision
Large Language Models Bootcamp
Generative models
Introduction to Diffusion Models
Ensemble methods
Machine Learning Pipelines
Few shot learning/ one shot learning/ machine learning
Explainable Machine Learning (Explainable ML)
And then there were ... Large Language Models
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
ChatGPT Evaluation for NLP
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
1.Introduction to deep learning
Explainable AI
Exploring Generating AI with Diffusion Models
Generative AI: Past, Present, and Future – A Practitioner's Perspective

Similar to Generative Models for General Audiences (20)

PPTX
Introduction to Generative Models.pptx
PDF
Deep Generative Modelling
PDF
Deep Generative Modelling (updated)
PPTX
100-Concepts-of-AI with Anupama Kate .pptx
PDF
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
PDF
Deep image generating models
PDF
Deep Generative Models
PPTX
Introduction to Generative AI refers to a subset of artificial intelligence
PDF
Lec 1-2 ssdsdffffsssssfsdfsdfstGenAI.pdf
PDF
The Success of Deep Generative Models
PDF
GENAI GENAI GENAI GENAIGENAI GENAI GENAI GENAI
PDF
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
PPTX
Exploring the Foundations and Applications of Generative Artificial Intellige...
PPTX
GDC2019 - SEED - Towards Deep Generative Models in Game Development
PDF
Understanding Generative Model_ A Comprehensive Guide for Training Data.docx.pdf
PDF
Recent Trends in Deep Learning
PDF
TensorFlow London: Progressive Growing of GANs for increased stability, quali...
DOCX
What Are Generative Al Models? A Deep Dive Blog.docx
PDF
Alberto Massidda - Scenes from a memory - Codemotion Rome 2019
PPTX
Gnerative AI presidency Module1_L1_L2.pptx
Introduction to Generative Models.pptx
Deep Generative Modelling
Deep Generative Modelling (updated)
100-Concepts-of-AI with Anupama Kate .pptx
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep image generating models
Deep Generative Models
Introduction to Generative AI refers to a subset of artificial intelligence
Lec 1-2 ssdsdffffsssssfsdfsdfstGenAI.pdf
The Success of Deep Generative Models
GENAI GENAI GENAI GENAIGENAI GENAI GENAI GENAI
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Exploring the Foundations and Applications of Generative Artificial Intellige...
GDC2019 - SEED - Towards Deep Generative Models in Game Development
Understanding Generative Model_ A Comprehensive Guide for Training Data.docx.pdf
Recent Trends in Deep Learning
TensorFlow London: Progressive Growing of GANs for increased stability, quali...
What Are Generative Al Models? A Deep Dive Blog.docx
Alberto Massidda - Scenes from a memory - Codemotion Rome 2019
Gnerative AI presidency Module1_L1_L2.pptx

More from Sangwoo Mo (20)

PDF
Brief History of Visual Representation Learning
PDF
Learning Visual Representations from Uncurated Data
PDF
Hyperbolic Deep Reinforcement Learning
PDF
A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...
PDF
Self-supervised Learning Lecture Note
PDF
Deep Learning Theory Seminar (Chap 3, part 2)
PDF
Deep Learning Theory Seminar (Chap 1-2, part 1)
PDF
Object-Region Video Transformers
PDF
Deep Implicit Layers: Learning Structured Problems with Neural Networks
PDF
Learning Theory 101 ...and Towards Learning the Flat Minima
PDF
Sharpness-aware minimization (SAM)
PDF
Explicit Density Models
PDF
Score-Based Generative Modeling through Stochastic Differential Equations
PDF
Self-Attention with Linear Complexity
PDF
Meta-Learning with Implicit Gradients
PDF
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
PDF
Bayesian Model-Agnostic Meta-Learning
PDF
Deep Learning for Natural Language Processing
PDF
Domain Transfer and Adaptation Survey
PDF
Neural Processes
Brief History of Visual Representation Learning
Learning Visual Representations from Uncurated Data
Hyperbolic Deep Reinforcement Learning
A Unified Framework for Computer Vision Tasks: (Conditional) Generative Model...
Self-supervised Learning Lecture Note
Deep Learning Theory Seminar (Chap 3, part 2)
Deep Learning Theory Seminar (Chap 1-2, part 1)
Object-Region Video Transformers
Deep Implicit Layers: Learning Structured Problems with Neural Networks
Learning Theory 101 ...and Towards Learning the Flat Minima
Sharpness-aware minimization (SAM)
Explicit Density Models
Score-Based Generative Modeling through Stochastic Differential Equations
Self-Attention with Linear Complexity
Meta-Learning with Implicit Gradients
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
Bayesian Model-Agnostic Meta-Learning
Deep Learning for Natural Language Processing
Domain Transfer and Adaptation Survey
Neural Processes

Recently uploaded (20)

PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Approach and Philosophy of On baking technology
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Modernizing your data center with Dell and AMD
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Cloud computing and distributed systems.
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Electronic commerce courselecture one. Pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Review of recent advances in non-invasive hemoglobin estimation
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
NewMind AI Weekly Chronicles - August'25 Week I
NewMind AI Monthly Chronicles - July 2025
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Approach and Philosophy of On baking technology
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Modernizing your data center with Dell and AMD
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Cloud computing and distributed systems.
Network Security Unit 5.pdf for BCA BBA.
Electronic commerce courselecture one. Pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Diabetes mellitus diagnosis method based random forest with bat algorithm
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Building Integrated photovoltaic BIPV_UPV.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy

Generative Models for General Audiences