SlideShare a Scribd company logo
Deep Unsupervised Learning using
Nonequlibrium Thermodynamics
Tran Quoc Hoan
@k09hthaduonght.wordpress.com/
14 December 2015, Paper Alert, Hasegawa lab., Tokyo
The University of Tokyo
Jascha Sohl-Dickstein, Eric A. Weiss, Niru Maheswaranathan, Surya Ganguli
Proceedings of the 32nd International Conference on Machine Learning, 2015
Abstract
Deep Unsupervised Learning using Nonequilibrium Thermodynamics 2
“…The essential idea, inspired by non-equilibrium statistical
physics, is to systematically and slowly destroy structure in
a data distribution through an iterative forward diffusion
process. We then learn a reverse diffusion process
that restores structure in data, yielding a highly flexible
and tractable generative model of the data…”
Outline
3
- The promise of deep unsupervised learning
• Motivation
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
- Diffusion processes and time reversal
• Physical intuition
- Derivation and experimental results
• Diffusion probabilistic model
Deep Unsupervised Learning
4
- Novel modalities
• Unknown features/labels
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
- Ex. disease part in medical image
• Expensive labels
• Unpredictable tasks / one shot learning
- Exploratory data analysis
https://www.ceessentials.net/article40.html
Physical Intuition
5
- Destroy structure in data
• Diffusion processes and time reversal
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
- Carefully characterize the destruction
- Learn how to reverse time
Observation 1: Diffusion Destroy Structure
6
Data distribution
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Uniform distribution
Uniform distributionData distribution
(Observation)

Diffusion destroys structure
(Recover structure)

Recover data distribution by starting from uniform
distribution and running dynamics backwards
Observation 2: Microscopic Diffusion
7
• Time reversible
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
https://www.youtube.com/watch?v=cDcprgWiQEY
• Brownian motion
• Position updates are small
Gaussians (both forwards and
backwards in time)
Diffusion-based Probabilistic Models
8
• Destroy all structure in data distribution using
diffusion process
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
• Learn reversal of diffusion process
- Estimate function for mean and covariance of each
step in the reverse diffusion process (Ex. binomial rate
for binary data)
• Reverse diffusion process is the model of the data
Diffusion-based Probabilistic Models
9
• Algorithm
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
• Deep convolutional network: universal function
approximatior
• Multiplying distributions: inputation, denoising,
computing posteriors
Destroy by Diffusion Process
10
Data

distribution
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Forward

diffusion
Noise

distribution
Temporal diffusion rate
Destroy by Gaussian Process
11
Data

distribution
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Forward

diffusion
Noise

distribution
Decay towards origin Add small noise
Reversal Gaussian Diffusion Process
12
Data

distribution
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Reverse

diffusion
Noise

distribution
Learned drift and covariance functions
Case Study: Swiss Roll
13Deep Unsupervised Learning using Nonequilibrium Thermodynamics
True model
Inference model
Training the reverse diffusion
14Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Model probability
Annealed importance sampling
Training the reverse diffusion
15Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Log likelihood
Jensen’s inequality
Training the reverse diffusion
16Deep Unsupervised Learning using Nonequilibrium Thermodynamics
…do some algebra…
Training the reverse diffusion
17Deep Unsupervised Learning using Nonequilibrium Thermodynamics
…for Gaussian diffusion process…
Training
unsupervised learning becomes regression problem
Training the reverse diffusion
18Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Setting the diffusion rate
• For Binomial diffusion (erase constant fraction of stimulus
variance each step)
• For Gaussian diffusion
t
1
t = (T t + 1) 1
= small constant (prevent over-fitting)
Training t
Multiplying Distributions
19Deep Unsupervised Learning using Nonequilibrium Thermodynamics
• Required to compute posterior distribution
- Missing data (inpainting)
- Corrupted data (denoising)
• Difficult and expensive using competing techniques
- Ex. VAE, GSNs, NADEs, most graphical models
Interested in
Acts as small perturbation to diffusion process
Multiplying Distributions
20Deep Unsupervised Learning using Nonequilibrium Thermodynamics
• Modified marginal distributions
Interested in
Acts as small perturbation to diffusion process
Multiplying Distributions
21Deep Unsupervised Learning using Nonequilibrium Thermodynamics
• Modified diffusion steps
Equilibrium 

condition
Normalized
Multiplying Distributions
22Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Reversal gaussian Diffusion Process
Interested in
Acts as small perturbation to diffusion process
Small perturbation affects only mean
Deep Network as Approximator for Images
23Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Multi-scale convolution
24Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Downsample
Convolve
Upsample
Sum
Applied to CIFAR-10
25Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Training data Samples from Generative
Adversarial [Goodfellow
et al, 2014]
Samples from
diffusion model
Applied to CIFAR-10
26Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Samples from
DRAW

[Gregor et al, 2015]
Samples from Generative
Adversarial [Goodfellow
et al, 2014]
Samples from
diffusion model
Applied to Dead Leaves
27Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Training data
Samples from
[Theis et al, 2012]

Log likelihood 1.24
bits/pixel
Samples from
diffusion model

Log likelihood 1.49
bits/pixel
Applied to Inpainting
28Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Table App.1
29Deep Unsupervised Learning using Nonequilibrium Thermodynamics
References
30Deep Unsupervised Learning using Nonequilibrium Thermodynamics
h"p://jmlr.org/proceedings/papers/v37/sohl-dickstein15.html	
h"p://videolectures.net/
icml2015_sohl_dickstein_deep_unsupervised_learning/	
h"p://www.inference.vc/icml-paper-unsupervised-learning-by-
inverEng-diffusion-processes/

More Related Content

PDF
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
PDF
End-to-End Object Detection with Transformers
PPTX
[Paper Reading] Attention is All You Need
PDF
Emerging Properties in Self-Supervised Vision Transformers
PDF
Object Detection with Transformers
PPTX
Diabetes Mellitus
PPTX
Hypertension
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Unde...
End-to-End Object Detection with Transformers
[Paper Reading] Attention is All You Need
Emerging Properties in Self-Supervised Vision Transformers
Object Detection with Transformers
Diabetes Mellitus
Hypertension

What's hot (20)

PPTX
Generative Adversarial Networks (GANs)
PPTX
Human Pose Estimation by Deep Learning
PDF
004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamics
PDF
【CVPR 2019】Second-order Attention Network for Single Image Super-Resolution
PDF
Generative adversarial networks
PDF
Generative Models for General Audiences
PPTX
Attention Is All You Need
PDF
Introduction to Generative Adversarial Networks (GANs)
PDF
[DL輪読会]Energy-based generative adversarial networks
PPTX
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
PPTX
Optimization in Deep Learning
PPTX
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
PPTX
Introduction to deep learning
PPTX
Artificial Intelligence, Machine Learning and Deep Learning
PPTX
Transfer Learning and Fine-tuning Deep Neural Networks
PDF
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
PDF
PR-409: Denoising Diffusion Probabilistic Models
PDF
[DL輪読会] Spectral Norm Regularization for Improving the Generalizability of De...
PDF
GANs and Applications
PPTX
Object Detection using Deep Neural Networks
Generative Adversarial Networks (GANs)
Human Pose Estimation by Deep Learning
004 20151116 deep_unsupervisedlearningusingnonequlibriumthermodynamics
【CVPR 2019】Second-order Attention Network for Single Image Super-Resolution
Generative adversarial networks
Generative Models for General Audiences
Attention Is All You Need
Introduction to Generative Adversarial Networks (GANs)
[DL輪読会]Energy-based generative adversarial networks
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
Optimization in Deep Learning
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
Introduction to deep learning
Artificial Intelligence, Machine Learning and Deep Learning
Transfer Learning and Fine-tuning Deep Neural Networks
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-409: Denoising Diffusion Probabilistic Models
[DL輪読会] Spectral Norm Regularization for Improving the Generalizability of De...
GANs and Applications
Object Detection using Deep Neural Networks
Ad

Viewers also liked (20)

PDF
006 20151207 draws - Deep Recurrent Attentive Writer
PDF
005 20151130 adversary_networks
PDF
018 20160902 Machine Learning Framework for Analysis of Transport through Com...
PDF
013_20160328_Topological_Measurement_Of_Protein_Compressibility
PDF
017_20160826 Thermodynamics Of Stochastic Turing Machines
PDF
Approximate Inference (Chapter 10, PRML Reading)
PDF
Tutorial of topological data analysis part 3(Mapper algorithm)
PDF
Tutorial of topological_data_analysis_part_1(basic)
PDF
016_20160722 Molecular Circuits For Dynamic Noise Filtering
PDF
015_20160422 Controlling Synchronous Patterns In Complex Networks
PDF
008 20151221 Return of Frustrating Easy Domain Adaptation
PPTX
003 20151109 nn_faster_andfaster
PDF
010_20160216_Variational Gaussian Process
PDF
009_20150201_Structural Inference for Uncertain Networks
PDF
011_20160321_Topological_data_analysis_of_contagion_map
PDF
PRML Reading Chapter 11 - Sampling Method
PDF
002 20151019 interconnected_network
PDF
Topological data analysis
PDF
Part 4. newton & kung fu
PPTX
Force And Laws Of Motion
006 20151207 draws - Deep Recurrent Attentive Writer
005 20151130 adversary_networks
018 20160902 Machine Learning Framework for Analysis of Transport through Com...
013_20160328_Topological_Measurement_Of_Protein_Compressibility
017_20160826 Thermodynamics Of Stochastic Turing Machines
Approximate Inference (Chapter 10, PRML Reading)
Tutorial of topological data analysis part 3(Mapper algorithm)
Tutorial of topological_data_analysis_part_1(basic)
016_20160722 Molecular Circuits For Dynamic Noise Filtering
015_20160422 Controlling Synchronous Patterns In Complex Networks
008 20151221 Return of Frustrating Easy Domain Adaptation
003 20151109 nn_faster_andfaster
010_20160216_Variational Gaussian Process
009_20150201_Structural Inference for Uncertain Networks
011_20160321_Topological_data_analysis_of_contagion_map
PRML Reading Chapter 11 - Sampling Method
002 20151019 interconnected_network
Topological data analysis
Part 4. newton & kung fu
Force And Laws Of Motion
Ad

Similar to 007 20151214 Deep Unsupervised Learning using Nonequlibrium Thermodynamics (20)

PDF
Online machine learning in Streaming Applications
PPTX
The Art Of Backpropagation
PPTX
Computational Giants_nhom.pptx
PDF
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
PDF
DEF CON 24 - Clarence Chio - machine duping 101
PPTX
Deep Learning in Recommender Systems - RecSys Summer School 2017
PPTX
Variational continual learning
PPTX
Computer Design Concepts for Machine Learning
PPTX
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
PPTX
Deeplearning
PDF
Entity embeddings for categorical data
PDF
Deep learning
PPTX
Intro to Deep Reinforcement Learning
PDF
Introduction to Neural Network
PDF
Machine learning and climate and weather research
PDF
Ch 1-1 introduction
PDF
How Can Machine Learning Help Your Research Forward?
PPT
Large Scale Data Mining using Genetics-Based Machine Learning
PPTX
Supervised learning for IOT IN Vellore Institute of Technology
PDF
Continuous control with deep reinforcement learning (DDPG)
Online machine learning in Streaming Applications
The Art Of Backpropagation
Computational Giants_nhom.pptx
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
DEF CON 24 - Clarence Chio - machine duping 101
Deep Learning in Recommender Systems - RecSys Summer School 2017
Variational continual learning
Computer Design Concepts for Machine Learning
Using Feature Grouping as a Stochastic Regularizer for High Dimensional Noisy...
Deeplearning
Entity embeddings for categorical data
Deep learning
Intro to Deep Reinforcement Learning
Introduction to Neural Network
Machine learning and climate and weather research
Ch 1-1 introduction
How Can Machine Learning Help Your Research Forward?
Large Scale Data Mining using Genetics-Based Machine Learning
Supervised learning for IOT IN Vellore Institute of Technology
Continuous control with deep reinforcement learning (DDPG)

More from Ha Phuong (7)

PDF
QTML2021 UAP Quantum Feature Map
PDF
CCS2019-opological time-series analysis with delay-variant embedding
PDF
SIAM-AG21-Topological Persistence Machine of Phase Transition
PDF
001 20151005 ranking_nodesingrowingnetwork
PDF
Deep Learning And Business Models (VNITC 2015-09-13)
PDF
Prediction io–final 2014-jp-handout
PDF
A Study on Privacy Level in Publishing Data of Smart Tap Network
QTML2021 UAP Quantum Feature Map
CCS2019-opological time-series analysis with delay-variant embedding
SIAM-AG21-Topological Persistence Machine of Phase Transition
001 20151005 ranking_nodesingrowingnetwork
Deep Learning And Business Models (VNITC 2015-09-13)
Prediction io–final 2014-jp-handout
A Study on Privacy Level in Publishing Data of Smart Tap Network

Recently uploaded (20)

PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPT
Project quality management in manufacturing
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
bas. eng. economics group 4 presentation 1.pptx
PDF
Well-logging-methods_new................
PDF
PPT on Performance Review to get promotions
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
OOP with Java - Java Introduction (Basics)
PDF
composite construction of structures.pdf
PPTX
Geodesy 1.pptx...............................................
PPT
Mechanical Engineering MATERIALS Selection
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Project quality management in manufacturing
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Lecture Notes Electrical Wiring System Components
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
CH1 Production IntroductoryConcepts.pptx
bas. eng. economics group 4 presentation 1.pptx
Well-logging-methods_new................
PPT on Performance Review to get promotions
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
OOP with Java - Java Introduction (Basics)
composite construction of structures.pdf
Geodesy 1.pptx...............................................
Mechanical Engineering MATERIALS Selection
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
UNIT-1 - COAL BASED THERMAL POWER PLANTS
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk

007 20151214 Deep Unsupervised Learning using Nonequlibrium Thermodynamics

  • 1. Deep Unsupervised Learning using Nonequlibrium Thermodynamics Tran Quoc Hoan @k09hthaduonght.wordpress.com/ 14 December 2015, Paper Alert, Hasegawa lab., Tokyo The University of Tokyo Jascha Sohl-Dickstein, Eric A. Weiss, Niru Maheswaranathan, Surya Ganguli Proceedings of the 32nd International Conference on Machine Learning, 2015
  • 2. Abstract Deep Unsupervised Learning using Nonequilibrium Thermodynamics 2 “…The essential idea, inspired by non-equilibrium statistical physics, is to systematically and slowly destroy structure in a data distribution through an iterative forward diffusion process. We then learn a reverse diffusion process that restores structure in data, yielding a highly flexible and tractable generative model of the data…”
  • 3. Outline 3 - The promise of deep unsupervised learning • Motivation Deep Unsupervised Learning using Nonequilibrium Thermodynamics - Diffusion processes and time reversal • Physical intuition - Derivation and experimental results • Diffusion probabilistic model
  • 4. Deep Unsupervised Learning 4 - Novel modalities • Unknown features/labels Deep Unsupervised Learning using Nonequilibrium Thermodynamics - Ex. disease part in medical image • Expensive labels • Unpredictable tasks / one shot learning - Exploratory data analysis https://www.ceessentials.net/article40.html
  • 5. Physical Intuition 5 - Destroy structure in data • Diffusion processes and time reversal Deep Unsupervised Learning using Nonequilibrium Thermodynamics - Carefully characterize the destruction - Learn how to reverse time
  • 6. Observation 1: Diffusion Destroy Structure 6 Data distribution Deep Unsupervised Learning using Nonequilibrium Thermodynamics Uniform distribution Uniform distributionData distribution (Observation)
 Diffusion destroys structure (Recover structure)
 Recover data distribution by starting from uniform distribution and running dynamics backwards
  • 7. Observation 2: Microscopic Diffusion 7 • Time reversible Deep Unsupervised Learning using Nonequilibrium Thermodynamics https://www.youtube.com/watch?v=cDcprgWiQEY • Brownian motion • Position updates are small Gaussians (both forwards and backwards in time)
  • 8. Diffusion-based Probabilistic Models 8 • Destroy all structure in data distribution using diffusion process Deep Unsupervised Learning using Nonequilibrium Thermodynamics • Learn reversal of diffusion process - Estimate function for mean and covariance of each step in the reverse diffusion process (Ex. binomial rate for binary data) • Reverse diffusion process is the model of the data
  • 9. Diffusion-based Probabilistic Models 9 • Algorithm Deep Unsupervised Learning using Nonequilibrium Thermodynamics • Deep convolutional network: universal function approximatior • Multiplying distributions: inputation, denoising, computing posteriors
  • 10. Destroy by Diffusion Process 10 Data
 distribution Deep Unsupervised Learning using Nonequilibrium Thermodynamics Forward
 diffusion Noise
 distribution Temporal diffusion rate
  • 11. Destroy by Gaussian Process 11 Data
 distribution Deep Unsupervised Learning using Nonequilibrium Thermodynamics Forward
 diffusion Noise
 distribution Decay towards origin Add small noise
  • 12. Reversal Gaussian Diffusion Process 12 Data
 distribution Deep Unsupervised Learning using Nonequilibrium Thermodynamics Reverse
 diffusion Noise
 distribution Learned drift and covariance functions
  • 13. Case Study: Swiss Roll 13Deep Unsupervised Learning using Nonequilibrium Thermodynamics True model Inference model
  • 14. Training the reverse diffusion 14Deep Unsupervised Learning using Nonequilibrium Thermodynamics Model probability Annealed importance sampling
  • 15. Training the reverse diffusion 15Deep Unsupervised Learning using Nonequilibrium Thermodynamics Log likelihood Jensen’s inequality
  • 16. Training the reverse diffusion 16Deep Unsupervised Learning using Nonequilibrium Thermodynamics …do some algebra…
  • 17. Training the reverse diffusion 17Deep Unsupervised Learning using Nonequilibrium Thermodynamics …for Gaussian diffusion process… Training unsupervised learning becomes regression problem
  • 18. Training the reverse diffusion 18Deep Unsupervised Learning using Nonequilibrium Thermodynamics Setting the diffusion rate • For Binomial diffusion (erase constant fraction of stimulus variance each step) • For Gaussian diffusion t 1 t = (T t + 1) 1 = small constant (prevent over-fitting) Training t
  • 19. Multiplying Distributions 19Deep Unsupervised Learning using Nonequilibrium Thermodynamics • Required to compute posterior distribution - Missing data (inpainting) - Corrupted data (denoising) • Difficult and expensive using competing techniques - Ex. VAE, GSNs, NADEs, most graphical models Interested in Acts as small perturbation to diffusion process
  • 20. Multiplying Distributions 20Deep Unsupervised Learning using Nonequilibrium Thermodynamics • Modified marginal distributions Interested in Acts as small perturbation to diffusion process
  • 21. Multiplying Distributions 21Deep Unsupervised Learning using Nonequilibrium Thermodynamics • Modified diffusion steps Equilibrium 
 condition Normalized
  • 22. Multiplying Distributions 22Deep Unsupervised Learning using Nonequilibrium Thermodynamics Reversal gaussian Diffusion Process Interested in Acts as small perturbation to diffusion process Small perturbation affects only mean
  • 23. Deep Network as Approximator for Images 23Deep Unsupervised Learning using Nonequilibrium Thermodynamics
  • 24. Multi-scale convolution 24Deep Unsupervised Learning using Nonequilibrium Thermodynamics Downsample Convolve Upsample Sum
  • 25. Applied to CIFAR-10 25Deep Unsupervised Learning using Nonequilibrium Thermodynamics Training data Samples from Generative Adversarial [Goodfellow et al, 2014] Samples from diffusion model
  • 26. Applied to CIFAR-10 26Deep Unsupervised Learning using Nonequilibrium Thermodynamics Samples from DRAW
 [Gregor et al, 2015] Samples from Generative Adversarial [Goodfellow et al, 2014] Samples from diffusion model
  • 27. Applied to Dead Leaves 27Deep Unsupervised Learning using Nonequilibrium Thermodynamics Training data Samples from [Theis et al, 2012]
 Log likelihood 1.24 bits/pixel Samples from diffusion model
 Log likelihood 1.49 bits/pixel
  • 28. Applied to Inpainting 28Deep Unsupervised Learning using Nonequilibrium Thermodynamics
  • 29. Table App.1 29Deep Unsupervised Learning using Nonequilibrium Thermodynamics
  • 30. References 30Deep Unsupervised Learning using Nonequilibrium Thermodynamics h"p://jmlr.org/proceedings/papers/v37/sohl-dickstein15.html h"p://videolectures.net/ icml2015_sohl_dickstein_deep_unsupervised_learning/ h"p://www.inference.vc/icml-paper-unsupervised-learning-by- inverEng-diffusion-processes/