SlideShare a Scribd company logo
Let's Paint a Picasso:
A look at Generative Adversarial Networks (GAN)
and its applications.
Sreya Ghosh (Ph.D.)
Data Scientist II, Merchandising
Wayfair
Portrait of Edmond Belamy
Artist: Obvious
Year: 2018
Auction House: Christie’s
Price: $432,500
Generative Adversarial Networks (GANS)
Generative: Generate new data based on some learned features.
Adversarial: A game-theory derived cost function
Network: Deep Neural Networks
Yann LeCun : “Adversarial training is the coolest thing since sliced bread.”
Generative Adversarial Nets.
Ian J. Goodfellow , Jean Pouget-Abadie , Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair , Aaron
Courville, Yoshua Bengio
University of Montreal, NeuralIPS, 2014
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and its Applications - Sreya Ghosh
Deep Neural Networks
AutoEncoders
Encoder: This is the part of the network that compresses the input into a latent-
space representation. It can be represented by an encoding function h=f(x).
Decoder: This part aims to reconstruct the input from the latent space
representation. It can be represented by a decoding function r=g(h).
Generative Model
● Noise from a simple distribution like a Gaussian/Uniform Distribution is used to
represent Latent Features.
● The semantic meaning of the latent features is learnt via a neural network.
Discriminative Model
● Generator alone will just create
random noise. Conceptually, the
discriminator in GAN provides
guidance to the generator on what
to create.
● Discriminator predicts the label of
images.
○ Fake images generated from the
Generator are 0
○ Images from the distribution is 1
● The idea is to bring the accuracy
of the Discriminator to 50%
GAN Architecture
The Cost Function
Discriminator: Cross-Entropy
Generator:
GAN:
Performance
Types of GANs
Deep Convolutional GANs (DCGANs): The first improvement over GAN. This
network studied and found optimal hyper-parameters.
Conditional GANs (cGANs): These are GANs that use extra label information.
This results in better quality images and being able to control to an extent how
generated images will look.
Wasserstein GANs : Change the loss function to include the Wasserstein
distance. As a result, WassGANs have loss functions that correlate with image
quality. Also, training stability improves and is not as dependent on the
architecture.
Super Resolution GAN (SRGAN)
● Super-resolution GAN applies a deep
network in combination with an
adversary network to produce higher
resolution images.
● During the training, a high-resolution
image (HR) is downsampled to a low-
resolution image (LR).
● A GAN generator upsamples LR
images to super-resolution images
(SR).
● A discriminator to distinguish the HR
images and backpropagate the GAN
loss to train the discriminator and the
generator.
StackGAN: Generative Adversarial Text to Image
Synthesis
● Generate photo-realistic images
conditioned on text descriptions.
● The Stage-I GAN sketches the
primitive shape and colors of the
object based on the given text
description.
● The Stage-II GAN takes Stage-I
results and text descriptions as
inputs, and generates high-
resolution images with photo-
realistic details.
● Conditioning Augmentation
technique encourages
smoothness in the latent
conditioning manifold.
Generative Visual Manipulation on the Natural Image
Manifold
● We then define a class of image
editing operations, and constrain
their output to lie on that learned
manifold at all times.
● The model automatically adjusts
the output keeping all edits as
realistic as possible.
● The presented method can further
be used for changing one image
to look like the other, as well as
generating novel imagery from
scratch based on user's scribbles.
Attribute2Image: Conditional Image Generation from
Visual Attributes
● The image as a composite of
foreground and background
● A layered generative model
with disentangled latent
variables that can be learned
end-to-end using a
variational auto-encoder.
● Generating realistic and
diverse samples with
disentangled latent
representations.
Pose Guided Person Image Generation
Trying to paint Modern Art
● Scraped 1500 images from Google
images using search terms paintings
of faces.
● Augmented the data-set using flipped
images.
● All images were resized to
112x112x3.
● DC-GAN was used to generate the
images
● 20,000 epochs.
Challenges
● The networks try to take successive steps
to minimize a non-convex objective and
end up in an oscillating process rather
than decreasing the underlying true
objective.
● The generator can accidentally start to
produce several copies of exactly the
same image.
● GANs can sometimes be far-sighted and
fail to differentiate the number of particular
objects that should occur at a location.
● GANs sometime are not capable of
differentiating between front and back
view.
Future Work with GAN
● Find effective solutions to challenges, different cost functions that ensure
Nash equilibrium.
● Drug discovery and potential medical applications.
● Application in NLP, especially language modelling.
● Speech generation.
● Cataloging and Image based search.
Citations:
1.Generative Adversarial Nets. Ian J. Goodfellow , Jean Pouget-Abadie , Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil
Ozair , Aaron Courville, Yoshua Bengio, University of Montreal, NeuralIPS, 2014.
2. GAN — What is Generative Adversary Networks GAN? Jonathan Hui. 06-2018
3. GAN — Some cool applications of GANs. Jonathan Hui. 06-2018
4.Generative Adversarial Networks (GANs) — A Beginner’s Guide. Owen Carey. 2018
5. DCGAN: Generate the images with Deep Convolutional GAN. Keisuke Umezawa. 2018
6. https://github.com/eriklindernoren/Keras-GAN

More Related Content

PPTX
Image colorization
PDF
Visual hull construction from semitransparent coloured silhouettes
PDF
Visual Hull Construction from Semitransparent Coloured Silhouettes
PDF
Visual Hull Construction from Semitransparent Coloured Silhouettes
PPTX
Artist Assistant AI(AAA)
PDF
Performance Anaysis for Imaging System
PDF
Comparison of thresholding methods
PPTX
On constructing z dimensional Image By DIBR Synthesized Images
Image colorization
Visual hull construction from semitransparent coloured silhouettes
Visual Hull Construction from Semitransparent Coloured Silhouettes
Visual Hull Construction from Semitransparent Coloured Silhouettes
Artist Assistant AI(AAA)
Performance Anaysis for Imaging System
Comparison of thresholding methods
On constructing z dimensional Image By DIBR Synthesized Images

Similar to Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and its Applications - Sreya Ghosh (20)

PPTX
brief Introduction to Different Kinds of GANs
PDF
Google Dev Group Yangon (2020) AI Talk (Creative AI in Action)
PDF
PDF
Jakub Langr (University of Oxford) - Overview of Generative Adversarial Netwo...
PPTX
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
PDF
PDF
An Introduction To Generative Adversarial Networks
PDF
Top Blockchain Development Services | Build Your Blockchain Today
PPTX
Generative Adversarial Network (GAN) for Image Synthesis
PDF
11_gan.pdf
PDF
EuroSciPy 2019 - GANs: Theory and Applications
PPTX
Generative advesarial networks technical seminar
PPTX
Generative adversarial networks
PPTX
Generative Adversarial Networks (GANs)
PDF
GAN - Theory and Applications
PPTX
Generative adversarial networks in computer vision
PPTX
A friendly introduction to GANs
PDF
An Extensive Review on Generative Adversarial Networks GAN’s
PDF
Introduction to Generative Adversarial Networks (GANs)
PDF
Generative Adversarial Networks and Their Applications
brief Introduction to Different Kinds of GANs
Google Dev Group Yangon (2020) AI Talk (Creative AI in Action)
Jakub Langr (University of Oxford) - Overview of Generative Adversarial Netwo...
A (Very) Gentle Introduction to Generative Adversarial Networks (a.k.a GANs)
An Introduction To Generative Adversarial Networks
Top Blockchain Development Services | Build Your Blockchain Today
Generative Adversarial Network (GAN) for Image Synthesis
11_gan.pdf
EuroSciPy 2019 - GANs: Theory and Applications
Generative advesarial networks technical seminar
Generative adversarial networks
Generative Adversarial Networks (GANs)
GAN - Theory and Applications
Generative adversarial networks in computer vision
A friendly introduction to GANs
An Extensive Review on Generative Adversarial Networks GAN’s
Introduction to Generative Adversarial Networks (GANs)
Generative Adversarial Networks and Their Applications
Ad

More from Catalina Arango (9)

PPTX
Intelligent Enterprise: How to Create the Customer Experience of the Future? ...
PDF
It's All About the Data - Tia Dubuisson
PPTX
Transforming an Organization using Data Science - Successes & Challenges - Da...
PPTX
Natural Language Processing - Principles and Practice - Gracie Diaz
PPTX
Digital Data Transformation
PPTX
The right JOIN: Merchants + Data
PDF
Biomedical Signal Extraction for Computer-assisted Clinical Decision Making -...
PPTX
The Power of Topology - Colleen Farrelly - WiDS Miami 2018
PPTX
How AI is Transforming Industry - Dr. Monica DeZulueta - WiDS Miami 2018
Intelligent Enterprise: How to Create the Customer Experience of the Future? ...
It's All About the Data - Tia Dubuisson
Transforming an Organization using Data Science - Successes & Challenges - Da...
Natural Language Processing - Principles and Practice - Gracie Diaz
Digital Data Transformation
The right JOIN: Merchants + Data
Biomedical Signal Extraction for Computer-assisted Clinical Decision Making -...
The Power of Topology - Colleen Farrelly - WiDS Miami 2018
How AI is Transforming Industry - Dr. Monica DeZulueta - WiDS Miami 2018
Ad

Recently uploaded (20)

PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Introduction to the R Programming Language
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Business Analytics and business intelligence.pdf
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
[EN] Industrial Machine Downtime Prediction
Introduction to the R Programming Language
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Reliability_Chapter_ presentation 1221.5784
oil_refinery_comprehensive_20250804084928 (1).pptx
Supervised vs unsupervised machine learning algorithms
1_Introduction to advance data techniques.pptx
Qualitative Qantitative and Mixed Methods.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
STERILIZATION AND DISINFECTION-1.ppthhhbx
Business Analytics and business intelligence.pdf
Galatica Smart Energy Infrastructure Startup Pitch Deck
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Database Infoormation System (DBIS).pptx
Business Ppt On Nestle.pptx huunnnhhgfvu

Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and its Applications - Sreya Ghosh

  • 1. Let's Paint a Picasso: A look at Generative Adversarial Networks (GAN) and its applications. Sreya Ghosh (Ph.D.) Data Scientist II, Merchandising Wayfair
  • 2. Portrait of Edmond Belamy Artist: Obvious Year: 2018 Auction House: Christie’s Price: $432,500
  • 3. Generative Adversarial Networks (GANS) Generative: Generate new data based on some learned features. Adversarial: A game-theory derived cost function Network: Deep Neural Networks Yann LeCun : “Adversarial training is the coolest thing since sliced bread.” Generative Adversarial Nets. Ian J. Goodfellow , Jean Pouget-Abadie , Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair , Aaron Courville, Yoshua Bengio University of Montreal, NeuralIPS, 2014
  • 6. AutoEncoders Encoder: This is the part of the network that compresses the input into a latent- space representation. It can be represented by an encoding function h=f(x). Decoder: This part aims to reconstruct the input from the latent space representation. It can be represented by a decoding function r=g(h).
  • 7. Generative Model ● Noise from a simple distribution like a Gaussian/Uniform Distribution is used to represent Latent Features. ● The semantic meaning of the latent features is learnt via a neural network.
  • 8. Discriminative Model ● Generator alone will just create random noise. Conceptually, the discriminator in GAN provides guidance to the generator on what to create. ● Discriminator predicts the label of images. ○ Fake images generated from the Generator are 0 ○ Images from the distribution is 1 ● The idea is to bring the accuracy of the Discriminator to 50%
  • 10. The Cost Function Discriminator: Cross-Entropy Generator: GAN:
  • 12. Types of GANs Deep Convolutional GANs (DCGANs): The first improvement over GAN. This network studied and found optimal hyper-parameters. Conditional GANs (cGANs): These are GANs that use extra label information. This results in better quality images and being able to control to an extent how generated images will look. Wasserstein GANs : Change the loss function to include the Wasserstein distance. As a result, WassGANs have loss functions that correlate with image quality. Also, training stability improves and is not as dependent on the architecture.
  • 13. Super Resolution GAN (SRGAN) ● Super-resolution GAN applies a deep network in combination with an adversary network to produce higher resolution images. ● During the training, a high-resolution image (HR) is downsampled to a low- resolution image (LR). ● A GAN generator upsamples LR images to super-resolution images (SR). ● A discriminator to distinguish the HR images and backpropagate the GAN loss to train the discriminator and the generator.
  • 14. StackGAN: Generative Adversarial Text to Image Synthesis ● Generate photo-realistic images conditioned on text descriptions. ● The Stage-I GAN sketches the primitive shape and colors of the object based on the given text description. ● The Stage-II GAN takes Stage-I results and text descriptions as inputs, and generates high- resolution images with photo- realistic details. ● Conditioning Augmentation technique encourages smoothness in the latent conditioning manifold.
  • 15. Generative Visual Manipulation on the Natural Image Manifold ● We then define a class of image editing operations, and constrain their output to lie on that learned manifold at all times. ● The model automatically adjusts the output keeping all edits as realistic as possible. ● The presented method can further be used for changing one image to look like the other, as well as generating novel imagery from scratch based on user's scribbles.
  • 16. Attribute2Image: Conditional Image Generation from Visual Attributes ● The image as a composite of foreground and background ● A layered generative model with disentangled latent variables that can be learned end-to-end using a variational auto-encoder. ● Generating realistic and diverse samples with disentangled latent representations.
  • 17. Pose Guided Person Image Generation
  • 18. Trying to paint Modern Art ● Scraped 1500 images from Google images using search terms paintings of faces. ● Augmented the data-set using flipped images. ● All images were resized to 112x112x3. ● DC-GAN was used to generate the images ● 20,000 epochs.
  • 19. Challenges ● The networks try to take successive steps to minimize a non-convex objective and end up in an oscillating process rather than decreasing the underlying true objective. ● The generator can accidentally start to produce several copies of exactly the same image. ● GANs can sometimes be far-sighted and fail to differentiate the number of particular objects that should occur at a location. ● GANs sometime are not capable of differentiating between front and back view.
  • 20. Future Work with GAN ● Find effective solutions to challenges, different cost functions that ensure Nash equilibrium. ● Drug discovery and potential medical applications. ● Application in NLP, especially language modelling. ● Speech generation. ● Cataloging and Image based search.
  • 21. Citations: 1.Generative Adversarial Nets. Ian J. Goodfellow , Jean Pouget-Abadie , Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair , Aaron Courville, Yoshua Bengio, University of Montreal, NeuralIPS, 2014. 2. GAN — What is Generative Adversary Networks GAN? Jonathan Hui. 06-2018 3. GAN — Some cool applications of GANs. Jonathan Hui. 06-2018 4.Generative Adversarial Networks (GANs) — A Beginner’s Guide. Owen Carey. 2018 5. DCGAN: Generate the images with Deep Convolutional GAN. Keisuke Umezawa. 2018 6. https://github.com/eriklindernoren/Keras-GAN