Introduction to Neural Networks + Art

Introduction to Neural
Networks + Art
Nandita Naik

Day 2: Session 2
Agenda
1. Can ML Create Images?
2. What is Style Transfer and
how does it work?
3. What is DeepDream and how
does it work?
4. Can ML write poetry or
compose music?
a. How?
Machine Learning for Art

Suppose we have a neural network which
does facial recognition.

Image Recognizer in Action
Each mini-image corresponds to
an edge.
lines
https://www.slideshare.net/roelofp/python-for-image-understanding-deep-learning-with-convolutional-neural-nets
parts of the
face
entire face

Using a method like image
recognition, we can generate
images.

From white noise, a model similar to a face recognition
model generates faces.

Image Generator in Action
https://www.slideshare.net/roelofp/python-for-image-understanding-deep-learning-with-convolutional-neural-nets

Text-image synthesis
Another Example of Image Generation:
Generating Images from a Description

Generate an Image from the Description Makes it as close to any “real” image

Questions?
● GANs
● How an image recognition network can be
repurposed into an image generating network
● Text-image synthesis
● Embeddings

What is style transfer?
Content image
Style image
Merged image

Style Transfer Between Images
Original Image Reference Image Style-Transfered
Original Image

How Does Style Transfer Work?
Input: Two images S and C: Image S provides the style
and image C provides the content.
A neural network extracts the style of S and the content
of C. (How? We’ll go into these terms later.)
Then it merges the two to create an image with the style
of S and the content of C.

What is content?
What can you see in
the picture?

What is content?
What can you see in
the picture?
- Wolf
- Mountain
- clouds

What is style?
Think of something
common across all
hidden layers,
such as colors, texture,
brush strokes

How do content & style extraction
work?

Image Recognition Neural Network
“rabbit”1 n
content content
Correlation between the weights at
different layers is an indicator of what
features the network thinks is most
important.
style

Then we merge the content
and style.
How?

How does merging work?
1. Start with white noise (call it “our_image”)

1. Start with white noise (call it “our_image”)
2. Run our_image and contentimage through
content extractor
our_image
content
extractor
Content of our_image
(ex. white noise)
Content of
content_image
(ex. bunny)
content
image

2. Run our_image and contentimage through content
extractor
3. Loss = difference between content of contentimage
and content of our_image
Content of
contentimage
Content of our_image
content loss

1. Do the exact same for style.
2. So we have two loss functions, content loss and
style loss.
3. Use gradient descent to minimize these
4. The image that minimizes the content loss and the
style loss is the style transferred image

Deep Dream : Convert Images
Into (Trippy) Art

Deep Dream: Creating Dogs When There are None

Can anyone guess how this is
created?

Finding visual patterns and emphasizing
them.
What is the computer doing?

How does it work?
Think about running an image recognition network backwards.
What individual neurons output are patterns, and a confidence level
So then the original image is modified to boost the confidence level for the
output neurons
Normally, we would fix the input and change the weights. In this
case, we’re fixing the weights and changing the input.

input
“rabbit”1 n
Patterns!
(with a confidence level)

Creating Music
These use something called a recurrent neural network, which is a
neural network that can remember what happened previously.
Train it on music previously generated.
Recommended: Project Magenta, https://deepjazz.io/. Pretty famous
on Soundcloud.

Creating Literature
A recurrent neural network that talks like Shakespeare!
Input a bunch of words, and ask it to generate the
words that come right after.

PANDARUS:
Alas, I think he shall be come approached and the day
When little srain would be attain'd into being never fed,
And who is but a chain and subjects of his death,
I should not sleep.
Second Senator:
They are away this miseries, produced upon my soul,
Breaking and strongly should be buried, when I perish
The earth and thoughts of many states.
DUKE VINCENTIO:
Well, your wit is in the care of side and that.
Second Lord:
They would be ruled after this chamber, and
my fair nues begun out of the fact, to be conveyed,
Whose noble souls I'll have the heart of the wars.
Clown:
Come, sir, I will make did behold your worship.
VIOLA:
I'll drink it.

Recap
Generative adversarial networks: two networks against each other, one
which generates and one which discriminates
Style transfer: extract content, style, calculate content loss, style loss,
optimize
DeepDream: network goes, “I found a pattern! Let me change the original
image so I am more confident that my pattern exists.”
AI+Music and AI+Literature: use an RNN which can remember what
happened previously

Introduction to Neural Networks + Art

More Related Content

Similar to Introduction to Neural Networks + Art (20)

Recently uploaded (20)

Introduction to Neural Networks + Art

Editor's Notes