TensorFlow for IITians

TensorFlow For IITians
Ashish Agarwal, Google
Ashish Bansal, Capital One
Dec, 2017

A Quick Comparison
Source: https://svds.com/getting-started-deep-
learning/

Machine Learning
repository on GitHub
#1

TensorFlow: Democratize ML
❖ An open-source machine learning platform for everyone
❖ Fast, flexible, and production-ready
❖ Scales from research to production

Talk Overview
❖ Tensorflow Introduction
➢ Execution model
➢ Writing your first model
❖ Image models
❖ Sequence models
❖ Tensorflow Deployment

Mul
Add
biases
weights
inputs
labels
Xent
Nodes are Operations
Dataflow Graph: Operations
inputs * weights + biases

Mul
Add
biases
weights
inputs
labels
Xent
Tensors flow along edges
Dataflow Graphs: Tensors
m = inputs * weights
a = m + biases

Mul
Add
biases
weights
inputs
labels
Xent
Values “fed” at execution times
Dataflow Graph: Placeholders
placeh
older
inputs = tf.placeholder(tf.float32)

Mul
Add
biases
weights
inputs
labels
Xent
Variables represent
persistent mutable state
Dataflow Graph: Variables
Variabl
e
weights = tf.Variable(0.3)

Interface to runtime: Sessions
sess = tf.Session(...)
sess.run(“f:0”, feed_dict={“b”: ...})

Example: Linear Regression using Gradient Descent

import tensorflow as tf
# Input (x)
x = tf.placeholder(tf.float32)
Example: Linear Regression

# Input (x)
# Model parameters
W = tf.Variable([.3], dtype=tf.float32)
b = tf.Variable([-.3], dtype=tf.float32)
# Prediction
prediction = W * x + b

# Input (x)
# Model parameters
# Prediction
# Label(y) and loss
y = tf.placeholder(tf.float32)
loss = tf.reduce_sum(tf.square(prediction - y))

# Input (x)
# Model parameters
# Prediction
# Label(y) and loss
# Initialization, gradients and SGD nodes
initializer = tf.global_variables_initializer()
optimizer = tf.train.GradientDescentOptimizer(0.01)
training_step = optimizer.minimize(loss)

# Input (x)
# Model parameters
# Prediction
# Label(y) and loss
# Training data
x_train = ...
y_train = ...
# Runtime setup
sess = tf.Session()

# Input (x)
# Model parameters
# Prediction
# Label(y) and loss
# Training data
x_train = ...
y_train = ...
# Runtime setup
sess = tf.Session()
# Initialization and training loop
sess.run(initializer)
for i in range(1000):
sess.run(training_step,
feed_dict={x: x_train, y: y_train})

# Input (x)
# Model parameters
# Prediction
# Label(y) and loss
y = tf.placeholder(tf.float32, shape=[4, 1])
Example: Linear Logistic Regression
loss = tf.reduce_sum(
tf.nn.sigmoid_cross_entropy_with_logits(
labels=y,
logits=logits))
logits = W * x + b
prediction = tf.sigmoid(logits)

# Input (x)
# Model parameters
# Prediction
# Label(y) and loss
Example: Linear Regression using tf.layers
prediction = tf.layers.dense(x, units=1,
use_bias=True, activation=None)

# Input (x)
x = tf.placeholder(tf.float32, shape=[4, 1])
# Prediction
prediction = tf.layers.dense(x, units=1,
use_bias=True, activation=None)
# Label(y) and loss
l1 = tf.layers.dense(x, units=100,
use_bias=True, activation=relu)
l2 = tf.layers.dense(l1, …)
...
prediction = tf.layers.dense(ln, …)
Non-
^

Convolutions
output = tf.layers.conv2d(
input,
filters=2,
kernel_size=(4, 4),
strides=(1, 1))
Credit: @martin_gorner’s slides

Downsampling: Avg/Max Pooling
output = tf.layers.max_pooling2d(
input,
pool_size=(4, 4),
strides=2)

Regularization: Dropouts
output = tf.layers.dropout(
input, keep_prob)

Putting it together: AlexNet
ImageNet Classification with Deep Convolutional Neural Networks
Alex Krizhevsky, Ilya Sutskever and Geoffrey E. Hinton

Putting it together: AlexNet
# Convolution and pooling layers.
conv1 = tf.layers.conv2d(input,
filters=64, kernel_size=[11, 11], stride=4)
pool1 = tf.layers.max_pooling2d(conv1,
kernel_size=[3, 3], stride=2)
conv2 = tf.layers.conv2d(pool1,
filters=192, kernel_size=[5, 5])
conv3 = tf.layers.conv2d(pool2,
conv4 = tf.layers.conv2d(conv3,
conv5 = tf.layers.conv2d(conv4,
reshaped_pool5 = tf.reshape(pool5, [-1, 5 * 5 * 256])
# Fully connected layers with dropout.
fc6 = tf.layers.dense(reshaped_pool5, units=4096)
drp6 = tf.layers.dropout(fc6, keep_prob=0.5)
fc7 = tf.layers.dense(drp6, units=4096)
drp7 = tf.layers.dropout(fc7, keep_prob=0.5)
fc8 = tf.layers.dense(drp7, units=1000,
activation_fn=None)
# Calculating the loss.
loss = tf.nn.softmax_cross_entropy_with_logits(
labels=labels, logits=fc8)

Batch Normalization: Reduce Internal Covariate Shift
output = tf.layers.batch_normalization(input)

Multi-scale and 1X1 convolutions
“Going Deeper with Convolutions”
Christian Szegedy, Wei Liu, Yangqing Jia et. al.

The Inception Architecture (GoogLeNet, 2015)
model = tf.keras.applications.InceptionV3(...)
model.train_on_batch(...)
model.predict(...)

Skip Connections: Residual Blocks
Deep Residual Learning for Image Recognition,
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun

Resnet Architecture
model = tf.keras.applications.Resnet50(...)
model.train_on_batch(...)
model.predict(...)

Data Augmentation: Adversarial Examples

Healthy Diseased
Hemorrhages
No DR Mild DR Moderate DR Severe DR Proliferative DR

Sequence Prediction with RNNs
IITs are the best आईआईटी सबसे अच्छे हैं

Looping constructs: RNN
Credits: Olah’s “Understanding LSTMs”

LSTM: Long Short-Term Memory
Credits: Olah’s “Understanding LSTMs”

Simple RNN
def rnn(cell, input_list, initial_state):
state = initial_state
outputs = []
for inp in input_list:
output, state = cell(inp, state)
outputs.append(output)
return outputs, state

Dynamic RNN
# 8 layer LSTM with residual connections
# Each layer is on a separate GPU
cell = MultiRNNCell(
[DeviceWrapper(ResidualWrapper(LSTMCell(num_units=512)),
device='/gpu:%d' % i)
for i in range(8)]))
outputs, states = dynamic_rnn(cell, inputs, sequence_length)

Language Models (Unsupervised)
InputsIITs are bestthe
Labelsare bestthe <EOS>

Deep Literature

Supervised Translation Models
Input sentence
Target sentence
Sequence to Sequence Learning with Neural Networks
Sutskever, Vinyals, Le
आईआईटी सबसे हैंअच्छे
IITs are bestthe <EOS> आईआईटी सबसे अच्छे

Encoder LSTMs
Decoder LSTMs
<s> Y1 Y3
SoftMax
Y1 Y2 </s>
X3 X2 </s>
8 Layers
Gpu1
Gpu2
Gpu2
Gpu3
+ + +
Gpu8
Attention
+ ++
Gpu1
Gpu2
Gpu3
Gpu8
Neural Machine Translation Model

Encoder LSTMs
Decoder LSTMs
<s> Y1 Y3
SoftMax
Y1 Y2 </s>
X3 X2 </s>
8 Layers
Gpu1
Gpu2
Gpu2
Gpu3
+ + +
Gpu8
Attention
+ ++
Gpu1
Gpu2
Gpu3
Gpu8
Go Deep!

Encoder LSTMs
Decoder LSTMs
<s> Y1 Y3
SoftMax
Y1 Y2 </s>
X3 X2 </s>
8 Layers
Gpu1
Gpu2
Gpu2
Gpu3
+ + +
Gpu8
Attention
+ ++
Gpu1
Gpu2
Gpu3
Gpu8
Residual
Connections

Encoder LSTMs
Decoder LSTMs
<s> Y1 Y3
SoftMax
Y1 Y2 </s>
X3 X2 </s>
8 Layers
Gpu1
Gpu2
Gpu2
Gpu3
+ + +
Gpu8
Attention
+ ++
Gpu1
Gpu2
Gpu3
Gpu8
Bidirectional
LSTM

Encoder LSTMs
Decoder LSTMs
<s> Y1 Y3
SoftMax
Y1 Y2 </s>
X3 X2 </s>
8 Layers
Gpu1
Gpu2
Gpu2
Gpu3
+ + +
Gpu8
Attention
+ ++
Gpu1
Gpu2
Gpu3
Gpu8
Attention

Deploying TensorFlow Applications
aka Inference

Deployment Options and Choices at a Glance

Python / C App
Custom API Serving
Basic model is run inference in context of an app that serves an API
This app can be built in C/C++ or Python easily
It can be deployed on-prem or in the cloud
An alternate path is to use tensorflow-serving
Train a
Model
Save the Model
& export to
protobuf
Load
lookups/
embeddings
Load the
Model
Call
tf.session/run

Cloud Based Serving
First two steps (training, saving model) is the
same
Option 1: Run the web/api on a cloud
compute instance. Works across
Azure/AWS/GoogleCloud
Option 2: [Google Cloud] Deploy saved
model to Cloud ML Engine
Option 3: [AWS] Through SageMaker (only
oythin 2.7) or custom EC2/Lambda
Option 4: [Azure] Not able to see
any specific things - looks like plain
ol’ compute

Mobile Deployment
Preparation of model for mobile requires
additional steps
- Consider reducing size of models by
removing nodes not useful for inference
- Quantization:
https://www.tensorflow.org/performance/quantization
- Usually embedded in an app
- TensorFlow Lite and TensorFlow Mobile
Qq: Do you really need it in an app?
Similar process for Raspberry Pi

Putting it together: Convolutional Network
conv2D
max_pooling2d
conv2D
max_pooling2d
dense
dense
28 X 28
dropout
Logits
softmax_cross_entropy_with_logits
Labels

Putting it together: Deep Mnist
# First convolutional layer.
x = tf.layers.conv2D(input, filters=32,
kernel_size=5, activation=tf.nn.relu)
# Pooling layer: down-samples by 2X.
x = tf.layers.max_pooling2d(x, pool_size=2,
strides=2)
# Second convolutional layer.
x = tf.layers.conv2D(x, filters=64,
kernel_size=5, activation=tf.nn.relu)
# Second pooling layer: down-samples by 2X.
x = tf.layers.max_pooling2d(x, pool_size=2,
strides=2)
# Flatten the last three dimensions to
allow
# applying FC layers.
x = tf.layers.flatten(x)
# Fully connected layer with 1024 units.
x = tf.layers.dense(x, units=1024,
activation=tf.nn.relu)
# Dropout.
keep_prob = tf.placeholder(tf.float32)
x = tf.layers.dropout(x, keep_prob)
# Fully connected layer with 10 units.
logits = tf.layers.dense(x, units=10)

Long Short-Term Memory (LSTMs):
Make Your Memory Cells Differentiable
[Hochreiter & Schmidhuber, 1997]
MX YMX Y
WRITE? READ?
FORGET?
W R
F
Sigmoids

TensorFlow Distributed Execution Engine
CPU GPU Android iOS ...
C++ FrontendPython Frontend ...
Layers
Estimator
Models in a box
Train and evaluate
models
Build models
Keras
Model
Canned Estimators

TensorFlow for IITians

More Related Content

What's hot (20)

Similar to TensorFlow for IITians (20)

Recently uploaded (20)

TensorFlow for IITians

Editor's Notes