SlideShare a Scribd company logo
Introduction to Deep Learning
with Microsoft Cognitive Toolkit
Deep Learning with CNTK
Deep learning at Microsoft
• Microsoft Cognitive Services
• Skype Translator
• Cortana
• Bing
• Bing Ads
• Augmented Reality
• Microsoft Research
deep learning at Microsoft
• Microsoft Cognitive Services
• Skype Translator
• Cortana
• Bing
• HoloLens
• Microsoft Research
Deep Learning with CNTK
Deep Learning with CNTK
Microsoft’s historic
speech breakthrough
• Microsoft 2016 research system for
conversational speech recognition
• 5.9% word-error rate
• enabled by CNTK’s multi-server scalability
[W. Xiong, J. Droppo, X. Huang, F. Seide, M. Seltzer, A. Stolcke,
D. Yu, G. Zweig: “Achieving Human Parity in Conversational
Speech Recognition,” https://arxiv.org/abs/1610.05256]
Machine Learning in a Nutshell
Machine
learning
algorithm
Model
Application
Data
Contains
patterns
Finds
patterns
Recognizes
patterns
Provides new data to
see if it matches
known patterns
Deploy
chosen
model
Chosen
Model
Apply
learning
algorithm
to data
Candidate
Model
The Machine Learning Process
Prepared
Data
Apply pre-
processing
to data
Iterate to find the
best model
Data
Preprocessing
Modules
Iterate until data
is ready
Preprocessing
Modules
Machine
Learning
Algorithms
Applications
The goal:
Smarter
applications
Raw
Data
Raw
Data
Choose
data
Terminology
Training data
The prepared data used to
create a model
Creating a model is called
training a model
Supervised learning
The value you want to
predict is in the training
data
The data is labeled
Unsupervised learning
The value you want to predict is
not in the training data
The data is unlabeled
The most common
approach
Styles of Machine Learning Algorithms Examples
Decision tree Neural network Bayesian K-means
P(A) P(B|A)
P(B)
P(A|B) =
Deep learning
uses this
What is deep learning
--- Yoshua Bengio
Machine learning is a way to try to make machines intelligent by allowing computers
to learn from examples about the world around us or about some specific aspect of
it.
Deep learning is an approach to machine learning, particular among all the machine
learning methods in that it is inspired by some of the things we know about the
brain. It’s trying to make computers learn multiple levels of abstraction and
representation, which is presumably what makes these systems so successful
Reinforcement learning is a type of machine learning where the learner doesn’t get
to know what a human would do in this context. The learner only gets to see if the
actions were good or bad after a long set of actions. A lot of the recent progress in
this area is in things like playing games, but reinforcement learning probably is going
to be very important for things like self-driving cars.
Deep learning fundamentals
• Model Generalization
• Network Architecture
• Activation function
• Regularization
• Model Training
• Loss functions
• Parameter gradient computation with backpropagation
• Gradient descent algorithms
DL is trying to make computers learn multiple levels of
abstraction and representation
Real
Neural
Network
Artificial
Neural
Network
Logistic regression
Features Weights
1.21 x 0.12
-2.2 x -3.4
-0.32 x 1.11
-1.29 x -0.94
1.4 x -1.2
Multiply features by weights
dot product
Sum them up,
add a bias term
Pass result through
the logistic function
0.94
A single-layer neural network
(also known as doing multiple logistic regressions at the same time)
Features
0.12
0.92
0.33
0.02
0.99
0.42
0.12
0.92
0.33
0.02
0.99
0.42
Another vector!
What might we
do with this?
Weight
matrix
Input
vector
Bias
vector(pretend the weights are different)
Multi-Layer Perceptrons
Network depth
Result of one
layer becomes
features to next
These diagrams are
getting hard to read
Let’s make each
individual input and
logistic regression into
a circle
How do we figure out the weights?
Start with random weights
Try out the model
Determine the error
Figure out how responsible
each weight is for the error
Punish each weight in
proportion to its crime
Truth:
Forward propagate to make prediction
Backward propagate errors
CNTK expresses (nearly) arbitrary
neural networks by composing simple
building blocks into complex
computational networks, supporting
relevant network types and applications
Microsoft Cognitive Toolkit
• Microsoft’s open-source deep-learning toolkit
• https://github.com/Microsoft/CNTK
• Created by Microsoft Speech researchers (Dong Yu et al.) in 2012,
“Computational Network Toolkit”
• On GitHub since Jan 2016 under MIT license
• Renamed from CNTK to “Cognitive Toolkit”
• Community contributions e.g. from MIT, Stanford and NVidia
Microsoft Cognitive Toolkit
CNTK - Other Benefits
• Python and C++ API
• Mostly implemented in C++
• Low level + high level Python API
• Extensibility
• User functions and learners in pure Python
• Readers
• Distributed, highly efficient built-in data readers
• Details: https://docs.microsoft.com/en-us/cognitive-toolkit/reasons-to-switch-from-tensorflow-
to-cntk
Microsoft Cognitive Toolkit
Anatomy of a CNTK training job
Script configure and executes through CNTK Python APIs…
trainer
• SGD
(momentum,
Adam, …)
• minibatching
reader
• minibatch source
• task-specific
deserializer
• automatic
randomization
• distributed
reading
corpus model
network
• model function
• criterion function
• CPU/GPU
execution engine
• packing, padding
Terms to remember: Neural Networks/Deep Networks
• Backpropagation
• Forward Pass
• Loss Function
• Backward pass
• Weight Adjustment
• Hidden layer – Neither an input nor and output layer
• Activation Function & Activation Value
• Activation Matrix (CNN)
• Stochastic Gradient Descent
• Convolutional Layers
• Auto Feature Detection
• Weights
• Bias
• Regularization
CNTK – Jupyter Notebook
MNIST Data – Recognize Digits
1. Logistic Regression
2. Multi Layer Perceptron
3. Convolution Neural Networks
Machine Learning – Logistic Regression
Deep Learning – Multi Level Perceptron
Deep Learning – CNN
Thank You
@ashishjaiman
cntkhelp@microsoft.com
https://github.com/Microsoft/CNTK
Introduction To CNTK
notebooks.azure.com/library/cntkbeta2

More Related Content

PDF
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
PPTX
Deep Learning with Microsoft Cognitive Toolkit
PPTX
V like Velocity, Predicting in Real-Time with Azure ML
PDF
Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...
PPTX
Deep Learning on Qubole Data Platform
PDF
Metta Innovations - Introdução ao Deep Learning aplicado a vídeo analytics
PDF
Deeplearning on Hadoop @OSCON 2014
PDF
Kaz Sato, Evangelist, Google at MLconf ATL 2016
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Deep Learning with Microsoft Cognitive Toolkit
V like Velocity, Predicting in Real-Time with Azure ML
Deep Learning, Microsoft Cognitive Toolkit (CNTK) and Azure Machine Learning ...
Deep Learning on Qubole Data Platform
Metta Innovations - Introdução ao Deep Learning aplicado a vídeo analytics
Deeplearning on Hadoop @OSCON 2014
Kaz Sato, Evangelist, Google at MLconf ATL 2016

What's hot (20)

PDF
GDG-Shanghai 2017 TensorFlow Summit Recap
PDF
Dato Keynote
PDF
Approximate "Now" is Better Than Accurate "Later"
PDF
Introduction To TensorFlow
PDF
Introducing TensorFlow: The game changer in building "intelligent" applications
PDF
A Tale of Three Deep Learning Frameworks: TensorFlow, Keras, & PyTorch with B...
PDF
Large Scale Deep Learning with TensorFlow
PPTX
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
PDF
CI/CD for Machine Learning with Daniel Kobran
PPTX
Big data app meetup 2016-06-15
PPTX
Deep learning with tensorflow
PDF
Smaller and Easier: Machine Learning on Embedded Things
PDF
Practical Deep Learning
PDF
On-device machine learning: TensorFlow on Android
PDF
Practical Digital Image Processing 1
PDF
Deep Learning with MXNet - Dmitry Larko
PPTX
Hadoop Summit 2014 - San Jose - Introduction to Deep Learning on Hadoop
PDF
Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...
PDF
Deep learning with TensorFlow
PDF
[Research] azure ml anatomy of a machine learning service - Sharat Chikkerur
GDG-Shanghai 2017 TensorFlow Summit Recap
Dato Keynote
Approximate "Now" is Better Than Accurate "Later"
Introduction To TensorFlow
Introducing TensorFlow: The game changer in building "intelligent" applications
A Tale of Three Deep Learning Frameworks: TensorFlow, Keras, & PyTorch with B...
Large Scale Deep Learning with TensorFlow
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
CI/CD for Machine Learning with Daniel Kobran
Big data app meetup 2016-06-15
Deep learning with tensorflow
Smaller and Easier: Machine Learning on Embedded Things
Practical Deep Learning
On-device machine learning: TensorFlow on Android
Practical Digital Image Processing 1
Deep Learning with MXNet - Dmitry Larko
Hadoop Summit 2014 - San Jose - Introduction to Deep Learning on Hadoop
Machine & Deep Learning: Practical Deployments and Best Practices for the Nex...
Deep learning with TensorFlow
[Research] azure ml anatomy of a machine learning service - Sharat Chikkerur
Ad

Similar to Deep Learning with CNTK (20)

PPTX
Deep Learning Explained
PPTX
Cognitive Toolkit - Deep Learning framework from Microsoft
PPTX
Overview of Machine Learning and its Applications
PPTX
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
PPTX
ppt on introduction to Machine learning tools
PPTX
Deep learning Introduction and Basics
PPT
DEEP LEARNING PPT aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
PDF
01_introduction.pdfbnmelllleitrthnjjjkkk
PPTX
Muhammad Usman Akhtar | Ph.D Scholar | Wuhan University | School of Co...
PPTX
Machine Learning GDSC DCE Darbhanga.pptx
PDF
01_introduction to machine learning algorithms and basics .pdf
PDF
Practical deepllearningv1
PDF
Deep learning: Cutting through the Myths and Hype
PDF
Introduction to Deep Learning: Concepts, Architectures, and Applications
PDF
01_introduction_ML.pdf
PPTX
INTRO TO ML.pptx
PDF
Deep learning - a primer
PDF
Deep learning - a primer
PDF
Deep Learning
PPTX
Deep Learning Tutorial
Deep Learning Explained
Cognitive Toolkit - Deep Learning framework from Microsoft
Overview of Machine Learning and its Applications
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
ppt on introduction to Machine learning tools
Deep learning Introduction and Basics
DEEP LEARNING PPT aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
01_introduction.pdfbnmelllleitrthnjjjkkk
Muhammad Usman Akhtar | Ph.D Scholar | Wuhan University | School of Co...
Machine Learning GDSC DCE Darbhanga.pptx
01_introduction to machine learning algorithms and basics .pdf
Practical deepllearningv1
Deep learning: Cutting through the Myths and Hype
Introduction to Deep Learning: Concepts, Architectures, and Applications
01_introduction_ML.pdf
INTRO TO ML.pptx
Deep learning - a primer
Deep learning - a primer
Deep Learning
Deep Learning Tutorial
Ad

Recently uploaded (20)

PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PPT
Teaching material agriculture food technology
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Unlocking AI with Model Context Protocol (MCP)
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Chapter 3 Spatial Domain Image Processing.pdf
Encapsulation theory and applications.pdf
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Review of recent advances in non-invasive hemoglobin estimation
20250228 LYD VKU AI Blended-Learning.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Electronic commerce courselecture one. Pdf
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Spectral efficient network and resource selection model in 5G networks
Teaching material agriculture food technology
The AUB Centre for AI in Media Proposal.docx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx

Deep Learning with CNTK

  • 1. Introduction to Deep Learning with Microsoft Cognitive Toolkit
  • 3. Deep learning at Microsoft • Microsoft Cognitive Services • Skype Translator • Cortana • Bing • Bing Ads • Augmented Reality • Microsoft Research
  • 4. deep learning at Microsoft • Microsoft Cognitive Services • Skype Translator • Cortana • Bing • HoloLens • Microsoft Research
  • 7. Microsoft’s historic speech breakthrough • Microsoft 2016 research system for conversational speech recognition • 5.9% word-error rate • enabled by CNTK’s multi-server scalability [W. Xiong, J. Droppo, X. Huang, F. Seide, M. Seltzer, A. Stolcke, D. Yu, G. Zweig: “Achieving Human Parity in Conversational Speech Recognition,” https://arxiv.org/abs/1610.05256]
  • 8. Machine Learning in a Nutshell Machine learning algorithm Model Application Data Contains patterns Finds patterns Recognizes patterns Provides new data to see if it matches known patterns
  • 9. Deploy chosen model Chosen Model Apply learning algorithm to data Candidate Model The Machine Learning Process Prepared Data Apply pre- processing to data Iterate to find the best model Data Preprocessing Modules Iterate until data is ready Preprocessing Modules Machine Learning Algorithms Applications The goal: Smarter applications Raw Data Raw Data Choose data
  • 10. Terminology Training data The prepared data used to create a model Creating a model is called training a model Supervised learning The value you want to predict is in the training data The data is labeled Unsupervised learning The value you want to predict is not in the training data The data is unlabeled The most common approach
  • 11. Styles of Machine Learning Algorithms Examples Decision tree Neural network Bayesian K-means P(A) P(B|A) P(B) P(A|B) = Deep learning uses this
  • 12. What is deep learning --- Yoshua Bengio Machine learning is a way to try to make machines intelligent by allowing computers to learn from examples about the world around us or about some specific aspect of it. Deep learning is an approach to machine learning, particular among all the machine learning methods in that it is inspired by some of the things we know about the brain. It’s trying to make computers learn multiple levels of abstraction and representation, which is presumably what makes these systems so successful Reinforcement learning is a type of machine learning where the learner doesn’t get to know what a human would do in this context. The learner only gets to see if the actions were good or bad after a long set of actions. A lot of the recent progress in this area is in things like playing games, but reinforcement learning probably is going to be very important for things like self-driving cars.
  • 13. Deep learning fundamentals • Model Generalization • Network Architecture • Activation function • Regularization • Model Training • Loss functions • Parameter gradient computation with backpropagation • Gradient descent algorithms DL is trying to make computers learn multiple levels of abstraction and representation
  • 16. Logistic regression Features Weights 1.21 x 0.12 -2.2 x -3.4 -0.32 x 1.11 -1.29 x -0.94 1.4 x -1.2 Multiply features by weights dot product Sum them up, add a bias term Pass result through the logistic function 0.94
  • 17. A single-layer neural network (also known as doing multiple logistic regressions at the same time) Features 0.12 0.92 0.33 0.02 0.99 0.42 0.12 0.92 0.33 0.02 0.99 0.42 Another vector! What might we do with this? Weight matrix Input vector Bias vector(pretend the weights are different)
  • 18. Multi-Layer Perceptrons Network depth Result of one layer becomes features to next
  • 19. These diagrams are getting hard to read Let’s make each individual input and logistic regression into a circle
  • 20. How do we figure out the weights? Start with random weights Try out the model Determine the error Figure out how responsible each weight is for the error Punish each weight in proportion to its crime Truth: Forward propagate to make prediction Backward propagate errors
  • 21. CNTK expresses (nearly) arbitrary neural networks by composing simple building blocks into complex computational networks, supporting relevant network types and applications Microsoft Cognitive Toolkit
  • 22. • Microsoft’s open-source deep-learning toolkit • https://github.com/Microsoft/CNTK • Created by Microsoft Speech researchers (Dong Yu et al.) in 2012, “Computational Network Toolkit” • On GitHub since Jan 2016 under MIT license • Renamed from CNTK to “Cognitive Toolkit” • Community contributions e.g. from MIT, Stanford and NVidia Microsoft Cognitive Toolkit
  • 23. CNTK - Other Benefits • Python and C++ API • Mostly implemented in C++ • Low level + high level Python API • Extensibility • User functions and learners in pure Python • Readers • Distributed, highly efficient built-in data readers • Details: https://docs.microsoft.com/en-us/cognitive-toolkit/reasons-to-switch-from-tensorflow- to-cntk Microsoft Cognitive Toolkit
  • 24. Anatomy of a CNTK training job Script configure and executes through CNTK Python APIs… trainer • SGD (momentum, Adam, …) • minibatching reader • minibatch source • task-specific deserializer • automatic randomization • distributed reading corpus model network • model function • criterion function • CPU/GPU execution engine • packing, padding
  • 25. Terms to remember: Neural Networks/Deep Networks • Backpropagation • Forward Pass • Loss Function • Backward pass • Weight Adjustment • Hidden layer – Neither an input nor and output layer • Activation Function & Activation Value • Activation Matrix (CNN) • Stochastic Gradient Descent • Convolutional Layers • Auto Feature Detection • Weights • Bias • Regularization
  • 26. CNTK – Jupyter Notebook MNIST Data – Recognize Digits 1. Logistic Regression 2. Multi Layer Perceptron 3. Convolution Neural Networks
  • 27. Machine Learning – Logistic Regression
  • 28. Deep Learning – Multi Level Perceptron