SlideShare a Scribd company logo
Fast deep learning
at your fingertips
Faster deep learning solutions
from training to inference using Intel® Deep Learning SDK
Nir Lotan
Machine Learning Product Manager
Advanced Analytics, Intel
Chief Data Scientist
Advanced Analytics, Intel
Dr. Amitai Armon
Deep learning is Everywhere AT
Intel
ManufacturingProcessor Design Sales & Marketing
Health Analytics AI ProductsPerceptual Computing
Visual understanding Natural Language Processing Speech recognition
Deep neural networks are solving real life cognitive tasks
person
Sed ut perspiciatis unde omnis iste
natus error sit voluptatem
accusantium doloremque
laudantium, totam rem aperiam,
eaque ipsa quae ab illo inventore
veritatis et quasi architecto beatae
vitae dicta sunt explicabo. Nemo
enim ipsam voluptatem quia volu
eaque ipsa quae ab illo inventore
veritatis et quasi architecto beatae
vitae dicta sunt explicabo. Nemo
enim ipsam voluptatem quia volu
DEEP
LEARNING
Model is inspired by a multi-layer network of neurons
Network Topology
DEEP
LEARNING
DEEP LEARNING steps
Step 1: Training
(In Data Center – Over Hours/Days/Weeks)
Person
Lots of labeled
input data
Output:
Trained Model
Create “Deep
neural net” math
model
Step 2: Inference
(End point or Data Center - Instantaneous)
New input from
camera and
sensors
Output:
Classification
Trained neural
network model
97% person
2% traffic light
Trained
Model
deep learning
todayis not really accessible…
…and can be overwhelming
Intel® deep
learning SDKEasily develop and deploy deep learning
solutions,
using Intel® Architecture & Popular
frameworks
Intel® Deep Learning SDK - Workflow
Data
Prep.
Build a
Model
Model Training
Training Inference
Compress
ion
Visualizat
ions
Algorith
mic
Features
Multi-
Node
Model
Optimizer
Inference
Engine
Our Vision: Democratize deep
learningAllow every Data scientist and Developer to easily deploy Open Sourced Deep
Learning Frameworks optimized for Intel® Architecture - delivering end-to-end
capabilities, a rich user experience, and tools to boost productivity.
Plug &
Train
Maximiz
e
perform
Producti
vity
tools
Accelerat
e
deploym
Plug &
Train
Plug & Train - An easy to use
installer
Install on Linux CentOS/Ubunto or Mac
Install from Linux, Mac or Windows
Use the tool remotely via Chrome browser from any platform.
Maximize
performance
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems,
components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated
purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance . *Other names and brands may be property of others
Configurations:
2 socket system with Intel® Xeon Processor E5-2699 v4 (22 Cores, 2.2 GHz,), 128 GB memory, Red Hat* Enterprise Linux 6.7, BVLC Caffe, Intel Optimized Caffe framework, Intel® MKL 11.3.3, Intel® MKL 2017
Intel® Xeon Phi™ Processor 7250 (68 Cores, 1.4 GHz, 16GB MCDRAM), 128 GB memory, Red Hat* Enterprise Linux 6.7, Intel® Optimized Caffe framework, Intel® MKL 2017
All numbers measured without taking data manipulation into account.
0
5
10
15
20
25
30
Intel® Xeon® E5-2699 v4 Intel® Xeon® E5-2699 v4 Intel® Xeon® E5-2699 v4 Intel® Xeon Phi 7250
Out-of-the-box +Intel MKL 11.3.3 +Intel MKL 2017 +Intel MKL 2017
Performancespeedup
Caffe/AlexNet single node training performance
2.1x
2x
training with Intel-Optimized
Frameworks
5.8x
24x
12x
Example: Deep-learning training with Intel-Optimized Caffe* on Intel® Xeon® Processor E5 v4 and Intel® Xeon Phi™
Kubernetes
Multi-node training
Jupyter notebooksBrowser
service
DLSDK
service
service
Node 3Node 1
Container Container
DLSDK
Container
Data (File System)
Node 2
Container Container
DLSDK
Container
Data (File System)
Container
Data (File
System)
…
Performance boost with distributed training
Productivity
tools
Step by Step Wizard
Productivity tools
Interactive Notebook
MODEL VISUALIZATION MODEL COMRESSION
Accelerate
deployment
• Optimize:
• Imports trained models from all popular DL
framework regardless of training HW
• Model Canonicalization, Compression and
Quantization
• Deploy:
• One API across all Intel HW and systems
• Friendly Inference solution: (low footprint,
easy API, control meeting Functional Safety)
• Optimizes Inference execution per target
hardware under-the-hood
Ease of use + Embedded friendly + Extra performance boost
1
2
Trained Model
Optimize
Compress
Quantize
Inference Engine
1
2
CPU GEN FPGA …
Enable full utilization of IA Inference while abstracting HW from developers
Intel Deep Learning Deployment Tool
Use Cases
Faster deep learning solutions from training to inference - Amitai Armon & Nir Lotan, Intel
This is our
dataset:
“Hands”
“Not-Hands”
Thanks to the Intel “Hands in VR” team for sharing the use case and dataset
Demo
http://software.intel.com/deep-learning-sdk/
Faster deep learning solutions from training to inference - Amitai Armon & Nir Lotan, Intel
Deep Learning
Evolution
Step 1: Training
(In Data Center – Over Hours/Days/Weeks)
Person
Lots of labeled
input data
Output:
Trained Model
Create model
Step 2: Inference
(End point or Data Center - Instantaneous)
New input from
camera and
sensors
Output:
Classification
Trained neural
network model
97% person
2% traffic light
Trained
Model
Reminder: Deep-Learning is leading today’s AI
Achieved breakthroughs in visual understanding and in natural language processing
Illustrating the deep-learning training and inference process
Step 1: Training
(In Data Center – Over Hours/Days/Weeks)
Person
Lots of labeled
input data
Output:
Trained Model
Create model
Step 2: Inference
(End point or Data Center - Instantaneous)
New input from
camera and
sensors
Output:
Classification
Trained neural
network model
97% person
2% traffic light
Trained
Model
New AI Methods try to cope with additional
challenges
Illustrating the training and inference process
GAN
(1) Labeled data is scarce and more labeled samples need to be generated (e.g. using GAN)
Step 1: Training
(In Data Center – Over Hours/Days/Weeks)
Person
Lots of labeled
input data
Output:
Trained Model
Create model
Step 2: Inference
(End point or Data Center - Instantaneous)
New input from
camera and
sensors
Output:
Classification
Trained neural
network model
97% person
2% traffic light
Trained
Model
New AI Methods try to cope with additional
challenges(2) Some tasks are learned better through trial-and-error than through examples
Illustrating the training and inference process
RL
Step 1: Training
(In Data Center – Over Hours/Days/Weeks)
Person
Lots of labeled
input data
Output:
Trained Model
Create model
Step 2: Inference
(End point or Data Center - Instantaneous)
New input from
camera and
sensors
Output:
Classification
Trained neural
network model
97% person
2% traffic light
Trained
Model
New AI Methods try to cope with additional
challenges(3) Deployment at the edge requires lower memory footprint and faster inference
Illustrating the training and inference process
Model
compression &
low precision
Step 1: Training
(In Data Center – Over Hours/Days/Weeks)
Person
Lots of labeled
input data
Output:
Trained Model
Create model
Step 2: Inference
(End point or Data Center - Instantaneous)
New input from
camera and
sensors
Output:
Classification
Trained neural
network model
97% person
2% traffic light
Trained
Model
New AI Methods try to cope with additional
challenges(4) Using the model often requires adapting it to a new type of data (transfer learning)
Illustrating the training and inference process
Transfer
learning
New AI Methods try to cope with additional
challenges=> There are multiple ongoing evolvements in AI, and we should be ready
Illustrating the training and inference process
Step 1: Training
(In Data Center – Over Hours/Days/Weeks)
Person
Lots of labeled
input data
Output:
Trained Model
Create model
Step 2: Inference
(End point or Data Center - Instantaneous)
New input from
camera and
sensors
Output:
Classification
Trained neural
network model
97% person
2% traffic light
Trained
Model
GANRL
Model
compression &
low precision
Transfer
learning
Download, use, and provide
feedback
or search for: Intel Deep Learning SDK
http://software.intel.com/deep-learning-sdk/
If someone has specific use case
they want to try with us – let me know
Thank you!

More Related Content

PDF
Build end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBM
PDF
Keynote: Trends in Modern Application Development - Gilly Dekel, IBM
PDF
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
PPTX
Trends in Cloud and Mobile Computing - Alain Azagury, IBM
PPTX
Using Modern Tools and Technologies to Improve Your Software Architecture
PPTX
Disruptive Technologies
PPTX
Azure Digital Twins 2.0
PPTX
CodeValue Architecture Next 2018 - Executive track dilemmas and solutions in...
Build end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBM
Keynote: Trends in Modern Application Development - Gilly Dekel, IBM
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
Trends in Cloud and Mobile Computing - Alain Azagury, IBM
Using Modern Tools and Technologies to Improve Your Software Architecture
Disruptive Technologies
Azure Digital Twins 2.0
CodeValue Architecture Next 2018 - Executive track dilemmas and solutions in...

What's hot (20)

PDF
Intro to Vertex AI, unified MLOps platform for Data Scientists & ML Engineers
PDF
Yi Wang, Tech Lead of AI Platform, Baidu, at MLconf 2017
PPTX
Smart building mendix azure influx / smart City / IoT
PPTX
#PCMVision: VMware NSX - Transforming Security
 
PPTX
Developing Digital Twins
PDF
TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...
PDF
Azure and Predix
PPTX
Architecting IoT solutions with Microsoft Azure
PDF
Java Application Modernization Patterns and Stories from the IBM Garage
PDF
IoT Architecture - Are Traditional Architectures Good Enough or do we Need Ne...
PPTX
Mendix Cloud Hosting on CloudFoundry
PPTX
Sl boston 05_12_15_ener_noc_final_public
PPTX
16370 cics project opening and project update f
PPTX
Windows 10 IoT-Core to Azure IoT Suite
PDF
Using the Actor Model with Domain-Driven Design (DDD) in Reactive Systems - w...
PPTX
Tectonic Summit 2016: Preparing for Cloud Native
PPTX
Netflix - 40 msec
PPTX
Splunk for Developers
PPTX
PDF
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Intro to Vertex AI, unified MLOps platform for Data Scientists & ML Engineers
Yi Wang, Tech Lead of AI Platform, Baidu, at MLconf 2017
Smart building mendix azure influx / smart City / IoT
#PCMVision: VMware NSX - Transforming Security
 
Developing Digital Twins
TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...
Azure and Predix
Architecting IoT solutions with Microsoft Azure
Java Application Modernization Patterns and Stories from the IBM Garage
IoT Architecture - Are Traditional Architectures Good Enough or do we Need Ne...
Mendix Cloud Hosting on CloudFoundry
Sl boston 05_12_15_ener_noc_final_public
16370 cics project opening and project update f
Windows 10 IoT-Core to Azure IoT Suite
Using the Actor Model with Domain-Driven Design (DDD) in Reactive Systems - w...
Tectonic Summit 2016: Preparing for Cloud Native
Netflix - 40 msec
Splunk for Developers
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Ad

Similar to Faster deep learning solutions from training to inference - Amitai Armon & Nir Lotan, Intel (20)

PDF
Faster deep learning solutions from training to inference - Michele Tameni - ...
PDF
Intel Powered AI Applications for Telco
PDF
Intel's Machine Learning Strategy
PDF
Accelerate Machine Learning Software on Intel Architecture
PDF
AIDC Summit LA- Hands-on Training
PPT
Enabling a hardware accelerated deep learning data science experience for Apa...
PDF
FPGAs and Machine Learning
PDF
Intel® AI: AI Lab at Intel
PDF
Accelerating AI from the Cloud to the Edge
PDF
Accelerate Your AI Today
PDF
Accelerating AI Adoption with Partners
PDF
Microsoft Build 2019- Intel AI Workshop
PDF
State-Of-The Art Machine Learning Algorithms and How They Are Affected By Nea...
PDF
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
PPTX
Introduction to Deep Learning
PDF
Workstations powered by Intel can play a vital role in CPU-intensive AI devel...
PDF
Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...
PDF
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...
PDF
NVIDIA 深度學習教育機構 (DLI): Neural network deployment
PDF
Machine Learning Challenges and Opportunities in Education, Industry, and Res...
Faster deep learning solutions from training to inference - Michele Tameni - ...
Intel Powered AI Applications for Telco
Intel's Machine Learning Strategy
Accelerate Machine Learning Software on Intel Architecture
AIDC Summit LA- Hands-on Training
Enabling a hardware accelerated deep learning data science experience for Apa...
FPGAs and Machine Learning
Intel® AI: AI Lab at Intel
Accelerating AI from the Cloud to the Edge
Accelerate Your AI Today
Accelerating AI Adoption with Partners
Microsoft Build 2019- Intel AI Workshop
State-Of-The Art Machine Learning Algorithms and How They Are Affected By Nea...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
Introduction to Deep Learning
Workstations powered by Intel can play a vital role in CPU-intensive AI devel...
Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...
NVIDIA 深度學習教育機構 (DLI): Neural network deployment
Machine Learning Challenges and Opportunities in Education, Industry, and Res...
Ad

More from Codemotion Tel Aviv (20)

PDF
Angular is one fire(base)! - Shmuela Jacobs
PDF
Demystifying docker networking black magic - Lorenzo Fontana, Kiratech
PDF
Facts about multithreading that'll keep you up at night - Guy Bar on, Vonage
PDF
Master the Art of the AST (and Take Control of Your JS!) - Yonatan Mevorach, ...
PDF
Unleash the power of angular Reactive Forms - Nir Kaufman, 500Tech
PDF
Can we build an Azure IoT controlled device in less than 40 minutes that cost...
PDF
Actors and Microservices - Can two walk together? - Rotem Hermon, Gigya
PDF
My Minecraft Smart Home: Prototyping the internet of uncanny things - Sascha ...
PDF
Distributed Systems explained (with NodeJS) - Bruno Bossola, JUG Torino
PDF
Containerised ASP.NET Core apps with Kubernetes
PDF
Fullstack DDD with ASP.NET Core and Anguar 2 - Ronald Harmsen, NForza
PDF
The Art of Decomposing Monoliths - Kfir Bloch, Wix
PDF
SOA Lessons Learnt (or Microservices done Better) - Sean Farmar, Particular S...
PDF
S3, Cassandra or Outer Space? Dumping Time Series Data using Spark - Demi Ben...
PDF
Getting Physical with Web Bluetooth - Uri Shaked, BlackBerry
PDF
Web based virtual reality - Tanay Pant, Mozilla
PDF
Material Design Demytified - Ran Nachmany, Google
PDF
All the reasons for choosing react js that you didn't know about - Avi Marcus...
PDF
Mobile Security Attacks: A Glimpse from the Trenches - Yair Amit, Skycure
PPTX
C10k and beyond - Uri Shamay, Akamai
Angular is one fire(base)! - Shmuela Jacobs
Demystifying docker networking black magic - Lorenzo Fontana, Kiratech
Facts about multithreading that'll keep you up at night - Guy Bar on, Vonage
Master the Art of the AST (and Take Control of Your JS!) - Yonatan Mevorach, ...
Unleash the power of angular Reactive Forms - Nir Kaufman, 500Tech
Can we build an Azure IoT controlled device in less than 40 minutes that cost...
Actors and Microservices - Can two walk together? - Rotem Hermon, Gigya
My Minecraft Smart Home: Prototyping the internet of uncanny things - Sascha ...
Distributed Systems explained (with NodeJS) - Bruno Bossola, JUG Torino
Containerised ASP.NET Core apps with Kubernetes
Fullstack DDD with ASP.NET Core and Anguar 2 - Ronald Harmsen, NForza
The Art of Decomposing Monoliths - Kfir Bloch, Wix
SOA Lessons Learnt (or Microservices done Better) - Sean Farmar, Particular S...
S3, Cassandra or Outer Space? Dumping Time Series Data using Spark - Demi Ben...
Getting Physical with Web Bluetooth - Uri Shaked, BlackBerry
Web based virtual reality - Tanay Pant, Mozilla
Material Design Demytified - Ran Nachmany, Google
All the reasons for choosing react js that you didn't know about - Avi Marcus...
Mobile Security Attacks: A Glimpse from the Trenches - Yair Amit, Skycure
C10k and beyond - Uri Shamay, Akamai

Recently uploaded (20)

PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
PDF
Approach and Philosophy of On baking technology
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Machine learning based COVID-19 study performance prediction
PDF
KodekX | Application Modernization Development
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PPT
Teaching material agriculture food technology
PDF
Advanced IT Governance
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Modernizing your data center with Dell and AMD
PPTX
Big Data Technologies - Introduction.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
Approach and Philosophy of On baking technology
Unlocking AI with Model Context Protocol (MCP)
Machine learning based COVID-19 study performance prediction
KodekX | Application Modernization Development
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Understanding_Digital_Forensics_Presentation.pptx
Spectral efficient network and resource selection model in 5G networks
Teaching material agriculture food technology
Advanced IT Governance
The AUB Centre for AI in Media Proposal.docx
Advanced Soft Computing BINUS July 2025.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Modernizing your data center with Dell and AMD
Big Data Technologies - Introduction.pptx
Electronic commerce courselecture one. Pdf
NewMind AI Monthly Chronicles - July 2025
20250228 LYD VKU AI Blended-Learning.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Faster deep learning solutions from training to inference - Amitai Armon & Nir Lotan, Intel

  • 1. Fast deep learning at your fingertips Faster deep learning solutions from training to inference using Intel® Deep Learning SDK Nir Lotan Machine Learning Product Manager Advanced Analytics, Intel Chief Data Scientist Advanced Analytics, Intel Dr. Amitai Armon
  • 2. Deep learning is Everywhere AT Intel ManufacturingProcessor Design Sales & Marketing Health Analytics AI ProductsPerceptual Computing
  • 3. Visual understanding Natural Language Processing Speech recognition Deep neural networks are solving real life cognitive tasks person Sed ut perspiciatis unde omnis iste natus error sit voluptatem accusantium doloremque laudantium, totam rem aperiam, eaque ipsa quae ab illo inventore veritatis et quasi architecto beatae vitae dicta sunt explicabo. Nemo enim ipsam voluptatem quia volu eaque ipsa quae ab illo inventore veritatis et quasi architecto beatae vitae dicta sunt explicabo. Nemo enim ipsam voluptatem quia volu DEEP LEARNING
  • 4. Model is inspired by a multi-layer network of neurons Network Topology DEEP LEARNING
  • 5. DEEP LEARNING steps Step 1: Training (In Data Center – Over Hours/Days/Weeks) Person Lots of labeled input data Output: Trained Model Create “Deep neural net” math model Step 2: Inference (End point or Data Center - Instantaneous) New input from camera and sensors Output: Classification Trained neural network model 97% person 2% traffic light Trained Model
  • 6. deep learning todayis not really accessible… …and can be overwhelming
  • 7. Intel® deep learning SDKEasily develop and deploy deep learning solutions, using Intel® Architecture & Popular frameworks
  • 8. Intel® Deep Learning SDK - Workflow Data Prep. Build a Model Model Training Training Inference Compress ion Visualizat ions Algorith mic Features Multi- Node Model Optimizer Inference Engine
  • 9. Our Vision: Democratize deep learningAllow every Data scientist and Developer to easily deploy Open Sourced Deep Learning Frameworks optimized for Intel® Architecture - delivering end-to-end capabilities, a rich user experience, and tools to boost productivity. Plug & Train Maximiz e perform Producti vity tools Accelerat e deploym
  • 11. Plug & Train - An easy to use installer Install on Linux CentOS/Ubunto or Mac Install from Linux, Mac or Windows Use the tool remotely via Chrome browser from any platform.
  • 13. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance . *Other names and brands may be property of others Configurations: 2 socket system with Intel® Xeon Processor E5-2699 v4 (22 Cores, 2.2 GHz,), 128 GB memory, Red Hat* Enterprise Linux 6.7, BVLC Caffe, Intel Optimized Caffe framework, Intel® MKL 11.3.3, Intel® MKL 2017 Intel® Xeon Phi™ Processor 7250 (68 Cores, 1.4 GHz, 16GB MCDRAM), 128 GB memory, Red Hat* Enterprise Linux 6.7, Intel® Optimized Caffe framework, Intel® MKL 2017 All numbers measured without taking data manipulation into account. 0 5 10 15 20 25 30 Intel® Xeon® E5-2699 v4 Intel® Xeon® E5-2699 v4 Intel® Xeon® E5-2699 v4 Intel® Xeon Phi 7250 Out-of-the-box +Intel MKL 11.3.3 +Intel MKL 2017 +Intel MKL 2017 Performancespeedup Caffe/AlexNet single node training performance 2.1x 2x training with Intel-Optimized Frameworks 5.8x 24x 12x Example: Deep-learning training with Intel-Optimized Caffe* on Intel® Xeon® Processor E5 v4 and Intel® Xeon Phi™
  • 14. Kubernetes Multi-node training Jupyter notebooksBrowser service DLSDK service service Node 3Node 1 Container Container DLSDK Container Data (File System) Node 2 Container Container DLSDK Container Data (File System) Container Data (File System) … Performance boost with distributed training
  • 16. Step by Step Wizard Productivity tools Interactive Notebook MODEL VISUALIZATION MODEL COMRESSION
  • 18. • Optimize: • Imports trained models from all popular DL framework regardless of training HW • Model Canonicalization, Compression and Quantization • Deploy: • One API across all Intel HW and systems • Friendly Inference solution: (low footprint, easy API, control meeting Functional Safety) • Optimizes Inference execution per target hardware under-the-hood Ease of use + Embedded friendly + Extra performance boost 1 2 Trained Model Optimize Compress Quantize Inference Engine 1 2 CPU GEN FPGA … Enable full utilization of IA Inference while abstracting HW from developers Intel Deep Learning Deployment Tool
  • 21. This is our dataset: “Hands” “Not-Hands” Thanks to the Intel “Hands in VR” team for sharing the use case and dataset
  • 25. Step 1: Training (In Data Center – Over Hours/Days/Weeks) Person Lots of labeled input data Output: Trained Model Create model Step 2: Inference (End point or Data Center - Instantaneous) New input from camera and sensors Output: Classification Trained neural network model 97% person 2% traffic light Trained Model Reminder: Deep-Learning is leading today’s AI Achieved breakthroughs in visual understanding and in natural language processing Illustrating the deep-learning training and inference process
  • 26. Step 1: Training (In Data Center – Over Hours/Days/Weeks) Person Lots of labeled input data Output: Trained Model Create model Step 2: Inference (End point or Data Center - Instantaneous) New input from camera and sensors Output: Classification Trained neural network model 97% person 2% traffic light Trained Model New AI Methods try to cope with additional challenges Illustrating the training and inference process GAN (1) Labeled data is scarce and more labeled samples need to be generated (e.g. using GAN)
  • 27. Step 1: Training (In Data Center – Over Hours/Days/Weeks) Person Lots of labeled input data Output: Trained Model Create model Step 2: Inference (End point or Data Center - Instantaneous) New input from camera and sensors Output: Classification Trained neural network model 97% person 2% traffic light Trained Model New AI Methods try to cope with additional challenges(2) Some tasks are learned better through trial-and-error than through examples Illustrating the training and inference process RL
  • 28. Step 1: Training (In Data Center – Over Hours/Days/Weeks) Person Lots of labeled input data Output: Trained Model Create model Step 2: Inference (End point or Data Center - Instantaneous) New input from camera and sensors Output: Classification Trained neural network model 97% person 2% traffic light Trained Model New AI Methods try to cope with additional challenges(3) Deployment at the edge requires lower memory footprint and faster inference Illustrating the training and inference process Model compression & low precision
  • 29. Step 1: Training (In Data Center – Over Hours/Days/Weeks) Person Lots of labeled input data Output: Trained Model Create model Step 2: Inference (End point or Data Center - Instantaneous) New input from camera and sensors Output: Classification Trained neural network model 97% person 2% traffic light Trained Model New AI Methods try to cope with additional challenges(4) Using the model often requires adapting it to a new type of data (transfer learning) Illustrating the training and inference process Transfer learning
  • 30. New AI Methods try to cope with additional challenges=> There are multiple ongoing evolvements in AI, and we should be ready Illustrating the training and inference process Step 1: Training (In Data Center – Over Hours/Days/Weeks) Person Lots of labeled input data Output: Trained Model Create model Step 2: Inference (End point or Data Center - Instantaneous) New input from camera and sensors Output: Classification Trained neural network model 97% person 2% traffic light Trained Model GANRL Model compression & low precision Transfer learning
  • 31. Download, use, and provide feedback or search for: Intel Deep Learning SDK http://software.intel.com/deep-learning-sdk/ If someone has specific use case they want to try with us – let me know Thank you!