SlideShare a Scribd company logo
High performance applications
using OpenVINO™
1
Yury Gorbachev
Internet of Things Group
Why?
• DL Training and inference are essentially very different domains
• Frameworks contain too much for training
• Hardware requirements are very different (100W is pretty normal)
• Performance goals are different (batch size, latency vs. throughput)
• Environment is different (development vs. deployment)
• Hard to find solution that does DL inference properly
Internet of Things Group 3
What is proper Deep Learning inference?
• Highest performance characteristics
• Inference/watt/$ is major concern, best possible performance is a must
• Minimal footprint
• Memory, binary size, execution overhead
• Absolute minimum of dependencies
• Cross platform portability
• Backward compatibility and predictable maintenance
Internet of Things Group 4
Also
• CV application is not just Deep Learning
• “Classical” components widely used
• OpenCV is undisputable champion of the CV world
• Need to satisfy deployment requirements as well
• Performance, footprint, legal cleanness, etc.
Internet of Things Group 5
OpenVINO™ Toolkit for best CV/DL applications
• Development toolkit for high performance CV and DL inference
• Solution for application designers
• No training/research overhead or specifics, minimal footprint, highly portable code
• Set of libraries to solve CV/DL deployment problems
• Fastest OpenCV build
• Deep Learning Inference Engine
• Provides access to all Intel accelerators and heterogeneous execution model
• Intel CPU, integrated GPU
• Vision Processing Unit (VPU) and FPGA
Internet of Things Group
OpenVINO™ vs. Computer Vision
Input
Object
Person
Face
Emotion
Gesture
Text
…
Custom Components CV/non-DL Components
Direct Coding Solution
API Solution API Solution
Custom Code
VPU GPU CPUFPGA VPU GPU CPU
DL Components
Computer Vision Pipeline
DL Inference Engine
VPU GPU CPUFPGA
OpenVINO
™
Internet of Things Group 7
OpenCV: OpenVINO vs. Open Source
• Most performant and fine tuned build
• SSE, AVX2 vectorization & TBB/OMP parallelism
• GPU offload via Transparent API
• DL Inference Engine by default for OpenCV DNN
• Legally clean
• Each build is checked with IP Protection tools, safe for production
• Additional algorithms from Intel in binary form
• Face Detection and analysis libraries
Internet of Things Group 8
Deep Learning Inference Engine
• Pure inference oriented solution (no training included)
• Superior performance on Intel platform, highly optimized
• Minimal memory use
• No framework required in runtime
• Support for CPU, GPU, FPGA, Movidius
• Heterogeneous execution support
• Cross-platform portability
Internet of Things Group 9
Deep Learning performance using OpenVINO/CPU
3.6
3.5
2.7
3.0
Core™-i5 6500@2.9 GHZ
Internet of Things Group 10
Deep Learning performance using OpenVINO/GPU
3.6
4.0
3.0
4.9
5.7
Core™-i5 6500@2.9 GHZ
Internet of Things Group 11
Inference memory footprint on CPU
7.8
3.0
3.1
3.4
4.5
1.6
Core™-i5 6500@2.9 GHZ
Internet of Things Group 12
Pre-trained Models
Video/Image
Pre-trained Models
Video/Image
Once in Design Time
IR
User Application +
Inference Engine
Model
Optimizer
User Application +
Framework
Model
Model
With Deep Learning Frameworks
With OpenVINO™ DL Inference Engine
Internet of Things Group
Deep Learning Inference Engine (IE)
DL Inference Engine API
Deep Learning application
IR
Model
Optimizer
Design time
CPU Plugin
MKL-
DNN
Heterogeneous Execution Engine
C++
layers
GPU Plugin
clDNN
Movidius Plugin
Custom OCL layers
FPGA Plugin
MVNC DLA
Custom layers
+
3
4
3 Framework independent lightweight internal representation
4 Customizations in C++ and OpenCL languages
2
Heterogeneous network execution across accelerators
1
2
1 Single API solution across accelerators
Internet of Things Group 14
Easier deployment
• Accurate against original framework
• Direct replacement of original framework calls
• No retraining/fine-tuning required
• Unified support for multiple OSes
• Linux and Windows are equally supported and performant
• Encapsulates basic preprocessing
• Mean subtraction / normalization integrated into model
Internet of Things Group 15
Portability across platforms
• Single API across platforms
• No need to change SDKs and application codes depending on targets
• Consistent set of layers and accurate results across targets
• Verified against reference model/framework
• Heterogeneous execution for missing pieces
• CPU fallback whenever needed
Internet of Things Group 16
Additional portability benefits
• Intel has rapidly developing hardware set
• New SoCs and architectures are evolving
• No need to wait for HW itself or emulator
• Design app for existing targets first and move to new ones easy
• (Bonus) Check algos on fastest, deploy on most suitable!
Create application using
Inference Engine API
Design and validate on
CPU/GPU clusters
Deploy on Movidius/FPGA
Targets
Internet of Things Group 17
Customization possibilities
• OpenVINO™ is partially binary product right now
• Possible to implement and add own layers
• New topologies are easy to support
• Most of the layers are delivered as a source
• Check how it is done for known topologies
• Base your implementation on those for faster TTM
• MKLDNN for CPU and clDNN for GPU are already available in source form
• Possible to check implementation there
Internet of Things Group 18
Open Model Zoo
• Free reference models for Deep Learning Inference Engine
• Object Detection (Face, People, Vehicles, etc.)
• Object Analysis (Facial analysis, Head Pose, Vehicle attributes)
• Superior performance on Intel
• Core™ i5 CPU: SSD 300 (6 fps) vs. People Detection Model (68 fps)
Significant reduction in development efforts, no dataset & training needed
Internet of Things Group 19
Samples
• Basic samples to facilitate API understanding
• Classification, object detection, segmentation
• Target selection via command line
• Extended samples using Model Zoo
• Face analysis, Security camera sample
• Interworking between Media SDK, OpenCV, DL IE
• Automated public models downloader script

More Related Content

PPTX
Application Portfolio Management
PDF
Introduction to Quantum Computing
PPTX
Introduction to Enterprise architecture and the steps to perform an Enterpris...
PPTX
Basic stat analysis using excel
PPTX
Unit 1-PROBLEM SOLVING AND PYTHON PROGRAMMING R2021.pptx
PPTX
The Future of Quantum Computing
PDF
Generative AI: Past, Present, and Future – A Practitioner's Perspective
PDF
Convolutional Neural Networks (CNN)
Application Portfolio Management
Introduction to Quantum Computing
Introduction to Enterprise architecture and the steps to perform an Enterpris...
Basic stat analysis using excel
Unit 1-PROBLEM SOLVING AND PYTHON PROGRAMMING R2021.pptx
The Future of Quantum Computing
Generative AI: Past, Present, and Future – A Practitioner's Perspective
Convolutional Neural Networks (CNN)

What's hot (20)

PDF
"How to Get the Best Deep Learning Performance with the OpenVINO Toolkit," a ...
PPTX
Develop and optimize CV/DL applications with Intel OpenVINO toolkit
PPTX
TensorRT survey
PDF
Introduction to Kubernetes and Google Container Engine (GKE)
PDF
Intro To Convolutional Neural Networks
PPTX
Introduction to Spiking Neural Networks: From a Computational Neuroscience pe...
PPTX
Deep Learning - CNN and RNN
PPTX
Introduction to CNN
PDF
Google Cloud Networking Deep Dive
PPTX
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
PPTX
Getting started with TensorFlow
PPTX
Deep Learning - Convolutional Neural Networks - Architectural Zoo
DOCX
Case study on deep learning
PPT
Action Recognition (Thesis presentation)
PPTX
ONNX - The Lingua Franca of Deep Learning
PDF
Gitlab, GitOps & ArgoCD
PPTX
An AI accelerator ASIC architecture
PPTX
Capsule networks
PDF
Deep Learning for Video: Action Recognition (UPC 2018)
PDF
Convolutional Neural Network Models - Deep Learning
"How to Get the Best Deep Learning Performance with the OpenVINO Toolkit," a ...
Develop and optimize CV/DL applications with Intel OpenVINO toolkit
TensorRT survey
Introduction to Kubernetes and Google Container Engine (GKE)
Intro To Convolutional Neural Networks
Introduction to Spiking Neural Networks: From a Computational Neuroscience pe...
Deep Learning - CNN and RNN
Introduction to CNN
Google Cloud Networking Deep Dive
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Getting started with TensorFlow
Deep Learning - Convolutional Neural Networks - Architectural Zoo
Case study on deep learning
Action Recognition (Thesis presentation)
ONNX - The Lingua Franca of Deep Learning
Gitlab, GitOps & ArgoCD
An AI accelerator ASIC architecture
Capsule networks
Deep Learning for Video: Action Recognition (UPC 2018)
Convolutional Neural Network Models - Deep Learning
Ad

Similar to OpenVINO introduction (20)

PPTX
Enabling Cross-platform Deep Learning Applications with Intel OpenVINO™
PDF
Early Successes Debugging with TotalView on the Intel Xeon Phi Coprocessor
PPTX
Sundance's presentation at B:RAI 2020
PDF
iSense Java Summit 2017 - Microservices in action at the Dutch National Police
PDF
Hpc lunch and learn
PDF
Devoxx PL 2018 - Microservices in action at the Dutch National Police
PDF
Coding Secure Infrastructure in the Cloud using the PIE framework
PDF
Get There meetup March 2018 - Microservices in action at the Dutch National P...
PDF
Dublin JUG February 2018 - Microservices in action at the Dutch National Police
PDF
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
PDF
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
PDF
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
PPTX
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
PDF
ITCamp 2011 - Cristian Lefter - SQL Server code-name Denali
PDF
JavaZone 2017 - Microservices in action at the Dutch National Police
PDF
OpenValue meetup October 2017 - Microservices in action at the Dutch National...
PDF
Microservices in action at the Dutch National Police
PPTX
Amazon Deep Learning
PDF
SolidFire + Platform9: Simply Faster OpenStack
PDF
Microservices in action at the Dutch National Police - Bert Jan Schrijver - C...
Enabling Cross-platform Deep Learning Applications with Intel OpenVINO™
Early Successes Debugging with TotalView on the Intel Xeon Phi Coprocessor
Sundance's presentation at B:RAI 2020
iSense Java Summit 2017 - Microservices in action at the Dutch National Police
Hpc lunch and learn
Devoxx PL 2018 - Microservices in action at the Dutch National Police
Coding Secure Infrastructure in the Cloud using the PIE framework
Get There meetup March 2018 - Microservices in action at the Dutch National P...
Dublin JUG February 2018 - Microservices in action at the Dutch National Police
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
ITCamp 2011 - Cristian Lefter - SQL Server code-name Denali
JavaZone 2017 - Microservices in action at the Dutch National Police
OpenValue meetup October 2017 - Microservices in action at the Dutch National...
Microservices in action at the Dutch National Police
Amazon Deep Learning
SolidFire + Platform9: Simply Faster OpenStack
Microservices in action at the Dutch National Police - Bert Jan Schrijver - C...
Ad

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Cloud computing and distributed systems.
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
cuic standard and advanced reporting.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Reach Out and Touch Someone: Haptics and Empathic Computing
Network Security Unit 5.pdf for BCA BBA.
Chapter 3 Spatial Domain Image Processing.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Cloud computing and distributed systems.
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
cuic standard and advanced reporting.pdf
Encapsulation theory and applications.pdf
Unlocking AI with Model Context Protocol (MCP)
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Agricultural_Statistics_at_a_Glance_2022_0.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Digital-Transformation-Roadmap-for-Companies.pptx
MYSQL Presentation for SQL database connectivity
Understanding_Digital_Forensics_Presentation.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...

OpenVINO introduction

  • 1. High performance applications using OpenVINO™ 1 Yury Gorbachev
  • 2. Internet of Things Group Why? • DL Training and inference are essentially very different domains • Frameworks contain too much for training • Hardware requirements are very different (100W is pretty normal) • Performance goals are different (batch size, latency vs. throughput) • Environment is different (development vs. deployment) • Hard to find solution that does DL inference properly
  • 3. Internet of Things Group 3 What is proper Deep Learning inference? • Highest performance characteristics • Inference/watt/$ is major concern, best possible performance is a must • Minimal footprint • Memory, binary size, execution overhead • Absolute minimum of dependencies • Cross platform portability • Backward compatibility and predictable maintenance
  • 4. Internet of Things Group 4 Also • CV application is not just Deep Learning • “Classical” components widely used • OpenCV is undisputable champion of the CV world • Need to satisfy deployment requirements as well • Performance, footprint, legal cleanness, etc.
  • 5. Internet of Things Group 5 OpenVINO™ Toolkit for best CV/DL applications • Development toolkit for high performance CV and DL inference • Solution for application designers • No training/research overhead or specifics, minimal footprint, highly portable code • Set of libraries to solve CV/DL deployment problems • Fastest OpenCV build • Deep Learning Inference Engine • Provides access to all Intel accelerators and heterogeneous execution model • Intel CPU, integrated GPU • Vision Processing Unit (VPU) and FPGA
  • 6. Internet of Things Group OpenVINO™ vs. Computer Vision Input Object Person Face Emotion Gesture Text … Custom Components CV/non-DL Components Direct Coding Solution API Solution API Solution Custom Code VPU GPU CPUFPGA VPU GPU CPU DL Components Computer Vision Pipeline DL Inference Engine VPU GPU CPUFPGA OpenVINO ™
  • 7. Internet of Things Group 7 OpenCV: OpenVINO vs. Open Source • Most performant and fine tuned build • SSE, AVX2 vectorization & TBB/OMP parallelism • GPU offload via Transparent API • DL Inference Engine by default for OpenCV DNN • Legally clean • Each build is checked with IP Protection tools, safe for production • Additional algorithms from Intel in binary form • Face Detection and analysis libraries
  • 8. Internet of Things Group 8 Deep Learning Inference Engine • Pure inference oriented solution (no training included) • Superior performance on Intel platform, highly optimized • Minimal memory use • No framework required in runtime • Support for CPU, GPU, FPGA, Movidius • Heterogeneous execution support • Cross-platform portability
  • 9. Internet of Things Group 9 Deep Learning performance using OpenVINO/CPU 3.6 3.5 2.7 3.0 Core™-i5 6500@2.9 GHZ
  • 10. Internet of Things Group 10 Deep Learning performance using OpenVINO/GPU 3.6 4.0 3.0 4.9 5.7 Core™-i5 6500@2.9 GHZ
  • 11. Internet of Things Group 11 Inference memory footprint on CPU 7.8 3.0 3.1 3.4 4.5 1.6 Core™-i5 6500@2.9 GHZ
  • 12. Internet of Things Group 12 Pre-trained Models Video/Image Pre-trained Models Video/Image Once in Design Time IR User Application + Inference Engine Model Optimizer User Application + Framework Model Model With Deep Learning Frameworks With OpenVINO™ DL Inference Engine
  • 13. Internet of Things Group Deep Learning Inference Engine (IE) DL Inference Engine API Deep Learning application IR Model Optimizer Design time CPU Plugin MKL- DNN Heterogeneous Execution Engine C++ layers GPU Plugin clDNN Movidius Plugin Custom OCL layers FPGA Plugin MVNC DLA Custom layers + 3 4 3 Framework independent lightweight internal representation 4 Customizations in C++ and OpenCL languages 2 Heterogeneous network execution across accelerators 1 2 1 Single API solution across accelerators
  • 14. Internet of Things Group 14 Easier deployment • Accurate against original framework • Direct replacement of original framework calls • No retraining/fine-tuning required • Unified support for multiple OSes • Linux and Windows are equally supported and performant • Encapsulates basic preprocessing • Mean subtraction / normalization integrated into model
  • 15. Internet of Things Group 15 Portability across platforms • Single API across platforms • No need to change SDKs and application codes depending on targets • Consistent set of layers and accurate results across targets • Verified against reference model/framework • Heterogeneous execution for missing pieces • CPU fallback whenever needed
  • 16. Internet of Things Group 16 Additional portability benefits • Intel has rapidly developing hardware set • New SoCs and architectures are evolving • No need to wait for HW itself or emulator • Design app for existing targets first and move to new ones easy • (Bonus) Check algos on fastest, deploy on most suitable! Create application using Inference Engine API Design and validate on CPU/GPU clusters Deploy on Movidius/FPGA Targets
  • 17. Internet of Things Group 17 Customization possibilities • OpenVINO™ is partially binary product right now • Possible to implement and add own layers • New topologies are easy to support • Most of the layers are delivered as a source • Check how it is done for known topologies • Base your implementation on those for faster TTM • MKLDNN for CPU and clDNN for GPU are already available in source form • Possible to check implementation there
  • 18. Internet of Things Group 18 Open Model Zoo • Free reference models for Deep Learning Inference Engine • Object Detection (Face, People, Vehicles, etc.) • Object Analysis (Facial analysis, Head Pose, Vehicle attributes) • Superior performance on Intel • Core™ i5 CPU: SSD 300 (6 fps) vs. People Detection Model (68 fps) Significant reduction in development efforts, no dataset & training needed
  • 19. Internet of Things Group 19 Samples • Basic samples to facilitate API understanding • Classification, object detection, segmentation • Target selection via command line • Extended samples using Model Zoo • Face analysis, Security camera sample • Interworking between Media SDK, OpenCV, DL IE • Automated public models downloader script