SlideShare a Scribd company logo
6
Most read
16
Most read
20
Most read
© 2020 Xilinx
Vitis and Vitis AI:
Application Acceleration from
Cloud to Edge
Vinod Kathail
Fellow and Chief Architect SW Tools
© 2020 Xilinx
Industry Trends
Heterogeneous
Compute
Cloud to Edge AI Proliferation
Key challenge:
Programming &
integration of accelerators
Key challenge:
Need for retargetability
Key challenge:
Efficient ML acceleration and
integration in the whole application
© 2020 Xilinx
Heterogeneous
Compute
Cloud to Edge AI Proliferation
© 2020 Xilinx
Vitis Unified
Software Platform
© 2020 Xilinx
From Hardware to Software Programmable
5
© 2020 Xilinx
Vitis: Unified Software Platform
© 2020 Xilinx
Anatomy of an Accelerated Application
7
AXI (or PCIe)
CPU
Host Application
Drivers
XRT
XRT/OpenCL API
FPGA
Accelerated Functions
DMA Engine
Global Memory
AXI Interfaces
User
Application
Code
Acceleration
Platform
© 2020 Xilinx
Platform-Based Development Approach
Development is performed in the context of a platform
• A pre-configured system containing I/O, status monitoring, and lifecycle management
Standardized interfaces allow for automated composition of user functionality
© 2020 Xilinx
Vitis: Comprehensive Development Tool Suite
9
Build
Run
System level
Simulation
ARM
Compiler
AIE
Compiler
Vitis HLSHost CPU
System Compile/Link
Xilinx Runtime Library (XRT)
Device/Card Device/CardDevice/Card
Analyze
Host
Application Libraries
Application
C/C++
Target
PlatformRTL
Debug & Performance
Analysis
© 2020 Xilinx
Kernel Drivers
User Space Library
Kernel Drivers
OCL XMA AI C++ Python
XOCL ZOCL
Ultrascale
PCI-e Device
MPSoC/ACAP
PCI-e Device
MPSoC/
ACAP Device
MB scheduler ARM Scheduler ARM Scheduler
˃ Platform- and OS-independent APIs for
Device management
Memory management and data transfers
Accelerator execution management
˃ OpenCL wrappers, media frameworks,
and domain-specific APIs built on top of
base APIs
˃ Open source and available on GitHub
Xilinx Runtime (XRT)
© 2020 Xilinx
Open Source, Standards Based Libraries
11
https://github.com/Xilinx/Vitis_Libraries
Vision & Image Finance Data Analytics &
Database
Data Compression Data Security
Math Linear Algebra Statistics DSP Data Management
Domain-Specific Libraries
Common Libraries
400+ functions across multiple libraries for performance-optimized out-of-the-box acceleration
Library
Docs
Tests
Examples
Bench-
marks
© 2020 Xilinx
Vitis Vision Libraries
12
© 2020 Xilinx
Deploy: Embedded, Single Server, Scale-out
© 2020 Xilinx
Vitis drivers & runtime (XRT)
Frameworks
Vitis AI
Development
Kit
Vitis AI
Models
DSA
AI Quantizer AI CompilerAI Optimizer AI Profiler
LSTM DPU MLP DPUCNN DPU
60+ pretrained, optimized
reference models
Performance
improvement up to 10-20x
Tensor based ISA for true
software programmability
DSA – Domain Specific Architecture
DPU – DNN Processing Unit
Vitis AI: Real Time AI Inference Acceleration
© 2020 Xilinx
ACAP
Server
CPU
PCI
Express
FPGA
AI Engine
CPU
Cores
Video Analytics
Management
Computer
Vision
Primary
Detector (AI)
Object
Tracker
Secondary
Classifiers (AI)
Video
Decode
AI Embedded in Apps, Rarely the Whole App
© 2020 Xilinx
Example: Whole Application Acceleration
>>
16
Input
Image
2-3 days to integrate and test using Vitis and Vitis+AI Libraries
0
100
200
300
Tiny Yolo v3 Resnet-50 GoogleNet
Frames/sec
Speedup over CPU
CPU FPGA
Implemented either in FPGA or CPU Always in FPGA
© 2020 Xilinx
Example: Defect Detection
Low Latency: End-to-end < 250ms
>>
17
© 2020 Xilinx
Shell
Hardware
Developers
Application
Software Developers
AI and Data Scientists
(iterations in minutes)
Embedded
Developers
Putting it All Together
© 2020 Xilinx
Summary
19
Unified Software Platform
 Cloud to edge, software and AI
 Comprehensive tools, runtime, libraries and models
Standards, Open Source
 Participating in open source
 Use of standard environments & APIs
© 2020 Xilinx
Additional Resources
Please visit the following sites for more information
Vitis Unified SW Platform
• https://www.xilinx.com/products/design-
tools/vitis.html
Vitis Libraries
• https://www.xilinx.com/products/design-
tools/vitis/vitis-libraries.html
• https://github.com/Xilinx/Vitis_Libraries/
Vitis AI
• https://www.xilinx.com/products/design-
tools/vitis/vitis-ai.html
Visit the Avnet-Xilinx booth to see the following demonstrations
in action:
• Face Detection, Pedestrian Detection, Pose Estimation,
Machine Learning and more
• Hardware families include the Zynq Ultrascale+ and Versal
AI Core
• Demonstration platforms include our SmartCamera+,
Ultra96, and UltraZed
• Xilinx and Avnet staff will be available to answer any
questions
2020 Embedded Vision Summit
• Vitis and Vitis AI: Application Acceleration from Cloud to
Edge
• September 17, 2020, 11:00-11:30AM PDT
20

More Related Content

PDF
FPGA Hardware Accelerator for Machine Learning
PDF
Introducing the Vitis Unified Software Platform for Programming FPGAs
PDF
GPU - Basic Working
PPTX
Introduction to FPGA acceleration
PDF
Hardware Acceleration for Machine Learning
PDF
What is Deep Learning | Deep Learning Simplified | Deep Learning Tutorial | E...
PDF
Deep learning: Hardware Landscape
PDF
TensorFlow Lite for mobile & IoT
FPGA Hardware Accelerator for Machine Learning
Introducing the Vitis Unified Software Platform for Programming FPGAs
GPU - Basic Working
Introduction to FPGA acceleration
Hardware Acceleration for Machine Learning
What is Deep Learning | Deep Learning Simplified | Deep Learning Tutorial | E...
Deep learning: Hardware Landscape
TensorFlow Lite for mobile & IoT

What's hot (20)

PPTX
Introduction to Keras
PDF
POWER10 innovations for HPC
PPTX
Tensor Processing Unit (TPU)
PDF
Architecture of TPU, GPU and CPU
PPTX
RISC-V Introduction
PDF
GPU - An Introduction
PPTX
Understanding eBPF in a Hurry!
PPTX
Hands on OpenCL
PPTX
What is TensorFlow? | Introduction to TensorFlow | TensorFlow Tutorial For Be...
PDF
Introduction to TensorFlow Lite
PDF
Introduction to OpenCL
PDF
HPC Application Profiling and Analysis
PDF
FPGAs and Machine Learning
PPTX
Nvidia (History, GPU Architecture and New Pascal Architecture)
PDF
Project ACRN: SR-IOV implementation
PDF
On-device ML with TFLite
PDF
BPF Internals (eBPF)
PDF
Accelerating Virtual Machine Access with the Storage Performance Development ...
PDF
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
PDF
DPDK in Containers Hands-on Lab
Introduction to Keras
POWER10 innovations for HPC
Tensor Processing Unit (TPU)
Architecture of TPU, GPU and CPU
RISC-V Introduction
GPU - An Introduction
Understanding eBPF in a Hurry!
Hands on OpenCL
What is TensorFlow? | Introduction to TensorFlow | TensorFlow Tutorial For Be...
Introduction to TensorFlow Lite
Introduction to OpenCL
HPC Application Profiling and Analysis
FPGAs and Machine Learning
Nvidia (History, GPU Architecture and New Pascal Architecture)
Project ACRN: SR-IOV implementation
On-device ML with TFLite
BPF Internals (eBPF)
Accelerating Virtual Machine Access with the Storage Performance Development ...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
DPDK in Containers Hands-on Lab
Ad

Similar to “Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presentation from Xilinx (20)

PDF
HiPEAC 2019 Workshop - Vision Processing
PDF
System Design on Zynq using SDSoC
PDF
Xilinx Edge Compute using Power 9 /OpenPOWER systems
PDF
SYCL 2020 Specification
PDF
“Solving Tomorrow’s AI Problems Today with Cadence’s Newest Processor,” a Pre...
PDF
[2017년 5월 정기세미나] IBM에서 바라보는 OpenStack 이야기
PDF
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
PPTX
Migrating from VMs to Kubernetes using HashiCorp Consul Service on Azure
PDF
Optimize your CI/CD with GitLab and AWS
PPTX
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
PPTX
StampedeCon 2015 Keynote
PDF
SCADA a gyakorlatban - Accenture Industry X.0 Meetup
PDF
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
PPTX
Cisco Connect 2018 Indonesia - software-defined access-a transformational ap...
PDF
HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...
PDF
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
PDF
“Streamlining Development of Edge AI Applications,” a Presentation from NVIDIA
PDF
MIPI DevCon Seoul 2018: Next Generation Verification Process for Automotive a...
PDF
Innovate everywhere - SUSE edge
PDF
“A Platform Approach to Developing Networked Visual AI Systems,” a Presentati...
HiPEAC 2019 Workshop - Vision Processing
System Design on Zynq using SDSoC
Xilinx Edge Compute using Power 9 /OpenPOWER systems
SYCL 2020 Specification
“Solving Tomorrow’s AI Problems Today with Cadence’s Newest Processor,” a Pre...
[2017년 5월 정기세미나] IBM에서 바라보는 OpenStack 이야기
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
Migrating from VMs to Kubernetes using HashiCorp Consul Service on Azure
Optimize your CI/CD with GitLab and AWS
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
StampedeCon 2015 Keynote
SCADA a gyakorlatban - Accenture Industry X.0 Meetup
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Cisco Connect 2018 Indonesia - software-defined access-a transformational ap...
HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to ...
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
“Streamlining Development of Edge AI Applications,” a Presentation from NVIDIA
MIPI DevCon Seoul 2018: Next Generation Verification Process for Automotive a...
Innovate everywhere - SUSE edge
“A Platform Approach to Developing Networked Visual AI Systems,” a Presentati...
Ad

More from Edge AI and Vision Alliance (20)

PDF
“Visual Search: Fine-grained Recognition with Embedding Models for the Edge,”...
PDF
“Optimizing Real-time SLAM Performance for Autonomous Robots with GPU Acceler...
PDF
“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applicat...
PDF
“Simplifying Portable Computer Vision with OpenVX 2.0,” a Presentation from AMD
PDF
“Quantization Techniques for Efficient Deployment of Large Language Models: A...
PDF
“Introduction to Data Types for AI: Trade-Offs and Trends,” a Presentation fr...
PDF
“Introduction to Radar and Its Use for Machine Perception,” a Presentation fr...
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
PDF
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
PDF
“Beyond the Demo: Turning Computer Vision Prototypes into Scalable, Cost-effe...
PDF
“Running Accelerated CNNs on Low-power Microcontrollers Using Arm Ethos-U55, ...
PDF
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
PDF
“A Re-imagination of Embedded Vision System Design,” a Presentation from Imag...
PDF
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
PDF
“Evolving Inference Processor Software Stacks to Support LLMs,” a Presentatio...
PDF
“Efficiently Registering Depth and RGB Images,” a Presentation from eInfochips
PDF
“How to Right-size and Future-proof a Container-first Edge AI Infrastructure,...
“Visual Search: Fine-grained Recognition with Embedding Models for the Edge,”...
“Optimizing Real-time SLAM Performance for Autonomous Robots with GPU Acceler...
“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applicat...
“Simplifying Portable Computer Vision with OpenVX 2.0,” a Presentation from AMD
“Quantization Techniques for Efficient Deployment of Large Language Models: A...
“Introduction to Data Types for AI: Trade-Offs and Trends,” a Presentation fr...
“Introduction to Radar and Its Use for Machine Perception,” a Presentation fr...
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...
“Beyond the Demo: Turning Computer Vision Prototypes into Scalable, Cost-effe...
“Running Accelerated CNNs on Low-power Microcontrollers Using Arm Ethos-U55, ...
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
“A Re-imagination of Embedded Vision System Design,” a Presentation from Imag...
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
“Evolving Inference Processor Software Stacks to Support LLMs,” a Presentatio...
“Efficiently Registering Depth and RGB Images,” a Presentation from eInfochips
“How to Right-size and Future-proof a Container-first Edge AI Infrastructure,...

Recently uploaded (20)

PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Electronic commerce courselecture one. Pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Encapsulation theory and applications.pdf
PDF
KodekX | Application Modernization Development
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Spectroscopy.pptx food analysis technology
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
cuic standard and advanced reporting.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPT
Teaching material agriculture food technology
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
Electronic commerce courselecture one. Pdf
Encapsulation_ Review paper, used for researhc scholars
Encapsulation theory and applications.pdf
KodekX | Application Modernization Development
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Spectroscopy.pptx food analysis technology
MYSQL Presentation for SQL database connectivity
Chapter 3 Spatial Domain Image Processing.pdf
Machine learning based COVID-19 study performance prediction
Dropbox Q2 2025 Financial Results & Investor Presentation
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
cuic standard and advanced reporting.pdf
Unlocking AI with Model Context Protocol (MCP)
Diabetes mellitus diagnosis method based random forest with bat algorithm
Teaching material agriculture food technology
Understanding_Digital_Forensics_Presentation.pptx
Programs and apps: productivity, graphics, security and other tools

“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presentation from Xilinx

  • 1. © 2020 Xilinx Vitis and Vitis AI: Application Acceleration from Cloud to Edge Vinod Kathail Fellow and Chief Architect SW Tools
  • 2. © 2020 Xilinx Industry Trends Heterogeneous Compute Cloud to Edge AI Proliferation Key challenge: Programming & integration of accelerators Key challenge: Need for retargetability Key challenge: Efficient ML acceleration and integration in the whole application
  • 4. © 2020 Xilinx Vitis Unified Software Platform
  • 5. © 2020 Xilinx From Hardware to Software Programmable 5
  • 6. © 2020 Xilinx Vitis: Unified Software Platform
  • 7. © 2020 Xilinx Anatomy of an Accelerated Application 7 AXI (or PCIe) CPU Host Application Drivers XRT XRT/OpenCL API FPGA Accelerated Functions DMA Engine Global Memory AXI Interfaces User Application Code Acceleration Platform
  • 8. © 2020 Xilinx Platform-Based Development Approach Development is performed in the context of a platform • A pre-configured system containing I/O, status monitoring, and lifecycle management Standardized interfaces allow for automated composition of user functionality
  • 9. © 2020 Xilinx Vitis: Comprehensive Development Tool Suite 9 Build Run System level Simulation ARM Compiler AIE Compiler Vitis HLSHost CPU System Compile/Link Xilinx Runtime Library (XRT) Device/Card Device/CardDevice/Card Analyze Host Application Libraries Application C/C++ Target PlatformRTL Debug & Performance Analysis
  • 10. © 2020 Xilinx Kernel Drivers User Space Library Kernel Drivers OCL XMA AI C++ Python XOCL ZOCL Ultrascale PCI-e Device MPSoC/ACAP PCI-e Device MPSoC/ ACAP Device MB scheduler ARM Scheduler ARM Scheduler ˃ Platform- and OS-independent APIs for Device management Memory management and data transfers Accelerator execution management ˃ OpenCL wrappers, media frameworks, and domain-specific APIs built on top of base APIs ˃ Open source and available on GitHub Xilinx Runtime (XRT)
  • 11. © 2020 Xilinx Open Source, Standards Based Libraries 11 https://github.com/Xilinx/Vitis_Libraries Vision & Image Finance Data Analytics & Database Data Compression Data Security Math Linear Algebra Statistics DSP Data Management Domain-Specific Libraries Common Libraries 400+ functions across multiple libraries for performance-optimized out-of-the-box acceleration Library Docs Tests Examples Bench- marks
  • 12. © 2020 Xilinx Vitis Vision Libraries 12
  • 13. © 2020 Xilinx Deploy: Embedded, Single Server, Scale-out
  • 14. © 2020 Xilinx Vitis drivers & runtime (XRT) Frameworks Vitis AI Development Kit Vitis AI Models DSA AI Quantizer AI CompilerAI Optimizer AI Profiler LSTM DPU MLP DPUCNN DPU 60+ pretrained, optimized reference models Performance improvement up to 10-20x Tensor based ISA for true software programmability DSA – Domain Specific Architecture DPU – DNN Processing Unit Vitis AI: Real Time AI Inference Acceleration
  • 15. © 2020 Xilinx ACAP Server CPU PCI Express FPGA AI Engine CPU Cores Video Analytics Management Computer Vision Primary Detector (AI) Object Tracker Secondary Classifiers (AI) Video Decode AI Embedded in Apps, Rarely the Whole App
  • 16. © 2020 Xilinx Example: Whole Application Acceleration >> 16 Input Image 2-3 days to integrate and test using Vitis and Vitis+AI Libraries 0 100 200 300 Tiny Yolo v3 Resnet-50 GoogleNet Frames/sec Speedup over CPU CPU FPGA Implemented either in FPGA or CPU Always in FPGA
  • 17. © 2020 Xilinx Example: Defect Detection Low Latency: End-to-end < 250ms >> 17
  • 18. © 2020 Xilinx Shell Hardware Developers Application Software Developers AI and Data Scientists (iterations in minutes) Embedded Developers Putting it All Together
  • 19. © 2020 Xilinx Summary 19 Unified Software Platform  Cloud to edge, software and AI  Comprehensive tools, runtime, libraries and models Standards, Open Source  Participating in open source  Use of standard environments & APIs
  • 20. © 2020 Xilinx Additional Resources Please visit the following sites for more information Vitis Unified SW Platform • https://www.xilinx.com/products/design- tools/vitis.html Vitis Libraries • https://www.xilinx.com/products/design- tools/vitis/vitis-libraries.html • https://github.com/Xilinx/Vitis_Libraries/ Vitis AI • https://www.xilinx.com/products/design- tools/vitis/vitis-ai.html Visit the Avnet-Xilinx booth to see the following demonstrations in action: • Face Detection, Pedestrian Detection, Pose Estimation, Machine Learning and more • Hardware families include the Zynq Ultrascale+ and Versal AI Core • Demonstration platforms include our SmartCamera+, Ultra96, and UltraZed • Xilinx and Avnet staff will be available to answer any questions 2020 Embedded Vision Summit • Vitis and Vitis AI: Application Acceleration from Cloud to Edge • September 17, 2020, 11:00-11:30AM PDT 20