SlideShare a Scribd company logo
connect ONNX to every deep learning accelerator
https://onnc.ai
https://onnc.ai
executablility
CPU GPU DSP DLA
https://onnc.ai
traditional compiler
heterogeneous
architecture system (HSA)
single architecture
PetriNet(1) CFG and DFG
target
programming
model
IR type ONNX IR
(multiple outputs)
three address code
(single output)
physical
feature
depends on
operand opcode
https://onnc.ai
Assumption of target systems
• Accelerators are more effective than processors
• Processors are more flexible than accelerators
• If the communication cost is less than the computation
cost, than the task will reside in accelerator
• All tasks start from the top level processor
CPU DSP DLA
flexible effective
https://onnc.ai
CPU
DSP
DLA CONV
load
store
load
store
CONV
CONV
compulsory spill
compulsory spill
cost effective
flexible
Compulsory Spill
is easy to implement in the other compiler framework
https://onnc.ai
CPU
DSP
DLA CONV
load
store
load
store
X
store
load
Y
store
load
memory spill
eliminate them
in compiler
Memory Spill
is what we already have in every compiler framework
https://onnc.ai
Z
CPU
DSP
DLA CONV
load
store
load
store
X
store
load store
Y
load store
load
W
store
load
Z
Z
operator spill
Operator Spill
is totally new and required for every accelerators
https://onnc.ai
What a compiler should do
when an operator spill occurs?
1. push the operator to upper device
2. split the operator
3. give up this compilation and retry
In many cases, option 3 is the only possible solution
https://onnc.ai
traditional compiler
heterogeneous
architecture system (HSA)
single architecture
ITERATIVE sequential
target
compilation
model
Lattice
D
BA C
Add D, PassManager will add A and B
automatically
A B D C
topologic sort
retry
https://onnc.ai
traditional compiler
Limited DLA
save 377% in avg.
paging systemMemory
constraint
randnet_
manual/t
est2
CaffeNet LeNet yolo9000 AlexNet
R-CNN-
ilsvrc13
yolov1
FlickrStyl
eCaffeNe
t
VGG_ILSV
RC_19_la
yer
VGG_ILSV
RC_16_la
yer
yolov2-
tiny
yolov1-
tiny
Ratio (origin size / new size) 361.25% 263.58% 120.83% 615.86% 312.82% 263.34% 1079.96% 264.32% 554.49% 494.60% 443.97% 408.18%
0.00%
200.00%
400.00%
600.00%
800.00%
1000.00%
1200.00%
Ratio (origin size / new size)
https://onnc.ai
Connect to both
LLVM and ASIC
No porting effort for LLVM compiler
Support complex ASIC design
https://onnc.ai
for porters
for developers
for testers
Projects reside in
https://repo.onnc.ai
The Regression project
The Umbrella project
https://onnc.ai
How to contribute
https://onnc.ai
I have a question I have a wish
Ask questions in the
mailing list
Is the wish specific?
Is it a long wish?
yes
Make
an issue
no
no
yes
https://onnc.ai
Current Status
0.9.1
1.0.0
~8/24
Next release
often release; fast Iterate
(3~4 weeks a release interval)
https://onnc.ai
Give me 罐罐 and
Stars, please
https://repo.onnc.ai

More Related Content

PDF
Ostech war story using mainline linux for an android tv bsp
PDF
Evaluating GPU programming Models for the LUMI Supercomputer
PDF
Utilizing AMD GPUs: Tuning, programming models, and roadmap
PDF
RISC-V 30908 patra
PPT
GEM - GNU C Compiler Extensions Framework
PDF
Las16 200 - firmware summit - ras what is it- why do we need it
PPTX
An Open Discussion of RISC-V BitManip, trends, and comparisons _ Claire
PDF
LAS16-TR03: Upstreaming 201
Ostech war story using mainline linux for an android tv bsp
Evaluating GPU programming Models for the LUMI Supercomputer
Utilizing AMD GPUs: Tuning, programming models, and roadmap
RISC-V 30908 patra
GEM - GNU C Compiler Extensions Framework
Las16 200 - firmware summit - ras what is it- why do we need it
An Open Discussion of RISC-V BitManip, trends, and comparisons _ Claire
LAS16-TR03: Upstreaming 201

What's hot (20)

PPTX
Online test program generator for RISC-V processors
PDF
LAS16-405:OpenDataPlane: Software Defined Dataplane leader
PDF
LAS16-207: Bus scaling QoS
PDF
Andes open cl for RISC-V
PDF
Challenges in GPU compilers
PPTX
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
PDF
Q4.11: NEON Intrinsics
PDF
BKK16-302: Android Optimizing Compiler: New Member Assimilation Guide
PDF
Programming Languages & Tools for Higher Performance & Productivity
PDF
LAS16-TR02: Upstreaming 101
PDF
Pragmatic optimization in modern programming - modern computer architecture c...
PDF
LAS16-403: GDB Linux Kernel Awareness
PPT
DUSK - Develop at Userland Install into Kernel
PDF
LAS16-400: Mini Conference 3 AOSP (Session 1)
PDF
eBPF Debugging Infrastructure - Current Techniques
PDF
LAS16-101: Efficient kernel backporting
PPTX
RISC-V assembly
PDF
Klessydra-T: Designing Configurable Vector Co-Processors for Multi-Threaded E...
PDF
LAS16-TR06: Remoteproc & rpmsg development
PPTX
Architecture Exploration of RISC-V Processor and Comparison with ARM Cortex-A53
Online test program generator for RISC-V processors
LAS16-405:OpenDataPlane: Software Defined Dataplane leader
LAS16-207: Bus scaling QoS
Andes open cl for RISC-V
Challenges in GPU compilers
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
Q4.11: NEON Intrinsics
BKK16-302: Android Optimizing Compiler: New Member Assimilation Guide
Programming Languages & Tools for Higher Performance & Productivity
LAS16-TR02: Upstreaming 101
Pragmatic optimization in modern programming - modern computer architecture c...
LAS16-403: GDB Linux Kernel Awareness
DUSK - Develop at Userland Install into Kernel
LAS16-400: Mini Conference 3 AOSP (Session 1)
eBPF Debugging Infrastructure - Current Techniques
LAS16-101: Efficient kernel backporting
RISC-V assembly
Klessydra-T: Designing Configurable Vector Co-Processors for Multi-Threaded E...
LAS16-TR06: Remoteproc & rpmsg development
Architecture Exploration of RISC-V Processor and Comparison with ARM Cortex-A53
Ad

Similar to ONNC - 0.9.1 release (20)

PDF
Preparing to program Aurora at Exascale - Early experiences and future direct...
PDF
XPDDS18: CPUFreq in Xen on ARM - Oleksandr Tyshchenko, EPAM Systems
PPTX
OFI Overview 2019 Webinar
PPT
Choosing the right processor
PPT
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
PDF
Heterogeneous multiprocessing on androd and i.mx7
PDF
Asymmetric Multiprocessing - Kynetics ELC 2018 portland
PDF
AMP Kynetics - ELC 2018 Portland
PDF
Exploring the Programming Models for the LUMI Supercomputer
PDF
6 open capi_meetup_in_japan_final
PPTX
Crusoe processor
PDF
CAPI and OpenCAPI Hardware acceleration enablement
PDF
00 opencapi acceleration framework yonglu_ver2
PDF
RISC V in Spacer
PDF
LAS16-210: Hardware Assisted Tracing on ARM with CoreSight and OpenCSD
PPTX
Cockatrice: A Hardware Design Environment with Elixir
PDF
XPDDS17: Keynote: Shared Coprocessor Framework on ARM - Oleksandr Andrushchen...
PDF
Learn more about the tremendous value Open Data Plane brings to NFV
PPTX
Add sale davinci
PDF
Development of Signal Processing Algorithms using OpenCL for FPGA based Archi...
Preparing to program Aurora at Exascale - Early experiences and future direct...
XPDDS18: CPUFreq in Xen on ARM - Oleksandr Tyshchenko, EPAM Systems
OFI Overview 2019 Webinar
Choosing the right processor
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
Heterogeneous multiprocessing on androd and i.mx7
Asymmetric Multiprocessing - Kynetics ELC 2018 portland
AMP Kynetics - ELC 2018 Portland
Exploring the Programming Models for the LUMI Supercomputer
6 open capi_meetup_in_japan_final
Crusoe processor
CAPI and OpenCAPI Hardware acceleration enablement
00 opencapi acceleration framework yonglu_ver2
RISC V in Spacer
LAS16-210: Hardware Assisted Tracing on ARM with CoreSight and OpenCSD
Cockatrice: A Hardware Design Environment with Elixir
XPDDS17: Keynote: Shared Coprocessor Framework on ARM - Oleksandr Andrushchen...
Learn more about the tremendous value Open Data Plane brings to NFV
Add sale davinci
Development of Signal Processing Algorithms using OpenCL for FPGA based Archi...
Ad

Recently uploaded (20)

PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
MYSQL Presentation for SQL database connectivity
PDF
cuic standard and advanced reporting.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Advanced IT Governance
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPT
Teaching material agriculture food technology
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PPTX
Cloud computing and distributed systems.
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
MYSQL Presentation for SQL database connectivity
cuic standard and advanced reporting.pdf
Machine learning based COVID-19 study performance prediction
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
Empathic Computing: Creating Shared Understanding
Understanding_Digital_Forensics_Presentation.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Advanced Soft Computing BINUS July 2025.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Advanced IT Governance
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Teaching material agriculture food technology
Chapter 3 Spatial Domain Image Processing.pdf
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
GamePlan Trading System Review: Professional Trader's Honest Take
Cloud computing and distributed systems.
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

ONNC - 0.9.1 release