SlideShare a Scribd company logo
Towards Better DL
Frameworks
Yangqing Jia
Research Lead on AI Platforms, Facebook
Source: XKCD, [Girshick et al. CVPR 2014]
• Researchers: "I will need to reproduce the ResNet
paper."
• Companies: "I need to apply DL to drive cars."
The Needs
Two sides of the same coin
• A grad student driven project
• Started by doing one job really well: image
classification
• Adopted by industry participants
• Popular deep learning framework run by a non-
profit.
Democratizing Deep Learning w/
CaffeGetting AlexNet running in 10 mins
http://caffe.berkeleyvision.org/
What makes a better DL
library?
???
"MAPS"
!!!
"MAPS"
-
Scalability
Scalability
Run fast, run far
“How do I train on

multiple GPUs and machines?”
- Probably the most question we got from Caffe users
Scalability
Run fast, run far
L1 L2 L3 L3b L2b L1b U3 U2 U1
Scalability
Run fast, run far
L1 L2 L3 L3b L2b L1b U3 U2 U1R3 R2 R1
Scalability
Run fast, run far
L1 L2 L3 L3b L2b L1b U3 U2 U1R3 R2 R1
L1 L2 L3 L3b L2b L1b U3 U2 U1R3 R2 R1
Scalability
Run fast, run far
L1 L2 L3 L3b L2b L1b
U3 U2 U1R3 R2 R1
L1 L2 L3 L3b L2b L1b
U3 U2 U1R3 R2 R1
The Return of MPI
"I'm your father", said Allreduce.
Allreduce
Tree based - O(MlogN)
Ring based - O(M)
etc.
Scalability
Sitting on top of giants
... and many more
"MAPS"
-
Portability
Portable System
Cloud, Mobile, IoT, Cars, Drones, Coffee makers
AI Math and Algorithms
Deployment Platforms
Yangqing Jia at AI Frontiers: Towards Better DL Frameworks
Portable System
Cloud, Mobile, IoT, Cars, Drones, Coffee makers
Model
auto predictor =

caffe2::Predictor(model_file)
public class Predictor implements

Caffe2ModelInterface;
Still, a lot of thoughts needed
• Limited computation
• Battery life is a thing
• Our models may be luxurious
• Ecosystem less developed
Portable System Challenges
Yangqing Jia at AI Frontiers: Towards Better DL Frameworks
"MAPS"
-
Augmented Comp Patterns
Augmented Comp Patterns
Forget about float dense math, the world is bigger
• Quantized Computation
• Sparse Math Libraries
• Model Compression
• Rethinking Existing Operations
Quantized Computation
Forget about float, the world is bigger
8 23
5 10
16
8
float
fp16
fixed16
fixed8
Quantized Computation
Forget about float, the world is bigger
float add
fp16 add
fixed16 add
fixed8 add
0.9
0.4
0.05
0.03
float mul
fp16 mul
fixed8 mul
4.0
1.0
0.2
Why?
Source: Nvidia https://devblogs.nvidia.com/parallelforall/mixed-precision-programming-cuda-8/
Rethinking Existing Operations
ResNEXT is coming to town
gconv gconv
g g g g g g g g g ..
.
g
g g g g g g g g g ..
.
g
AlexNet Group
Conv
ResNext
Augmented Math Challenges
Forget about float, the world is bigger
• Solutions
• Eigen fp16
• CuDNN
• NNPack
• gemmlowp
• Challenges
• Seamless
conversion?
• Model training?
• Performance tuning?
• ...
"MAPS"
-
Modularity
A Repeated Pattern
Many key components in deep learning
are

reusable

across frameworks.
In 2013 it used to be...
Caffe Torch Theano ...
Unix Philosophy?
Applications
Caffe,Torch,TF, MXNet, etc...
Core Math
Eigen

CuDNN
NNPack

THNN

MKL
Comms
NCCL

MPI

ZeroMQ

Redis

...
Low Level
CUDA
OpenGL
OpenCL
Vulkan

...
Compilers
DataBases
LevelDB

RocksDB

Hadoop
Amazon S3
your old disk
or, "UnFramework"
Modular
Designs
MAPS for a good framework
Augmented
Mathematics
Portable
System
Scalability
Interface to
Existing
Toolkits
Efficient

Mobile
Runtimes
Tuned

Collective
Primitives
Optimized

Math
Libraries
+

Flexible Framework Design
No Silver Bullet?
There is no silver bullet
Industry:
Stability
Scale & speed
Data Integration
Relatively Fixed
Research:

Flexible
Fast Iteration
Debuggable
Relatively bare-
Caffe Torch
TheanoTensorFlowD4J etc.
There is no silver bullet
Industry:
Stability
Scale & speed
Data Integration
Relatively Fixed
Research:

Flexible
Fast Iteration
Debuggable
Relatively bare-
Caffe Torch
“In open source, we feel strongly that

to really do something well,

you have to get a lot of people involved.”
— Linus Torvalds
Thank you!
Towards Better Deep Learning Frameworks
Yangqing Jia, Research Lead on AI Platforms, Facebook

More Related Content

PDF
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision System
PDF
Soumith Chintala at AI Frontiers: A Dynamic View of the Deep Learning World
PDF
Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang
PPTX
The deep learning tour - Q1 2017
PDF
Recent developments in Deep Learning
PPTX
Deep learning on mobile - 2019 Practitioner's Guide
PDF
Improving Hardware Efficiency for DNN Applications
PDF
Deep learning with TensorFlow
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision System
Soumith Chintala at AI Frontiers: A Dynamic View of the Deep Learning World
Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang
The deep learning tour - Q1 2017
Recent developments in Deep Learning
Deep learning on mobile - 2019 Practitioner's Guide
Improving Hardware Efficiency for DNN Applications
Deep learning with TensorFlow

What's hot (20)

PPTX
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
PPTX
Android and Deep Learning
PDF
"How to Test and Validate an Automated Driving System," a Presentation from M...
PPTX
Intel and Amazon - Powering your innovation together.
PDF
GTC Taiwan 2017 企業端深度學習與人工智慧應用
PPTX
Deep learning on mobile
PDF
Distributed Deep Learning on Spark
PPTX
GPU and Deep learning best practices
PDF
Fascinating Tales of a Strange Tomorrow
PDF
"Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ...
PDF
GTC Taiwan 2017 在 Google Cloud 當中使用 GPU 進行效能最佳化
PDF
Intro to Machine Learning for GPUs
PDF
CNN Quantization
PDF
Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark
PDF
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
PPTX
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
PDF
Introducing google’s mobile nets
PDF
Deep Dive on Deep Learning (June 2018)
PPTX
Narayanan Sundaram, Research Scientist, Intel Labs at MLconf SF - 11/13/15
PDF
Metaflow: The ML Infrastructure at Netflix
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
Android and Deep Learning
"How to Test and Validate an Automated Driving System," a Presentation from M...
Intel and Amazon - Powering your innovation together.
GTC Taiwan 2017 企業端深度學習與人工智慧應用
Deep learning on mobile
Distributed Deep Learning on Spark
GPU and Deep learning best practices
Fascinating Tales of a Strange Tomorrow
"Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ...
GTC Taiwan 2017 在 Google Cloud 當中使用 GPU 進行效能最佳化
Intro to Machine Learning for GPUs
CNN Quantization
Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Introducing google’s mobile nets
Deep Dive on Deep Learning (June 2018)
Narayanan Sundaram, Research Scientist, Intel Labs at MLconf SF - 11/13/15
Metaflow: The ML Infrastructure at Netflix
Ad

Viewers also liked (20)

PDF
Alex Smola at AI Frontiers: Scalable Deep Learning Using MXNet
PDF
Rajat Monga at AI Frontiers: Deep Learning with TensorFlow
PDF
Intelligent Chatbot on WeChat
PDF
Nikko Ström at AI Frontiers: Deep Learning in Alexa
PDF
Scaling Deep Learning with MXNet
PDF
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLP
PDF
Charles Fan at AI Frontiers: The New Era of AI Plus
PDF
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
PDF
Adam Coates at AI Frontiers: AI for 100 Million People with Deep Learning
PDF
Andres Rodriguez at AI Frontiers: Catalyzing Deep Learning's Impact in the En...
PDF
Junli Gu at AI Frontiers: Autonomous Driving Revolution
PDF
Kevin Shaw at AI Frontiers: AI on the Edge: Bringing Intelligence to Small De...
PDF
Jeff Dean at AI Frontiers: Trends and Developments in Deep Learning Research
PDF
Liu Ren at AI Frontiers: Sensor-aware Augmented Reality
PDF
Jisheng Wang at AI Frontiers: Deep Learning in Security
PDF
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
PDF
James Manyika at AI Frontiers: A Future That Works: Automation, Employment, a...
PDF
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...
PDF
Alex Smola, Director of Machine Learning, AWS/Amazon, at MLconf SF 2016
PDF
Rajat_Monga -Resume V04
Alex Smola at AI Frontiers: Scalable Deep Learning Using MXNet
Rajat Monga at AI Frontiers: Deep Learning with TensorFlow
Intelligent Chatbot on WeChat
Nikko Ström at AI Frontiers: Deep Learning in Alexa
Scaling Deep Learning with MXNet
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLP
Charles Fan at AI Frontiers: The New Era of AI Plus
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Adam Coates at AI Frontiers: AI for 100 Million People with Deep Learning
Andres Rodriguez at AI Frontiers: Catalyzing Deep Learning's Impact in the En...
Junli Gu at AI Frontiers: Autonomous Driving Revolution
Kevin Shaw at AI Frontiers: AI on the Edge: Bringing Intelligence to Small De...
Jeff Dean at AI Frontiers: Trends and Developments in Deep Learning Research
Liu Ren at AI Frontiers: Sensor-aware Augmented Reality
Jisheng Wang at AI Frontiers: Deep Learning in Security
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
James Manyika at AI Frontiers: A Future That Works: Automation, Employment, a...
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...
Alex Smola, Director of Machine Learning, AWS/Amazon, at MLconf SF 2016
Rajat_Monga -Resume V04
Ad

Similar to Yangqing Jia at AI Frontiers: Towards Better DL Frameworks (20)

PDF
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
PDF
Cytoscape: Now and Future
PDF
Latest Developments in H2O
PDF
AI & Machine Learning Pipelines with Knative
PDF
Cloud: From Unmanned Data Center to Algorithmic Economy using Openstack
PPTX
Machine Learning for Smarter Apps - Jacksonville Meetup
PPT
Coffee script throwdown
PDF
Using BigBench to compare Hive and Spark (Long version)
PDF
Machine learning model to production
PPTX
Session 01 - Introduction to Java
PDF
Bringing Deep Learning into production
PPT
Big Graph Analytics on Neo4j with Apache Spark
PDF
Low Latency Polyglot Model Scoring using Apache Apex
PDF
How a Small Team Scales Instagram
PDF
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
PDF
Hadoop made fast - Why Virtual Reality Needed Stream Processing to Survive
PDF
Practical virtual network functions with Snabb (SDN Barcelona VI)
PDF
Ncku csie talk about Spark
PDF
NVIDIA Rapids presentation
PDF
Rapids: Data Science on GPUs
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Cytoscape: Now and Future
Latest Developments in H2O
AI & Machine Learning Pipelines with Knative
Cloud: From Unmanned Data Center to Algorithmic Economy using Openstack
Machine Learning for Smarter Apps - Jacksonville Meetup
Coffee script throwdown
Using BigBench to compare Hive and Spark (Long version)
Machine learning model to production
Session 01 - Introduction to Java
Bringing Deep Learning into production
Big Graph Analytics on Neo4j with Apache Spark
Low Latency Polyglot Model Scoring using Apache Apex
How a Small Team Scales Instagram
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
Hadoop made fast - Why Virtual Reality Needed Stream Processing to Survive
Practical virtual network functions with Snabb (SDN Barcelona VI)
Ncku csie talk about Spark
NVIDIA Rapids presentation
Rapids: Data Science on GPUs

More from AI Frontiers (20)

PPTX
Divya Jain at AI Frontiers : Video Summarization
PPTX
Training at AI Frontiers 2018 - LaiOffer Data Session: How Spark Speedup AI
PDF
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 1: Heuristi...
PDF
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
PDF
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-lecture 2: Incremen...
PDF
Training at AI Frontiers 2018 - Udacity: Enhancing NLP with Deep Neural Networks
PDF
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 3: Any-Angl...
PDF
Training at AI Frontiers 2018 - Lukasz Kaiser: Sequence to Sequence Learning ...
PDF
Percy Liang at AI Frontiers : Pushing the Limits of Machine Learning
PDF
Ilya Sutskever at AI Frontiers : Progress towards the OpenAI mission
PDF
Mark Moore at AI Frontiers : Uber Elevate
PPTX
Mario Munich at AI Frontiers : Consumer robotics: embedding affordable AI in ...
PPTX
Arnaud Thiercelin at AI Frontiers : AI in the Sky
PPTX
Anima Anandkumar at AI Frontiers : Modern ML : Deep, distributed, Multi-dimen...
PPTX
Wei Xu at AI Frontiers : Language Learning in an Interactive and Embodied Set...
PPTX
Sumit Gupta at AI Frontiers : AI for Enterprise
PPTX
Yuandong Tian at AI Frontiers : Planning in Reinforcement Learning
PPTX
Alex Ermolaev at AI Frontiers : Major Applications of AI in Healthcare
PPTX
Long Lin at AI Frontiers : AI in Gaming
PDF
Melissa Goldman at AI Frontiers : AI & Finance
Divya Jain at AI Frontiers : Video Summarization
Training at AI Frontiers 2018 - LaiOffer Data Session: How Spark Speedup AI
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 1: Heuristi...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-lecture 2: Incremen...
Training at AI Frontiers 2018 - Udacity: Enhancing NLP with Deep Neural Networks
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 3: Any-Angl...
Training at AI Frontiers 2018 - Lukasz Kaiser: Sequence to Sequence Learning ...
Percy Liang at AI Frontiers : Pushing the Limits of Machine Learning
Ilya Sutskever at AI Frontiers : Progress towards the OpenAI mission
Mark Moore at AI Frontiers : Uber Elevate
Mario Munich at AI Frontiers : Consumer robotics: embedding affordable AI in ...
Arnaud Thiercelin at AI Frontiers : AI in the Sky
Anima Anandkumar at AI Frontiers : Modern ML : Deep, distributed, Multi-dimen...
Wei Xu at AI Frontiers : Language Learning in an Interactive and Embodied Set...
Sumit Gupta at AI Frontiers : AI for Enterprise
Yuandong Tian at AI Frontiers : Planning in Reinforcement Learning
Alex Ermolaev at AI Frontiers : Major Applications of AI in Healthcare
Long Lin at AI Frontiers : AI in Gaming
Melissa Goldman at AI Frontiers : AI & Finance

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Modernizing your data center with Dell and AMD
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Electronic commerce courselecture one. Pdf
PPTX
A Presentation on Artificial Intelligence
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Big Data Technologies - Introduction.pptx
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Approach and Philosophy of On baking technology
CIFDAQ's Market Insight: SEC Turns Pro Crypto
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Modernizing your data center with Dell and AMD
Review of recent advances in non-invasive hemoglobin estimation
Chapter 3 Spatial Domain Image Processing.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Electronic commerce courselecture one. Pdf
A Presentation on Artificial Intelligence
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
MYSQL Presentation for SQL database connectivity
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Dropbox Q2 2025 Financial Results & Investor Presentation
Big Data Technologies - Introduction.pptx
Bridging biosciences and deep learning for revolutionary discoveries: a compr...

Yangqing Jia at AI Frontiers: Towards Better DL Frameworks