Using MXNet to Train and Deploy your Deep Learning Model

Using Apache MXNet to Train and Deploy
your Deep Learning model
Qing Lan
PPMC Member of Apache MXNet
T r a c k : M a c h i n e L e a r n i n g

Agenda
• Introduction to Deep Learning
• Introduction to Apache MXNet
• Train your model with MXNet
• Use MXNet for predictions
• Start Learning Apache MXNet
• Apache MXNet: Now and Future

Neural network
Output
layer
Input
layer
Hidden
layers
ManyMore…
• Non-linear
• Hierarchical
feature learning
• Scalable architecture
• Computationally
intensive

Forward pass
Backwards pass
Input Data
Neural
Network
Output
Loss
Back
Propagate
Update
Weights
Forward-backward repeats across multiple epochs, each
epoch goes through the entire training dataset.
Training neural networks

Apache MXNet - Background
● Framework for building, training, and deploying Deep Neural Nets
● Apache (incubating) open source project
● Created by academia (CMU and UW)
● Adopted by AWS as DNN framework of choice, Nov 2016
http://mxnet.apache.org

Apache MXNet for Training
• Simple and Powerful API: Gluon
• Data Science compatibility: Numpy support
• Distributed Training: Horovod, PSLite, BytePS
• Training Speed improvement:
• GPU: CUDNN and Float16 support (NVIDIA AMP)
• CPU: Intel MKLDNN

Current Deep Learning scenario
• Prototype code is hard to maintain
• Setting up baseline for different workloads is hard
• Pre-trained models are hard to obtain
• Models trained in Python cannot be easily deployed to
production system

MXNet Community: Gluon Toolkits
• Carefully designed API for versatile needs
• Implementation for state-of-the-art models
• One-command download of hundreds of pre-trained models
• Easy model export and deployment in C++, Java, Scala with support
for Control flows and model quantization
• Gluon Toolkits
• GluonCV: Computer Vision
• GluonNLP: Natural Language Processing
• GluonTS: Probabilistic Time Series Modeling
• Deep Graph Library

GluonCV: A Vision Toolkit
• Scripts for reproducing SOTA results
• State-of-the-Art pretrained Models
• Easy Deployment
Detection
Pose Est.
Classification Semantic
Segmentation
Instance
Segmentation

GluonNLP: A Natural Language Toolkit
• Data Processing APIs
• Data API, support Multiprocessing, Batching, Vocabulary loading,
Tokenizing…
• Embedding Methods (~500 Pretrained)
• Word2Vec, GloVe, FastText, ELMo, BERT, RoBERTa…
• Sequence Sampler
• Beam Search, Random Sampling
• Models
• Encoder/Decoder, AWD-LSTM, Transformer, Transformer-XL

Apache MXNet for Inference
• Train in one Language, deploy in many:
• JVM: Java, Scala, Clojure
• Other languages: C++, R
• Model support
• Gluon Model Zoo
• Open Neural Network Exchange (ONNX) models
• Keras Model*
• Model Deployment: MXNet Model Server

Gluon Model Zoo
• CV: 194 models
• NLP: 450 models

Multi-Model Server
• Low latency, high throughput
• Language agnostic: Python/Java
• Model loading at runtime
• Serving multiple models
• Highly customizable (use plugins)

How can I make a start?
• Book: Dive into Deep Learning (CHN: 动手学深度学习)
• MXNet Community project
• Comprehensive knowledge to learn Deep Learning
• Include code to practice in MXNet
• Course: STAT 157 Introduction to Deep Learning
• UC Berkerly Spring 2019
• Instructed by Mu Li and Alexander Smola

Apache MXNet: Future plan (2.0)
• Full numpy operator support
• Gluon usability improvement
• Accelerator support
• TVM: operator integration
• TVM: Relay IR integration (Experimental)

Contribute to Apache MXNet
● GitHub: https://github.com/apache/incubator-mxnet
● Subscribe to our developer mailing list:
dev@mxnet.incubator.apache.org
● Slack Channel: https://the-asf.slack.com and go to #mxnet

Thank you!
Qing Lan
PPMC Member of Apache MXNet
lanking@apache.org

Using MXNet to Train and Deploy your Deep Learning Model

More Related Content

What's hot (20)

Similar to Using MXNet to Train and Deploy your Deep Learning Model (20)

Recently uploaded (20)

Using MXNet to Train and Deploy your Deep Learning Model

Editor's Notes