SlideShare a Scribd company logo
© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Julien Simon, Principal Technical Evangelist, AWS
@julsimon
DEVit, Thessaloniki, 20/05/2017
Scalable Deep Learning on AWS
using Apache MXNet
Agenda
•  AI: The Story So Far
•  Applications of Deep Learning
•  Apache MXNet Overview
•  Apache MXNet API
•  Code and Demos
•  Tools and Resources
Scalable Deep Learning on AWS using Apache MXNet (May 2017)
• Machine Learning is now a commodity, but still no HAL in sight
• Traditional Machine Learning doesn’t work well with problems
where features can’t be explicitly defined
• So what about solving tasks that are easy for people to
perform, but hard to describe formally?
• Is there a way to get informal knowledge into a computer?
Where is HAL?
•  Universal approximation machine
•  Through training, a neural network
discovers features automatically
•  Not new technology!
•  Perceptron - Rosenblatt, 1958
image recognition, 20x20 pixels
•  Backpropagation - Werbos, 1975
•  They failed back then because:
•  Data sets were too small
•  Solving large problems with fully connected networks
required too much memory and computing power,
aka the Curse of Dimensionality
Neural Networks, Revisited
Everything is digital: large data sets are available
•  Imagenet: 14M+ labeled images - http://www.image-net.org/
•  YouTube-8M: 7M+ labeled videos - https://research.google.com/youtube8m/
•  AWS public data sets - https://aws.amazon.com/public-datasets/
The parallel computing power of GPUs make training possible
•  Simard (2005), Ciresan (2011)
•  State of the art networks have hundreds of layers
•  Baidu’s Chinese speech recognition: 4TB of training data, +/- 10 Exaflops
Cloud scalability and elasticity make training affordable
•  Grab a lot of resources for fast training, then release them
•  Using a DL model is lightweight: you can do it on a Raspberry Pi
Why It’s Different This Time
Applications of Deep Learning
ImageNet Large Scale Visual Recognition Challenge (ILSVRC)
Same breed?
Humans: 5,1%
Amazon Echo
https://medium.com/@julsimon/create-your-own-basquiat-with-deep-learning-for-much-less-than-110-million-314aa07c9ba8
Apache MXNet Overview
Apache MXNet
Programmable Portable High Performance
Near linear scaling
across hundreds of GPUs
Highly efficient
models for mobile
and IoT
Simple syntax,
multiple languages
Most Open Best On AWS
Optimized for
deep learning on
AWS
Accepted into the
Apache Incubator
0!
4!
8!
12!
16!
1! 2! 4! 8! 16!
Ideal
Inception v3
Resnet
Alexnet
91%
Efficiency
Multi-GPU Scaling With MXNet
Ideal
Inception v3
Resnet
Alexnet
88%
Efficiency
0!
64!
128!
192!
256!
1! 2! 4! 8! 16! 32! 64! 128! 256!
Multi-Machine Scaling With MXNet
Apache MXNet API
Demo #1 – Training MXNet on MNIST
https://medium.com/@julsimon/training-mxnet-part-1-mnist-6f0dc4210c62
https://github.com/juliensimon/aws/tree/master/mxnet/mnist
Demo #2 – Object Detection on a Raspberry Pi
https://medium.com/@julsimon/an-introduction-to-the-mxnet-api-part-6-fcdd7521ae87
GoPiGo Arduino Yùn AWS IoT
MQTT@CallMeJohnnyPi
One-Click GPU or CPU
Deep Learning
AWS Deep Learning AMI
Up to~40k CUDA cores
Apache MXNet
TensorFlow
Theano
Caffe
Torch
Keras
Pre-configured CUDA drivers,
MKL
Anaconda, Python3
Ubuntu and Amazon Linux
+ CloudFormation template
+ Container Image
Additional Resources
MXNet Resources
•  MXNet Blog Post | AWS Endorsement
•  Read up on MXNet and Learn More: mxnet.io
•  MXNet Github Repo
•  MXNet Recommender Systems Talk | Leo Dirac
AWS Resources
•  Deep Learning AMI |Amazon Linux
•  Deep Learning AMI | Ubuntu
•  CloudFormation Template Instructions
•  Deep Learning Benchmark
•  MXNet on Lambda
•  MXNet on ECS/Docker
Thank You!
Julien Simon, Principal Technical Evangelist, AWS
@julsimon

More Related Content

PDF
Scalable Deep Learning on AWS using Apache MXNet (May 2017)
PDF
Scalable Deep Learning on AWS with Apache MXNet
PDF
AI on a Pi
PDF
Fascinating Tales of a Strange Tomorrow
PPTX
Introduction to Deep Learning (September 2017)
PPTX
AI on a PI
PDF
Building serverless applications (April 2018)
PDF
Fascinating Tales of a Strange Tomorrow
Scalable Deep Learning on AWS using Apache MXNet (May 2017)
Scalable Deep Learning on AWS with Apache MXNet
AI on a Pi
Fascinating Tales of a Strange Tomorrow
Introduction to Deep Learning (September 2017)
AI on a PI
Building serverless applications (April 2018)
Fascinating Tales of a Strange Tomorrow

What's hot (14)

PDF
New AI/ML services at AWS re:Invent 2017
PDF
IoT & Deep Learning on AWS
PPTX
Amazon AI (October 2017)
PDF
Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...
PDF
Getting started with Machine Learning
PDF
Getting started with machine learning | Mike Fowler
PPTX
Parking: DeepLens to the rescue
PDF
Inrastructure as Code
PPTX
Advanced Machine Learning with Amazon SageMaker
PDF
AI Demo
PDF
Deep learning at supercomputing scale by Rangan Sukumar from Cray
PPT
Cloud Computing by Team Go Getters
PPTX
Aws intro 1
PDF
R과 Sagemaker를 활용한 딥러닝 어플리케이션 만들기
New AI/ML services at AWS re:Invent 2017
IoT & Deep Learning on AWS
Amazon AI (October 2017)
Cloud Academy & AWS: how we use Amazon Web Services for machine learning and ...
Getting started with Machine Learning
Getting started with machine learning | Mike Fowler
Parking: DeepLens to the rescue
Inrastructure as Code
Advanced Machine Learning with Amazon SageMaker
AI Demo
Deep learning at supercomputing scale by Rangan Sukumar from Cray
Cloud Computing by Team Go Getters
Aws intro 1
R과 Sagemaker를 활용한 딥러닝 어플리케이션 만들기
Ad

Similar to Scalable Deep Learning on AWS using Apache MXNet (May 2017) (20)

PDF
Deep Dive into Apache MXNet on AWS
PPTX
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
PDF
Deep Learning for Developers (December 2017)
PDF
Machine Learning on the Cloud with Apache MXNet
PDF
Deep Learning with Apache MXNet
PPTX
Emotion recognition in images: from idea to a model in production - Nordic DS...
PPTX
Deep Learning for Developers (expanded version, 12/2017)
PPTX
AI powered emotion recognition: From Inception to Production - Global AI Conf...
PPTX
AI powered emotion recognition: From Inception to Production - Global AI Conf...
PPTX
An Introduction to Deep Learning with Apache MXNet (November 2017)
PDF
Emotion Recognition in Images
PDF
DeepLearning001&ApacheMXNetWithSparkForInference-ACNA2018
PDF
MCL 322 Optimizing Training on Apache MXNet
PDF
Deep Dive on Deep Learning (June 2018)
PPTX
Optimizing training on Apache MXNet (January 2018)
PPTX
Tensorflow vs MxNet
PDF
Deep Learning for Developers (October 2017)
PPTX
An introduction to Deep Learning with Apache MXNet (November 2017)
PDF
Distributed Inference on Large Datasets Using Apache MXNet and Apache Spark ...
PDF
Distributed Inference with MXNet and Spark
Deep Dive into Apache MXNet on AWS
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Deep Learning for Developers (December 2017)
Machine Learning on the Cloud with Apache MXNet
Deep Learning with Apache MXNet
Emotion recognition in images: from idea to a model in production - Nordic DS...
Deep Learning for Developers (expanded version, 12/2017)
AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...
An Introduction to Deep Learning with Apache MXNet (November 2017)
Emotion Recognition in Images
DeepLearning001&ApacheMXNetWithSparkForInference-ACNA2018
MCL 322 Optimizing Training on Apache MXNet
Deep Dive on Deep Learning (June 2018)
Optimizing training on Apache MXNet (January 2018)
Tensorflow vs MxNet
Deep Learning for Developers (October 2017)
An introduction to Deep Learning with Apache MXNet (November 2017)
Distributed Inference on Large Datasets Using Apache MXNet and Apache Spark ...
Distributed Inference with MXNet and Spark
Ad

More from Julien SIMON (20)

PDF
Implementing high-quality and cost-effiient AI applications with small langua...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
PDF
Arcee AI - building and working with small language models (06/25)
PDF
deep_dive_multihead_latent_attention.pdf
PDF
Deep Dive: Model Distillation with DistillKit
PDF
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
PDF
Building High-Quality Domain-Specific Models with Mergekit
PDF
Tailoring Small Language Models for Enterprise Use Cases
PDF
Tailoring Small Language Models for Enterprise Use Cases
PDF
Julien Simon - Deep Dive: Compiling Deep Learning Models
PDF
Tailoring Small Language Models for Enterprise Use Cases
PDF
Julien Simon - Deep Dive - Optimizing LLM Inference
PDF
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
PDF
Julien Simon - Deep Dive - Quantizing LLMs
PDF
Julien Simon - Deep Dive - Model Merging
PDF
An introduction to computer vision with Hugging Face
PDF
Reinventing Deep Learning
 with Hugging Face Transformers
PDF
Building NLP applications with Transformers
PPTX
Building Machine Learning Models Automatically (June 2020)
Implementing high-quality and cost-effiient AI applications with small langua...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Trying to figure out MCP by actually building an app from scratch with open s...
Arcee AI - building and working with small language models (06/25)
deep_dive_multihead_latent_attention.pdf
Deep Dive: Model Distillation with DistillKit
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Building High-Quality Domain-Specific Models with Mergekit
Tailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use Cases
Julien Simon - Deep Dive: Compiling Deep Learning Models
Tailoring Small Language Models for Enterprise Use Cases
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien Simon - Deep Dive - Quantizing LLMs
Julien Simon - Deep Dive - Model Merging
An introduction to computer vision with Hugging Face
Reinventing Deep Learning
 with Hugging Face Transformers
Building NLP applications with Transformers
Building Machine Learning Models Automatically (June 2020)

Recently uploaded (20)

PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
sap open course for s4hana steps from ECC to s4
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
cuic standard and advanced reporting.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Review of recent advances in non-invasive hemoglobin estimation
Unlocking AI with Model Context Protocol (MCP)
sap open course for s4hana steps from ECC to s4
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
MYSQL Presentation for SQL database connectivity
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
20250228 LYD VKU AI Blended-Learning.pptx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Electronic commerce courselecture one. Pdf
Chapter 3 Spatial Domain Image Processing.pdf
cuic standard and advanced reporting.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Machine learning based COVID-19 study performance prediction
Network Security Unit 5.pdf for BCA BBA.
Dropbox Q2 2025 Financial Results & Investor Presentation
“AI and Expert System Decision Support & Business Intelligence Systems”
Agricultural_Statistics_at_a_Glance_2022_0.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?

Scalable Deep Learning on AWS using Apache MXNet (May 2017)

  • 1. © 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Julien Simon, Principal Technical Evangelist, AWS @julsimon DEVit, Thessaloniki, 20/05/2017 Scalable Deep Learning on AWS using Apache MXNet
  • 2. Agenda •  AI: The Story So Far •  Applications of Deep Learning •  Apache MXNet Overview •  Apache MXNet API •  Code and Demos •  Tools and Resources
  • 4. • Machine Learning is now a commodity, but still no HAL in sight • Traditional Machine Learning doesn’t work well with problems where features can’t be explicitly defined • So what about solving tasks that are easy for people to perform, but hard to describe formally? • Is there a way to get informal knowledge into a computer? Where is HAL?
  • 5. •  Universal approximation machine •  Through training, a neural network discovers features automatically •  Not new technology! •  Perceptron - Rosenblatt, 1958 image recognition, 20x20 pixels •  Backpropagation - Werbos, 1975 •  They failed back then because: •  Data sets were too small •  Solving large problems with fully connected networks required too much memory and computing power, aka the Curse of Dimensionality Neural Networks, Revisited
  • 6. Everything is digital: large data sets are available •  Imagenet: 14M+ labeled images - http://www.image-net.org/ •  YouTube-8M: 7M+ labeled videos - https://research.google.com/youtube8m/ •  AWS public data sets - https://aws.amazon.com/public-datasets/ The parallel computing power of GPUs make training possible •  Simard (2005), Ciresan (2011) •  State of the art networks have hundreds of layers •  Baidu’s Chinese speech recognition: 4TB of training data, +/- 10 Exaflops Cloud scalability and elasticity make training affordable •  Grab a lot of resources for fast training, then release them •  Using a DL model is lightweight: you can do it on a Raspberry Pi Why It’s Different This Time
  • 8. ImageNet Large Scale Visual Recognition Challenge (ILSVRC) Same breed? Humans: 5,1%
  • 12. Apache MXNet Programmable Portable High Performance Near linear scaling across hundreds of GPUs Highly efficient models for mobile and IoT Simple syntax, multiple languages Most Open Best On AWS Optimized for deep learning on AWS Accepted into the Apache Incubator
  • 13. 0! 4! 8! 12! 16! 1! 2! 4! 8! 16! Ideal Inception v3 Resnet Alexnet 91% Efficiency Multi-GPU Scaling With MXNet
  • 14. Ideal Inception v3 Resnet Alexnet 88% Efficiency 0! 64! 128! 192! 256! 1! 2! 4! 8! 16! 32! 64! 128! 256! Multi-Machine Scaling With MXNet
  • 16. Demo #1 – Training MXNet on MNIST https://medium.com/@julsimon/training-mxnet-part-1-mnist-6f0dc4210c62 https://github.com/juliensimon/aws/tree/master/mxnet/mnist
  • 17. Demo #2 – Object Detection on a Raspberry Pi https://medium.com/@julsimon/an-introduction-to-the-mxnet-api-part-6-fcdd7521ae87 GoPiGo Arduino Yùn AWS IoT MQTT@CallMeJohnnyPi
  • 18. One-Click GPU or CPU Deep Learning AWS Deep Learning AMI Up to~40k CUDA cores Apache MXNet TensorFlow Theano Caffe Torch Keras Pre-configured CUDA drivers, MKL Anaconda, Python3 Ubuntu and Amazon Linux + CloudFormation template + Container Image
  • 19. Additional Resources MXNet Resources •  MXNet Blog Post | AWS Endorsement •  Read up on MXNet and Learn More: mxnet.io •  MXNet Github Repo •  MXNet Recommender Systems Talk | Leo Dirac AWS Resources •  Deep Learning AMI |Amazon Linux •  Deep Learning AMI | Ubuntu •  CloudFormation Template Instructions •  Deep Learning Benchmark •  MXNet on Lambda •  MXNet on ECS/Docker
  • 20. Thank You! Julien Simon, Principal Technical Evangelist, AWS @julsimon