SlideShare a Scribd company logo
Large-Scale AI with Azure Container Service
Large-Scale AI with Azure Container Service
Large-Scale AI with Azure Container Service
Automated Software Development in Heterogeneous
GPU/CPU Environments for Seismic Modeling
2 x NVIDIA Tesla X2070
2 x (512 CORES 6144 MBMEMORY SIZE)
...the software system that
orchestrates the whole
thing ... is called Borg, and
it’s one of the best-kept
secrets of Google’s rapid
evolution into the most
dominant force on the web
Large-Scale AI with Azure Container Service
Large-Scale AI with Azure Container Service
Large-Scale AI with Azure Container Service
Azure N
Series
2496 x2 CORES
12288 MB x2 MEMORY
• No need to deal with IT.
• Single entry point for the team to perform
experiments.
• Scalability, based on the demands of training.
• Handling production SLAs for trained models.
Large-Scale AI with Azure Container Service
Large-Scale AI with Azure Container Service
ACS Engine ACS AKS
Open Source
Community and innovation
Managed by Azure
Choice: Swarm, Mesos, Kubernetes
Managed Kubernetes
Horizontal Pod Autoscaler (HPA)
kubectl autoscale deployment foo
--min=4 --max=6 --cpu-percent=80
Node-level Autoscaling
Large-Scale AI with Azure Container Service
Large-Scale AI with Azure Container Service
• https://github.com/Microsoft/CNTK
Large-Scale AI with Azure Container Service
1784 samples/s
1709 samples/s
Azure NC6 Virtual Machine Azure NC6 Virtual Machine Node
CNTK training using
CIFAR-10 (50,000
training images and
10,000 test images)
Large-Scale AI with Azure Container Service
Large-Scale AI with Azure Container Service
BatchSize Caffe CNTK MXNET TensorFlo
w
Torch
86 339.9 265.2 274.7 882.7 358.0
128 327.9 236.9 245.2 853.0 335.8
256 311.4 217.5 229.6
818.2
315.7
512 301.5 217.9 217.6 796.2 307.0
1024 297.2 206.1 210.7 783.3 302.6
alexnet on K80
http://dlbench.comp.hkbu.edu.hk/

More Related Content

PDF
STAR CCM GLOBAL CONFERENCE UBERCLOUD
PDF
JOSA TechTalks - Downgrade your Costs
PDF
OpenNebula TechDay Boston 2015 - HA HPC with OpenNebula
PDF
SCasia 2018 MSFT hands on session for Azure Batch AI
PDF
HPC on Azure for Reserach
PPTX
FPGAs in the cloud? (October 2017)
PDF
Scaling MLOps on NVIDIA DGX Systems
PPTX
HaaS: HPCC Systems as a Service – BYOD to the Cloud Party
STAR CCM GLOBAL CONFERENCE UBERCLOUD
JOSA TechTalks - Downgrade your Costs
OpenNebula TechDay Boston 2015 - HA HPC with OpenNebula
SCasia 2018 MSFT hands on session for Azure Batch AI
HPC on Azure for Reserach
FPGAs in the cloud? (October 2017)
Scaling MLOps on NVIDIA DGX Systems
HaaS: HPCC Systems as a Service – BYOD to the Cloud Party

What's hot (15)

PPTX
Kubernetes Optimization - How We Cut Our Cloud Infrastructure Cost By 40% Usi...
PDF
Running BSD on AWS
PPTX
Windows Azure IaaS and Hybrid
PPTX
Amazon Web Services EC2 Basics
PPTX
Deep Learning with Apache MXNet (September 2017)
PPTX
themidgame-tube-slides
PDF
Cloud hosting survey
PDF
1. CNCF kubernetes meetup - Ondrej Sika
PPTX
Java on azure
PPTX
Google Compute Engine
PPTX
Get superior performance with auto scalable e nlight managed cloud
PPTX
Stefano Doni - Achieve Superhuman Performance with Machine Learning
PDF
Re invent 2018 meetup presentation
PDF
From Rack scale computers to Warehouse scale computers
PDF
MizuhoDeploymentProcess
Kubernetes Optimization - How We Cut Our Cloud Infrastructure Cost By 40% Usi...
Running BSD on AWS
Windows Azure IaaS and Hybrid
Amazon Web Services EC2 Basics
Deep Learning with Apache MXNet (September 2017)
themidgame-tube-slides
Cloud hosting survey
1. CNCF kubernetes meetup - Ondrej Sika
Java on azure
Google Compute Engine
Get superior performance with auto scalable e nlight managed cloud
Stefano Doni - Achieve Superhuman Performance with Machine Learning
Re invent 2018 meetup presentation
From Rack scale computers to Warehouse scale computers
MizuhoDeploymentProcess
Ad

Similar to Large-Scale AI with Azure Container Service (20)

PDF
Using Deep Learning Toolkits with Kubernetes clusters
PDF
Tesla Accelerated Computing Platform
PDF
Gömülü Sistemlerde Derin Öğrenme Uygulamaları
PPTX
Return on Ignite 2019: Azure, .NET, A.I. & Data
PPT
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
PDF
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
PPTX
Migrating Existing Open Source Machine Learning to Azure
PPTX
Migrating existing open source machine learning to azure
PDF
201908 Overview of Automated ML
PDF
Harnessing the virtual realm for successful real world artificial intelligence
PPTX
cloud conference 2013 - Infrastructure as a Service in Amazon Web Services
PDF
AI for Intelligent Cloud and Intelligent Edge: Discover, Deploy, and Manage w...
PPTX
High Performance Computing Pitch Deck
PDF
Deep Learning at the Edge
PDF
Microsoft Azure in HPC scenarios
PPTX
Level 200 - Intro to Azure IaaS - Short deck.pptx
PDF
Nvidia at SEMICon, Munich
PPTX
Azure machine learning service
PPTX
Build, train, and deploy Machine Learning models at scale (May 2018)
PPTX
Azure Batch AI for Neural Networks
Using Deep Learning Toolkits with Kubernetes clusters
Tesla Accelerated Computing Platform
Gömülü Sistemlerde Derin Öğrenme Uygulamaları
Return on Ignite 2019: Azure, .NET, A.I. & Data
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Migrating Existing Open Source Machine Learning to Azure
Migrating existing open source machine learning to azure
201908 Overview of Automated ML
Harnessing the virtual realm for successful real world artificial intelligence
cloud conference 2013 - Infrastructure as a Service in Amazon Web Services
AI for Intelligent Cloud and Intelligent Edge: Discover, Deploy, and Manage w...
High Performance Computing Pitch Deck
Deep Learning at the Edge
Microsoft Azure in HPC scenarios
Level 200 - Intro to Azure IaaS - Short deck.pptx
Nvidia at SEMICon, Munich
Azure machine learning service
Build, train, and deploy Machine Learning models at scale (May 2018)
Azure Batch AI for Neural Networks
Ad

Recently uploaded (20)

PPT
Quality review (1)_presentation of this 21
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPT
Predictive modeling basics in data cleaning process
PDF
Mega Projects Data Mega Projects Data
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
Managing Community Partner Relationships
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Introduction to Knowledge Engineering Part 1
Quality review (1)_presentation of this 21
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
climate analysis of Dhaka ,Banglades.pptx
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Predictive modeling basics in data cleaning process
Mega Projects Data Mega Projects Data
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Clinical guidelines as a resource for EBP(1).pdf
Reliability_Chapter_ presentation 1221.5784
Optimise Shopper Experiences with a Strong Data Estate.pdf
[EN] Industrial Machine Downtime Prediction
Managing Community Partner Relationships
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
SAP 2 completion done . PRESENTATION.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Introduction to Knowledge Engineering Part 1

Large-Scale AI with Azure Container Service

Editor's Notes