SlideShare a Scribd company logo
1WekaIO Confidential © 2017 All rights reserved.
Making Machine Learning
Compute Bound Again
Liran Zvibel, Co-founder and CEO
liran@weka.io
@liranzvibel
2WekaIO Confidential © 2017 All rights reserved.
THE STORAGE
DNA
WekaIO Introduction
WekaIO Matrix is the fastest, most scalable parallel file system for AI and
technical compute workloads that ensures your applications never wait for data.
WHO WE ARE
THE PARTNERS THE ACCOLADES
3WekaIO Confidential © 2017 All rights reserved.
We are using WekaIO technologies over InfiniBand to address the challenges of
data analytics at extreme scale in life sciences, particle physics, geosciences, and
other fields. That process is still ongoing but to-date we’ve already achieved some
promising results.
Michael Norman, Director of San Diego Supercomputer Center at UCSD
4WekaIO Confidential © 2017 All rights reserved.
5WekaIO Confidential © 2017 All rights reserved.
Future-thinking companies like WekaIO, complement our core
principle of accelerating research and discovery. The ability to run
more concurrent high performance genomic workloads will
significantly advance our time to discovery.
Nelson Kick, Manager of HPC Operations
6WekaIO Confidential © 2017 All rights reserved.
“Where can you find another non-
linear servo-mechanism weighing only
150 pounds and having great
adaptability, that can be produced so
cheaply by completely unskilled
labor?”
-- Albert Scott Crossfield, May 1954
https://quoteinvestigator.com/2016/02/01/computer/#note-12956-2
7WekaIO Confidential © 2017 All rights reserved.
"within a generation ... the problem of creating
'artificial intelligence' will substantially be solved”
(1967)
Marvin Minsky (1927 – 2016)
Co-Founder of MIT’s AI lab in 1959
The Journey Continues
Artificial Intelligence Demystified
Source: www.AnalyticsVidhya.com
8WekaIO Confidential © 2017 All rights reserved.
“Self driving vehicles will completely
revolutionize the way we get around in the
future, but also have the potential to literally
save 100k of lives…
Self driving vehicles let us think differently about
road safety… 94% of vehicle accidents are
caused by human error.
Why wouldn’t we do anything we can to bring
self driving vehicles technologies to our roads?”
-- Senator Gary Peters, November 2017
Making the Case for AI
9WekaIO Confidential © 2017 All rights reserved.
Deep Learning Requires Data
10WekaIO Confidential © 2017 All rights reserved.
Source: Andrew Ng, VP & Chief Scientist of Baidu; Co-Chairman and Co-
Founder of Coursera; and Adjunct Professor at Stanford University
Scale Drives Progress
11WekaIO Confidential © 2017 All rights reserved.
What has changed recently?
o Deep Learning algorithms have been
known since the ’80s
o Problem was with the computational
power
– GPUs and other accelerators made deep
learning practical
o “GPUs are the most efficient way to
take a compute bound problem, and
make it IO bound”
– Addison Snell, InterSect360 Research
o We will show you how to make sure
your Machine Learning workloads are
not IO bound!
12WekaIO Confidential © 2017 All rights reserved.
GPU Progress in One Year
nVidia P100 vs V100
0.0x
0.5x
1.0x
1.5x
2.0x
2.5x
3.0x
3.5x
PyTorch Tensor
Flow
Caffe2 Caffe MXNet
2.2x
2.5x
2.9x 3.0x 3.05x
Speedup
DGX-1 P100 DGX-1 V100
13WekaIO Confidential © 2017 All rights reserved.
Deep Learning Requirements
o Actually very close to HPC problems…
o Store a vast amount of data
– Effectively “stage” working set back on fast storage, for efficient
access
o High bandwidth, low latency
o Very good metadata performance – traverse files quickly
o Very high single host performance
o Support multiprotocol (S3, HDFS, SMB, NFS)
14WekaIO Confidential © 2017 All rights reserved.
Parallel FS over NVMe-oF
WekaIO Matrix Shared File System
VM
Hypervisor – KVM, VMWare
VM VM App
App App App
Fully Coherent POSIX File System That Delivers Local File System Performance
Distributed Coding, More Resilient at Scale, Fast Rebuilds, End-to-End DP
Instantaneous Snapshots, Clones, Tiering to S3, Partial File Rehydration
InfiniBand or Ethernet, Hyperconverged or Dedicated Storage Server
15WekaIO Confidential © 2017 All rights reserved.
Why parallel FS over NVMe-oF is needed
o Local copy architectures (e.g. Hadoop, or caching solutions) were developed when
1GbitE and HDDs were standard
o Modern networks on 100Gbit IB or Ethernet are 10x faster than SSD
o It is much easier to create distributed algorithms when locality is not important
o With right networking stack, shared storage is faster than local storage
0 20 40 60 80 100 120
100Gbit (EDR)
10Gbit (SDR)
SSD Write
SSD Read
Time it takes to Complete a 4KB Page Move
Microseconds
16WekaIO Confidential © 2017 All rights reserved.
Data Lake
S3
MatrixFS
Matrix Client
WekaIO Storage Solution for GPU Clusters
Ethernet or InfiniBand Network
Storage Servers
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
APP APP
GPU GPU
Unified Namespace
17WekaIO Confidential © 2017 All rights reserved.
Data Lake
S3
WekaIO Can Even Run in the GPU Cluster
Unified Namespace
Ethernet or InfiniBand Network
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
APP
APP
APP
APP
APP
APP
GPU
SSD
APP APP
GPU GPU
18WekaIO Confidential © 2017 All rights reserved.
AI Requires an Appropriate IA
INSTRUMENT
INGEST
CLEAN
TAG
EXPLORE
TRAIN
VALIDATE
S3
SMB
NFS
CPU Based
HDFS Interface
ARCHIVE &
CATALOG
GPU Based
Server Cluster
Growing Data
Archive
WekaIO
CPU
GPU
19WekaIO Confidential © 2017 All rights reserved.
Aggregate low latency operations – high throughput
[root@us2 imagenet]# time find . | wc -l
14092659
real 3m15.444s
user 0m8.333s
sys 0m4.397s
[root@us2 imagenet]# time ls|wc -l
21851
real 0m0.643s
user 0m0.084s
sys 0m0.005s
[root@us2 imagenet]# time ~/bin/fd . | wc -l
14092658
real 0m51.199s
user 1m44.484s
sys 0m46.463s
[https://github.com/sharkdp/fd ]
20WekaIO Confidential © 2017 All rights reserved.
Aggregate low latency operations – high throughput
o Higher performance than a local file system over NVMe devices
– caching is useless!
– Much lower latency for 1MB files, similar latency even for 4k IOs
o Fill line (100GbEEDR) with reads of small file (can fill line with 4k IO)
o Cluster scales as needed, can support 100s of GPU servers working
concurrently
21WekaIO Confidential © 2017 All rights reserved.
o Problem: Could not achieve the bandwidth
required to keep GPU cluster saturated
o Pain Point: Wasted cycle time ($$$$) on
very expensive GPU clusters.
o Test Platform: 10 Node HPE DL360 vs.
local disk and leading All-Flash NAS
o Result:
– WekaIO – over 2x faster than Local Disk
– WekaIO – 6x faster than All-flash NAS
WekaIO is over 2x Faster than Local Disk
GPU Client
Actual measured data at an autonomous vehicle training installation
0
1,000
2,000
3,000
4,000
5,000
6,000
7,000
8,000
WekaIO Local Disk SSD All Flash NAS
MB/sec
1MB Read Performance (Single GPU Client)
22WekaIO Confidential © 2017 All rights reserved.
Comparing WekaIO and GPFS over NVMe
0
0.5
1
1.5
2
2.5
3
60 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200
SPEC 2014 Results – Comparing Netapp FAS and IBM Spectrum Scale 5.0
IBM Spectrum Scale NetApp WekaIO
Latency(milisecond)
WekaIO does 2x
the workload of IBM
Spectrum Scale on
standard Whitebox
Servers
23WekaIO Confidential © 2017 All rights reserved.
Tiering
o HDD integration provides optimal
economic model
o Parallel FS algorithms over the HDDs
similar to traditional solutions
o Large files are chopped down into small
objects to achieve parallelism
– Can rehydrate and modify partial files
o Stage next training set to NVMe while
current training run works – make sure
GPUs are always utilized
o Data demoted to object storage remains
on SSD as cache until space is required
COLD
DATA
24WekaIO Confidential © 2017 All rights reserved.
Fully Distributed Snapshots
o 4K granularity
o Instantaneous and no impact on performance
o Supports clones (writable snapshots)
o Support file system wide or file-based snaps
o Redirect-on-write based snaps
25WekaIO Confidential © 2017 All rights reserved.
Snap-to-obj DR/Backup/burst Functionality
Follow-on snapshot will be saved in differential manner
Enabled use cases:
• Pause/resume : Allows a cluster to be shutdown when not needed,
supporting cloud elasticity
• Backup : the original cluster is not needed in order to access the data
• DR : enabled via geo-replicated object storage, or tiering to the cloud
• Cloud bursting : launch a cluster in the cloud based on a snap saved
on the object storage
The ability to
coherently save a
complete snapshot to
the object storage
Rehydrated
cluster may be
of different size
The original
cluster can now
shut off and the
data is safe
26WekaIO Confidential © 2017 All rights reserved.
Snapshot File System for Infrastructure Elasticity
Amazon
Cloud
Matrix On-Prem
Matrix Global Namespace
Matrix Global Namespace
Matrix
Cluster
27WekaIO Confidential © 2017 All rights reserved.
Cloud Bursting
o EC2 cluster can be formed based on S3 snap data
o The original on-premises cluster can continue running and
take snaps
o The EC2 cluster can run concurrently and take snaps
o Each snap that is pushed to S3 can be linked back to the
other system
o The data is viewed via the namespace using the .snapshots
directory and data can be merged
o A hybrid-cloud solution that is “sticky” to the public cloud.
Other solutions require network transfers that are transient
o The “data gravity” is in AWS rather than on-prem. A follow up
burst is “cheaper”
28WekaIO Confidential © 2017 All rights reserved.
Simple, Intuitive GUI
29WekaIO Confidential © 2017 All rights reserved.
Simplifying Machine Learning Workflows
APP APP
GPU GPU
APP APP
GPU GPU
o Full line bandwidth to a single GPU
– GPUs are kept fully utilized
– Faster deep learning Wall time
– Or same results can be obtained with less GPUs
EDR
o No need to copy data to local disk on GPU server
– Higher performance than local disk due to parallel access
– Faster time to first epoch
o Centralized storage for easy management
o Run in AWS for on-demand access to P3 GPUs
o Burst to AWS for elastic scaling of GPU resources
>>>>>>>
On-premises AWS Cloud
30WekaIO Confidential © 2017 All rights reserved.
Scalable High Performance Storage Infrastructure
Easy to manage
• Single namespace with easy
data sharing
• Simple intuitive GUI
• Installs in minutes
• Dynamic capacity scaling
• Easy performance scaling
• Multi-protocol access
- NFS/SMB/HDFS/REST(S3)
• Cloud analytics for easy
maintenance and control
• Policy driven auto. tiering
Flexible
• Supports Ethernet or
InfiniBand
• Public, private or hybrid
Cloud deployment
• Annual subscription with no
maintenance contracts
• Flexible configurations
- Hyperconverged
- Dedicated servers
• Data lake to lower cost
HDDs via S3, Swift or NFS
on -prem or public cloud
Feature Rich
• Highest Performance NVMe
- 150 usec 4k latency
- Up to 6GB/sec per client
- Lower latency than a local
FS over NVMe under load
• High Performance SATA
- Leverage existing infra
• Configurable resiliency N+2
to N+4
- Multiple concurrent failures
- scales to 1000s of servers
• Native-perf. 4k granular
snapshots and clones
• Cloud bursting and DR
31WekaIO Confidential © 2017 All rights reserved.
32WekaIO Confidential © 2017 All rights reserved.
WekaIO over IB – Best Storage for Deep LearningGPU
INSTRUMENT
INGEST
CLEAN
TAG
EXPLORE
TRAIN
VALIDATE
S3
SMB
NFS
CPU Based
HDFS
Interface
ARCHIVE
&
CATALOG
GPU Based
Server
Cluster
Growing
Data
Archive
WekaIO
CPU
GPU
Best Workflow support for AI workloads
• Natively support Infiniband – default interconnect for GPU
servers and workloads (DGX-1, HPE 6500)
• Highest Single Client Performance – saturate EDR line,
lowest latency 4k IO. Low latency, high throughput
• Your GPU servers won’t idle anymore!
• Run GPU-Direct concurrent to storage, single network
• Scales to support any number of GPU servers sharing the
FS
• Single solution from ingest to archive. Native ability to
stage upcoming working set to SSD for next training job
• Parallel File System with NVMe-oF technology – IB
latency insignificant compared to SSD latency
• Shared file system with metadata performance of local FS
0 50 100 150
100Gbit (EDR)
10Gbit (SDR)
SSD Write
SSD Read
Microseconds
33WekaIO Confidential © 2017 All rights reserved.
Software Architecture
o Runs inside LXC container for
isolation
o SR-IOV to run network stack and
NVMe in user space
o Provides POSIX VFS through
lockless queues to WekaIO driver
o I/O stack bypasses kernel
o Scheduling and memory
management also bypass kernel
o Metadata split into many Buckets –
Buckets quickly migrate è no hot
spots
o Support, bare metal, container &
hypervisor
Clustering
Balancing
Failure Det.
& Comm.
IP Takeover
Applicatio
nApplicatio
nApplicatio
nApplication
Frontend
SSD Agent
H/W
User
Space
KernelWeka DriverTCP/IP Stack
Distributed
FS “Bucket”Distributed
FS “Bucket”Distributed
FS “Bucket”Distributed
FS “Bucket”Distributed
FS “Bucket”
Data
Placement
FS
Metadata
Tiering
Backend
Networking
NFS
S3
SMB
HDFS
34WekaIO Confidential © 2017 All rights reserved.
Snapshot File System as DR Strategy
Amazon
Region A
Matrix On-Prem
Amazon
Region B
File
System

More Related Content

PPTX
Cloud Computing For Beginners | Cloud Computing Explained | Cloud Computing T...
PPTX
Introduction to GCP (Google Cloud Platform)
PPTX
Azure Security Fundamentals
PPT
CLOUD COMPUTING AND STORAGE
PDF
SAP S/4HANA on AWS Tシャツモデル
PPTX
Cloud Security Architecture.pptx
PDF
Amazon S3を中心とするデータ分析のベストプラクティス
PPTX
Docker In Cloud
Cloud Computing For Beginners | Cloud Computing Explained | Cloud Computing T...
Introduction to GCP (Google Cloud Platform)
Azure Security Fundamentals
CLOUD COMPUTING AND STORAGE
SAP S/4HANA on AWS Tシャツモデル
Cloud Security Architecture.pptx
Amazon S3を中心とするデータ分析のベストプラクティス
Docker In Cloud

What's hot (20)

PDF
Azure Arc Overview from Microsoft
PDF
Cloud Migration
PDF
Cloud Migration Strategy and Best Practices
PDF
AWS Black Belt Online Seminar 2017 AWS X-Ray
PPTX
サーバ構築・デプロイが簡単に!Elastic beanstalk
PPTX
Cloud Security
PPTX
Cloud security and security architecture
PPTX
Google Cloud Platfrom
PPTX
Virtualization security
PPTX
Migration into a Cloud
PPTX
Google cloud
PPSX
On-premise to Microsoft Azure Cloud Migration.
PDF
AZ-900 Azure Fundamentals.pdf
PPTX
Azure Cosmos DB を使った高速分散アプリケーションの設計パターン
PPTX
Cloud Based Disaster Recovery (DRaaS)
PPTX
Snowflake Automated Deployments / CI/CD Pipelines
PDF
EC2のストレージどう使う? -Instance Storageを理解して高速IOを上手に活用!-
PDF
FedScoop Public Sector Innovation Summit DOD Enterprise DevSecOps Initiative ...
PDF
Cloud Security Top Threats
PPTX
Zero trust for everybody: 3 ways to get there fast
Azure Arc Overview from Microsoft
Cloud Migration
Cloud Migration Strategy and Best Practices
AWS Black Belt Online Seminar 2017 AWS X-Ray
サーバ構築・デプロイが簡単に!Elastic beanstalk
Cloud Security
Cloud security and security architecture
Google Cloud Platfrom
Virtualization security
Migration into a Cloud
Google cloud
On-premise to Microsoft Azure Cloud Migration.
AZ-900 Azure Fundamentals.pdf
Azure Cosmos DB を使った高速分散アプリケーションの設計パターン
Cloud Based Disaster Recovery (DRaaS)
Snowflake Automated Deployments / CI/CD Pipelines
EC2のストレージどう使う? -Instance Storageを理解して高速IOを上手に活用!-
FedScoop Public Sector Innovation Summit DOD Enterprise DevSecOps Initiative ...
Cloud Security Top Threats
Zero trust for everybody: 3 ways to get there fast
Ad

Similar to WekaIO: Making Machine Learning Compute Bound Again (20)

PPTX
Webinar: Three Reasons Why NAS is No Good for AI and Machine Learning
PDF
Weka_AI_datasheet_W07R4DS201807_Web_WEWE
PDF
Architecting a 35 PB distributed parallel file system for science
PDF
Building a High Performance Analytics Platform
PPTX
Storage and performance- Batch processing, Whiptail
PDF
The Consequences of Infinite Storage Bandwidth: Allen Samuels, SanDisk
PDF
NVMe over Fabric
PPTX
Ceph Community Talk on High-Performance Solid Sate Ceph
PDF
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
PDF
02.28.13 WANdisco ApacheCon 2013
PPTX
Deploying ssd in the data center 2014
PDF
Infoboom future-storage-aug2011-v3
PDF
Infoboom future-storage-aug2011-v3
PDF
High Performance Hardware for Data Analysis
PDF
High Performance Hardware for Data Analysis
PPTX
fpga2014-wjun.pptx
PPTX
Storage and performance, Whiptail
PDF
Revolutionary Storage for Modern Databases, Applications and Infrastrcture
PDF
Monster Trends in Enterprise Tech: The Flash Disruption and What it Means for...
PDF
Ceph on Intel: Intel Storage Components, Benchmarks, and Contributions
Webinar: Three Reasons Why NAS is No Good for AI and Machine Learning
Weka_AI_datasheet_W07R4DS201807_Web_WEWE
Architecting a 35 PB distributed parallel file system for science
Building a High Performance Analytics Platform
Storage and performance- Batch processing, Whiptail
The Consequences of Infinite Storage Bandwidth: Allen Samuels, SanDisk
NVMe over Fabric
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
02.28.13 WANdisco ApacheCon 2013
Deploying ssd in the data center 2014
Infoboom future-storage-aug2011-v3
Infoboom future-storage-aug2011-v3
High Performance Hardware for Data Analysis
High Performance Hardware for Data Analysis
fpga2014-wjun.pptx
Storage and performance, Whiptail
Revolutionary Storage for Modern Databases, Applications and Infrastrcture
Monster Trends in Enterprise Tech: The Flash Disruption and What it Means for...
Ceph on Intel: Intel Storage Components, Benchmarks, and Contributions
Ad

More from inside-BigData.com (20)

PDF
Major Market Shifts in IT
PDF
Preparing to program Aurora at Exascale - Early experiences and future direct...
PPTX
Transforming Private 5G Networks
PDF
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
PDF
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
PDF
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
PDF
HPC Impact: EDA Telemetry Neural Networks
PDF
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
PDF
Machine Learning for Weather Forecasts
PPTX
HPC AI Advisory Council Update
PDF
Fugaku Supercomputer joins fight against COVID-19
PDF
Energy Efficient Computing using Dynamic Tuning
PDF
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
PDF
State of ARM-based HPC
PDF
Versal Premium ACAP for Network and Cloud Acceleration
PDF
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
PDF
Scaling TCO in a Post Moore's Era
PDF
CUDA-Python and RAPIDS for blazing fast scientific computing
PDF
Introducing HPC with a Raspberry Pi Cluster
PDF
Overview of HPC Interconnects
Major Market Shifts in IT
Preparing to program Aurora at Exascale - Early experiences and future direct...
Transforming Private 5G Networks
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
HPC Impact: EDA Telemetry Neural Networks
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Machine Learning for Weather Forecasts
HPC AI Advisory Council Update
Fugaku Supercomputer joins fight against COVID-19
Energy Efficient Computing using Dynamic Tuning
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
State of ARM-based HPC
Versal Premium ACAP for Network and Cloud Acceleration
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Scaling TCO in a Post Moore's Era
CUDA-Python and RAPIDS for blazing fast scientific computing
Introducing HPC with a Raspberry Pi Cluster
Overview of HPC Interconnects

Recently uploaded (20)

PDF
cuic standard and advanced reporting.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
KodekX | Application Modernization Development
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Big Data Technologies - Introduction.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Electronic commerce courselecture one. Pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
cuic standard and advanced reporting.pdf
Programs and apps: productivity, graphics, security and other tools
Digital-Transformation-Roadmap-for-Companies.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
KodekX | Application Modernization Development
Building Integrated photovoltaic BIPV_UPV.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Encapsulation_ Review paper, used for researhc scholars
Reach Out and Touch Someone: Haptics and Empathic Computing
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
The AUB Centre for AI in Media Proposal.docx
Review of recent advances in non-invasive hemoglobin estimation
Big Data Technologies - Introduction.pptx
Empathic Computing: Creating Shared Understanding
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
MYSQL Presentation for SQL database connectivity
Electronic commerce courselecture one. Pdf
Understanding_Digital_Forensics_Presentation.pptx
20250228 LYD VKU AI Blended-Learning.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx

WekaIO: Making Machine Learning Compute Bound Again

  • 1. 1WekaIO Confidential © 2017 All rights reserved. Making Machine Learning Compute Bound Again Liran Zvibel, Co-founder and CEO liran@weka.io @liranzvibel
  • 2. 2WekaIO Confidential © 2017 All rights reserved. THE STORAGE DNA WekaIO Introduction WekaIO Matrix is the fastest, most scalable parallel file system for AI and technical compute workloads that ensures your applications never wait for data. WHO WE ARE THE PARTNERS THE ACCOLADES
  • 3. 3WekaIO Confidential © 2017 All rights reserved. We are using WekaIO technologies over InfiniBand to address the challenges of data analytics at extreme scale in life sciences, particle physics, geosciences, and other fields. That process is still ongoing but to-date we’ve already achieved some promising results. Michael Norman, Director of San Diego Supercomputer Center at UCSD
  • 4. 4WekaIO Confidential © 2017 All rights reserved.
  • 5. 5WekaIO Confidential © 2017 All rights reserved. Future-thinking companies like WekaIO, complement our core principle of accelerating research and discovery. The ability to run more concurrent high performance genomic workloads will significantly advance our time to discovery. Nelson Kick, Manager of HPC Operations
  • 6. 6WekaIO Confidential © 2017 All rights reserved. “Where can you find another non- linear servo-mechanism weighing only 150 pounds and having great adaptability, that can be produced so cheaply by completely unskilled labor?” -- Albert Scott Crossfield, May 1954 https://quoteinvestigator.com/2016/02/01/computer/#note-12956-2
  • 7. 7WekaIO Confidential © 2017 All rights reserved. "within a generation ... the problem of creating 'artificial intelligence' will substantially be solved” (1967) Marvin Minsky (1927 – 2016) Co-Founder of MIT’s AI lab in 1959 The Journey Continues Artificial Intelligence Demystified Source: www.AnalyticsVidhya.com
  • 8. 8WekaIO Confidential © 2017 All rights reserved. “Self driving vehicles will completely revolutionize the way we get around in the future, but also have the potential to literally save 100k of lives… Self driving vehicles let us think differently about road safety… 94% of vehicle accidents are caused by human error. Why wouldn’t we do anything we can to bring self driving vehicles technologies to our roads?” -- Senator Gary Peters, November 2017 Making the Case for AI
  • 9. 9WekaIO Confidential © 2017 All rights reserved. Deep Learning Requires Data
  • 10. 10WekaIO Confidential © 2017 All rights reserved. Source: Andrew Ng, VP & Chief Scientist of Baidu; Co-Chairman and Co- Founder of Coursera; and Adjunct Professor at Stanford University Scale Drives Progress
  • 11. 11WekaIO Confidential © 2017 All rights reserved. What has changed recently? o Deep Learning algorithms have been known since the ’80s o Problem was with the computational power – GPUs and other accelerators made deep learning practical o “GPUs are the most efficient way to take a compute bound problem, and make it IO bound” – Addison Snell, InterSect360 Research o We will show you how to make sure your Machine Learning workloads are not IO bound!
  • 12. 12WekaIO Confidential © 2017 All rights reserved. GPU Progress in One Year nVidia P100 vs V100 0.0x 0.5x 1.0x 1.5x 2.0x 2.5x 3.0x 3.5x PyTorch Tensor Flow Caffe2 Caffe MXNet 2.2x 2.5x 2.9x 3.0x 3.05x Speedup DGX-1 P100 DGX-1 V100
  • 13. 13WekaIO Confidential © 2017 All rights reserved. Deep Learning Requirements o Actually very close to HPC problems… o Store a vast amount of data – Effectively “stage” working set back on fast storage, for efficient access o High bandwidth, low latency o Very good metadata performance – traverse files quickly o Very high single host performance o Support multiprotocol (S3, HDFS, SMB, NFS)
  • 14. 14WekaIO Confidential © 2017 All rights reserved. Parallel FS over NVMe-oF WekaIO Matrix Shared File System VM Hypervisor – KVM, VMWare VM VM App App App App Fully Coherent POSIX File System That Delivers Local File System Performance Distributed Coding, More Resilient at Scale, Fast Rebuilds, End-to-End DP Instantaneous Snapshots, Clones, Tiering to S3, Partial File Rehydration InfiniBand or Ethernet, Hyperconverged or Dedicated Storage Server
  • 15. 15WekaIO Confidential © 2017 All rights reserved. Why parallel FS over NVMe-oF is needed o Local copy architectures (e.g. Hadoop, or caching solutions) were developed when 1GbitE and HDDs were standard o Modern networks on 100Gbit IB or Ethernet are 10x faster than SSD o It is much easier to create distributed algorithms when locality is not important o With right networking stack, shared storage is faster than local storage 0 20 40 60 80 100 120 100Gbit (EDR) 10Gbit (SDR) SSD Write SSD Read Time it takes to Complete a 4KB Page Move Microseconds
  • 16. 16WekaIO Confidential © 2017 All rights reserved. Data Lake S3 MatrixFS Matrix Client WekaIO Storage Solution for GPU Clusters Ethernet or InfiniBand Network Storage Servers APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU APP APP GPU GPU Unified Namespace
  • 17. 17WekaIO Confidential © 2017 All rights reserved. Data Lake S3 WekaIO Can Even Run in the GPU Cluster Unified Namespace Ethernet or InfiniBand Network APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU APP APP APP APP APP APP GPU SSD APP APP GPU GPU
  • 18. 18WekaIO Confidential © 2017 All rights reserved. AI Requires an Appropriate IA INSTRUMENT INGEST CLEAN TAG EXPLORE TRAIN VALIDATE S3 SMB NFS CPU Based HDFS Interface ARCHIVE & CATALOG GPU Based Server Cluster Growing Data Archive WekaIO CPU GPU
  • 19. 19WekaIO Confidential © 2017 All rights reserved. Aggregate low latency operations – high throughput [root@us2 imagenet]# time find . | wc -l 14092659 real 3m15.444s user 0m8.333s sys 0m4.397s [root@us2 imagenet]# time ls|wc -l 21851 real 0m0.643s user 0m0.084s sys 0m0.005s [root@us2 imagenet]# time ~/bin/fd . | wc -l 14092658 real 0m51.199s user 1m44.484s sys 0m46.463s [https://github.com/sharkdp/fd ]
  • 20. 20WekaIO Confidential © 2017 All rights reserved. Aggregate low latency operations – high throughput o Higher performance than a local file system over NVMe devices – caching is useless! – Much lower latency for 1MB files, similar latency even for 4k IOs o Fill line (100GbEEDR) with reads of small file (can fill line with 4k IO) o Cluster scales as needed, can support 100s of GPU servers working concurrently
  • 21. 21WekaIO Confidential © 2017 All rights reserved. o Problem: Could not achieve the bandwidth required to keep GPU cluster saturated o Pain Point: Wasted cycle time ($$$$) on very expensive GPU clusters. o Test Platform: 10 Node HPE DL360 vs. local disk and leading All-Flash NAS o Result: – WekaIO – over 2x faster than Local Disk – WekaIO – 6x faster than All-flash NAS WekaIO is over 2x Faster than Local Disk GPU Client Actual measured data at an autonomous vehicle training installation 0 1,000 2,000 3,000 4,000 5,000 6,000 7,000 8,000 WekaIO Local Disk SSD All Flash NAS MB/sec 1MB Read Performance (Single GPU Client)
  • 22. 22WekaIO Confidential © 2017 All rights reserved. Comparing WekaIO and GPFS over NVMe 0 0.5 1 1.5 2 2.5 3 60 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 SPEC 2014 Results – Comparing Netapp FAS and IBM Spectrum Scale 5.0 IBM Spectrum Scale NetApp WekaIO Latency(milisecond) WekaIO does 2x the workload of IBM Spectrum Scale on standard Whitebox Servers
  • 23. 23WekaIO Confidential © 2017 All rights reserved. Tiering o HDD integration provides optimal economic model o Parallel FS algorithms over the HDDs similar to traditional solutions o Large files are chopped down into small objects to achieve parallelism – Can rehydrate and modify partial files o Stage next training set to NVMe while current training run works – make sure GPUs are always utilized o Data demoted to object storage remains on SSD as cache until space is required COLD DATA
  • 24. 24WekaIO Confidential © 2017 All rights reserved. Fully Distributed Snapshots o 4K granularity o Instantaneous and no impact on performance o Supports clones (writable snapshots) o Support file system wide or file-based snaps o Redirect-on-write based snaps
  • 25. 25WekaIO Confidential © 2017 All rights reserved. Snap-to-obj DR/Backup/burst Functionality Follow-on snapshot will be saved in differential manner Enabled use cases: • Pause/resume : Allows a cluster to be shutdown when not needed, supporting cloud elasticity • Backup : the original cluster is not needed in order to access the data • DR : enabled via geo-replicated object storage, or tiering to the cloud • Cloud bursting : launch a cluster in the cloud based on a snap saved on the object storage The ability to coherently save a complete snapshot to the object storage Rehydrated cluster may be of different size The original cluster can now shut off and the data is safe
  • 26. 26WekaIO Confidential © 2017 All rights reserved. Snapshot File System for Infrastructure Elasticity Amazon Cloud Matrix On-Prem Matrix Global Namespace Matrix Global Namespace Matrix Cluster
  • 27. 27WekaIO Confidential © 2017 All rights reserved. Cloud Bursting o EC2 cluster can be formed based on S3 snap data o The original on-premises cluster can continue running and take snaps o The EC2 cluster can run concurrently and take snaps o Each snap that is pushed to S3 can be linked back to the other system o The data is viewed via the namespace using the .snapshots directory and data can be merged o A hybrid-cloud solution that is “sticky” to the public cloud. Other solutions require network transfers that are transient o The “data gravity” is in AWS rather than on-prem. A follow up burst is “cheaper”
  • 28. 28WekaIO Confidential © 2017 All rights reserved. Simple, Intuitive GUI
  • 29. 29WekaIO Confidential © 2017 All rights reserved. Simplifying Machine Learning Workflows APP APP GPU GPU APP APP GPU GPU o Full line bandwidth to a single GPU – GPUs are kept fully utilized – Faster deep learning Wall time – Or same results can be obtained with less GPUs EDR o No need to copy data to local disk on GPU server – Higher performance than local disk due to parallel access – Faster time to first epoch o Centralized storage for easy management o Run in AWS for on-demand access to P3 GPUs o Burst to AWS for elastic scaling of GPU resources >>>>>>> On-premises AWS Cloud
  • 30. 30WekaIO Confidential © 2017 All rights reserved. Scalable High Performance Storage Infrastructure Easy to manage • Single namespace with easy data sharing • Simple intuitive GUI • Installs in minutes • Dynamic capacity scaling • Easy performance scaling • Multi-protocol access - NFS/SMB/HDFS/REST(S3) • Cloud analytics for easy maintenance and control • Policy driven auto. tiering Flexible • Supports Ethernet or InfiniBand • Public, private or hybrid Cloud deployment • Annual subscription with no maintenance contracts • Flexible configurations - Hyperconverged - Dedicated servers • Data lake to lower cost HDDs via S3, Swift or NFS on -prem or public cloud Feature Rich • Highest Performance NVMe - 150 usec 4k latency - Up to 6GB/sec per client - Lower latency than a local FS over NVMe under load • High Performance SATA - Leverage existing infra • Configurable resiliency N+2 to N+4 - Multiple concurrent failures - scales to 1000s of servers • Native-perf. 4k granular snapshots and clones • Cloud bursting and DR
  • 31. 31WekaIO Confidential © 2017 All rights reserved.
  • 32. 32WekaIO Confidential © 2017 All rights reserved. WekaIO over IB – Best Storage for Deep LearningGPU INSTRUMENT INGEST CLEAN TAG EXPLORE TRAIN VALIDATE S3 SMB NFS CPU Based HDFS Interface ARCHIVE & CATALOG GPU Based Server Cluster Growing Data Archive WekaIO CPU GPU Best Workflow support for AI workloads • Natively support Infiniband – default interconnect for GPU servers and workloads (DGX-1, HPE 6500) • Highest Single Client Performance – saturate EDR line, lowest latency 4k IO. Low latency, high throughput • Your GPU servers won’t idle anymore! • Run GPU-Direct concurrent to storage, single network • Scales to support any number of GPU servers sharing the FS • Single solution from ingest to archive. Native ability to stage upcoming working set to SSD for next training job • Parallel File System with NVMe-oF technology – IB latency insignificant compared to SSD latency • Shared file system with metadata performance of local FS 0 50 100 150 100Gbit (EDR) 10Gbit (SDR) SSD Write SSD Read Microseconds
  • 33. 33WekaIO Confidential © 2017 All rights reserved. Software Architecture o Runs inside LXC container for isolation o SR-IOV to run network stack and NVMe in user space o Provides POSIX VFS through lockless queues to WekaIO driver o I/O stack bypasses kernel o Scheduling and memory management also bypass kernel o Metadata split into many Buckets – Buckets quickly migrate è no hot spots o Support, bare metal, container & hypervisor Clustering Balancing Failure Det. & Comm. IP Takeover Applicatio nApplicatio nApplicatio nApplication Frontend SSD Agent H/W User Space KernelWeka DriverTCP/IP Stack Distributed FS “Bucket”Distributed FS “Bucket”Distributed FS “Bucket”Distributed FS “Bucket”Distributed FS “Bucket” Data Placement FS Metadata Tiering Backend Networking NFS S3 SMB HDFS
  • 34. 34WekaIO Confidential © 2017 All rights reserved. Snapshot File System as DR Strategy Amazon Region A Matrix On-Prem Amazon Region B File System