SlideShare a Scribd company logo
Speed Meets Scale: Analyzing & Visualizing Billions of Data Points with GPUs
DB Tech Showcase | Tokyo | September 20, 2018
© MapD 2018
Aaron Williams
VP of Global Community
@_arw_
aaron@mapd.com
/in/aaronwilliams/
/williamsaaron
slides: https://speakerdeck.com/mapd
© MapD 2018
3
Personas in
Analytics Lifecycle
(Illustrative)Business Analyst
Data Scientist
Data Engineer
IT Systems Admin
Data Scientist / Business Analyst
Data
Preparation
Data
Discovery
& Feature
Engineering
Model &
Validate
Predict
Operationalize
Monitoring &
Refinement
Evaluate
& Decide
GPUs
Friday, Sept 21
[db tech showcase Tookyo 2018] #dbts2018 #B24 『Speed Meets Scale: Analyzing & Visualizing Billions of Data Points with GPUs』
[db tech showcase Tookyo 2018] #dbts2018 #B24 『Speed Meets Scale: Analyzing & Visualizing Billions of Data Points with GPUs』
[db tech showcase Tookyo 2018] #dbts2018 #B24 『Speed Meets Scale: Analyzing & Visualizing Billions of Data Points with GPUs』
[db tech showcase Tookyo 2018] #dbts2018 #B24 『Speed Meets Scale: Analyzing & Visualizing Billions of Data Points with GPUs』
The Fastest Software
Designed for the Fastest Hardware
HARNESS GPUs
© MapD 2018 9
GPU ProcessingCPU Processing
40,000
Cores
20 Cores
*fictitious example
Latency Throughput
CPU
1 ns
per task
(1 task/ns) x (20 cores) =
20 tasks/ns
GPU
10 ns
per task
(0.1 task per ns) x (40,000 cores) =
4,000 task per ns
Latency: Time to do a task. | Throughput: Number of tasks per unit time.
© MapD 2018 10
* open source for single node
github.com/mapd/mapd-core
D E M O S
https://www.mapd.com/demos/
© MapD 2018
Advanced memory management
Three-tier caching to GPU RAM for speed and to SSDs for persistent storage
1
2
SSD or NVRAM STORAGE (L3)
250GB to 20TB
1-2 GB/sec
CPU RAM (L2)
32GB to 3TB
70-120 GB/sec
GPU RAM (L1)
24GB to 256GB
1000-6000 GB/sec
Hot Data
Speedup = 1500x to 5000x
Over Cold Data
Warm Data
Speedup = 35x to 120x
Over Cold Data
Cold Data
COMPUTE
LAYER
STORAGE
LAYER
Data Lake/Data Warehouse/System Of Record
© MapD 2018
MapD Core: Query Compilation with LLVM
© MapD 2018
MapD Immerse: Hybrid Rendering
EXTREME ANALYTICS:
EXTREME DATA & EXTREME EXPERIENCE
[db tech showcase Tookyo 2018] #dbts2018 #B24 『Speed Meets Scale: Analyzing & Visualizing Billions of Data Points with GPUs』
© MapD 2018
TOP-TIER VENTURE BACKING
USED BY 100+ GLOBAL ORGS$37 MILLION
IN FUNDING
OPEN-SOURCE COMMUNITY
About MapD
17
• mapd.com/demos
Play with our demos - everything demo you
saw in this talk was live!
• mapd.cloud
Get a MapD instance in less than 60 seconds
• www.mapd.com/platform/downloads/
Download the Community Edition
• community.mapd.com
Ask questions and share your experiences
Next Steps
© MapD 2018
Aaron Williams
VP of Global Community
@_arw_
aaron@mapd.com
/in/aaronwilliams/
/williamsaaron
slides: https://speakerdeck.com/mapd
Thank you! Questions?

More Related Content

PDF
Supermap gis 10i(2020) ai gis technology v1.0
PPTX
ADF 3D Laser Scanning
DOCX
More on my LinkedIn Summary
PPTX
Time Series with Driverless AI - Marios Michailidis and Mathias Müller - H2O ...
PPSX
Capture and Use of Geo-Located Asset Information using Reality Capture Techno...
PPTX
Detecting Buildings in AHN2 LiDAR data with ArcGIS - Grontmij
PDF
Greenplum for Kubernetes PGConf india 2019
PDF
ICLR'2020 参加速報
Supermap gis 10i(2020) ai gis technology v1.0
ADF 3D Laser Scanning
More on my LinkedIn Summary
Time Series with Driverless AI - Marios Michailidis and Mathias Müller - H2O ...
Capture and Use of Geo-Located Asset Information using Reality Capture Techno...
Detecting Buildings in AHN2 LiDAR data with ArcGIS - Grontmij
Greenplum for Kubernetes PGConf india 2019
ICLR'2020 参加速報

What's hot (19)

PPTX
Jovian Data Amazon Final Version
PDF
2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...
PDF
PowerStream: Propelling Energy Innovation with Predictive Analytics
PDF
Introduction of super map gis 10i(2020) (1)
PDF
Digital Transformation & Solvency II Simulations for L&G: Optimizing, Acceler...
 
PPTX
Field Activity Planner - A cloud based digital energy platform
PDF
Why Open Source Works for DevOps Monitoring
PDF
Demonstration of super map ai gis technology
PDF
Developing Spatial Applications with CARTO for React v1.1
PDF
Large Scale Geospatial Indexing and Analysis on Apache Spark
PDF
Developing Spatial Applications with Google Maps and CARTO
PPTX
Walking in the Cloud: A New Paradigm in Geospatial World
PDF
Skyworks Aerial Systems
PDF
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
PDF
The Environment Agency - Improving Incident Response - Collaborative Working ...
Jovian Data Amazon Final Version
2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...
PowerStream: Propelling Energy Innovation with Predictive Analytics
Introduction of super map gis 10i(2020) (1)
Digital Transformation & Solvency II Simulations for L&G: Optimizing, Acceler...
 
Field Activity Planner - A cloud based digital energy platform
Why Open Source Works for DevOps Monitoring
Demonstration of super map ai gis technology
Developing Spatial Applications with CARTO for React v1.1
Large Scale Geospatial Indexing and Analysis on Apache Spark
Developing Spatial Applications with Google Maps and CARTO
Walking in the Cloud: A New Paradigm in Geospatial World
Skyworks Aerial Systems
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
The Environment Agency - Improving Incident Response - Collaborative Working ...
Ad

Similar to [db tech showcase Tookyo 2018] #dbts2018 #B24 『Speed Meets Scale: Analyzing & Visualizing Billions of Data Points with GPUs』 (20)

PDF
DSDT Meetup January 2018
PDF
Dsdt meetup-january2018
PDF
Machine Learning & Data Science in the Age of the GPU: Smarter, Faster, Better
PDF
GTC Tel Aviv: Accelerate Analytics with a GPU Data Frame
PDF
Leveraging GPU-Accelerated Analytics on top of Apache Spark with Todd Mostak
PDF
SoCal Data Science Conference: Machine Learning & Data Science in the Age of ...
PDF
Data Con LA 2018 - How the Auto Industry Accelerates ML with Analytics by Aa...
PDF
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
PDF
GPU databases - How to use them and what the future holds
PDF
Accelerating analytics in a new era of data
PDF
GPU accelerated Large Scale Analytics
PDF
Accelerating Cyber Threat Detection With GPU
PDF
Fast data in times of crisis with GPU accelerated database QikkDB | Business ...
PDF
Introduction to SQream and the IoT environment
PPTX
Advanced Analytics in Banking, CITI
PPTX
Cloud Computing y Big Data, próxima frontera de la innovación
PDF
GOAI: GPU-Accelerated Data Science DataSciCon 2017
PPTX
Powering Real-Time Big Data Analytics with a Next-Gen GPU Database
PPTX
End to End Machine Learning Open Source Solution Presented in Cisco Developer...
PDF
SQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
DSDT Meetup January 2018
Dsdt meetup-january2018
Machine Learning & Data Science in the Age of the GPU: Smarter, Faster, Better
GTC Tel Aviv: Accelerate Analytics with a GPU Data Frame
Leveraging GPU-Accelerated Analytics on top of Apache Spark with Todd Mostak
SoCal Data Science Conference: Machine Learning & Data Science in the Age of ...
Data Con LA 2018 - How the Auto Industry Accelerates ML with Analytics by Aa...
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
GPU databases - How to use them and what the future holds
Accelerating analytics in a new era of data
GPU accelerated Large Scale Analytics
Accelerating Cyber Threat Detection With GPU
Fast data in times of crisis with GPU accelerated database QikkDB | Business ...
Introduction to SQream and the IoT environment
Advanced Analytics in Banking, CITI
Cloud Computing y Big Data, próxima frontera de la innovación
GOAI: GPU-Accelerated Data Science DataSciCon 2017
Powering Real-Time Big Data Analytics with a Next-Gen GPU Database
End to End Machine Learning Open Source Solution Presented in Cisco Developer...
SQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
Ad

More from Insight Technology, Inc. (20)

PDF
グラフデータベースは如何に自然言語を理解するか?
PDF
Docker and the Oracle Database
PDF
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
PDF
事例を通じて機械学習とは何かを説明する
PDF
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
PDF
MBAAで覚えるDBREの大事なおしごと
PDF
グラフデータベースは如何に自然言語を理解するか?
PDF
DBREから始めるデータベースプラットフォーム
PDF
SQL Server エンジニアのためのコンテナ入門
PDF
Lunch & Learn, AWS NoSQL Services
PDF
db tech showcase2019オープニングセッション @ 森田 俊哉
PDF
db tech showcase2019 オープニングセッション @ 石川 雅也
PDF
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
PPTX
難しいアプリケーション移行、手軽に試してみませんか?
PPTX
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
PPTX
そのデータベース、クラウドで使ってみませんか?
PPTX
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
PDF
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
PPTX
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
PPTX
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]
グラフデータベースは如何に自然言語を理解するか?
Docker and the Oracle Database
Great performance at scale~次期PostgreSQL12のパーティショニング性能の実力に迫る~
事例を通じて機械学習とは何かを説明する
仮想通貨ウォレットアプリで理解するデータストアとしてのブロックチェーン
MBAAで覚えるDBREの大事なおしごと
グラフデータベースは如何に自然言語を理解するか?
DBREから始めるデータベースプラットフォーム
SQL Server エンジニアのためのコンテナ入門
Lunch & Learn, AWS NoSQL Services
db tech showcase2019オープニングセッション @ 森田 俊哉
db tech showcase2019 オープニングセッション @ 石川 雅也
db tech showcase2019 オープニングセッション @ マイナー・アレン・パーカー
難しいアプリケーション移行、手軽に試してみませんか?
Attunityのソリューションと異種データベース・クラウド移行事例のご紹介
そのデータベース、クラウドで使ってみませんか?
コモディティサーバー3台で作る高速処理 “ハイパー・コンバージド・データベース・インフラストラクチャー(HCDI)” システム『Insight Qube』...
複数DBのバックアップ・切り戻し運用手順が異なって大変?!運用性の大幅改善、その先に。。
Attunity社のソリューションの日本国内外適用事例及びロードマップ紹介[ATTUNITY & インサイトテクノロジー IoT / Big Data フ...
レガシーに埋もれたデータをリアルタイムでクラウドへ [ATTUNITY & インサイトテクノロジー IoT / Big Data フォーラム 2018]

Recently uploaded (20)

PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Cloud computing and distributed systems.
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Empathic Computing: Creating Shared Understanding
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Unlocking AI with Model Context Protocol (MCP)
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Big Data Technologies - Introduction.pptx
Spectral efficient network and resource selection model in 5G networks
Network Security Unit 5.pdf for BCA BBA.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Approach and Philosophy of On baking technology
Understanding_Digital_Forensics_Presentation.pptx
Encapsulation_ Review paper, used for researhc scholars
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Cloud computing and distributed systems.
NewMind AI Weekly Chronicles - August'25 Week I
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Empathic Computing: Creating Shared Understanding
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Diabetes mellitus diagnosis method based random forest with bat algorithm

[db tech showcase Tookyo 2018] #dbts2018 #B24 『Speed Meets Scale: Analyzing & Visualizing Billions of Data Points with GPUs』

  • 1. Speed Meets Scale: Analyzing & Visualizing Billions of Data Points with GPUs DB Tech Showcase | Tokyo | September 20, 2018
  • 2. © MapD 2018 Aaron Williams VP of Global Community @_arw_ aaron@mapd.com /in/aaronwilliams/ /williamsaaron slides: https://speakerdeck.com/mapd
  • 3. © MapD 2018 3 Personas in Analytics Lifecycle (Illustrative)Business Analyst Data Scientist Data Engineer IT Systems Admin Data Scientist / Business Analyst Data Preparation Data Discovery & Feature Engineering Model & Validate Predict Operationalize Monitoring & Refinement Evaluate & Decide GPUs Friday, Sept 21
  • 8. The Fastest Software Designed for the Fastest Hardware HARNESS GPUs
  • 9. © MapD 2018 9 GPU ProcessingCPU Processing 40,000 Cores 20 Cores *fictitious example Latency Throughput CPU 1 ns per task (1 task/ns) x (20 cores) = 20 tasks/ns GPU 10 ns per task (0.1 task per ns) x (40,000 cores) = 4,000 task per ns Latency: Time to do a task. | Throughput: Number of tasks per unit time.
  • 10. © MapD 2018 10 * open source for single node github.com/mapd/mapd-core
  • 11. D E M O S https://www.mapd.com/demos/
  • 12. © MapD 2018 Advanced memory management Three-tier caching to GPU RAM for speed and to SSDs for persistent storage 1 2 SSD or NVRAM STORAGE (L3) 250GB to 20TB 1-2 GB/sec CPU RAM (L2) 32GB to 3TB 70-120 GB/sec GPU RAM (L1) 24GB to 256GB 1000-6000 GB/sec Hot Data Speedup = 1500x to 5000x Over Cold Data Warm Data Speedup = 35x to 120x Over Cold Data Cold Data COMPUTE LAYER STORAGE LAYER Data Lake/Data Warehouse/System Of Record
  • 13. © MapD 2018 MapD Core: Query Compilation with LLVM
  • 14. © MapD 2018 MapD Immerse: Hybrid Rendering
  • 15. EXTREME ANALYTICS: EXTREME DATA & EXTREME EXPERIENCE
  • 17. © MapD 2018 TOP-TIER VENTURE BACKING USED BY 100+ GLOBAL ORGS$37 MILLION IN FUNDING OPEN-SOURCE COMMUNITY About MapD 17
  • 18. • mapd.com/demos Play with our demos - everything demo you saw in this talk was live! • mapd.cloud Get a MapD instance in less than 60 seconds • www.mapd.com/platform/downloads/ Download the Community Edition • community.mapd.com Ask questions and share your experiences Next Steps
  • 19. © MapD 2018 Aaron Williams VP of Global Community @_arw_ aaron@mapd.com /in/aaronwilliams/ /williamsaaron slides: https://speakerdeck.com/mapd Thank you! Questions?