SlideShare a Scribd company logo
Victor Haydin
Head of R&D, ELEKS
Agenda
1. What is HPC?
2. Why does somebody need it?
3. How to do it?
What?
Definition
Wikipedia: “High-performance computing (HPC)
uses supercomputers and computer clusters to solve
advanced computation problems.
Today, computer systems approaching the teraflops-
region are counted as HPC-computers.”
Definition
   advanced
computation
   problems
Modeling and Simulation
Low-latency processing
Big Data
A.I.
Supercomputers
    Computer clusters
Teraflops performance
HPC systems comparison
100000000


10000000


  1000000


   100000


    10000

            HPC
     1000


      100


       10


        1
             CPU (Intel Ivy Bridge)   100xCPU   GPU (NVIDIA Kepler)   100xGPU   IBM Sequoia
Why?
Finances
Healthcare
Fluid- and Aerodynamics
Genetics
Computer Vision and Image Processing
How?
Disclaimer
Commodity Hardware
Specialized


    VS.
GPU-based
Example 1:
Financial Risk Analysis
Using Monte-Carlo method
On GPGPU
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Distribute
Run
Define
Store
Feed
Present
Survive
High-level architecture
Middleware
Worker
Example 2:
Image Search platform
Using local feature detection
On GPGPU
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
High-level architecture
Middleware
Load Balancing
Fast & Furious: building HPC solutions in a nutshell
Fast & Furious: building HPC solutions in a nutshell
Unicast
                                140

• Computation time – 1 second   120


• Sending time – 120 seconds!   100

                                 80
                                                               Unicast

• More workers – slower speed
                                 60

                                 40

                                 20

                                  0
                                      9 workers   18 workers
Multicast
                                140

• Computation time – 1 second   120


• Sending time –25 seconds      100

                                 80
                                              Unicast
                                              Multicast
• Almost 5 times faster
                                 60

                                 40

                                 20

                                  0
                                      1   2
Middleware
Worker
Fast & Furious: building HPC solutions in a nutshell
ERROR: CUDA ERROR CODE 30 (“UNKNOWN ERROR”)
Run same code on CPU and GPU
Kernel
CUDA_KERNEL foo(…)
{
     CUDA_DEFINE_PARAMS;
     // your code here
}

CUDA_CALL(threads, blocks, foo(…))
Generated code
// GPU mode                   // CPU mode
__global__ void foo (…)       void foo(…)
                              {
{
                                      // same code here
       // your code here      }
}
                              // LOOP OVER threads and blocks
foo<<<threads, blocks>>>(…)   {
                                     foo(…)
                              }
Pros & Cons
• Same code for CPU and   • Shared memory
  GPU                     • __syncthreads()
• Debugging
• Range checking
• No CUDA ERROR 30
Fast & Furious: building HPC solutions in a nutshell
@victor_haydin

linkedin.com/in/victorhaydin

victor.haydin@gmail.com
Got a question?
      Ask!

More Related Content

PPT
PDF
PDF
Cuda tutorial
PPTX
Compute Unified Device Architecture (CUDA) Programmimg
PDF
Chainer ui v0.3 and imagereport
PPTX
Nvprof um 1
PDF
A beginner’s guide to programming GPUs with CUDA
PDF
Kato Mivule: An Overview of CUDA for High Performance Computing
Cuda tutorial
Compute Unified Device Architecture (CUDA) Programmimg
Chainer ui v0.3 and imagereport
Nvprof um 1
A beginner’s guide to programming GPUs with CUDA
Kato Mivule: An Overview of CUDA for High Performance Computing

What's hot (13)

PPT
NVidia CUDA for Bruteforce Attacks - DefCamp 2012
PDF
Introduction to CUDA
PDF
How to Burn Multi-GPUs using CUDA stress test memo
PDF
GPU: Understanding CUDA
PDF
Nvidia cuda tutorial_no_nda_apr08
PPT
Introduction to parallel computing using CUDA
PPTX
Multi-GPU MapReduce on GPU Clusters
PDF
CUDA by Example : Atomics : Notes
PDF
Computing using GPUs
PDF
BINARY DATA ADVENTURES IN BROWSER JAVASCRIPT
PDF
Project ACRN GVT-d introduction and tutorial
PPTX
GPGPU programming with CUDA
NVidia CUDA for Bruteforce Attacks - DefCamp 2012
Introduction to CUDA
How to Burn Multi-GPUs using CUDA stress test memo
GPU: Understanding CUDA
Nvidia cuda tutorial_no_nda_apr08
Introduction to parallel computing using CUDA
Multi-GPU MapReduce on GPU Clusters
CUDA by Example : Atomics : Notes
Computing using GPUs
BINARY DATA ADVENTURES IN BROWSER JAVASCRIPT
Project ACRN GVT-d introduction and tutorial
GPGPU programming with CUDA
Ad

Viewers also liked (7)

PPT
HPC in healthcare
PPTX
Big Data HPC Convergence
PDF
Past, present, and future of HPC in life sciences
PDF
ODCA Board Best Practice: High Performance Computing at BMW
PDF
HPC Market Update from IDC
PDF
NVIDIA – Inventor of the GPU
PPTX
Vechicle accident prevention using eye bilnk sensor ppt
HPC in healthcare
Big Data HPC Convergence
Past, present, and future of HPC in life sciences
ODCA Board Best Practice: High Performance Computing at BMW
HPC Market Update from IDC
NVIDIA – Inventor of the GPU
Vechicle accident prevention using eye bilnk sensor ppt
Ad

Similar to Fast & Furious: building HPC solutions in a nutshell (20)

PPT
Current Trends in HPC
PDF
[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages tools
PDF
CUG2011 Introduction to GPU Computing
PPTX
iMinds The Conference: Jan Lemeire
PDF
AFDS 2011 Phil Rogers Keynote: “The Programmer’s Guide to the APU Galaxy.”
PDF
Pgopencl
PDF
PostgreSQL with OpenCL
PPTX
Introduction to heterogeneous_computing_for_hpc
PDF
Accelerating Real Time Applications on Heterogeneous Platforms
PDF
Cots moves to multicore: AMD
PDF
Mauricio breteernitiz hpc-exascale-iscte
PDF
The Rise of Parallel Computing
PDF
High Performance Computing: an Introduction for the Society of Actuaries
PDF
[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics
PPTX
IP Address Lookup By Using GPU
PPT
Vpu technology &gpgpu computing
PPT
Vpu technology &gpgpu computing
PPT
Vpu technology &gpgpu computing
PDF
Current Trends in HPC
[01][gpu 컴퓨팅을 위한 언어, 도구 및 api] miller languages tools
CUG2011 Introduction to GPU Computing
iMinds The Conference: Jan Lemeire
AFDS 2011 Phil Rogers Keynote: “The Programmer’s Guide to the APU Galaxy.”
Pgopencl
PostgreSQL with OpenCL
Introduction to heterogeneous_computing_for_hpc
Accelerating Real Time Applications on Heterogeneous Platforms
Cots moves to multicore: AMD
Mauricio breteernitiz hpc-exascale-iscte
The Rise of Parallel Computing
High Performance Computing: an Introduction for the Society of Actuaries
[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics
IP Address Lookup By Using GPU
Vpu technology &gpgpu computing
Vpu technology &gpgpu computing
Vpu technology &gpgpu computing

More from Victor Haydin (12)

PDF
IoT: future that has already happened
PDF
Marketing by nerds: how R&D actually works
PDF
How to write your database: the story about Event Store
PDF
Not Only Java [JDay Lviv 2013]
PDF
The Renaissance of C++
PPTX
Concurrency: how to shoot yourself in both feet. Simultaneously.
PPTX
Hadoop: the Big Answer to the Big Question of the Big Data
PPTX
Cloud Computing in a Nutshell
PPTX
Databases in .NET
PPTX
Distributed vcs basics + hg
PPTX
Web Development: Yesterday, Today, Tomorrow
PPTX
ASP.Net Core Services
IoT: future that has already happened
Marketing by nerds: how R&D actually works
How to write your database: the story about Event Store
Not Only Java [JDay Lviv 2013]
The Renaissance of C++
Concurrency: how to shoot yourself in both feet. Simultaneously.
Hadoop: the Big Answer to the Big Question of the Big Data
Cloud Computing in a Nutshell
Databases in .NET
Distributed vcs basics + hg
Web Development: Yesterday, Today, Tomorrow
ASP.Net Core Services

Recently uploaded (20)

PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Cloud computing and distributed systems.
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Machine learning based COVID-19 study performance prediction
PDF
Empathic Computing: Creating Shared Understanding
PDF
Approach and Philosophy of On baking technology
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
KodekX | Application Modernization Development
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
NewMind AI Weekly Chronicles - August'25 Week I
Cloud computing and distributed systems.
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
cuic standard and advanced reporting.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
sap open course for s4hana steps from ECC to s4
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
MIND Revenue Release Quarter 2 2025 Press Release
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Machine learning based COVID-19 study performance prediction
Empathic Computing: Creating Shared Understanding
Approach and Philosophy of On baking technology
20250228 LYD VKU AI Blended-Learning.pptx
Spectral efficient network and resource selection model in 5G networks
KodekX | Application Modernization Development

Fast & Furious: building HPC solutions in a nutshell