SlideShare a Scribd company logo
3
Most read
6
Most read
7
Most read
CUDAArchitecture
Prof. Shashikant V. Athawale
Assistant Professor | Computer Engineering
Department | AISSMS College of Engineering,
Kennedy Road, Pune , MH, India - 411001
Contents
❖ CUDAArchitecture
❖ Applications of CUDA
❖ Introduction to CUDA C-Write and launch CUDA C
kernels
❖ Manage GPU memory
❖ Manage communication and synchronization
❖ Parallel programming in CUDA- C.
Communication And Synchronization in
Thread
Communication And Synchronization in
Thread
CUDAArchitecture
CUDAArchitecture
Applications of CUDA
CUDA C : The Basics
❖ Based on industry-standard C
❖ A handful of language extensions to allow heterogeneous
programs
❖ Straightforward APIs to manage devices, memory, etc.
❖ Terminology:
➢ Host – The CPU and its memory (host memory)
➢ Device – The GPU and its memory (device memory)
Device
CUDA Kernels
GPU Memory Management
Data Transfer Directions Keywords
❖ cudaMemcpyHostToHost
❖ cudaMemcpyHostToDevice
❖ cudaMemcpyDeviceToHost
❖ cudaMemcpyDeviceToDevice
Parallel Programming in CUDA C
❖ CUDA brings data-parallel computing to the masses.
❖ CUDA is a scalable parallel programming model.
❖ Program runs on any number of processors without
recompiling.
Architecture Of Parallel CUDA Programming
CUDA Uses Extensive Multithreading
❖ CUDA threads express fine-grained data parallelism.
➢ Map threads to GPU threads.
➢ Virtualize the processors.
❖ CUDA thread blocks express coarse-grained parallelism.
➢ Blocks hold arrays of GPU threads, define shared
memory boundaries.
➢ Allow scaling between smaller and larger GPUs.
CUDA Uses Extensive Multithreading
❖ GPUs execute thousands of lightweight threads.
➢ In graphics, each thread computes one pixel.
➢ One CUDA thread computes one result (or several
results).
➢ Hardware multithreading & zero-overhead
scheduling.
Applications
❖ High bandwidth
❖ Visual computing
❖ High arithmetic intensity

More Related Content

PDF
Everything You Need To Know About ChatGPT
PDF
用十分鐘 向jserv學習作業系統設計
PPTX
Encryption ppt
PPTX
PDF
GAN in medical imaging
PPT
CCNA PPT
PPT
Human Rights Presentation
Everything You Need To Know About ChatGPT
用十分鐘 向jserv學習作業系統設計
Encryption ppt
GAN in medical imaging
CCNA PPT
Human Rights Presentation

What's hot (20)

PPTX
Cuda Architecture
PPT
Parallel computing
PPTX
High performance computing
PPTX
Parallel Programming
PDF
Lecture 1 introduction to parallel and distributed computing
PDF
Interconnection Network
PPTX
Computer architecture multi processor
DOCX
Parallel computing persentation
PPT
Parallel Computing
PDF
Introduction to Parallel Computing
PPT
Evaluation of morden computer & system attributes in ACA
PPT
system interconnect architectures in ACA
PPTX
Cachememory
PPTX
Flynn's Taxonomy
PDF
Distributed Operating System_1
PDF
Parallel programming model, language and compiler in ACA.
PPTX
Dichotomy of parallel computing platforms
PPTX
Superscalar Architecture_AIUB
PPT
Unit 4
PPTX
Lecture 3 threads
Cuda Architecture
Parallel computing
High performance computing
Parallel Programming
Lecture 1 introduction to parallel and distributed computing
Interconnection Network
Computer architecture multi processor
Parallel computing persentation
Parallel Computing
Introduction to Parallel Computing
Evaluation of morden computer & system attributes in ACA
system interconnect architectures in ACA
Cachememory
Flynn's Taxonomy
Distributed Operating System_1
Parallel programming model, language and compiler in ACA.
Dichotomy of parallel computing platforms
Superscalar Architecture_AIUB
Unit 4
Lecture 3 threads
Ad

Similar to CUDA Architecture (20)

PPTX
Compute Unified Device Architecture (CUDA) Programmimg
PDF
IIT ropar_CUDA_Report_Ankita Dewan
PDF
IIT ropar_CUDA_Report_Ankita Dewan
PDF
NASA Advanced Supercomputing (NAS) Division - Programming and Building HPC Ap...
PDF
CUDA by Example : The Final Countdown : Notes
PPTX
PDF
PPTX
PPTX
Pycon2014 GPU computing
PDF
Cuda materials
PDF
GPU - An Introduction
PDF
Introduction to GPU Programming
PDF
GPU Programming with Java
PDF
Cuda lab manual
PDF
Engineer Engineering Software
PDF
The Past, Present, and Future of OpenACC
PDF
Kato Mivule: An Overview of CUDA for High Performance Computing
PPT
Guide to heterogeneous system architecture (hsa)
PPT
Cuda intro
PDF
S0333 gtc2012-gmac-programming-cuda
Compute Unified Device Architecture (CUDA) Programmimg
IIT ropar_CUDA_Report_Ankita Dewan
IIT ropar_CUDA_Report_Ankita Dewan
NASA Advanced Supercomputing (NAS) Division - Programming and Building HPC Ap...
CUDA by Example : The Final Countdown : Notes
Pycon2014 GPU computing
Cuda materials
GPU - An Introduction
Introduction to GPU Programming
GPU Programming with Java
Cuda lab manual
Engineer Engineering Software
The Past, Present, and Future of OpenACC
Kato Mivule: An Overview of CUDA for High Performance Computing
Guide to heterogeneous system architecture (hsa)
Cuda intro
S0333 gtc2012-gmac-programming-cuda
Ad

More from Dr Shashikant Athawale (20)

PPT
multi threaded and distributed algorithms
PPT
Amortized analysis
PPT
Complexity theory
PPT
Divide and Conquer
PPT
Model and Design
PPT
Fundamental of Algorithms
PPT
Parallel Algorithms- Sorting and Graph
PPT
Analytical Models of Parallel Programs
PPT
Basic Communication
PPT
Parallel Processing Concepts
PPT
Parallel Processing Concepts
PPT
Dynamic programming
PPT
Parallel algorithms
PPT
Greedy method
PPT
Divide and conquer
PPT
Branch and bound
PPT
Asymptotic notation
PPT
String matching algorithms
PPTX
Advanced Wireless Technologies
multi threaded and distributed algorithms
Amortized analysis
Complexity theory
Divide and Conquer
Model and Design
Fundamental of Algorithms
Parallel Algorithms- Sorting and Graph
Analytical Models of Parallel Programs
Basic Communication
Parallel Processing Concepts
Parallel Processing Concepts
Dynamic programming
Parallel algorithms
Greedy method
Divide and conquer
Branch and bound
Asymptotic notation
String matching algorithms
Advanced Wireless Technologies

Recently uploaded (20)

DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Arduino robotics embedded978-1-4302-3184-4.pdf
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
Digital Logic Computer Design lecture notes
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
additive manufacturing of ss316l using mig welding
PPT
Project quality management in manufacturing
PPTX
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
Geodesy 1.pptx...............................................
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
web development for engineering and engineering
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
Construction Project Organization Group 2.pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Arduino robotics embedded978-1-4302-3184-4.pdf
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Digital Logic Computer Design lecture notes
Model Code of Practice - Construction Work - 21102022 .pdf
additive manufacturing of ss316l using mig welding
Project quality management in manufacturing
Strings in CPP - Strings in C++ are sequences of characters used to store and...
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Internet of Things (IOT) - A guide to understanding
Geodesy 1.pptx...............................................
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
bas. eng. economics group 4 presentation 1.pptx
web development for engineering and engineering
Embodied AI: Ushering in the Next Era of Intelligent Systems
Construction Project Organization Group 2.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS

CUDA Architecture