Introduction to CUDA Programming Basics

This document provides an outline for an introduction to CUDA course. It begins with an introduction to GPUs and their evolution towards general purpose computing. It then discusses key differences between CPUs and GPUs, how latency is hidden on GPUs, and how this enabled the dawn of general purpose GPU (GPGPU) programming. The document outlines CUDA as NVIDIA's programming model for GPGPU, its compilation process, execution model with threads arranged in blocks and grids, and memory model. It provides examples of applications that utilize GPU acceleration like machine learning, scientific computing, and medical imaging.

Uploaded by

Raghav Ganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views15 pages

Introduction to CUDA Programming Basics

Uploaded by

Raghav Ganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

IT301: INTRODUCTION TO

CUDA
By,
Ms. Thanmayee
Adhoc Faculty,
Department of IT,
NITK, Surathkal
OUTLINE
● Introduction to GPU
● Evolution of GPU microarchitectures
● General Purpose GPU
● Introduction to CUDA
● CUDA Execution Model
● CUDA Memory Model
● Steps in GPU Execution
● Hello World Program
● CUDA Device Variables
● CUDA Programming examples
CPU vs GPU
● Need to understand how CPUs and GPUs differ
− Simpler calculation versus complex calculation
− Basic graphics versus 3D rendering, animations.
− Few higher capacity cores versus many low capacity
cores
− Latency Intolerance versus Latency Tolerance
− Task Parallelism versus Data Parallelism
− 10s of Threads versus 10,000s of Threads
Latency Hiding in GPU
General Purpose GPU : GPGPU
The dawn of GPGPU
General Purpose Computing on GPU was far from easy back then
− Even for those who knew graphics programming
languages such as OpenGL!
− Developers had to map scientific calculations onto
problems that could be represented by triangles and
polygons.
Applications
Applications
● Machine Learning – self driving cars,
Watson AI Supercomputer.
● Scientific Applications such as Genome
sequencing, molecular simulations.
● Medical Image processing.
● Image tagging in Facebook.
● Numeric weather predictions.
● Oil exploration.
● Movie making.
● Atmospheric simulation.
● Sequencing the novel coronavirus and the
genomes of people afflicted with
COVID-19.
CUDA – Compute Unified Device Architecture
● In 2003, a team of researchers led by Ian Buck unveiled Brook,
the first widely adopted programming model to extend C with
data-parallel constructs.
● Exposed the GPU as a general - purpose processor in a high-
level language
− Most importantly, Brook programs were
● Easier to write than hand-tuned GPU code
● Seven times faster than similar existing code
CUDA – Compute Unified Device Architecture
● NVIDIA invited Ian Buck to join the company.
− Started evolving a solution to seamlessly run C on the GPU.
− Putting the software and hardware together, NVIDIA unveiled CUDA in
2006
●
− CUDA was launched in 2007.
− The world's first solution for general-computing on GPUs
− CUDA:
■ is a parallel computing architecture and programming model.

■ Includes C/C++ compiler and also support for OpenCL, DirectCompute.

General Structure of the GPU Program in
CUDA

● Host Program – Executed by the

CPU.
●
− This is a serial code.
− Sets up the parameters for
GPU (kernel) execution.
● Kernel Program – Executed in
Parallel by the SIMD cores
(Streaming Processors) in the
GPU.
Compiling CUDA Program:
CUDA Execution Model
● Threads :
○ perform computations. They run
on Scalar Processor (Streaming
Processors) in GPU.
○ Thousands are needed to get full
efficiency.
● Blocks :
○ Group of Threads. Max. Number
of Threads vary from 1 to 1024.
○ They are alloted to Streaming
Multiprocessors (SMs) in GPU.
○ Multiple blocks can reside in one
SM.
● Grid :
○ Group of Blocks.
○ Holds the complete computation
task. They represent the Kernel.
Blocks in SMs
THANK YOU

CUDA 1 - Introduction To GPU, CUDA
No ratings yet
CUDA 1 - Introduction To GPU, CUDA
21 pages
Programming Gpus With Cuda: John Mellor-Crummey
No ratings yet
Programming Gpus With Cuda: John Mellor-Crummey
42 pages
1 Cuda
100% (1)
1 Cuda
173 pages
CUDA Programming for Engineers
No ratings yet
CUDA Programming for Engineers
17 pages
CUDA Programming Model Overview
No ratings yet
CUDA Programming Model Overview
31 pages
Cuuda Nvidai Guide - Part1
No ratings yet
Cuuda Nvidai Guide - Part1
15 pages
CUDA Class Lecture01
No ratings yet
CUDA Class Lecture01
26 pages
DS1822 - Parallel Computing-Unit3
No ratings yet
DS1822 - Parallel Computing-Unit3
17 pages
GPU Cluster4
No ratings yet
GPU Cluster4
31 pages
CUDA
No ratings yet
CUDA
18 pages
Understanding PGPU and CUDA Basics
No ratings yet
Understanding PGPU and CUDA Basics
70 pages
PDC Lecture 09
No ratings yet
PDC Lecture 09
36 pages
Lecture 2
No ratings yet
Lecture 2
77 pages
Understanding GPU Architecture and CUDA
No ratings yet
Understanding GPU Architecture and CUDA
12 pages
Cuda-: An Emerging Technology That Can Make Robots Reflex Action Faster
No ratings yet
Cuda-: An Emerging Technology That Can Make Robots Reflex Action Faster
11 pages
Introduction - CUDA C Programming Guide
No ratings yet
Introduction - CUDA C Programming Guide
573 pages
GPU Architecture Ebook
No ratings yet
GPU Architecture Ebook
67 pages
GPU Basics
No ratings yet
GPU Basics
93 pages
Intro GPUs
No ratings yet
Intro GPUs
36 pages
CUDA
No ratings yet
CUDA
46 pages
Gpu Computing
No ratings yet
Gpu Computing
57 pages
CUDA Wikipedia
No ratings yet
CUDA Wikipedia
10 pages
Introduction to CUDA Programming
No ratings yet
Introduction to CUDA Programming
26 pages
Chapter7 GPU
No ratings yet
Chapter7 GPU
45 pages
Comp206 Lecture14
No ratings yet
Comp206 Lecture14
29 pages
GPU & CUDA Programming Guide
No ratings yet
GPU & CUDA Programming Guide
31 pages
Intro to CUDA Programming Guide
No ratings yet
Intro to CUDA Programming Guide
33 pages
Topic GPU1
No ratings yet
Topic GPU1
32 pages
0 Gpu Computing I Give It
No ratings yet
0 Gpu Computing I Give It
57 pages
Cuda C
No ratings yet
Cuda C
70 pages
CUDA Tutorial
100% (1)
CUDA Tutorial
50 pages
Course 7
No ratings yet
Course 7
21 pages
Understanding GPU Architecture and Evolution
No ratings yet
Understanding GPU Architecture and Evolution
2 pages
Lecture-12-PDC - CUDA
No ratings yet
Lecture-12-PDC - CUDA
25 pages
HPC Final 4-8
No ratings yet
HPC Final 4-8
25 pages
HPC 5th Unit - 240504 - 160548
No ratings yet
HPC 5th Unit - 240504 - 160548
18 pages
From CPU To GPU With CUDA C Language: Michele Tuttafesta Dottorato Di Ricerca in Fisica 25 Ciclo
No ratings yet
From CPU To GPU With CUDA C Language: Michele Tuttafesta Dottorato Di Ricerca in Fisica 25 Ciclo
71 pages
p10 Cuda
No ratings yet
p10 Cuda
28 pages
CUDA Programming for Engineers
No ratings yet
CUDA Programming for Engineers
84 pages
Chapter 8
No ratings yet
Chapter 8
58 pages
Parallel & Distributed Computing Report
No ratings yet
Parallel & Distributed Computing Report
4 pages
Barnett Haskins
No ratings yet
Barnett Haskins
29 pages
Cuda
No ratings yet
Cuda
69 pages
Unit 4 Programming With Cuda
No ratings yet
Unit 4 Programming With Cuda
31 pages
Lec 1
No ratings yet
Lec 1
27 pages
Cuda PDF
No ratings yet
Cuda PDF
18 pages
Note2 4
No ratings yet
Note2 4
11 pages
Cuda PPT
No ratings yet
Cuda PPT
54 pages
GPU Programming: Dr. Florian Ferreira
No ratings yet
GPU Programming: Dr. Florian Ferreira
101 pages
Introduction To Gpu Programming With Cuda and Openacc
100% (1)
Introduction To Gpu Programming With Cuda and Openacc
40 pages
Cs-3006 8 Gpuprogramming Using Cuda&Opencl
No ratings yet
Cs-3006 8 Gpuprogramming Using Cuda&Opencl
167 pages
Introduction to CUDA Parallel Programming
No ratings yet
Introduction to CUDA Parallel Programming
25 pages
CUDA Programming Overview and Guide
No ratings yet
CUDA Programming Overview and Guide
28 pages
Introduction To The Cuda Programming
No ratings yet
Introduction To The Cuda Programming
25 pages
Christian Eh An Sen 2
No ratings yet
Christian Eh An Sen 2
18 pages
GPU Architecture and Programming
No ratings yet
GPU Architecture and Programming
3 pages
GPGPU Programming With CUDA: Leandro Avila - University of Northern Iowa
No ratings yet
GPGPU Programming With CUDA: Leandro Avila - University of Northern Iowa
29 pages
04 IntroductionGPUsCUDA
No ratings yet
04 IntroductionGPUsCUDA
25 pages
Cambridge IGCSE Chemistry 4th Edition
87% (52)
Cambridge IGCSE Chemistry 4th Edition
290 pages
AS & A Level Chemistry Workbook Answers
95% (19)
AS & A Level Chemistry Workbook Answers
104 pages
Lower Secondary Science 7 End-Of-Year Test
82% (28)
Lower Secondary Science 7 End-Of-Year Test
9 pages
Year 7 End-of-Year Maths Test
90% (30)
Year 7 End-of-Year Maths Test
7 pages
IGCSE-Maths Book
20% (15)
IGCSE-Maths Book
26 pages
Cambridge Checkpoint Lower Secondary Science Students Book 8
88% (59)
Cambridge Checkpoint Lower Secondary Science Students Book 8
517 pages
Ls Maths9 2ed TR Workbook Answers
85% (85)
Ls Maths9 2ed TR Workbook Answers
49 pages
Cambridge IGCSE™ Chemistry Workbook - Answers
96% (24)
Cambridge IGCSE™ Chemistry Workbook - Answers
64 pages
Lower Secondary Checkpoint 2024
83% (12)
Lower Secondary Checkpoint 2024
24 pages
Cambridge Lower Secondary Science Workbook 9
78% (32)
Cambridge Lower Secondary Science Workbook 9
200 pages
Cambridge Past Paper Stage 7 + Their Answer
92% (26)
Cambridge Past Paper Stage 7 + Their Answer
160 pages
Lower Secondary Science 8 Workbook Answers
82% (22)
Lower Secondary Science 8 Workbook Answers
31 pages
Cambridge Year 7 Math Learner Book Answers
87% (157)
Cambridge Year 7 Math Learner Book Answers
57 pages
Cambridge Lower Secondary Mathematics 2ed 9 Workbook
74% (19)
Cambridge Lower Secondary Mathematics 2ed 9 Workbook
205 pages
IGCSE Chemistry - Electrolysis
97% (39)
IGCSE Chemistry - Electrolysis
11 pages
Cambridge IGCSE Physics CB
93% (46)
Cambridge IGCSE Physics CB
370 pages
IGCSE Chemistry Notes
93% (61)
IGCSE Chemistry Notes
46 pages
Year 7 Science End-of-Year Test
88% (16)
Year 7 Science End-of-Year Test
9 pages
Workbook Stage 9
87% (55)
Workbook Stage 9
205 pages
1416752cambridge Y7 WB MS
82% (82)
1416752cambridge Y7 WB MS
60 pages
Hodder Biology Textbook 2022
88% (24)
Hodder Biology Textbook 2022
459 pages
Science Checkpoint Past Papers Grade 8
82% (11)
Science Checkpoint Past Papers Grade 8
285 pages
Physics: For Cambridge
95% (19)
Physics: For Cambridge
527 pages
Igcse Physics 3ed TR Coursebook Answers
93% (44)
Igcse Physics 3ed TR Coursebook Answers
39 pages
Workbook Answers: Unit 1 Respiration
88% (244)
Workbook Answers: Unit 1 Respiration
30 pages
Science Checkpoint Oct 2024 P1 QP 1
73% (30)
Science Checkpoint Oct 2024 P1 QP 1
16 pages
Cambridge Lower Secondary Science Workbook 7
76% (62)
Cambridge Lower Secondary Science Workbook 7
25 pages
Science Exam Paper2 - Cambridge Lower Secondary Checkpoint Oct2024
64% (11)
Science Exam Paper2 - Cambridge Lower Secondary Checkpoint Oct2024
20 pages
Course Book Answers For Cambridge International As A Level Chemistry Coursebook
88% (16)
Course Book Answers For Cambridge International As A Level Chemistry Coursebook
145 pages
Cambridge IGCSE and O Level Computer Science Study and Revision Guide Second Edition (David Watson, Helen Williams, David Fairley) (Z-Library)
83% (36)
Cambridge IGCSE and O Level Computer Science Study and Revision Guide Second Edition (David Watson, Helen Williams, David Fairley) (Z-Library)
211 pages
ME352 Lecture 1 Static Force Analysis
100% (1)
ME352 Lecture 1 Static Force Analysis
68 pages
Cam and Follower Dynamics Explained
No ratings yet
Cam and Follower Dynamics Explained
43 pages
Gpu Cuda Part1
No ratings yet
Gpu Cuda Part1
27 pages
Cam and Follower Kinematic Analysis
No ratings yet
Cam and Follower Kinematic Analysis
12 pages
2019 Minor Programs Course & Student Lists
No ratings yet
2019 Minor Programs Course & Student Lists
9 pages
Cam Kinematics - Cam Profile
No ratings yet
Cam Kinematics - Cam Profile
12 pages
Azure Data Storage Lab for Engineers
No ratings yet
Azure Data Storage Lab for Engineers
15 pages
Azure Fundamentals
No ratings yet
Azure Fundamentals
5 pages
ECL300 - RS 232 Protocol PDF
No ratings yet
ECL300 - RS 232 Protocol PDF
20 pages
PE-1 (Unix Programming) - Unit - 4 - Process and Signals
No ratings yet
PE-1 (Unix Programming) - Unit - 4 - Process and Signals
47 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Ict Theory (Knec) - Notes
No ratings yet
Ict Theory (Knec) - Notes
101 pages
Pts B Inggris Kelas 11 Rossi 18 September 2023
No ratings yet
Pts B Inggris Kelas 11 Rossi 18 September 2023
17 pages
MN Gtwin Reference JP en
No ratings yet
MN Gtwin Reference JP en
282 pages
Extend Oracle Tablespace on Linux
No ratings yet
Extend Oracle Tablespace on Linux
19 pages
Ethernet Configuration Fc11 Guide PDF
No ratings yet
Ethernet Configuration Fc11 Guide PDF
2 pages
Pcarch Full - Isa Bus
No ratings yet
Pcarch Full - Isa Bus
53 pages
PowerStore - Unexpected node reboot or kernel panic - Dell 中国
No ratings yet
PowerStore - Unexpected node reboot or kernel panic - Dell 中国
3 pages
Securing Apache Web Servers With Mod Security & CIS Benchmark
No ratings yet
Securing Apache Web Servers With Mod Security & CIS Benchmark
58 pages
IT Equipment - MRP - July14
No ratings yet
IT Equipment - MRP - July14
6 pages
Evidence Plan
No ratings yet
Evidence Plan
13 pages
Asus A68HM-K: Una Placa Base Con Socket AMD FM2+
No ratings yet
Asus A68HM-K: Una Placa Base Con Socket AMD FM2+
3 pages
Siemens S7 200 Ethernet
No ratings yet
Siemens S7 200 Ethernet
8 pages
Abstract On Network Security and Cryptography
89% (9)
Abstract On Network Security and Cryptography
13 pages
CveBinarySheet A Comprehensive Pre-Built Binaries
No ratings yet
CveBinarySheet A Comprehensive Pre-Built Binaries
4 pages
Eisbaer Scada Product Catalogue 03 2018
No ratings yet
Eisbaer Scada Product Catalogue 03 2018
60 pages
Industrial SCADA Solutions
No ratings yet
Industrial SCADA Solutions
9 pages
Powershell Crash Course ZedShaw 68 Pages
No ratings yet
Powershell Crash Course ZedShaw 68 Pages
70 pages
Anatomy of Grid
No ratings yet
Anatomy of Grid
9 pages
HP 34401A Software User Manual
No ratings yet
HP 34401A Software User Manual
8 pages
Microcontrollers: Assignment
No ratings yet
Microcontrollers: Assignment
5 pages
GCP Networking Course Slides For Downloads Rev1
No ratings yet
GCP Networking Course Slides For Downloads Rev1
129 pages
CCN Lecture Notes 4
No ratings yet
CCN Lecture Notes 4
20 pages
ATN 910&910I&910B&950B V200R003C10 Configuration Guide 01 (CLI) PDF
100% (3)
ATN 910&910I&910B&950B V200R003C10 Configuration Guide 01 (CLI) PDF
4,653 pages
Video Library Management System Overview
No ratings yet
Video Library Management System Overview
4 pages
Is The Cim-IO Software Backwards and Forwards Compatible
No ratings yet
Is The Cim-IO Software Backwards and Forwards Compatible
2 pages

Introduction to CUDA Programming Basics

Uploaded by

Introduction to CUDA Programming Basics

Uploaded by

IT301: INTRODUCTION TO

■ Includes C/C++ compiler and also support for OpenCL, DirectCompute.

● Host Program – Executed by the

You might also like