0% found this document useful (0 votes)

26 views24 pages

OMP Exec

The document provides an overview of the OpenMP programming and execution model, detailing its execution and memory models, race conditions, and parallel constructs. It explains how threads are created, managed, and how data is shared or kept private among them, along with various clauses and runtime functions. Additionally, it highlights the importance of correctly defining variable scopes to ensure thread safety in parallel programming.

Uploaded by

Shruthi Gowda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views24 pages

OMP Exec

Uploaded by

Shruthi Gowda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

OpenMP programming and

execution model
SoHPC, 2021
Outline

▪ Execution Model
▪ Memory Model
▪ Race Condition
▪ Parallel Construct
▪ Hello World
▪ If Clause
▪ Dynamic and Nested Regions
▪ Data Clauses
▪ Number of Threads
▪ Practical
2
Execution Model
• Thread-based Parallelism
o Initially there is the master thread, at a designated point multiple threads are created,
a parallel region
• Compiler Directive Based
o Directives tell the compiler where are these parallel regions
o Means minimal and incremental changes needed to sequential code
• Explicit Parallelism
• Fork-Join Model

3
Execution Model

• Dynamic Threads
o More than one parallel region
o Different number of threads

• Nested Parallelism
o Parallel region inside another parallel region.

4
Memory model
• All threads have access to the shared memory.
• Rule of thumb: One thread per core (or processor)
• Cache is private to each core/thread.
• Maintaining a consistent view of main memory within the caches is called cache
coherency.

CPU CPU CPU CPU

Cache Cache Cache Cache

Main
Memory I/O System

5
Memory model
• Threads can share data with other threads, but also have
private data.
Thread 1 Thread 2 Thread 3

CPU Private data CPU Private data CPU Private data

Shared data

6
Four different parts of the
memory:
• Code area
• Globals area
• Heap
• Stack

Heap: large pool of memory,

deallocation, shared by all
threads

Stack: each thread has its

own, store private data, LIFO
principle, fast, no need for
deallocation like heap.

7
Race Condition
• Threads communicate through shared variables.
Uncoordinated access of these variables can lead to
undesired effects.
• two threads update (write) a shared variable in the
same step of execution, the result is dependent on
the way this variable is accessed. This is called a
race condition.
• Suppose that one processor has an updated result
in private cache. Second processor wants to access
that memory location - but a read from memory will
get the old value since original data not yet written
back.
• Can be time consuming; better first to change how data is
accessed
8
Parallel Constructs
• The fundamental construct in
OpenMP.
• Creates team of threads
• Every thread executes the same
statements inside the parallel
region at the end of the parallel
region there is an implicit barrier
C/C++: Fortran:
!$omp parallel [clauses]
#pragma omp parallel [clauses]
…
{
!$omp end parallel
…
}

9
double A[1000];
Create a 4-thread parallel region omp_set_num_threads(4);
#pragma omp parallel
{
int tid=omp_get_thread_num();
foo(tid,A);
double A[1000]; }
Tid: from 0 to 3
printf(“All Done\n”);
Each calls foo(tid, A)
omp_set_num_threads(4);

foo(0,A); foo(1,A); foo(2,A); foo(3,A);

Threads wait for all treads to finish
before proceeding

printf(“All Done\n”);
10
Parallel Construct
• Clauses:
num_threads (integer-expression)
if (scalar_expression)
private (list)
shared (list)
default (shared | none)
firstprivate (list)
reduction (operator: list)
copyin (list)

11
Hello World

C - Serial: C:
#include<stdio.h> #include<stdio.h>
#include<omp.h>
int main(int argc, char**argv){ int main(int argc, char**argv){
#pragma omp parallel
printf("Hello world!\n”); printf("Hello from thread %d out of %d\n",
omp_get_thread_num(),
omp_get_num_threads());
} }

12
Hello World

Fortran - Serial: Fortran:

program hello program hello
use omp_lib
implicit none implicit none

!$omp parallel
print *, 'Hello world!’ print *, 'Hello from thread', &
omp_get_thread_num(), &
'out of’,omp_get_num_threads()
!$omp end parallel

end program hello end program hello

13
If Clause
If Clause:
• Used to make the parallel region directive itself conditional.
• Only execute in parallel if expression is true.
(Checks the size of the data)

C/C++:
Fortran:
#pragma omp parallel if(n>100)
{ !$omp parallel if(n>100)
… ...
} !$omp end parallel

14
Dynamic Threads
Dynamic threads:
• Used to create parallel regions with a variable number of threads
• OpenMP runtime will decide the number of threads
• omp_set_dynamic(), OMP_DYNAMIC, omp_get_dynamic()
omp_set_dynamic(0);
omp_set_num_threads(10);
#pragma omp parallel
printf("Num threads in non-dynamic region is = %d\n",
omp_get_num_threads());

omp_set_dynamic(1);
omp_set_num_threads(10);
#pragma omp parallel
printf("Num threads in dynamic region is = %d\n", omp_get_num_threads());

15
Nested Regions
Nested parallel regions:
• If a parallel directive is encountered
within another parallel directive, a
new team of threads will be created.
• omp_set_nested(), OMP_NESTED,
omp_get_nested()
• Num threads affects the new regions
• New threads with one thread unless
nested parallelism is enabled
• num_threads(n) clause or dynamic
threading for different num threads

16
Data Clauses
• Used in conjunction with several directives to control the
scoping of enclosed variables.
– default(shared|none): The default scope for all of the variables;
Fortran has more options.
– shared(list): Variable is shared by all threads in the team. All threads
can read or write to that variable.
C/C++: #pragma omp parallel default(none) shared(n)
Fortran: !$omp parallel default(none) shared(n)

– private(list): Each thread has a private copy of variable. It can only

be read or written by its own thread.
C/C++: #pragma omp parallel default(shared) private(tid)
Fortran: !$omp parallel default(shared) private(tid)

17
Example
C: Fortran:
#include<stdio.h> program hello
#include<omp.h> use omp_lib
int main(){ implicit none
int tid, nthreads; integer tid, nthreads

#pragma omp parallel private(tid), shared(nthreads) !$omp parallel private(tid), shared(nthreads)

{ tid=omp_get_thread_num()
tid=omp_get_thread_num(); nthreads=omp_get_num_threads()
nthreads=omp_get_num_threads(); print*, 'Hello from thread',tid,'out of',nthreads
printf("Hello from thread %d out of %d\n", tid, !$omp end parallel
nthreads);
} end program hello
}

18
• How do we decide which variables should be shared
and which private?
– Loop indices - private
– Loop temporaries - private
– Read-only variables - shared
– Main arrays - shared
• Most variables are shared by default
– C/C++: File scope, static variables
– Fortran: COMMON blocks, SAVE, MODULE
variables
– Both: dynamically allocated variables

• Variables declared in parallel region are always

private 19
Additional Data Clauses
– firstprivate(list): pre-initialize
private vars with value of j = jstart;
variable with same name #pragma omp parallel shared(arr), firstprivate(j)
before parallel construct. {
Int tid = omp_get_thread_num()
arr[tid] = tid+j;
– lastprivate(list): On exiting the }
for (int i=0; i<nthreads; i++) printf("%d, %d\n",i,arr[i]);
parallel region, this gives
private data the value of last
iteration if sequential)
#pragma omp parallel copyin(jstart)
– threadprivate(list): Used to {
make global file scope int tid = omp_get_thread_num();
variables (C/C++) or common jstart = jstart + tid + 1;
printf("%d, %d\n",tid,jstart);
blocks (Fortran) private to }
thread.
printf("%d\n",jstart);{

– copyin(list): Copies the

threadprivate variables from
master thread to the team
20
threads.
Runtime Functions
• Runtime Functions: for managing the parallel program
dynamically.

– omp_set_num_threads(n) - set the desired number of

threads
– omp_get_num_threads() - returns the current number
of threads
– omp_get_thread_num() - returns the id of this thread
– omp_in_parallel() – returns .true. if inside parallel
region

C/C++: Add #include<omp.h>

Fortran: Add use omp_lib
21
Shell Variables

• Environment Variables: for controlling the

execution of parallel program at run-time.

– csh/tcsh: setenv OMP_NUM_THREADS n

– ksh/sh/bash: export OMP_NUM_THREADS=n
echo $OMP_NUM_THREADS

22
How many threads?
The number of threads in a parallel region is determined by:

▪ Setting of the OMP_NUM_THREADS environment

variable.
▪ Use of the omp_set_num_threads(n) library function.
▪ Use of num_threads(n) clause.
▪ The implementation default - usually the number of
CPUs/cores on a node

Threads are numbered from 0 (master thread) to n-1 where

n=the total number of threads.

23
Summary
• Parallel construct forks threads.
• There are several ways to determine the number of threads
per region.
• We can have dynamic and nested parallel regions.
• Variables must be defined as private or shared.
• One of the common problems is not declaring them properly.
This will lead to different results for different numbers of
threads.
• Program is said to be thread safe, if results are the same for
any number of threads.

OpenMP Intro
No ratings yet
OpenMP Intro
52 pages
OpenMP Prefix Sum Techniques
No ratings yet
OpenMP Prefix Sum Techniques
39 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
OpenMP Parallel Computing Tutorial
No ratings yet
OpenMP Parallel Computing Tutorial
58 pages
OpenMP Basics: Parallel Programming Guide
No ratings yet
OpenMP Basics: Parallel Programming Guide
67 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
40 pages
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
No ratings yet
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
46 pages
CS-3006 8 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 8 UsingOpenMP SharedMemoryProgramming
61 pages
OpenMP Programming Guide
No ratings yet
OpenMP Programming Guide
38 pages
Unit III
No ratings yet
Unit III
15 pages
OpenMP: Shared Memory Parallelism Guide
No ratings yet
OpenMP: Shared Memory Parallelism Guide
76 pages
OpenMP Shared Memory Guide
No ratings yet
OpenMP Shared Memory Guide
35 pages
OpenMP Parallel Processing Guide
No ratings yet
OpenMP Parallel Processing Guide
115 pages
M4: Shared Memory Programming With Openmp
No ratings yet
M4: Shared Memory Programming With Openmp
63 pages
Openmp HPC Ass1
No ratings yet
Openmp HPC Ass1
43 pages
OpenMP Programming Overview and Examples
No ratings yet
OpenMP Programming Overview and Examples
46 pages
OpenMP Synchronization Guide
No ratings yet
OpenMP Synchronization Guide
32 pages
OpenMP for Shared Memory Programming
No ratings yet
OpenMP for Shared Memory Programming
30 pages
OpenMP Parallel Programming Basics
No ratings yet
OpenMP Parallel Programming Basics
36 pages
OpenMP: A Guide to Parallel Programming
No ratings yet
OpenMP: A Guide to Parallel Programming
54 pages
OpenMP Basics and Examples Guide
No ratings yet
OpenMP Basics and Examples Guide
80 pages
Understanding Shared Memory and OpenMP
No ratings yet
Understanding Shared Memory and OpenMP
86 pages
OpenMP: Parallel Programming Guide
No ratings yet
OpenMP: Parallel Programming Guide
37 pages
OpenMP Parallel Programming Techniques
No ratings yet
OpenMP Parallel Programming Techniques
19 pages
OpenMP Guide for C/Fortran Programming
No ratings yet
OpenMP Guide for C/Fortran Programming
15 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
OpenMP Guide for Parallel Computing
No ratings yet
OpenMP Guide for Parallel Computing
32 pages
OpenMP Constructs for Parallel Programming
No ratings yet
OpenMP Constructs for Parallel Programming
14 pages
OpenMP Shared Memory Programming Guide
No ratings yet
OpenMP Shared Memory Programming Guide
23 pages
High Performance Computing WS2022 Slides 2 Openmp GDB Gprof
No ratings yet
High Performance Computing WS2022 Slides 2 Openmp GDB Gprof
41 pages
OpenMP Shared-Memory Programming Guide
No ratings yet
OpenMP Shared-Memory Programming Guide
35 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
OpenMP Parallel Processing Guide
No ratings yet
OpenMP Parallel Processing Guide
90 pages
Introduction to OpenMP Basics
No ratings yet
Introduction to OpenMP Basics
152 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
29 pages
OpenMP Shared Memory Programming Guide
No ratings yet
OpenMP Shared Memory Programming Guide
32 pages
Parallel Programming: in C With Mpi and Openmp Michael J. Quinn
No ratings yet
Parallel Programming: in C With Mpi and Openmp Michael J. Quinn
73 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
OpenMP Shared Memory Programming Guide
No ratings yet
OpenMP Shared Memory Programming Guide
25 pages
OpenMP Shared Memory Programming Guide
No ratings yet
OpenMP Shared Memory Programming Guide
24 pages
Open MP
No ratings yet
Open MP
30 pages
OpenMP Parallel Programming Techniques
No ratings yet
OpenMP Parallel Programming Techniques
19 pages
Lecture Open MP
No ratings yet
Lecture Open MP
25 pages
OpenMP Basics for Parallel Programming
No ratings yet
OpenMP Basics for Parallel Programming
47 pages
Introduction to OpenMP Programming
No ratings yet
Introduction to OpenMP Programming
35 pages
High-Performance Computing Overview
No ratings yet
High-Performance Computing Overview
27 pages
Introduction To Open MP
No ratings yet
Introduction To Open MP
42 pages
OpenMP Shared Memory Programming Guide
No ratings yet
OpenMP Shared Memory Programming Guide
65 pages
OpenMP for Shared Memory Programming
No ratings yet
OpenMP for Shared Memory Programming
88 pages
Shared Memory Parallel Programming: Introduction To Openmp
No ratings yet
Shared Memory Parallel Programming: Introduction To Openmp
39 pages
OpenMP Parallel Programming Guide
No ratings yet
OpenMP Parallel Programming Guide
74 pages
Openmp
No ratings yet
Openmp
61 pages
OpenMP Nested Parallelism Overview
No ratings yet
OpenMP Nested Parallelism Overview
20 pages
OpenMP Tasking for Developers
No ratings yet
OpenMP Tasking for Developers
21 pages
OpenMP C/C++ Reference Sheet
No ratings yet
OpenMP C/C++ Reference Sheet
2 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
OpenMP Overview and Programming Model
No ratings yet
OpenMP Overview and Programming Model
46 pages
8601-2 Assignment
No ratings yet
8601-2 Assignment
30 pages
Campaign Ad for Digital Issues Awareness
100% (1)
Campaign Ad for Digital Issues Awareness
15 pages
Roadmap For Procurement Transformation - 1
No ratings yet
Roadmap For Procurement Transformation - 1
31 pages
Interim PBC - For Reference
No ratings yet
Interim PBC - For Reference
50 pages
STAR Program Overview and Implementation
No ratings yet
STAR Program Overview and Implementation
1 page
Jal Makuach Jal
No ratings yet
Jal Makuach Jal
3 pages
PPWRDB000: Power Distribution Indicator (PDI) Standard
No ratings yet
PPWRDB000: Power Distribution Indicator (PDI) Standard
1 page
Canadian Higher Education Internationalization
No ratings yet
Canadian Higher Education Internationalization
26 pages
Software Testing and Project Management
No ratings yet
Software Testing and Project Management
55 pages
My - Bill - 01 Jun, 2024 - 30 Jun, 2024 - 778690241280
No ratings yet
My - Bill - 01 Jun, 2024 - 30 Jun, 2024 - 778690241280
1 page
Merchandising Business Pt. 2
100% (1)
Merchandising Business Pt. 2
18 pages
A Sensitive Data Test Strategy Outlines The Approach
No ratings yet
A Sensitive Data Test Strategy Outlines The Approach
2 pages
08 Stopping Sight Distance
0% (1)
08 Stopping Sight Distance
38 pages
Capstone Project Report 2.0
No ratings yet
Capstone Project Report 2.0
6 pages
Kidney Failure: Signs and Management
No ratings yet
Kidney Failure: Signs and Management
6 pages
City of Gretna Legals: Mayor Councilmen
No ratings yet
City of Gretna Legals: Mayor Councilmen
1 page
4 Supercharging
100% (4)
4 Supercharging
36 pages
Video Editing Tips and Tricks
No ratings yet
Video Editing Tips and Tricks
6 pages
UTI Mutual Fund Statement Guide
No ratings yet
UTI Mutual Fund Statement Guide
2 pages
4223 Glencoe Leasing Brochure 11.6.23
No ratings yet
4223 Glencoe Leasing Brochure 11.6.23
6 pages
Energy Argus: Petroleum Coke
No ratings yet
Energy Argus: Petroleum Coke
21 pages
Insurance Rider Details
No ratings yet
Insurance Rider Details
4 pages
Boral Partiwall and Fire Wall Class 1a
No ratings yet
Boral Partiwall and Fire Wall Class 1a
28 pages
QUESTIONNAIRE (Sadiq)
No ratings yet
QUESTIONNAIRE (Sadiq)
3 pages
AM24 SR US 18Nm 19626
No ratings yet
AM24 SR US 18Nm 19626
2 pages
Numerical Modelling of Seepage Analysis Using SEEP-WAcasestudy
No ratings yet
Numerical Modelling of Seepage Analysis Using SEEP-WAcasestudy
12 pages
BioInformatics Syllabus
No ratings yet
BioInformatics Syllabus
65 pages
Lovello Ice Cream Internship Report
No ratings yet
Lovello Ice Cream Internship Report
26 pages
Java Library Management System Code
No ratings yet
Java Library Management System Code
6 pages
512 KM Composite Rainbow Radar Loop
No ratings yet
512 KM Composite Rainbow Radar Loop
1 page

OMP Exec

Uploaded by

OMP Exec

Uploaded by

OpenMP programming and

CPU CPU CPU CPU

Cache Cache Cache Cache

CPU Private data CPU Private data CPU Private data

Heap: large pool of memory,

Stack: each thread has its

foo(0,A); foo(1,A); foo(2,A); foo(3,A);

Fortran - Serial: Fortran:

end program hello end program hello

– private(list): Each thread has a private copy of variable. It can only

#pragma omp parallel private(tid), shared(nthreads) !$omp parallel private(tid), shared(nthreads)

• Variables declared in parallel region are always

– copyin(list): Copies the

– omp_set_num_threads(n) - set the desired number of

C/C++: Add #include<omp.h>

• Environment Variables: for controlling the

– csh/tcsh: setenv OMP_NUM_THREADS n

▪ Setting of the OMP_NUM_THREADS environment

Threads are numbered from 0 (master thread) to n-1 where

You might also like