PAR Final Lab Sol 2023 24Q1

Uploaded by

romeuesteve

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views3 pages

PAR Final Lab Sol 2023 24Q1

Uploaded by

romeuesteve

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

PAR Final Exam Laboratory Course 2023/24-Q1

January 17th , 2023

Problem 1: Lab 1 (2.5 points)

Let's assume the pi_omp code in Lab 1. Below we include the execution time and speedup scalability
plots obtained with the submit-strong-omp.sh script when setting np_MAX=40. Recall that this script
executes the parallel code using from 1 to np_MAX threads. We ask you to: Briey explain the reason why
the speed-up goes down abruptly when going from 20 to 21 OpenMP threads, and why the performance for
20 and 40 OpenMP threads is very similar.

Solution: No solution provided. Look at the Atenea survey of Questions laboratory 1 - Session 1 - third
question.
Problem 2: Lab 3 (2.5 points)

1. Assume the following task decomposition strategies for the dot_product code:
// Strategy 1 // Strategy 2
float result = 0.0; float result = 0.0;
#pragma omp parallel #pragma omp parallel
#pragma omp single #pragma omp single
#pragma omp taskloop reduction(+:result) #pragma omp taskgroup task_reduction(+: result)
for (int i = 0; i < n; ++i) { for (int i = 0; i < n; ++i) {
result += x[i] + y[i]; #pragma omp task in_reduction (+:result)
} result += x[i] + y[i];
}

We ask you to: Indicate which of the two strategies would obtain better scalability. Justify briey
your answer.
Solution:

Strategy 2 applies a task decomposition with task granularity much ner than Strategy 1. Observe
that in Strategy 2, tasks are created for every loop iteration, while in Strategy 1, tasks are created
with a bunch of consecutive iterations (taskloop behaviour). The work performed at each loop iteration
(accumulation on a variable with a sum up) do not justify the task creation at this level. We can arm
because in Laboratory 3, where for Mandelbrot set calculation (which involved a greater number of
operations), the Point strategy suered from a great task creation overhead, and Row strategy showed
better performance.
2. Take a look at the two versions of Modelfactors tables generated after executing the Mandelbrot appli-
cation parallelized using the Point strategy and taskloop pragma to create the explicit tasks.
We ask you to: Analyze the main dierences between the two executions and indicate which is the
optimization applied to Parallelization 2. Reason briey your answer.
Solution: No solution provided. Look at your laboratory deliverable and feedback
Figure 1: Modelfactors table for Parallelization 1

Figure 2: Modelfactors table for Parallelization 2

Problem 3: Lab 4 (2.5 points)

Consider the following sequential recursive version for the dot_product problem presented in the assign-
ment for Lab 4 :
#define N 512
#define MIN_SIZE 64

int iter_dot_product(int A, int B, int n);

int rec_dot_product(int A, int B, int n) {

int res1, res2=0;
if (n>MIN_SIZE) {
int n2 = n / 2;
res1 = rec_dot_product(A, B, n2);
res2 = rec_dot_product(A+n2, B+n2, n-n2);
}
else res1 = iter_dot_product(A, B, n);
return res1+res2;
}

void main() {
int result;
result = rec_dot_product(a, b, N);
}

We ask you to:

1. Assuming we only add the necessary OpenMP directives to parallelize the previous code following a
Recursive Task Decomposition using a Leaf strategy, indicate which would be the Maximum Instan-
taneous Parallelism we can achieve. Reason your answer. Note: Including the parallel code in your
answer is not mandatory.
Solution: The maximum instantaneous parallelism that we can achieve is 1. Observe that the
rec_dot_product function cannot return until res1 and res2 are calculated. So in the case
of a leaf strategy, as tasks are created sequentially, and have to nish execution before returning the
result, there won't be more than one task executing at any given moment.
2. Assuming we only add the necessary OpenMP directives to parallelize the previous code following a
Recursive Task Decomposition using a Tree strategy, indicate which would be the Maximum Instan-
taneous Parallelism we can achieve. Reason your answer. Note: Including the parallel code in your
answer is not mandatory.
Solution: The maximum instantaneous parallelism that we can achieve is 8, after three levels of
recursive calls and one task created per recursive call (two tasks per level). Note that after three
recursive levels the base case of the recursivity is achieved, when 8 tasks can be potentially executed
in parallel.
Problem 4: Lab 5 (2.5 points)
Assume a parallelization strategy for the Gauss-Seidel solver in Lab5 that uses a geometric block by rows data
decomposition. Remember that the code in the laboratory assignment included an argument, userparam, to
be used to determine the number of blocks each thread has to process.
1. Indicate why userparam impacts directly on the amount of parallelism obtained. Reason your answer.
Solution: No solution provided. Look at your laboratory deliverable and feedback.

2. Explain the reason why in Lab5 assignment you needed to implement your own synchronization between
threads.
Solution:

When working with implicit tasks, there is no way to specify dependencies using OpenMP clauses.
Consequently, all the implicit tasks will be executed simultaneously without taking care of dependencies
between blocks.

Par - 1 In-Term Exam - Course 2018/19-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2018/19-Q2
9 pages
Par - 1 In-Term Exam - Course 2017/18-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2017/18-Q2
7 pages
t2 2017 Key
No ratings yet
t2 2017 Key
7 pages
National University of Computer and Emerging Sciences, Lahore Campus
No ratings yet
National University of Computer and Emerging Sciences, Lahore Campus
9 pages
Advanced OpenMP Pitfalls & Solutions
No ratings yet
Advanced OpenMP Pitfalls & Solutions
52 pages
Par - 1 In-Term Exam - Course 2017/18-Q1: Matrix U J
No ratings yet
Par - 1 In-Term Exam - Course 2017/18-Q1: Matrix U J
11 pages
Mid Sem QP&Solution
No ratings yet
Mid Sem QP&Solution
7 pages
OpenMP Tasking for Developers
No ratings yet
OpenMP Tasking for Developers
21 pages
Sheet 5: (A) False Sharing in False Sharing Add
No ratings yet
Sheet 5: (A) False Sharing in False Sharing Add
2 pages
Viva Questions
No ratings yet
Viva Questions
15 pages
2022 Mid 1
No ratings yet
2022 Mid 1
4 pages
Problem Statement
No ratings yet
Problem Statement
2 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
Excelente
No ratings yet
Excelente
64 pages
OpenMP Workshop Day 2
No ratings yet
OpenMP Workshop Day 2
155 pages
HPC Int I Retest Answer Key
No ratings yet
HPC Int I Retest Answer Key
10 pages
PC - Lab Manuall
No ratings yet
PC - Lab Manuall
15 pages
E 3 (Openmp - Iii) : Matrix Multiplication
No ratings yet
E 3 (Openmp - Iii) : Matrix Multiplication
10 pages
Parallel Assignment 3
No ratings yet
Parallel Assignment 3
9 pages
Name: Harshvardhan Singh Gahlaut Reg. No.: 19BCE2372 Slot: L41+L42
No ratings yet
Name: Harshvardhan Singh Gahlaut Reg. No.: 19BCE2372 Slot: L41+L42
3 pages
OpenMP Shared
No ratings yet
OpenMP Shared
28 pages
18-Assignment 1 - Solution
No ratings yet
18-Assignment 1 - Solution
12 pages
Sample - Code - Parallel - Cse6230 Fa14 04 Omp
No ratings yet
Sample - Code - Parallel - Cse6230 Fa14 04 Omp
51 pages
OpenMP Binary Tree Traversal Lab
No ratings yet
OpenMP Binary Tree Traversal Lab
10 pages
Lab 3
No ratings yet
Lab 3
23 pages
4 Performance.4x
No ratings yet
4 Performance.4x
14 pages
SWE2017 - Lab Assignment 1pages-7
No ratings yet
SWE2017 - Lab Assignment 1pages-7
5 pages
Open MP
No ratings yet
Open MP
59 pages
Worksharing and Parallel Loops
No ratings yet
Worksharing and Parallel Loops
23 pages
Module 4 - 4.6 - Understanding Shared Variables and Their Protection Mechanisms in OpenMP
No ratings yet
Module 4 - 4.6 - Understanding Shared Variables and Their Protection Mechanisms in OpenMP
5 pages
Lec7 - TLP Shared Memory and OpenMP
No ratings yet
Lec7 - TLP Shared Memory and OpenMP
45 pages
OpenMP and MPI Multiple Choice Questions (MCQS) For Exam Preparation
No ratings yet
OpenMP and MPI Multiple Choice Questions (MCQS) For Exam Preparation
13 pages
Solutions To Exercises On Parallelism and Concurrency
No ratings yet
Solutions To Exercises On Parallelism and Concurrency
5 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Parallel Algorithm for Pairwise Computation
No ratings yet
Parallel Algorithm for Pairwise Computation
2 pages
OpenMP Programming Exercises in C
100% (1)
OpenMP Programming Exercises in C
15 pages
Multithreading Seminar 4 Activities
No ratings yet
Multithreading Seminar 4 Activities
1 page
Task PDF
No ratings yet
Task PDF
1 page
Parallel Answers
No ratings yet
Parallel Answers
6 pages
(Serial)
No ratings yet
(Serial)
8 pages
OpenMP Tutorial
No ratings yet
OpenMP Tutorial
3 pages
Lab 2
No ratings yet
Lab 2
2 pages
HPC Programs
No ratings yet
HPC Programs
19 pages
Lecture 9-OpenMP Coclusion
No ratings yet
Lecture 9-OpenMP Coclusion
39 pages
Chapter 5
No ratings yet
Chapter 5
92 pages
OpenMP Guide for Parallel Computing
No ratings yet
OpenMP Guide for Parallel Computing
32 pages
OpenMP Basics for Multithreading
No ratings yet
OpenMP Basics for Multithreading
14 pages
Gauravkumar 221it027@it301 Lab2
No ratings yet
Gauravkumar 221it027@it301 Lab2
28 pages
OpenMP Performance Analysis
No ratings yet
OpenMP Performance Analysis
8 pages
Composable Multi-Threading For Python Libraries: Hsutter Wtichy
No ratings yet
Composable Multi-Threading For Python Libraries: Hsutter Wtichy
5 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
Labquiz 3
No ratings yet
Labquiz 3
8 pages
Assignment No. 2 PDC 21L-1786
No ratings yet
Assignment No. 2 PDC 21L-1786
6 pages
Bert 2a Parallel Algorithms Parfor Quicksort Reduction Listranking Rootfinding Postordernumbering
No ratings yet
Bert 2a Parallel Algorithms Parfor Quicksort Reduction Listranking Rootfinding Postordernumbering
73 pages
Practice OpenMP
No ratings yet
Practice OpenMP
2 pages
Name: Castulo JR., Edwin B. Bse-Tle 3
No ratings yet
Name: Castulo JR., Edwin B. Bse-Tle 3
2 pages
Bainite Transformation in Medium Carbon Steel
No ratings yet
Bainite Transformation in Medium Carbon Steel
11 pages
For Anne Gregory by WB Yeats
No ratings yet
For Anne Gregory by WB Yeats
8 pages
How To Handle Neurotypicals Abel Abelson 2020 Independently Published 978
No ratings yet
How To Handle Neurotypicals Abel Abelson 2020 Independently Published 978
152 pages
F1 Housekeeping Schedule
No ratings yet
F1 Housekeeping Schedule
2 pages
SMS Life Sciences India Limited Financial Report
No ratings yet
SMS Life Sciences India Limited Financial Report
7 pages
The Encyclical Letter of St. Mark of Ephesus
100% (2)
The Encyclical Letter of St. Mark of Ephesus
11 pages
Thesis
No ratings yet
Thesis
15 pages
Lect 4
No ratings yet
Lect 4
14 pages
Science - October 2020 - Question Paper 1 Ans MS by Omotosho Tobiloba
100% (1)
Science - October 2020 - Question Paper 1 Ans MS by Omotosho Tobiloba
20 pages
Framework of Accounting
No ratings yet
Framework of Accounting
11 pages
Meg 5 em 2024 25
No ratings yet
Meg 5 em 2024 25
12 pages
HSK 4 Nouns 91 120
No ratings yet
HSK 4 Nouns 91 120
11 pages
Monday Tuesday Wednesday Thursday Friday: GRADES 1 To 12 Daily Lesson Log
No ratings yet
Monday Tuesday Wednesday Thursday Friday: GRADES 1 To 12 Daily Lesson Log
4 pages
Fetal Development Stages Explained
No ratings yet
Fetal Development Stages Explained
12 pages
SparesBulletins 2020 12-02-12!00!33 639 H1 MKM BSIIIA Parts Catalogue-1
100% (1)
SparesBulletins 2020 12-02-12!00!33 639 H1 MKM BSIIIA Parts Catalogue-1
372 pages
New Sama Racking & Shelving IN UAE
No ratings yet
New Sama Racking & Shelving IN UAE
8 pages
Candidate Guide - Imdaad
No ratings yet
Candidate Guide - Imdaad
3 pages
08.4 Expedition To The Haunted Vale - Master NPC and Military Forces Tables by Phillip Gladney (October, 2000)
No ratings yet
08.4 Expedition To The Haunted Vale - Master NPC and Military Forces Tables by Phillip Gladney (October, 2000)
2 pages
Ajoy Kumar Ghose v. State of Jharkhand and Another: Procedure Where Accused Is Not Discharged
No ratings yet
Ajoy Kumar Ghose v. State of Jharkhand and Another: Procedure Where Accused Is Not Discharged
1 page
A Journey Through The Cold War A Memoir of Containment and Coexistence Raymond L Garthoff PDF Download
100% (7)
A Journey Through The Cold War A Memoir of Containment and Coexistence Raymond L Garthoff PDF Download
87 pages
Revision Notes Social Science Class 6 Chapter 1 - Locating Places On Earth
No ratings yet
Revision Notes Social Science Class 6 Chapter 1 - Locating Places On Earth
2 pages
User Behavior Analytics Whitepaper
No ratings yet
User Behavior Analytics Whitepaper
7 pages
10 Best Scientific Calculator Find The Best Choice
No ratings yet
10 Best Scientific Calculator Find The Best Choice
1 page
FC Project
100% (1)
FC Project
27 pages
Palletiser Design Guide
100% (1)
Palletiser Design Guide
5 pages
Low FODMAP Diet Guide
No ratings yet
Low FODMAP Diet Guide
3 pages
Fake News Leaflet
No ratings yet
Fake News Leaflet
1 page
Starbucks CEO's Ethical Leadership
No ratings yet
Starbucks CEO's Ethical Leadership
14 pages
Part 5 Soil Report A
No ratings yet
Part 5 Soil Report A
15 pages

PAR Final Lab Sol 2023 24Q1

Uploaded by

PAR Final Lab Sol 2023 24Q1

Uploaded by

PAR  Final Exam Laboratory Course 2023/24-Q1

January 17th , 2023

Problem 1: Lab 1 (2.5 points)

Figure 2: Modelfactors table for Parallelization 2

Problem 3: Lab 4 (2.5 points)

int iter_dot_product(int *A, int *B, int n);

int rec_dot_product(int *A, int *B, int n) {

We ask you to:

You might also like

PAR Final Exam Laboratory Course 2023/24-Q1

int iter_dot_product(int A, int B, int n);

int rec_dot_product(int A, int B, int n) {