0% found this document useful (0 votes)

47 views8 pages

Quick Sort

Uploaded by

Bezawada Manasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views8 pages

Quick Sort

Uploaded by

Bezawada Manasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Quick sort using Winthreads

unsigned __stdcall QuickSort(qSortIndex *m)

{
int p = m->lo;
int r = m->hi;
if (p < r)
{
qSortIndex s, t;
HANDLE tH[2];
int q = Partition(p, r);
s.lo = p; s.hi = q-1;
tH[0] = (HANDLE)_beginthreadex (NULL, 0, QuickSort, &s, 0, NULL);
t.lo = q+1; t.hi = r;
tH[1] = (HANDLE)_beginthreadex (NULL, 0, QuickSort, &t, 0, NULL);
WaitForMultipleObjects(2, tH, INFINITE);
}
return 0;
}

To convert the provided QuickSort function (which uses Windows threading via
_beginthreadex) to OpenMP, we need to focus on parallelizing the recursive calls to
QuickSort while respecting the structure of the algorithm.

Here's a step-by-step explanation of how to convert the code:

Key Changes:

1. Thread Management:
o The original program uses _beginthreadex and WaitForMultipleObjects to
create and synchronize threads. OpenMP abstracts away much of this complexity
using #pragma omp parallel and #pragma omp task.
2. Recursive Parallelism:
o In OpenMP, recursive functions can be parallelized using tasks. The #pragma
omp task directive allows the creation of tasks (which can be executed in
parallel) for recursive calls.
3. Synchronization:
o OpenMP automatically handles synchronization in recursive functions with its
tasking model, so we no longer need explicit thread synchronization like
WaitForMultipleObjects.
OpenMP
#include <omp.h>
#include <stdio.h>
#include <stdlib.h>

typedef struct {
int lo, hi;
} qSortIndex;

// Partition function (same as in the original code)

int Partition(int lo, int hi) {
int pivot = A[hi]; // Assuming A[] is the array to be sorted
int i = lo - 1;
for (int j = lo; j < hi; j++) {
if (A[j] < pivot) {
i++;
// Swap A[i] and A[j]
int temp = A[i];
A[i] = A[j];
A[j] = temp;
}
}
// Swap A[i+1] and A[hi] (pivot)
int temp = A[i + 1];
A[i + 1] = A[hi];
A[hi] = temp;
return i + 1;
}

// OpenMP QuickSort function

void QuickSort(qSortIndex *m) {
int p = m->lo;
int r = m->hi;

if (p < r) {
int q = Partition(p, r);

qSortIndex s1 = {p, q - 1};

qSortIndex s2 = {q + 1, r};

// Create tasks for the two recursive calls

#pragma omp task shared(s1) if (r - p > 100) // Parallelize if the range is large enough
QuickSort(&s1);

#pragma omp task shared(s2) if (r - p > 100)

QuickSort(&s2);

// Wait for the tasks to finish (synchronization)

#pragma omp taskwait
}
}

int main() {
// Example usage

// Initialize the array

int A[] = {9, 7, 5, 11, 12, 2, 14, 3, 10, 6};
int N = sizeof(A) / sizeof(A[0]);

// Set OpenMP threads to use (optional, can be omitted for automatic management)
omp_set_num_threads(4);

// Start the QuickSort

qSortIndex index = {0, N - 1};

// Use OpenMP to parallelize the QuickSort

#pragma omp parallel
{
#pragma omp single
QuickSort(&index);
}

// Print sorted array

for (int i = 0; i < N; i++) {
printf("%d ", A[i]);
}
printf("\n");
return 0;
}

Explanation of Key Changes:

1. #pragma omp parallel:

o The #pragma omp parallel directive is used to specify that a parallel region
will be executed. In this case, the QuickSort function is called within a parallel
region, and only a single thread will begin the recursion, while others can
execute tasks.
2. Task Parallelism with #pragma omp task:
o The recursive calls to QuickSort are wrapped in #pragma omp task directives.
These directives allow the recursive sub-problems (left and right partitions) to be
executed in parallel. The condition if (r - p > 100) is added to avoid creating
tasks for small sub-arrays, as the overhead of managing parallel tasks for small
ranges might outweigh the benefits.
o OpenMP will automatically decide when to parallelize based on the size of the
problem.
3. #pragma omp taskwait:
o After creating the recursive tasks, we use #pragma omp taskwait to ensure that
the current thread waits for the completion of both tasks (left and right sub-
problems) before continuing. This is necessary to synchronize the parallel tasks
and ensure that the sorting happens correctly.
4. Shared vs. Private Variables:
o The qSortIndex structure (s1 and s2) is passed as a shared variable to ensure
that the tasks can access it. We use shared(s1) and shared(s2) to indicate that
these structures should be available to all tasks.
5. Setting the Number of Threads:
o The number of threads used by OpenMP can be controlled via
omp_set_num_threads(), but this is optional. OpenMP will manage thread pool
creation and task scheduling automatically.

Partition Function:

The Partition function in the OpenMP version remains the same as the original code. It is
responsible for partitioning the array around a pivot element and returning the index of the pivot
after the partitioning step.

Performance Considerations:

 The condition if (r - p > 100) is used to prevent excessive overhead in task creation
for small sub-arrays. QuickSort benefits most from parallelism when the sub-arrays are
sufficiently large, and the task overhead for very small sub-arrays can hurt performance.
 OpenMP works best with larger datasets. For very small arrays, the overhead of
managing threads may reduce performance compared to a sequential QuickSort.

Concurrent iterative version

To convert the given Windows-threaded QuickSort program to OpenMP, we need to handle

several aspects differently because OpenMP abstracts thread management, synchronization, and
communication between threads in a way that is different from Windows threading. Specifically,
OpenMP provides constructs for parallel loops (#pragma omp parallel for) and task
parallelism (#pragma omp task), which we'll use to handle the recursive calls and thread
synchronization.

Key Points:
1. Thread Pooling and Queue Management: In the original code, you manage a thread
pool using a semaphore and a queue to manage recursive calls. In OpenMP, you don’t
need to manage a thread pool or use semaphores to signal threads explicitly; OpenMP
handles thread management and synchronization for you.
2. Recursive Parallelism: We will parallelize the recursive calls of QuickSort using
OpenMP’s tasking model (#pragma omp task).
3. Shared Variables: OpenMP will handle shared variables, so we need to ensure proper
data sharing or privatization.
4. Termination Conditions: We'll replace the semaphore and event signaling with a
condition that halts the recursion when sorting is done.

OpenMP program

#include <omp.h>
#include <stdio.h>
#include <stdlib.h>

typedef struct {
int lo, hi;
} qSortIndex;

int A[] = {9, 7, 5, 11, 12, 2, 14, 3, 10, 6}; // Example array to sort
int N = sizeof(A) / sizeof(A[0]);

// Partition function for QuickSort (same as the original version)

int Partition(int lo, int hi) {
int pivot = A[hi];
int i = lo - 1;
for (int j = lo; j < hi; j++) {
if (A[j] < pivot) {
i++;
int temp = A[i];
A[i] = A[j];
A[j] = temp;
}
}
int temp = A[i + 1];
A[i + 1] = A[hi];
A[hi] = temp;
return i + 1;
}

// OpenMP QuickSort function

void QuickSort(qSortIndex *m) {
int p = m->lo;
int r = m->hi;

if (p < r) {
int q = Partition(p, r);

qSortIndex s1 = {p, q - 1};

qSortIndex s2 = {q + 1, r};

// Create parallel tasks for the two recursive calls

#pragma omp task shared(s1) if (r - p > 100) // Parallelize for larger ranges
QuickSort(&s1);

#pragma omp task shared(s2) if (r - p > 100)

QuickSort(&s2);

// Ensure that both tasks are finished before continuing

#pragma omp taskwait
}
}

int main() {
// Initialize the array for sorting
qSortIndex index = {0, N - 1};

// Set OpenMP threads (optional, can be omitted for automatic management)

omp_set_num_threads(4);

// Start QuickSort in parallel

#pragma omp parallel
{
#pragma omp single
QuickSort(&index);
}

// Print the sorted array

for (int i = 0; i < N; i++) {
printf("%d ", A[i]);
}
printf("\n");

return 0;
}
Explanation of Key Changes:

1. OpenMP Parallel Region:

o We use #pragma omp parallel to begin a parallel region and ensure that
multiple threads can be used to perform recursive calls.
o The #pragma omp single directive ensures that only one thread (the main
thread) starts the first call to QuickSort.
2. Task Parallelism:
o The recursive calls to QuickSort are parallelized using #pragma omp task. This
creates two tasks for the left and right sub-arrays. Each task runs in parallel when
the array size is sufficiently large (r - p > 100), but for small arrays, we avoid
task creation to reduce overhead.
3. #pragma omp taskwait:
o #pragma omp taskwait ensures that the parent task (the current instance of
QuickSort) waits for the completion of both recursive tasks before continuing.
This ensures that each partition is completely sorted before moving to the next
step.
4. Termination Condition:
o In the original Windows code, termination is triggered using a semaphore and
event signaling. In OpenMP, the recursion naturally terminates when the base
case is met (p >= r), so no explicit synchronization or signaling is necessary.
5. Shared and Private Variables:
o The qSortIndex structure is passed to the tasks using shared(s1) and
shared(s2) to make sure that the tasks can access the index of the array that
needs to be sorted. OpenMP automatically manages the sharing of these structures
across threads.
6. Control of Thread Count:
o You can control the number of threads used by OpenMP using
omp_set_num_threads(). However, OpenMP will automatically determine the
optimal number of threads based on the system's available resources, so this
setting is optional.

Performance Considerations:

 Recursive Task Overhead: When using OpenMP tasks for recursion, there is overhead
associated with creating tasks, especially for smaller sub-arrays. To mitigate this, we use
if (r - p > 100) to prevent parallelization of very small sub-arrays.
 Task Granularity: Adjusting the threshold if (r - p > 100) allows for better
performance tuning. In practice, you would choose a threshold that balances parallel
overhead and task granularity.

Lab 1 & 2
No ratings yet
Lab 1 & 2
6 pages
OpenMP Bubble Sort Implementation Guide
No ratings yet
OpenMP Bubble Sort Implementation Guide
4 pages
Mamindla Sathvika Lab9
No ratings yet
Mamindla Sathvika Lab9
10 pages
Lab Programs
No ratings yet
Lab Programs
15 pages
PC Labmanual
No ratings yet
PC Labmanual
19 pages
Sort Open MP
No ratings yet
Sort Open MP
6 pages
Parallel Computing Manual
No ratings yet
Parallel Computing Manual
15 pages
Lec6 - OpenMP (Function-Level Parallelism)
No ratings yet
Lec6 - OpenMP (Function-Level Parallelism)
20 pages
Lab Programs
No ratings yet
Lab Programs
18 pages
Lab Manual
No ratings yet
Lab Manual
33 pages
Program1 PP
No ratings yet
Program1 PP
5 pages
OpenMP Basics for Multithreading
No ratings yet
OpenMP Basics for Multithreading
14 pages
PC - Lab Manuall
No ratings yet
PC - Lab Manuall
15 pages
PDC-Lab 21BCE10419
No ratings yet
PDC-Lab 21BCE10419
20 pages
Program 1
No ratings yet
Program 1
3 pages
PP Manual
No ratings yet
PP Manual
22 pages
Parallel Computing Lab Manual
No ratings yet
Parallel Computing Lab Manual
26 pages
#Include Stdio.h
No ratings yet
#Include Stdio.h
3 pages
Beginning OpenMP
No ratings yet
Beginning OpenMP
20 pages
PPA Lab 10
No ratings yet
PPA Lab 10
10 pages
CC Lab Manual
No ratings yet
CC Lab Manual
39 pages
PC Manual
No ratings yet
PC Manual
33 pages
4 Performance.4x
No ratings yet
4 Performance.4x
14 pages
PC File
No ratings yet
PC File
57 pages
Optimized QuickSort with OpenMP
No ratings yet
Optimized QuickSort with OpenMP
1 page
Mcap-Lab Manual 1
No ratings yet
Mcap-Lab Manual 1
19 pages
CP 4292 MCP Lab Manual
No ratings yet
CP 4292 MCP Lab Manual
20 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
HPC Printout 1
No ratings yet
HPC Printout 1
22 pages
OpenMP SPM
No ratings yet
OpenMP SPM
9 pages
Project Assignment 3 Multi Processor System (DV 2544) : Susheel Sagar
No ratings yet
Project Assignment 3 Multi Processor System (DV 2544) : Susheel Sagar
4 pages
Pdclab 7
No ratings yet
Pdclab 7
10 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
8 pages
MAP Lab Completed
No ratings yet
MAP Lab Completed
29 pages
MPC LAB Manual New
No ratings yet
MPC LAB Manual New
23 pages
OpenMP Notes
No ratings yet
OpenMP Notes
3 pages
Question 1 - Serial: Output
No ratings yet
Question 1 - Serial: Output
9 pages
Openmp in Microsoft Windows
No ratings yet
Openmp in Microsoft Windows
7 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
OpenMP Tutorial: Hands-On Introduction
No ratings yet
OpenMP Tutorial: Hands-On Introduction
153 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
CO3 Efficient openMP Programming in High Performance Computing
No ratings yet
CO3 Efficient openMP Programming in High Performance Computing
23 pages
Open MP
No ratings yet
Open MP
28 pages
OpenMP for C/C++ Parallel Programming
No ratings yet
OpenMP for C/C++ Parallel Programming
7 pages
OpenMP and Pthread Functions Explained
No ratings yet
OpenMP and Pthread Functions Explained
4 pages
Multisort Omp Cut - Off.c
No ratings yet
Multisort Omp Cut - Off.c
5 pages
Multi Core
No ratings yet
Multi Core
25 pages
OpenMP Programming Examples
No ratings yet
OpenMP Programming Examples
29 pages
Tp2 - Openmp (Introduction) : Imad Kissami
No ratings yet
Tp2 - Openmp (Introduction) : Imad Kissami
4 pages
PDC Lab 2-5
No ratings yet
PDC Lab 2-5
5 pages
Radix Sort
No ratings yet
Radix Sort
10 pages
PC Pgms
No ratings yet
PC Pgms
14 pages
Openmp Assignment
No ratings yet
Openmp Assignment
4 pages
Packet Tracer Multiarea Ospf Exploration Physical Mode Part 1sabordo
No ratings yet
Packet Tracer Multiarea Ospf Exploration Physical Mode Part 1sabordo
9 pages
Understanding 4G Cellular Technology
No ratings yet
Understanding 4G Cellular Technology
19 pages
Jharkhand High Court Assistant Recruitment Guide
No ratings yet
Jharkhand High Court Assistant Recruitment Guide
2 pages
Liudmyla H. Havrilova
No ratings yet
Liudmyla H. Havrilova
15 pages
Godinot Chua 2006 Use of A Wbs Matrix To Improve Interface Management in Projects
No ratings yet
Godinot Chua 2006 Use of A Wbs Matrix To Improve Interface Management in Projects
13 pages
Introduction to Database Administration
No ratings yet
Introduction to Database Administration
15 pages
ECS - ECS Upgrade Procedures-Upgrade ECS Fabric and Object Services - 2.1.0.0 To 2.2.0.x
No ratings yet
ECS - ECS Upgrade Procedures-Upgrade ECS Fabric and Object Services - 2.1.0.0 To 2.2.0.x
47 pages
Test Bank For Financial Reporting and Analysis, 13th Edition, Charles H. Gibson, Full Chapters Included
100% (6)
Test Bank For Financial Reporting and Analysis, 13th Edition, Charles H. Gibson, Full Chapters Included
319 pages
(Ebook) Microsoft Excel Data Analysis and Business Modeling by Wayne Winston ISBN 9781509304219, 1509304215 Full Access
No ratings yet
(Ebook) Microsoft Excel Data Analysis and Business Modeling by Wayne Winston ISBN 9781509304219, 1509304215 Full Access
341 pages
Core Python Notes
No ratings yet
Core Python Notes
64 pages
Install MS-DOS 6.22 Guide
No ratings yet
Install MS-DOS 6.22 Guide
58 pages
Prototype Model in Software Development
No ratings yet
Prototype Model in Software Development
7 pages
Cloud File Management Essentials
No ratings yet
Cloud File Management Essentials
3 pages
STS - Module 8 - The Information Age
No ratings yet
STS - Module 8 - The Information Age
1 page
Text2Face: Controlled Face Generation
No ratings yet
Text2Face: Controlled Face Generation
12 pages
Problem - C1 - Codeforces
No ratings yet
Problem - C1 - Codeforces
3 pages
Skmei Model 0993 Manual
67% (3)
Skmei Model 0993 Manual
1 page
c351 Manual
100% (1)
c351 Manual
251 pages
Web Basics for Beginners
No ratings yet
Web Basics for Beginners
2 pages
Exercitation N. 1 Introduction To 1C:Drive
No ratings yet
Exercitation N. 1 Introduction To 1C:Drive
11 pages
M55 User Guide PDF
No ratings yet
M55 User Guide PDF
7 pages
ACKNOWLEDGEMENT
No ratings yet
ACKNOWLEDGEMENT
6 pages
6.2.2.1 Common Problems and Solutions For Networking PDF
50% (2)
6.2.2.1 Common Problems and Solutions For Networking PDF
2 pages
Case Studies of Common Csharp Problems
100% (2)
Case Studies of Common Csharp Problems
26 pages
Security Games With Layered Defenses: Adaptive Adversaries and Gittins Indices
No ratings yet
Security Games With Layered Defenses: Adaptive Adversaries and Gittins Indices
11 pages
Set-2 (AIML202)
No ratings yet
Set-2 (AIML202)
6 pages
Pure Virtual Functions and Abstract Classes in C++
No ratings yet
Pure Virtual Functions and Abstract Classes in C++
7 pages
Display Devices
No ratings yet
Display Devices
25 pages
BR Ie Switches en
No ratings yet
BR Ie Switches en
26 pages
CMN Resolution No 4,893 2021
No ratings yet
CMN Resolution No 4,893 2021
10 pages

Quick Sort

Uploaded by

Quick Sort

Uploaded by

Quick sort using Winthreads

unsigned __stdcall QuickSort(qSortIndex *m)

Here's a step-by-step explanation of how to convert the code:

// Partition function (same as in the original code)

// OpenMP QuickSort function

qSortIndex s1 = {p, q - 1};

// Create tasks for the two recursive calls

#pragma omp task shared(s2) if (r - p > 100)

// Wait for the tasks to finish (synchronization)

// Initialize the array

// Start the QuickSort

// Use OpenMP to parallelize the QuickSort

// Print sorted array

Explanation of Key Changes:

1. #pragma omp parallel:

Concurrent iterative version

To convert the given Windows-threaded QuickSort program to OpenMP, we need to handle

// Partition function for QuickSort (same as the original version)

// OpenMP QuickSort function

qSortIndex s1 = {p, q - 1};

// Create parallel tasks for the two recursive calls

#pragma omp task shared(s2) if (r - p > 100)

// Ensure that both tasks are finished before continuing

// Set OpenMP threads (optional, can be omitted for automatic management)

// Start QuickSort in parallel

// Print the sorted array

1. OpenMP Parallel Region:

You might also like