0% found this document useful (0 votes)

15 views41 pages

Chapter 5 An Introduction To Parallel Programming

The document discusses the necessity of parallel computing in response to the limitations of increasing microprocessor speeds and the growing complexity of computational problems. It highlights the importance of writing parallel programs to fully utilize multicore processors and presents various approaches to parallel programming, including task and data parallelism. Additionally, it outlines the types of parallel systems and the coordination required among cores to achieve efficient computation.

Uploaded by

lolaa.kam21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views41 pages

Chapter 5 An Introduction To Parallel Programming

Uploaded by

lolaa.kam21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

An Introduction to Parallel Programming

Chapter 5
Why Parallel Computing?

From Coulouris, Dollimore, Kindberg and Blair

Distributed Systems:
Concepts and Design
Edition 5, © Addison-Wesley 2012

1
Roadmap
◼ Why we need ever-increasing performance.
◼ Why we’re building parallel systems.
◼ Why we need to write parallel programs.
◼ How do we write parallel programs?
◼ What we’ll be doing.
◼ Concurrent, parallel, distributed!

2
Changing times
◼ From 1986 – 2002, microprocessors were
speeding like a rocket, increasing in
performance an average of 50% per year.

◼ Since then, it’s dropped to about 20%

increase per year.

3
An intelligent solution
◼ Instead of designing and building faster microprocessors,
put multiple processors on a single integrated circuit.

4
Now it’s up to the programmers
◼ Adding more processors doesn’t help
much if programmers aren’t aware of
them…
◼ … or don’t know how to use them.

◼ Serial programs don’t benefit from this

approach (in most cases).

5
Why we need ever-increasing
performance
◼ Computational power is increasing, but so
are our computation problems and needs.
◼ Problems we never dreamed of have been
solved because of past increases, such as
decoding the human genome.
◼ More complex problems are still waiting to
be solved.

6
Climate modeling

7
Protein folding

8
Drug discovery

9
Energy research

10
Data analysis

11
Why we’re building parallel
systems
◼ Up to now, performance increases have
been attributable to increasing density of
transistors.

◼ But there are

inherent
problems.

12
A little physics lesson
◼ Smaller transistors = faster processors.
◼ Faster processors = increased power
consumption.
◼ Increased power consumption = increased
heat.
◼ Increased heat = unreliable processors.

13
Solution
◼ Move away from single-core systems to
multicore processors.
◼ “core” = central processing unit (CPU)

◼ Introducing parallelism!!!

14
Why we need to write parallel
programs
◼ Running multiple instances of a serial
program often isn’t very useful.
◼ Think of running multiple instances of your favorite
game.

◼ What you really want is for

it to run faster.

15
Approaches to the serial problem
◼ Rewrite serial programs so that they’re
parallel.

◼ Write translation programs that automatically

convert serial programs into parallel programs.

◼ This is very difficult to do.

◼ Success has been limited.

16
More problems
◼ Some coding constructs can be
recognized by an automatic program
generator, and converted to a parallel
construct.
◼ However, it’s likely that the result will be a
very inefficient program.
◼ Sometimes the best parallel solution is to
step back and devise an entirely new
algorithm.

17
Example
◼ Compute n values and add them together.
◼ Serial solution:

18
Example (cont.)
◼ We have p cores, p much smaller than n.
◼ Each core performs a partial sum of
approximately n/p values.

Each core uses it’s own private variables

and executes this block of code
independently of the other cores.

19
Example (cont.)
◼

■ After each core completes execution of the code, is a private

variable my_sum contains the sum of the values computed
by its calls to Compute_next_value.

◼ Ex., 8 cores, n = 24, then the calls to

Compute_next_value return:
1,4,3, 9,2,8, 5,1,1, 5,2,7, 2,5,0, 4,1,8, 6,5,1, 2,3,9

20
Example (cont.)
◼

■ Once all the cores are done computing their private

my_sum, they form a global sum by sending results to a
designated “master” core which adds the final result.

21
Example (cont.)

22
Example (cont.)
Core 0 1 2 3 4 5 6 7
my_sum 8 19 7 15 7 13 12 14

Global sum
8 + 19 + 7 + 15 + 7 + 13 + 12 + 14 = 95

Core 0 1 2 3 4 5 6 7
my_sum 95 19 7 15 7 13 12 14

23
But wait!
There’s a much better way
to compute the global sum.

24
Better parallel algorithm
◼ Don’t make the master core do all the
work.
◼ Share it among the other cores.
◼ Pair the cores so that core 0 adds its result
with core 1’s result.
◼ Core 2 adds its result with core 3’s result,
etc.
◼ Work with odd and even numbered pairs of
cores.

25
Better parallel algorithm (cont.)
◼ Repeat the process now with only the
evenly ranked cores.
◼ Core 0 adds result from core 2.
◼ Core 4 adds the result from core 6, etc.

◼ Now cores divisible by 4 repeat the

process, and so forth, until core 0 has the
final result.

26
Multiple cores forming a global
sum

27
Analysis
◼ In the first example, the master core
performs 7 receives and 7 additions.

◼ In the second example, the master core

performs 3 receives and 3 additions.

◼ The improvement is more than a factor of 2!

28
Analysis (cont.)
◼ The difference is more dramatic with a
larger number of cores.
◼ If we have 1000 cores:
◼ The first example would require the master to
perform 999 receives and 999 additions.
◼ The second example would only require 10
receives and 10 additions.

◼ That’s an improvement of almost a factor

of 100!
29
How do we write parallel
programs?
◼ Task parallelism
◼ Partition various tasks carried out solving the
problem among the cores.

◼ Data parallelism
◼ Partition the data used in solving the problem
among the cores.
◼ Each core carries out similar operations on it’s
part of the data.

30
Professor P

15 questions
300 exams

31
Professor P’s grading assistants

TA#1 TA#3
TA#2

32
data parallelism
Division of work –
TA#1

100 exams
TA#3

100 exams

100 exams
TA#2

33
task parallelism
Division of work –
TA#1

TA#3
Questions 11 - 15
Questions 1 - 5

TA#2
Questions 6 - 10

34
data parallelism
Division of work –

35
Division of work –
task parallelism

Tasks
1) Receiving

2) Addition

36
Coordination
◼ Cores usually need to coordinate their work.
◼ Communication – one or more cores send
their current partial sums to another core.
◼ Load balancing – share the work evenly
among the cores so that one is not heavily
loaded.
◼ Synchronization – because each core works
at its own pace, make sure cores do not get
too far ahead of the rest.

37
What we’ll be doing
◼ Learning to write programs that are
explicitly parallel.
◼ Using the C language.
◼ Using three different extensions to C.
◼ Message-Passing Interface (MPI)
◼ Posix Threads (Pthreads)
◼ OpenMP

38
Type of parallel systems
◼ Shared-memory
◼ The cores can share access to the computer’s
memory.
◼ Coordinate the cores by having them examine
and update shared memory locations.
◼ Distributed-memory
◼ Each core has its own, private memory.
◼ The cores must communicate explicitly by
sending messages across a network.

39
Type of parallel systems

Shared-memory Distributed-memory

40
Terminology
◼ Concurrent computing – a program is one
in which multiple tasks can be in progress
at any instant.
◼ Parallel computing – a program is one in
which multiple tasks cooperate closely to
solve a problem
◼ Distributed computing – a program may
need to cooperate with other programs to
solve a problem.

Chapter 1
No ratings yet
Chapter 1
47 pages
Module 1
No ratings yet
Module 1
53 pages
Why Parallel Computing?: Peter Pacheco
No ratings yet
Why Parallel Computing?: Peter Pacheco
84 pages
Chapter 1
No ratings yet
Chapter 1
39 pages
Parallel Computing Chapter 1
No ratings yet
Parallel Computing Chapter 1
56 pages
Intro to Parallel Computing Concepts
No ratings yet
Intro to Parallel Computing Concepts
33 pages
Introduction to Parallel Computing
No ratings yet
Introduction to Parallel Computing
34 pages
Clase01 - Introducción Al Paralelismo
No ratings yet
Clase01 - Introducción Al Paralelismo
30 pages
Clase01 - Introducción Al Paralelismo
No ratings yet
Clase01 - Introducción Al Paralelismo
30 pages
01 (Why Parallel Computing)
No ratings yet
01 (Why Parallel Computing)
24 pages
High Performance Computing: Sabah Sayed
No ratings yet
High Performance Computing: Sabah Sayed
22 pages
Unit1 RMD PDF
No ratings yet
Unit1 RMD PDF
27 pages
Introduction
No ratings yet
Introduction
17 pages
Apt05 2024S2
No ratings yet
Apt05 2024S2
23 pages
Module1 1
No ratings yet
Module1 1
32 pages
Parallelism and Concurrency Guide
No ratings yet
Parallelism and Concurrency Guide
18 pages
COL380: Parallel & Distributed Programming
No ratings yet
COL380: Parallel & Distributed Programming
20 pages
02 - Introduction To Concurrent Systems PDF
No ratings yet
02 - Introduction To Concurrent Systems PDF
31 pages
Module-01 Bcs702 (Parallel Computing) Search Creators
No ratings yet
Module-01 Bcs702 (Parallel Computing) Search Creators
31 pages
UNIt 1 GPC
No ratings yet
UNIt 1 GPC
65 pages
CS4230 Parallel Programming Introduction To Parallel Algorithms
No ratings yet
CS4230 Parallel Programming Introduction To Parallel Algorithms
25 pages
01 - Lecture Intro To HPC
No ratings yet
01 - Lecture Intro To HPC
62 pages
01 Introduction
No ratings yet
01 Introduction
41 pages
Introduction To Parallel Programming
No ratings yet
Introduction To Parallel Programming
29 pages
Parallel Algorithms: Theory and Practice: Deterministi C Parallelism
No ratings yet
Parallel Algorithms: Theory and Practice: Deterministi C Parallelism
51 pages
Pipelining vs. Parallel Processing Explained
No ratings yet
Pipelining vs. Parallel Processing Explained
23 pages
Parallel & Distributed Computing Course Overview
No ratings yet
Parallel & Distributed Computing Course Overview
63 pages
Parellel Computing Module1 Notes
No ratings yet
Parellel Computing Module1 Notes
29 pages
2 - ParallelProgramming
No ratings yet
2 - ParallelProgramming
90 pages
Understanding High Performance Computing
No ratings yet
Understanding High Performance Computing
8 pages
Multicore02 2
No ratings yet
Multicore02 2
18 pages
L04 Concurrency Consistency Updated
No ratings yet
L04 Concurrency Consistency Updated
40 pages
Simulating Ocean Currents
No ratings yet
Simulating Ocean Currents
35 pages
Unit-2 Process Parallel Lang
No ratings yet
Unit-2 Process Parallel Lang
10 pages
Parallel and Distributed Computing Module I
No ratings yet
Parallel and Distributed Computing Module I
26 pages
Lecture Parallelism DC PDF
No ratings yet
Lecture Parallelism DC PDF
7 pages
CS3006 Parallel Computing Course Overview
100% (1)
CS3006 Parallel Computing Course Overview
46 pages
CSE524sp10 01
No ratings yet
CSE524sp10 01
62 pages
Parallel Computing 1 Unit
No ratings yet
Parallel Computing 1 Unit
59 pages
HPC Parallel
No ratings yet
HPC Parallel
122 pages
Parallelism in Computer Architecture
No ratings yet
Parallelism in Computer Architecture
27 pages
Parallel Programming
No ratings yet
Parallel Programming
10 pages
Watercolor Organic Shapes SlidesMania
No ratings yet
Watercolor Organic Shapes SlidesMania
23 pages
PDC 3
No ratings yet
PDC 3
26 pages
02 - Lecture #2
No ratings yet
02 - Lecture #2
29 pages
Intro Parallel Programming 2015
No ratings yet
Intro Parallel Programming 2015
38 pages
Module 1 PC Answers
No ratings yet
Module 1 PC Answers
20 pages
Module 1: PARALLEL AND DISTRIBUTED COMPUTING
No ratings yet
Module 1: PARALLEL AND DISTRIBUTED COMPUTING
65 pages
Introduction to Parallel Programming
No ratings yet
Introduction to Parallel Programming
18 pages
Parallel Algorithms: Theory and Practice
No ratings yet
Parallel Algorithms: Theory and Practice
44 pages
Ebook Fundations of Paralllel Programming
No ratings yet
Ebook Fundations of Paralllel Programming
109 pages
Overview of Parallel Programming in C++ - Pablo Halpern - CppCon 2014
No ratings yet
Overview of Parallel Programming in C++ - Pablo Halpern - CppCon 2014
37 pages
Parallel Computing Concepts Explained
No ratings yet
Parallel Computing Concepts Explained
90 pages
CMP 252 - Parallelism Fundamentals
No ratings yet
CMP 252 - Parallelism Fundamentals
64 pages
Parallel Programming Basics
No ratings yet
Parallel Programming Basics
17 pages
High Performance Computing Unit 1-2
No ratings yet
High Performance Computing Unit 1-2
60 pages
Parallel Programming Guide
No ratings yet
Parallel Programming Guide
30 pages
Infosys HackWithInfy
No ratings yet
Infosys HackWithInfy
8 pages
iGPSport BSC200 Manual (English)
No ratings yet
iGPSport BSC200 Manual (English)
9 pages
Medmont Studio 7.2.9 Release Notes ٢
No ratings yet
Medmont Studio 7.2.9 Release Notes ٢
8 pages
Acer Aspire 3750ZG 1 Pegatron EIH31 - Rev1.3
No ratings yet
Acer Aspire 3750ZG 1 Pegatron EIH31 - Rev1.3
89 pages
Excel Tips: Save Time with Tricks
No ratings yet
Excel Tips: Save Time with Tricks
15 pages
Linux Boot Process Explained
No ratings yet
Linux Boot Process Explained
10 pages
Taxonomy of Parallel Computing Paradigms
No ratings yet
Taxonomy of Parallel Computing Paradigms
9 pages
MT7975PN Datasheet 1.5
No ratings yet
MT7975PN Datasheet 1.5
23 pages
Ronald Raajesh. P: Resume
No ratings yet
Ronald Raajesh. P: Resume
4 pages
Android System Server Log Analysis
No ratings yet
Android System Server Log Analysis
967 pages
24P01273 Shreenidhi C
No ratings yet
24P01273 Shreenidhi C
12 pages
Camara Digital Samsung s750
No ratings yet
Camara Digital Samsung s750
98 pages
Creating Our First Android App: Licensed Under Creative Commons Attribution 2.5 License. All Rights Reserved
No ratings yet
Creating Our First Android App: Licensed Under Creative Commons Attribution 2.5 License. All Rights Reserved
25 pages
Open STA
100% (1)
Open STA
75 pages
Ccna MCQ
No ratings yet
Ccna MCQ
5 pages
WinTVR3 FM Eng Manual
No ratings yet
WinTVR3 FM Eng Manual
24 pages
SOI-1 To 4 - Preparations
No ratings yet
SOI-1 To 4 - Preparations
14 pages
Safety Validator
No ratings yet
Safety Validator
8 pages
3334 PCN4+ & Main Display Release Rev 2
No ratings yet
3334 PCN4+ & Main Display Release Rev 2
7 pages
React Interview Prep Guide
No ratings yet
React Interview Prep Guide
50 pages
Host Simulator Setup Guide
No ratings yet
Host Simulator Setup Guide
7 pages
Connect MySQL with EntityFramework MVC
No ratings yet
Connect MySQL with EntityFramework MVC
3 pages
Saas Pricing Update 2022 USD
No ratings yet
Saas Pricing Update 2022 USD
1 page
ATM Skimmer & POS - FULL TUT FOR NEW USERS 2022
0% (1)
ATM Skimmer & POS - FULL TUT FOR NEW USERS 2022
3 pages
Comp Engg
No ratings yet
Comp Engg
4 pages
Dynamic AES Encryption and Blockchain Key Management A Novel Solution For Cloud Data Security - Abstract
No ratings yet
Dynamic AES Encryption and Blockchain Key Management A Novel Solution For Cloud Data Security - Abstract
6 pages
Unconditional Jump Statements
No ratings yet
Unconditional Jump Statements
7 pages
Airline Reservation Software
No ratings yet
Airline Reservation Software
11 pages
Data Structure Mid Exam, Summer 2020
No ratings yet
Data Structure Mid Exam, Summer 2020
1 page
DSS License Quick Guide V8.61
No ratings yet
DSS License Quick Guide V8.61
12 pages

Chapter 5 An Introduction To Parallel Programming

Uploaded by

Chapter 5 An Introduction To Parallel Programming

Uploaded by

An Introduction to Parallel Programming

From Coulouris, Dollimore, Kindberg and Blair

◼ Since then, it’s dropped to about 20%

◼ Serial programs don’t benefit from this

◼ But there are

◼ What you really want is for

◼ Write translation programs that automatically

◼ This is very difficult to do.

Each core uses it’s own private variables

■ After each core completes execution of the code, is a private

◼ Ex., 8 cores, n = 24, then the calls to

■ Once all the cores are done computing their private

◼ Now cores divisible by 4 repeat the

◼ In the second example, the master core

◼ The improvement is more than a factor of 2!

◼ That’s an improvement of almost a factor

You might also like