0% found this document useful (0 votes)

16 views24 pages

Lecture 3

This document discusses parallel computing platforms including their logical organization and Flynn's taxonomy for classifying parallel computers. It explains the different models in Flynn's taxonomy including SISD, SIMD, MISD and MIMD and provides examples of each. The key aspects are parallelism can be expressed at different granularities and Flynn's taxonomy is a widely used classification scheme for parallel computers based on how instructions and data are handled.

Uploaded by

ahmedtarek86519623

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views24 pages

Lecture 3

Uploaded by

ahmedtarek86519623

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

High Performance

Computing
LECTURE #3

1
O Introduction
O Parallel Computing Platforms: logical & physical
Agenda O Logical Organization
1- Control
2- Communication
O Flynn Taxonomy
O 1- Control

2
1- Introduction
❖A computing platform includes a hardware architecture and a software
framework (including application frameworks), where the combination allows
software to run.

❖ Typical platforms include a

- computer architecture,
- operating system,
- programming languages and
- related user interface (run-time system libraries or graphical user interface).
Parallel Computing Platform

2- Parallel Computing Platform

4
Parallel Computing Platform

5
Parallel Computing Platform
Logical Organization

8
Parallel Computing Platform
Logical Organization

❖ Parallelism can be expressed at various levels of granularity.

❖ Computation / Communication Ratio:

◦ In parallel computing, granularity is a qualitative measure of the ratio of

computation to communication.

◦ Periods of computation are typically separated from periods of communication by

synchronization events.

9
Parallel Computing Platform
Logical Organization
Fine-grain Parallelism: Coarse-grain Parallelism:
❖ Relatively small amounts of computational
❖Relatively large amounts of computational work
work are done between communication events are done between communication/synchronization
❖ Low computation to communication ratio. events
❖ High computation to communication ratio
❖ Facilitates load balancing.
❖ Implies more opportunity for performance
❖ Implies high communication overhead and less increase
opportunity for performance enhancement
❖ Harder to load balance efficiently.
❖ If granularity is too fine it is possible that the
overhead required for communications and
synchronization between tasks takes longer than
the computation.
Parallel Computing Platform
Logical Organization

Which is Best?
❖ The most efficient granularity is dependent on the algorithm and the
hardware environment in which it runs.

❖In most cases the overhead associated with communications and

synchronization is high relative to execution speed so it is advantageous to
have coarse granularity.

❖ Fine-grain parallelism can help reduce overheads due to load imbalance.

11
Parallel Computing Platform
Logical Organization

Task A logically discrete section of computational work. A task is typically a program

or program-like set of instructions that is executed by a processor.

12
Models:
Flynn's Classical Taxonomy
❖Flynn’s classification scheme is based on the notion of a stream of information.

❖Two types of information flow into a processor: instructions and data

❖Each of these dimensions can have only one of two possible states: Single or
Multiple.

❖One of the more widely used classifications that classify parallel computers, use
since 1966.

1
4
Flynn's Classical Taxonomy
The matrix below defines the 4 possible classifications according to Flynn:

1
5
Flynn's Classical Taxonomy
The matrix below defines the 4 possible classifications according to Flynn:

1
6
Flynn's Taxonomy

Single Instruction, Single Data (SISD):

❖ A serial (non-parallel) computer

❖Single instruction: only one instruction stream is being acted on by the

CPU during any one clock cycle

❖Single data: only one data stream is being used as input during any one
clock cycle

❖ This is the oldest and even today, the most common type of computer

❖ Examples: older generation mainframes, minicomputers and

workstations; most modern day PCs.
1
7
Flynn's Taxonomy

Single Instruction, Multiple Data (SIMD):

❖Single instruction: All processing
units execute the same
instruction at any given clock
cycle

❖Multiple data: Each processing

unit can operate on a different
data element A single control unit that
dispatches the same
instruction to various
processors (that work on
different data)
9
19
Flynn's Taxonomy

❖ Processor Arrays: ILLIAC IV, DAP Connection Machine CM-2, MasPar MP-1.
❖Vector Pipelines: IBM 9000, Cray X-MP, Y-MP & C90, Fujitsu VP, NEC SX-2,
Hitachi S820, ETA10

❖Most modern computers, particularly those with graphics processor units

(GPUs) employ SIMD instructions and execution units.

❖ Examples:
For (I = 0; i<1000; i++)
c[i] = a[i] + b[i];

20
21
Flynn's Taxonomy

22
Your Turn !!!
Guess what are the SIMD drawbacks??!!

23
Flynn's Taxonomy

Multiple Instruction, Single Data (MISD):

❖A single data stream is fed into multiple processing
units.

❖Each processing unit operates on the data

independently via independent instruction streams.

❖Few actual examples of this class of parallel

computer have ever existed. One is the experimental
Carnegie-Mellon C.mmp computer (1971).

❖ex: Multiple cryptography algorithms attempting to

crack a single coded message.
24
Multiple Instruction, Multiple Data (MIMD):
❖Currently, the most common type of parallel computer. Most
modern computers fall into this category.

❖Multiple Instruction: every processor may be executing a

different instruction stream.

❖Multiple Data: every processor may be working with a

different data stream.

❖Examples: most current supercomputers, networked parallel

computer clusters and "grids", multi-processor SMP computers,
multi-core PCs.

❖Note: many MIMD architectures also include SIMD execution

sub-components
25
26
YourTurn
Compare between SIMD and MIMD

Parallel Computing
100% (1)
Parallel Computing
53 pages
Lecture 01 - Parallel and Distributed Computing
No ratings yet
Lecture 01 - Parallel and Distributed Computing
97 pages
Unit - 01 Easid
No ratings yet
Unit - 01 Easid
18 pages
Pda 2
No ratings yet
Pda 2
105 pages
Introduction Mod1
No ratings yet
Introduction Mod1
120 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
Introduction to Parallel Computing
No ratings yet
Introduction to Parallel Computing
40 pages
Introduction to Parallel Computing
No ratings yet
Introduction to Parallel Computing
38 pages
Downloadfile
No ratings yet
Downloadfile
16 pages
Chapter 2
No ratings yet
Chapter 2
13 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
Parallel Computing Architecture Overview
No ratings yet
Parallel Computing Architecture Overview
6 pages
02 Lecture Flynn IN
No ratings yet
02 Lecture Flynn IN
78 pages
Unit 1
No ratings yet
Unit 1
22 pages
1990 Duncan Parallel Architectures
No ratings yet
1990 Duncan Parallel Architectures
12 pages
L2
No ratings yet
L2
27 pages
Introduction to Parallel Computing
No ratings yet
Introduction to Parallel Computing
90 pages
Module - 4 - Parallel Processing
No ratings yet
Module - 4 - Parallel Processing
32 pages
Lecture 2 General Parallelism Terms 1
No ratings yet
Lecture 2 General Parallelism Terms 1
24 pages
Week1 Parallel and Distributed Computing
No ratings yet
Week1 Parallel and Distributed Computing
55 pages
MESI Protocol in Multi-Processor Systems
No ratings yet
MESI Protocol in Multi-Processor Systems
84 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
Chapter 02 - Asynchronous and Parallel Programming in
No ratings yet
Chapter 02 - Asynchronous and Parallel Programming in
55 pages
Parallel Computing Unit 2 - Parallel Computing Architecture
No ratings yet
Parallel Computing Unit 2 - Parallel Computing Architecture
49 pages
Materi 3
No ratings yet
Materi 3
26 pages
Unit V
No ratings yet
Unit V
95 pages
Parallel Scalable Models
No ratings yet
Parallel Scalable Models
61 pages
Introduction To Computing
No ratings yet
Introduction To Computing
6 pages
Introduction to Parallel Computing Basics
No ratings yet
Introduction to Parallel Computing Basics
6 pages
Chapter 1 - Parallel Architectures
No ratings yet
Chapter 1 - Parallel Architectures
60 pages
Parallel Computing Essentials
No ratings yet
Parallel Computing Essentials
43 pages
Parallel Processor Computing Unit 1
No ratings yet
Parallel Processor Computing Unit 1
10 pages
Coa Unit 04
No ratings yet
Coa Unit 04
85 pages
Parallel Computing Essentials
No ratings yet
Parallel Computing Essentials
40 pages
Cloud Computing
No ratings yet
Cloud Computing
27 pages
Lec1 Introduction To Parallel Computing
No ratings yet
Lec1 Introduction To Parallel Computing
40 pages
I Notes
No ratings yet
I Notes
27 pages
Unit 1 - Part 1
No ratings yet
Unit 1 - Part 1
51 pages
Parallel Computer Models: PCA Chapter 1
No ratings yet
Parallel Computer Models: PCA Chapter 1
61 pages
Understanding Parallel Computing Basics
No ratings yet
Understanding Parallel Computing Basics
11 pages
Unit 1 - Part - 2
No ratings yet
Unit 1 - Part - 2
30 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
Parallel Computing & Flynn's Taxonomy
No ratings yet
Parallel Computing & Flynn's Taxonomy
34 pages
Flynn's and Fengs Architecture
No ratings yet
Flynn's and Fengs Architecture
28 pages
Unit 5
No ratings yet
Unit 5
96 pages
Ceg4131 Models
No ratings yet
Ceg4131 Models
27 pages
Overview of Parallel Processing Units
No ratings yet
Overview of Parallel Processing Units
64 pages
Slides Taken From: Parallel Computing Platforms
No ratings yet
Slides Taken From: Parallel Computing Platforms
11 pages
Perfect ? I
No ratings yet
Perfect ? I
7 pages
Parallel Computer Structures
No ratings yet
Parallel Computer Structures
23 pages
Explicitly Parallel Platforms
No ratings yet
Explicitly Parallel Platforms
90 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
28 pages
HPA - Notes
No ratings yet
HPA - Notes
5 pages
Parallel Computing for Tech Students
No ratings yet
Parallel Computing for Tech Students
14 pages
Sefam Dream Star Brochure
No ratings yet
Sefam Dream Star Brochure
8 pages
Samsung UE43DU7100 - 43 Inch 4K Ultra HD HDR Smart LED TV - Richer Sounds
No ratings yet
Samsung UE43DU7100 - 43 Inch 4K Ultra HD HDR Smart LED TV - Richer Sounds
13 pages
131 LC 3 Control Signals
No ratings yet
131 LC 3 Control Signals
5 pages
Sparsa Brochure
No ratings yet
Sparsa Brochure
28 pages
DX Diag
No ratings yet
DX Diag
53 pages
EE 533 Lab #10: Convertible First-In First-Out (SRAM) Memory
No ratings yet
EE 533 Lab #10: Convertible First-In First-Out (SRAM) Memory
1 page
OCI Foundations Practice Exam (1Z0-1085) Flashcards - Quizlet
No ratings yet
OCI Foundations Practice Exam (1Z0-1085) Flashcards - Quizlet
42 pages
Log
No ratings yet
Log
381 pages
Vector and Array Processing Basics
No ratings yet
Vector and Array Processing Basics
20 pages
Welding Tech for Professionals
No ratings yet
Welding Tech for Professionals
2 pages
Network Video Recorder: Quick Operation Guide
No ratings yet
Network Video Recorder: Quick Operation Guide
26 pages
HP LAPTOP OCT 24 New
No ratings yet
HP LAPTOP OCT 24 New
2 pages
IT Tools and Networks Basics Paper2022
No ratings yet
IT Tools and Networks Basics Paper2022
12 pages
STM32 ADC Features and Modes
No ratings yet
STM32 ADC Features and Modes
10 pages
Data Recovery
No ratings yet
Data Recovery
7 pages
Understanding Computer Basics
100% (1)
Understanding Computer Basics
11 pages
Charles Darwin University: HIT332: Embedded and Mobile Systems Casuarina Campus
No ratings yet
Charles Darwin University: HIT332: Embedded and Mobile Systems Casuarina Campus
13 pages
Ecse 2104 - l3 - Vhdl1
No ratings yet
Ecse 2104 - l3 - Vhdl1
37 pages
Unit 4 Hadoop
No ratings yet
Unit 4 Hadoop
86 pages
UTP Cat 5e LAN Cable Specifications
No ratings yet
UTP Cat 5e LAN Cable Specifications
1 page
Cit314 2023 - 2
No ratings yet
Cit314 2023 - 2
2 pages
Lenovo RAID 5350-8i
No ratings yet
Lenovo RAID 5350-8i
20 pages
ME96 - Programming Manual (Modbus) LSPM-0075-0 (07.16)
No ratings yet
ME96 - Programming Manual (Modbus) LSPM-0075-0 (07.16)
94 pages
Valuation Ruling 1931
No ratings yet
Valuation Ruling 1931
9 pages
LEXMARK OPTRA S Maintenance Kit Instructions
No ratings yet
LEXMARK OPTRA S Maintenance Kit Instructions
1 page
CO, Ayan Saha, Cse-21-007
No ratings yet
CO, Ayan Saha, Cse-21-007
9 pages
Ethernet Demarcation Device (EDD) - O Que É e Como Pode Ser Útil para A Rede Do Seu Provedor - Blog - Datacom
No ratings yet
Ethernet Demarcation Device (EDD) - O Que É e Como Pode Ser Útil para A Rede Do Seu Provedor - Blog - Datacom
6 pages
OpManager Installation Guide
No ratings yet
OpManager Installation Guide
42 pages
h18116 Dell Emc Powerstore Vmware Vsphere Best Practices
No ratings yet
h18116 Dell Emc Powerstore Vmware Vsphere Best Practices
24 pages
Procedimiento de Tatuaje Lenovo
No ratings yet
Procedimiento de Tatuaje Lenovo
17 pages

Lecture 3

Uploaded by

Lecture 3

Uploaded by

High Performance

❖ Typical platforms include a

2- Parallel Computing Platform

❖ Parallelism can be expressed at various levels of granularity.

❖ Computation / Communication Ratio:

◦ In parallel computing, granularity is a qualitative measure of the ratio of

◦ Periods of computation are typically separated from periods of communication by

❖In most cases the overhead associated with communications and

❖ Fine-grain parallelism can help reduce overheads due to load imbalance.

Task A logically discrete section of computational work. A task is typically a program

❖Two types of information flow into a processor: instructions and data

Single Instruction, Single Data (SISD):

❖Single instruction: only one instruction stream is being acted on by the

❖ Examples: older generation mainframes, minicomputers and

Single Instruction, Multiple Data (SIMD):

❖Multiple data: Each processing

❖Most modern computers, particularly those with graphics processor units

Multiple Instruction, Single Data (MISD):

❖Each processing unit operates on the data

❖Few actual examples of this class of parallel

❖ex: Multiple cryptography algorithms attempting to

❖Multiple Instruction: every processor may be executing a

❖Multiple Data: every processor may be working with a

❖Examples: most current supercomputers, networked parallel

❖Note: many MIMD architectures also include SIMD execution

You might also like