0% found this document useful (0 votes)

56 views47 pages

5 4 Parallel

The document discusses different types of parallel processing architectures including symmetric multiprocessing (SMP), clusters, and non-uniform memory access (NUMA). It describes the key characteristics of each architecture and compares SMP systems to clusters. Additionally, it covers parallel processor concepts such as pipelining, vector processing, multithreading, and multicore processors.

Uploaded by

Uday Shubhraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views47 pages

5 4 Parallel

Uploaded by

Uday Shubhraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Parallel processors

Dr. Mrs. B. Janet,

Department of Computer Applications,
NIT, Trichy 15.

Computer Organizations

SEQUENTIAL & PARALLEL PROCESSING

SEQUENTIAL

PARALLEL

Program

Program
TASK 1

CPU

TASK 1

TASK 2

TASK 3

RESULT
RESULT

Program
TASK 2
CPU
RESULT

MASSIVE PARALLEL COMPUTERS

CAN HAVE THOUSANDS OF CPUs

Processor Designs
Pipelined ALU
Within operations
Across operations

Parallel ALUs
Parallel processors

Parallel Processor
Increase system performance by using
multiple Processors that can execute in
Parallel
Symmetric Multi-Processor
Cluster
Non-Uniform Memory Access (NUMA)

Pipelining
Instruction Level Parallelism causes overlap of
instructions
Loop Level Parallelism in iterations of a loop

Multiple Processor OrganizationTypes of Parallel Processor systems

Single instruction, single data stream - SISD

Single instruction, multiple data stream - SIMD
Multiple instruction, single data stream - MISD
Multiple instruction, multiple data stream- MIMD

Single Instruction, Single Data

Stream - SISD

Single processor
Single instruction stream
Data stored in single memory
CU
- Control Unit
Uni-processor
IS
- Instruction Stream
PU
DS
MU

- Processing Unit
- Data Stream
- Memory Unit

Single Instruction, Multiple Data

Stream - SIMD
Single machine instruction
Controls simultaneous execution of a
number of processing elements on a
Lockstep basis
Each processing element has associated
data memory
Each instruction executed on different set
of data by different processors
Vector and array processors

Parallel Organizations - SIMD

- Local Memory

Multiple Instruction, Single Data

Stream - MISD
Sequence of data is transmitted to set of
processors
Each processor executes different
instruction sequence
Never been implemented

Multiple Instruction, Multiple

Data Stream- MIMD
Set of processors
simultaneously
execute different
instruction
sequences on
different sets of data
SMPs, clusters and
NUMA systems

MIMD - Overview
General purpose processors
Each can process all instructions
necessary
Further classified by method of processor
communication

Parallel Organizations - MIMD

Distributed Memory

Taxonomy of Parallel Processor

Architectures

SMP
Multiple similar processors within same
computer, interconnected by bus or switching.
Problem is Cache coherance

Symmetric Multiprocessors
A stand alone computer with the following characteristics
Two or more similar processors of comparable capacity
Processors share same memory and I/O
Processors are connected by a bus or other internal
connection such that Memory access time is approximately
the same for each processor
All processors share access to I/O
Either through same channels or different channels giving
paths to same devices
All processors can perform the same functions (hence
symmetric)
System controlled by integrated operating system
providing interaction between processors, threads
scheduling and synchronisation
Interaction at job, task, file and data element levels

SMP Advantages
Performance
If some work can be done in parallel

Availability
Since all processors can perform the same
functions, failure of a single processor does not
halt the system

Incremental growth
User can enhance performance by adding
additional processors

Scaling
Vendors can offer range of products based on
number of processors

Block Diagram of Tightly

Coupled Multiprocessor

Tightly Coupled - SMP

Processors share memory
Communicate via that shared memory
Symmetric Multiprocessor (SMP)
Share single memory or pool
Shared bus to access memory
Memory access time to given area of memory
is approximately the same for each processor

Symmetric Multiprocessor Organization

IBM z990
Multiprocessor
Structure

Chip Multiprocessing
More than one processor implemented on
a single chip

Multithreading and Chip

Multiprocessors
Instruction stream divided into smaller
streams (threads)
Executed in parallel
Wide variety of multithreading designs

Cluster
A Group of interconnected whole computers
working together as a unified computing
resource.

Clusters

Alternative to SMP
High performance
High availability
Server applications

A group of interconnected whole computers

Working together as unified resource
Illusion of being one machine
Each computer called a node

Cluster Benefits

Absolute scalability
Incremental scalability
High availability
Superior price/performance

Cluster Configurations - Standby

Server, No Shared Disk

Cluster Configurations Shared Disk

Cluster v. SMP
Both provide multiprocessor support to high demand
applications.
Both available commercially
SMP for longer
SMP:
Easier to manage and control
Closer to single processor systems
Scheduling is main difference
Less physical space
Lower power consumption
Clustering:
Superior incremental & absolute scalability
Superior availability
Redundancy

NUMA
Shared memory multi-processor in which
the access time from a given processor to
a word in memory varies with the location
of the memory word.

Nonuniform Memory Access (NUMA)

Alternative to SMP & clustering
Uniform memory access
All processors have access to all parts of memory
Using load & store

Access time to all regions of memory is the same

Access time to memory for different processors same
As used by SMP

Nonuniform memory access

All processors have access to all parts of memory
Using load & store

Access time of processor differs depending on region of memory

Different processors access different regions of memory at
different speeds

Cache coherent NUMA

Cache coherence is maintained among the caches of the various
processors
Significantly different from SMP and clusters

Motivation
SMP has practical limit to number of processors
Bus traffic limits to between 16 and 64 processors

In clusters each node has own memory

Apps do not see large global memory
Coherence maintained by software not hardware

NUMA retains SMP flavour while giving large

scale multiprocessing
e.g. Silicon Graphics Origin NUMA 1024 MIPS
R10000 processors

Objective is to maintain transparent system wide

memory while permitting multiprocessor nodes,
each with own bus or internal interconnection
system

CC-NUMA Organization

Scalar Processor Approaches

Single-threaded scalar
Simple pipeline
No multithreading
Interleaved multithreaded scalar
Easiest multithreading to implement
Switch threads at each clock cycle
Pipeline stages kept close to fully occupied
Hardware needs to switch thread context between
cycles
Blocked multithreaded scalar
Thread executed until latency event occurs
Would stop pipeline
Processor switches to another thread

Vector Computation
Maths problems involving physical processes present
different difficulties for computation
Aerodynamics, seismology, meteorology
Continuous field simulation

High precision
Repeated floating point calculations on large arrays of
numbers
Supercomputers handle these types of problem

Hundreds of millions of flops

$10-15 million
Optimised for calculation rather than multitasking and I/O
Limited market
Research, government agencies, meteorology

Array processor
Alternative to supercomputer
Configured as peripherals to mainframe & mini
Just run vector portion of problems

Vector Addition Example

Vector Processor
Process vectors or arrays of Data

Approaches
to
Vector
Computation

Symmetric Multiprocessing to the Rescue

Multiple Processors

Multithreading
One
instruction
stream per
slot

Multi threaded Processor

To Replicate some components of the
processor to execute multiple threads
concurrently

Multithreading
Alleviates some of the
memory latency
problems
Still has problems
What if red thread waits
for data from memory and
there is a cache miss?
Yellow thread waits
unnecessarily

Hyperthreading
More than one
instruction stream
per slot

SMP vs SMT

Having Multiple Cores

Cache

Arch. State

APIC

Processor
Core

System Bus
Dual Processor

On-Die Cache

Processor
Core

On-Die Cache

System Bus

HyperThreading

Dual Core

Multicores
Two or more processors on the same chip
Each has an independent interface to the
front side bus
Both OS and the applications must
support thread-level parallelism

Typically

15 Parallel Processing
No ratings yet
15 Parallel Processing
36 pages
Unit 3
No ratings yet
Unit 3
28 pages
Parallel Processing
No ratings yet
Parallel Processing
28 pages
Parallel Prrocessor
No ratings yet
Parallel Prrocessor
12 pages
CS 213: Parallel Processing Syllabus
No ratings yet
CS 213: Parallel Processing Syllabus
26 pages
William Stallings Computer Organization and Architecture: Parallel Processing
No ratings yet
William Stallings Computer Organization and Architecture: Parallel Processing
40 pages
Unit Iv Parallelism
No ratings yet
Unit Iv Parallelism
80 pages
21cs401 CA Unit V
No ratings yet
21cs401 CA Unit V
16 pages
Multiprocessor Systems Explained
No ratings yet
Multiprocessor Systems Explained
17 pages
Flynn's Taxonomy in Parallel Computing
No ratings yet
Flynn's Taxonomy in Parallel Computing
29 pages
CS213 Parallel Processing Syllabus
No ratings yet
CS213 Parallel Processing Syllabus
26 pages
Parallel Processing:: Multiple Processor Organization
No ratings yet
Parallel Processing:: Multiple Processor Organization
24 pages
Overview of Parallel Hardware Concepts
No ratings yet
Overview of Parallel Hardware Concepts
60 pages
HPA - Notes
No ratings yet
HPA - Notes
5 pages
Chapter - 5 Parallel Processing
No ratings yet
Chapter - 5 Parallel Processing
117 pages
Parallel Processing
No ratings yet
Parallel Processing
35 pages
Parallel Computing Unit 2 - Parallel Computing Architecture
No ratings yet
Parallel Computing Unit 2 - Parallel Computing Architecture
49 pages
CC Unit 1.2
No ratings yet
CC Unit 1.2
39 pages
Parallel Processors: Session 2
No ratings yet
Parallel Processors: Session 2
32 pages
L32 SMP
No ratings yet
L32 SMP
47 pages
Understanding Parallel Computing Basics
No ratings yet
Understanding Parallel Computing Basics
22 pages
CH5 Parallel Processing
No ratings yet
CH5 Parallel Processing
30 pages
Ca - Unit 4
No ratings yet
Ca - Unit 4
77 pages
Mod 7
No ratings yet
Mod 7
56 pages
Parallel Computing Concepts Guide
No ratings yet
Parallel Computing Concepts Guide
32 pages
Unit 7 - Parallel Processing Paradigm
No ratings yet
Unit 7 - Parallel Processing Paradigm
26 pages
Unit VI
No ratings yet
Unit VI
50 pages
Unit 5 Notes
No ratings yet
Unit 5 Notes
28 pages
CS82 Advanced Computer Architecture: Parallel Computer Models 1.2 Multiprocessors and Multicomputers
No ratings yet
CS82 Advanced Computer Architecture: Parallel Computer Models 1.2 Multiprocessors and Multicomputers
19 pages
CA Chap7 Multicores Multiprocessors
No ratings yet
CA Chap7 Multicores Multiprocessors
42 pages
PP16 Lec4 Arch3
No ratings yet
PP16 Lec4 Arch3
23 pages
Parallel Computers
No ratings yet
Parallel Computers
39 pages
Understanding MIMD Architecture
No ratings yet
Understanding MIMD Architecture
28 pages
Multiprocessor Systems Overview
No ratings yet
Multiprocessor Systems Overview
51 pages
Unit 4
No ratings yet
Unit 4
16 pages
U1-Theory of Parallelism
No ratings yet
U1-Theory of Parallelism
43 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
91 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Cloud Computing Lecture3
No ratings yet
Cloud Computing Lecture3
50 pages
Pda 2
No ratings yet
Pda 2
105 pages
Baker CHPT 5 SIMD Good
No ratings yet
Baker CHPT 5 SIMD Good
94 pages
Multiprocessor
No ratings yet
Multiprocessor
22 pages
Multi-Processor-Parallel Processing PDF
No ratings yet
Multi-Processor-Parallel Processing PDF
12 pages
Multi-Processor / Parallel Processing
No ratings yet
Multi-Processor / Parallel Processing
12 pages
Multi-Processor / Parallel Processing
No ratings yet
Multi-Processor / Parallel Processing
12 pages
Chapter 3
No ratings yet
Chapter 3
35 pages
Symmetric & Distributed Memory Architectures
No ratings yet
Symmetric & Distributed Memory Architectures
31 pages
Module 2
No ratings yet
Module 2
124 pages
Pipeliningandvectorprocessing 140612142847 Phpapp01
No ratings yet
Pipeliningandvectorprocessing 140612142847 Phpapp01
53 pages
Parallel Computer Models: PCA Chapter 1
No ratings yet
Parallel Computer Models: PCA Chapter 1
61 pages
Advanced Computer Architecture Unit 1
No ratings yet
Advanced Computer Architecture Unit 1
23 pages
Architecture
No ratings yet
Architecture
67 pages
Lecture 3
No ratings yet
Lecture 3
49 pages
1/1 Multiprocessors (Or) Shared Memory Multi-Processor Model
No ratings yet
1/1 Multiprocessors (Or) Shared Memory Multi-Processor Model
17 pages
Multiprocessors and Thread-Level Parallelism
No ratings yet
Multiprocessors and Thread-Level Parallelism
20 pages
Understanding Multi-Core Processors
No ratings yet
Understanding Multi-Core Processors
31 pages
Chapter 6 Parallel and Concurrent Computing
No ratings yet
Chapter 6 Parallel and Concurrent Computing
27 pages
Unit 6 Mom
No ratings yet
Unit 6 Mom
23 pages
PARALLEL PROGRAMMING Module 1
No ratings yet
PARALLEL PROGRAMMING Module 1
20 pages
Oops Syllabus
No ratings yet
Oops Syllabus
2 pages
Full-Stack Dev Course for Students
No ratings yet
Full-Stack Dev Course for Students
5 pages
Dilkhush Kumar (C++ Language)
No ratings yet
Dilkhush Kumar (C++ Language)
14 pages
Python Basics for Year 9 Students
No ratings yet
Python Basics for Year 9 Students
20 pages
My School (English Ver.)
No ratings yet
My School (English Ver.)
33 pages
Dokumen - Tips - Subroutine Guide 56228b2f86860
No ratings yet
Dokumen - Tips - Subroutine Guide 56228b2f86860
68 pages
Programming Principles Exam 2017
No ratings yet
Programming Principles Exam 2017
7 pages
Using HTTP Methods For Restful Services: Crud Entire Collection (E.G. /customers)
No ratings yet
Using HTTP Methods For Restful Services: Crud Entire Collection (E.G. /customers)
3 pages
Explain Cloud Computing Architecture
No ratings yet
Explain Cloud Computing Architecture
2 pages
Introduction to vi Editor Tutorial
No ratings yet
Introduction to vi Editor Tutorial
9 pages
To-Do List Using DSA
No ratings yet
To-Do List Using DSA
34 pages
Anuradha (Python)
No ratings yet
Anuradha (Python)
3 pages
Session 3 - K6
No ratings yet
Session 3 - K6
7 pages
CSE1041Week1LecUpdated
No ratings yet
CSE1041Week1LecUpdated
16 pages
Control Statements, Arrays PDF
No ratings yet
Control Statements, Arrays PDF
27 pages
Project Report (Calendar)
No ratings yet
Project Report (Calendar)
7 pages
ADF Demos
100% (1)
ADF Demos
50 pages
SLL DLL Functions With Explanations
No ratings yet
SLL DLL Functions With Explanations
8 pages
Aishwarya
No ratings yet
Aishwarya
2 pages
Ajay Kumar's Tech Portfolio
No ratings yet
Ajay Kumar's Tech Portfolio
1 page
Apache2 Debian Default Page Guide
No ratings yet
Apache2 Debian Default Page Guide
2 pages
UIUX
100% (1)
UIUX
2 pages
L12 LCNC
No ratings yet
L12 LCNC
20 pages
Apache Pig in Nosql Databases
No ratings yet
Apache Pig in Nosql Databases
5 pages
Module 2 Lab SQL - Injection - Lab Explainer
No ratings yet
Module 2 Lab SQL - Injection - Lab Explainer
13 pages
Profile
No ratings yet
Profile
2 pages
Complex Data Types in Vertica
No ratings yet
Complex Data Types in Vertica
43 pages
Amit Pratap Singh FlowCV Resume 20241212
No ratings yet
Amit Pratap Singh FlowCV Resume 20241212
2 pages
JEDI Slides-Intro2-Chapter11-Applets
No ratings yet
JEDI Slides-Intro2-Chapter11-Applets
30 pages
Bca Complete Syllabus Modified
No ratings yet
Bca Complete Syllabus Modified
49 pages

5 4 Parallel

Uploaded by

5 4 Parallel

Uploaded by

Parallel processors

Dr. Mrs. B. Janet,

SEQUENTIAL & PARALLEL PROCESSING

MASSIVE PARALLEL COMPUTERS

Multiple Processor OrganizationTypes of Parallel Processor systems

Single instruction, single data stream - SISD

Single Instruction, Single Data

Single Instruction, Multiple Data

Parallel Organizations - SIMD

Multiple Instruction, Single Data

Multiple Instruction, Multiple

Parallel Organizations - MIMD

Taxonomy of Parallel Processor

Block Diagram of Tightly

Tightly Coupled - SMP

Symmetric Multiprocessor Organization

Multithreading and Chip

A group of interconnected whole computers

Cluster Configurations - Standby

Cluster Configurations Shared Disk

Nonuniform Memory Access (NUMA)

Access time to all regions of memory is the same

Nonuniform memory access

Access time of processor differs depending on region of memory

Cache coherent NUMA

In clusters each node has own memory

NUMA retains SMP flavour while giving large

Objective is to maintain transparent system wide

Scalar Processor Approaches

Hundreds of millions of flops

Vector Addition Example

Symmetric Multiprocessing to the Rescue

Multi threaded Processor

Having Multiple Cores

You might also like