Pipelining & Parallel Processing Guide

Uploaded by

bittukamble0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views12 pages

Pipelining & Parallel Processing Guide

Uploaded by

bittukamble0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Chapter 06

Pipelining and Parallel Processing

Agenda

• Pipelining: Introduction, Pipeline organization, Pipelining issues,

Memory delays, Branch delays, Performance evaluation, The ARM
processor
Pipelining :
• Pipelining is a particularly effective way of organizing concurrent activity in a computer
system.
• Pipelining is a technique where multiple instructions are overlapped during execution.
• In pipeline system, each segment consists of an input register followed by a
combinational circuit.
Pipelining :
• A Pipeline is a series of stages, where some work is done at each stage. The work is
not finished until it has passed through all stages.

Types of Pipelines
• Arithmetic pipeline
where different stages of an arithmetic operation are handled along the stages of a
pipeline. They are used for floating point operations, multiplication of fixed point numbers
• A and B are mantissas (significant digit of floating point numbers), while a and b are
exponents. The floating point addition and subtraction is done in 4 parts:
1. Compare the exponents.
2. Align the mantissas.
3. Add or subtract mantissas
4. Produce the result.
Pipelining :
• Instructional pipeline
where different stages of an instruction fetch and execution are handled in a
pipeline.
In this a stream of instructions can be executed by overlapping fetch, decode and execute
phases of an instruction cycle.
An instruction pipeline reads instruction from the memory while previous instructions are
being executed in other segments of the pipeline.
Pipeline
organization
• each stage of the pipeline is processing a
different instruction.
• Interstage buffer B1 feeds the Decode stage with
a newly-fetched instruction.
• Interstage buffer B2 feeds the Compute stage
with the two operands read from the register
file, the source/destination register identifiers,
the immediate value derived from the
instruction, the incremented PC value.
• Interstage buffer B3 holds the result of the ALU
operation, which may be data to be written into
the register file or an address that feeds the
Memory stage
• Interstage buffer B4 feeds the Write stage with a
value to be written into the register file.
Pipelining Issues
• there are times when it is not possible to have a new instruction enter the pipeline in every
cycle.
• Consider the case of two instructions, Ij and Ij+1, where the destination register for
instruction Ij is a source register for instruction Ij+1.
• the result of instruction Ij+1 would be incorrect because the arithmetic operation would be
performed using the old value of the register in question.
• Any condition that causes the pipeline to stall is called a hazard
Memory delays
• Delays arising from memory accesses are another cause of pipeline stalls.
• This occur because the requested instruction or data are not found in the cache, resulting in a
cache miss. A memory access may take ten or more cycles.
• There is an additional type of memory-related stall that occurs when there is a data
dependency involving a Load instruction.
Load R2, (R3)
Subtract R9, R2, #30

• The compiler can eliminate the one-cycle

stall for this type of data dependency by
reordering instructions to insert a useful
instruction between the Load instruction
and the instruction that depends on the
data read from the memory.
Branch delays
• Branch instructions can alter the sequence of execution, but they must first be executed to
determine whether and where to branch.
• The effect of branch instructions and the techniques that can be used for mitigating their
impact on pipelined execution.

Unconditional Branches-
• two-cycle delay constitutes a branch penalty
• With a two-cycle branch penalty, the relatively
high frequency of branch instructions could
increase the execution time for a program by
as much as 40 percent.
• Reducing the branch penalty requires the
branch target address to be computed earlier
in the pipeline.
Branch delays
Conditional Branches-
• Consider a conditional branch instruction such as
Branch_if_[R5]=[R6] LOOP
• For pipelining, the branch condition must be tested as early as possible to limit the branch
penalty.
• Moving the branch decision to the Decode stage ensures a common branch penalty of only
one cycle for all branch instructions.
Performance Evaluation
• For a non-pipelined processor, the execution time, T, of a program that has a dynamic
instruction count of N is given by
T=N×S
R
where S is the average number of clock cycles it takes to fetch and execute one instruction, and
R is the clock rate in cycles per second.
• A useful performance indicator is the instruction throughput, which is the number of
instructions executed per second.
• Pipelining improves performance by overlapping the execution of successive instructions,
which increases instruction throughput even though an individual instruction is still executed
in the same number of cycles.
Performance Evaluation
[Link] of Stalls and Penalties-
• The five-stage pipeline involves memory-access operations in the Fetch and Memory stages,
and ALU operations in the Compute stage.
• The operations with the longest delay dictate the cycle time, and hence the clock rate R.
• The compiler can improve performance by reducing the number of times that a Load
instruction is immediately followed by a dependent instruction.
• A stall is eliminated each time the compiler can safely move a nearby instruction to a position
between the Load instruction and the dependent instruction.
• The effect of cache misses on performance can be assessed by considering the frequency of
their occurrence.

2. Number of Pipeline Stages-

• The number of pipeline stages increases, there are more instructions being executed
concurrently.

Operand Forwarding in Pipelining
No ratings yet
Operand Forwarding in Pipelining
34 pages
Computer Architecture M2 (Part 3)
No ratings yet
Computer Architecture M2 (Part 3)
34 pages
SIMD Pipeline System Overview
No ratings yet
SIMD Pipeline System Overview
35 pages
CA Unit-2 Chapter-2
No ratings yet
CA Unit-2 Chapter-2
36 pages
Lecture 7 - PIPELINING
No ratings yet
Lecture 7 - PIPELINING
16 pages
COA Unit - V Notes
No ratings yet
COA Unit - V Notes
21 pages
CO Pipelining PDF Notes
No ratings yet
CO Pipelining PDF Notes
10 pages
Computer Organization: An Introduction To RISC Hardware: 6.1 An Overview of Pipelining
No ratings yet
Computer Organization: An Introduction To RISC Hardware: 6.1 An Overview of Pipelining
12 pages
Pipeline and Vector Processing Overview
No ratings yet
Pipeline and Vector Processing Overview
74 pages
Branch Hazard.: Control Hazards
No ratings yet
Branch Hazard.: Control Hazards
4 pages
Instruction Pipelining Explained
No ratings yet
Instruction Pipelining Explained
40 pages
Uni1-2 Pipelining
No ratings yet
Uni1-2 Pipelining
12 pages
Module 3 Pipelining
No ratings yet
Module 3 Pipelining
7 pages
Understanding Pipelining and Hazards
No ratings yet
Understanding Pipelining and Hazards
24 pages
Advanced Pipelining Techniques
No ratings yet
Advanced Pipelining Techniques
44 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
Instruction Pipelining
No ratings yet
Instruction Pipelining
32 pages
Slides Chapter 6 Pipelining
No ratings yet
Slides Chapter 6 Pipelining
60 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
53 pages
Understanding Pipelining Techniques
No ratings yet
Understanding Pipelining Techniques
15 pages
COA Unit 3 Pipelining 31.5.23
No ratings yet
COA Unit 3 Pipelining 31.5.23
12 pages
COA Module 5 QB Complete Solutions
No ratings yet
COA Module 5 QB Complete Solutions
32 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Module 4-Pipelining
No ratings yet
Module 4-Pipelining
39 pages
Instruction Pipeline
No ratings yet
Instruction Pipeline
16 pages
Lec-12 Pipelining
No ratings yet
Lec-12 Pipelining
44 pages
Module 5 Pipeline and Vector Processing
No ratings yet
Module 5 Pipeline and Vector Processing
71 pages
CAAL-Micro Architechture
No ratings yet
CAAL-Micro Architechture
21 pages
Pipelining: Techniques and Challenges
No ratings yet
Pipelining: Techniques and Challenges
13 pages
Unit 6
No ratings yet
Unit 6
22 pages
RISC Pipeline Implementation Explained
100% (1)
RISC Pipeline Implementation Explained
16 pages
Chapter 5
No ratings yet
Chapter 5
38 pages
4-Concept of Pipelining
No ratings yet
4-Concept of Pipelining
20 pages
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
No ratings yet
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
8 pages
Understanding CPU Pipelining Basics
No ratings yet
Understanding CPU Pipelining Basics
5 pages
Pipe Line1
No ratings yet
Pipe Line1
7 pages
Understanding Pipelining in CPUs
No ratings yet
Understanding Pipelining in CPUs
8 pages
11 Processor Structure and Function 20 3 18
No ratings yet
11 Processor Structure and Function 20 3 18
27 pages
ILP - Appendix C PDF
No ratings yet
ILP - Appendix C PDF
52 pages
Ch#16 (CPU Structure and Function)
No ratings yet
Ch#16 (CPU Structure and Function)
48 pages
Instruction and Arithmetic Pipelining
100% (1)
Instruction and Arithmetic Pipelining
30 pages
Ca 5
No ratings yet
Ca 5
12 pages
Unit 3
No ratings yet
Unit 3
94 pages
Parallel Processing Chapter - 3: Instruction Level Parallelism
No ratings yet
Parallel Processing Chapter - 3: Instruction Level Parallelism
33 pages
Pipelining Basic Concept
No ratings yet
Pipelining Basic Concept
23 pages
Unit 6
No ratings yet
Unit 6
20 pages
Instruction Level Parallelism: Pipelining
No ratings yet
Instruction Level Parallelism: Pipelining
6 pages
2 Performance Issue
No ratings yet
2 Performance Issue
4 pages
CH 12.ppt Type I
No ratings yet
CH 12.ppt Type I
54 pages
Module 3
No ratings yet
Module 3
20 pages
Snehasis Barik-Computer Architeacture
No ratings yet
Snehasis Barik-Computer Architeacture
11 pages
Pipelining New
No ratings yet
Pipelining New
33 pages
COA Lecture 10
No ratings yet
COA Lecture 10
22 pages
5.pipeline and Multiprocessors
100% (1)
5.pipeline and Multiprocessors
16 pages
Understanding Pipelining and Hazards
No ratings yet
Understanding Pipelining and Hazards
19 pages