0% found this document useful (0 votes)

41 views24 pages

Lecture 3.1.3 (Instruction Pipeline)

The document discusses instruction pipelining, a technique that allows multiple instructions to execute simultaneously within a single processor, enhancing efficiency compared to non-pipelined execution. It outlines a four-stage pipeline architecture consisting of Instruction Fetch, Instruction Decode, Instruction Execute, and Write Back, detailing the functions of each stage and the performance metrics such as speed up, efficiency, and throughput. Additionally, it provides calculations for cycle time, execution time, and the conditions necessary for achieving optimal performance in pipelined architectures.

Uploaded by

amitkr.jaiswal736

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views24 pages

Lecture 3.1.3 (Instruction Pipeline)

Uploaded by

amitkr.jaiswal736

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

University Institute of Engineering

Department of Computer Science & Engineering

COMPUTER ORGANIZATION & ARCHITECTURE

(23CST-204/23ITT-204)

ER. SHIKHA ATWAL

E11186

ASSISTANT PROFESSOR

BE-CSE
INSTRUCTION PIPELINE

 Instruction pipelining is a technique that implements a

form of parallelism called
instruction level parallelism within a single processor.
 A pipelined processor does not wait until the previous instruction has been
executed completely.
 Rather, it fetches the next instruction and begins its execution.
 Multiple instructions execute simultaneously in this.
 The efficiency of pipelined execution is more than that of non-pipelined
execution.
Four-Stage Pipeline

In a four-stage pipelined architecture, the execution of each instruction is

completed in the following four stages:

1.Instruction Fetch (IF)

2.Instruction Decode (ID)
3.Instruction Execute (IE)
4.Write back (WB)

To implement a four-stage pipeline,

●The hardware of the CPU is divided into four functional units.

●Each functional unit performs a dedicated task.
Stage-01:

At stage-01,
●First functional unit performs instruction fetch.
●It fetches the instruction to be executed.
●Instruction to be fetched from code area of main memory into the instruction
register.
Stage-02:

At stage-02,

●The second functional unit performs instruction decode.

●It decodes the instruction to be executed.
●The opcode from instruction buffer is decoded so as to identify the operation to
be performed.
●The instruction to be operated in input data to be fetched from memory (data
area).
Stage-03:

At stage-03,
●The third functional unit performs instruction execution.
●It executes the instruction.
●Perform arithmetic/logical or any specified operation on the operand and
generate result.

Stage-04:

At stage-04,
●The fourth functional unit performs writeback.
●It writes back the result, to the main memory or the registers, obtained after
executing the instruction.
Execution of Instruction Pipeline

In pipeline architecture,

●Instructions in the program execute in parallel.

●When one instruction goes from the nth stage to the (n+1)th stage, another
instruction goes from the (n-1)th stage to the nth stage.

In non-pipelined mode, these four stages are performed sequentially one after
other for each instruction.
S4
S3
S2
S1
In a 4-stage pipelined computer, successive stages are operated/executed in
overlapped fashion.

S4
S3
S2
S1

Phase-Time Diagram
●A phase-time diagram shows the execution of instructions in the pipelined
architecture.
●The following diagram shows the execution of three instructions in a four-stage
pipeline architecture.
Time taken to execute three instructions in four stage pipelined architecture = 6
clock cycles.

NOTE
In a non-pipelined architecture, the time taken to execute three instructions would
be = 3 x Time taken to execute one instruction
= 3 x 4 clock cycles

= 12 clock cycles

Clearly, pipelined execution of instructions is far more efficient than non-

pipelined execution.

Performance of Pipelined Execution

The following parameters serve as criteria to estimate the performance of

pipelined execution:

●Speed Up
●Efficiency
●Throughput
1. Speed Up

It gives an idea of "how much faster" the pipelined execution is as

compared to non- pipelined execution. It is calculated as-
2. Efficiency

The efficiency of pipelined execution is calculated as-

2. Throughput

Throughput is defined as number of instructions executed per unit time. It is

calculated as-
Calculation of Important Parameters
Let us learn how to calculate certain important parameters of pipelined
architecture. Consider-
●A pipelined architecture consisting of k-stage pipeline
●Total number of instructions to be executed = n

Point-01: Calculating Cycle Time-

In pipelined architecture,

●There is a global clock that synchronizes the working of all the stages.
●Frequency of the clock is set such that all the stages are synchronized.
●At the beginning of each clock cycle, each stage reads the data from its register
and process it.
●Cycle time is the value of one clock cycle.
There are two cases possible-

Case-01: All the stages offer same delay-

If all the stages offer same delay, then-

Cycle time = Delay offered by one stage including the delay due to its register

Case-02: All the stages do not offer same delay-

If all the stages do not offer same delay, then-

Cycle time = Maximum delay offered by any stage including the delay due to its
register
Point-02: Calculating Frequency of Clock-

Frequency of the clock (f) = 1 / Cycle time

Point-03: Calculating Non-Pipelined Execution Time-

In non-pipelined architecture,
• The instructions execute one after the other.
• The execution of a new instruction begins only after the previous instruction
has executed completely.
So, number of clock cycles taken by each instruction = k clock cycles

Thus, Non-pipelined execution time

= Total number of instructions x Time taken to execute one instruction
= n x k clock cycles
Point-04: Calculating Pipelined Execution Time-

In pipelined architecture,

●Multiple instructions execute parallelly.

●Number of clock cycles taken by the first instruction = k clock cycles
●After first instruction has completely executed, one instruction comes out per
clock cycle.
●So, number of clock cycles taken by each remaining instruction = 1 clock
cycle

Thus, Pipelined execution time

= Time taken to execute first instruction + Time taken to execute remaining
instructions
= 1 x k clock cycles + (n-1) x 1 clock cycle
= (k + n – 1) clock cycles
Point-05: Calculating Speed Up-

Speed up
= Non-pipelined execution time / Pipelined execution time
= n*k clock cycles / (k + n – 1) clock cycles
= n*k / (k + n – 1)
= n*k / n + (k – 1)
= k / [1 + (k – 1)/n]

●For very large number of instructions, n→∞. Thus, speed up = k.

●Practically, total number of instructions never tend to infinity.
●Therefore, speed up is always less than number of stages in pipeline.
●Theoretically, K stage pipeline time is K - times faster than serial.
●But this ideal speed up mentioned above cannot be achieved due to factors like
data dependency branch and interrupts.
Important Notes

Note-01:

●The aim of pipelined architecture is to execute one complete

●In other words, the aim of pipelining is to maintain CPI ≅ 1.

instruction in one clock cycle.

●Practically, it is not possible to achieve CPI ≅ 1 due to delays that get

introduced due to registers.
●Ideally, a pipelined architecture executes one complete instruction per
clock cycle (CPI=1).
Note-02:
●The maximum speed up that can be achieved is always equal to the
number of stages.
●This is achieved when efficiency becomes 100%.
●Practically, efficiency is always less than 100%.
●Therefore speed up is always less than number of stages in pipelined
architecture.

Note-03:

Under ideal conditions,

●One complete instruction is executed per clock cycle i.e. CPI = 1.

●Speed up = Number of stages in pipelined architecture
Note-04:
●Experiments show that 5 stage pipelined processor gives the best
performance.

Note-05:

In case only one instruction has to be executed, then-

●Non-pipelined execution gives better performance than pipelined

execution.
●This is because delays are introduced due to registers in pipelined
architecture.
●Thus, time taken to execute one instruction in non-pipelined
architecture is less.
Note-06:

High efficiency of pipelined processor is achieved when-

●All the stages are of equal duration.

●There are no conditional branch instructions.
●There are no interrupts.
●There are no register and memory conflicts.
●Performance degrades in absence of these conditions.
References

Reference Books:
●J.P. Hayes, “Computer Architecture and
Organization”, Third Edition.
●Mano, M., “Computer System Architecture”, Third
Edition, Prentice Hall.

●Stallings, W., “Computer Organization and Architecture”, Eighth

Edition, Pearson Education.

Text Books:
●Carpinelli J.D,” Computer systems organization &Architecture”, Fourth
Edition, Addison Wesley.
●Patterson and Hennessy, “Computer Architecture”, Fifth Edition Morgaon
Other References:

●https://www.gatevidyalay.com/pipelining-practice-problems/

Video Links:

●https://youtu.be/YhGv5AOcz1s?si=2Me5gnNt1SqiIOgl
●https://youtu.be/1vqCgOVTB0I?si=vGnV1sPbpO8ZY3dA
●https://youtu.be/_cNrYUUDaq8?si=uUf63U4XXXiLOHWZ

Lecture 3.1.3 (Instruction Pipeline)
No ratings yet
Lecture 3.1.3 (Instruction Pipeline)
9 pages
Pipe Lining
No ratings yet
Pipe Lining
23 pages
Instruction Pipeline 3.1.5
No ratings yet
Instruction Pipeline 3.1.5
7 pages
7.1 Instruction Pipelining
No ratings yet
7.1 Instruction Pipelining
5 pages
L17 Pipelined Vs Non Pipelined
No ratings yet
L17 Pipelined Vs Non Pipelined
16 pages
Lec8 Cache Coherence and Introduction To Pipline
No ratings yet
Lec8 Cache Coherence and Introduction To Pipline
41 pages
3.3 Pipelining Performance1
No ratings yet
3.3 Pipelining Performance1
13 pages
Module 6-Advanced Processors and Buses
No ratings yet
Module 6-Advanced Processors and Buses
54 pages
Enhancing CPU Performance with Pipelining
No ratings yet
Enhancing CPU Performance with Pipelining
17 pages
Module 3-Part 2
No ratings yet
Module 3-Part 2
50 pages
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
No ratings yet
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
7 pages
Pipelining in Computer Architecture
No ratings yet
Pipelining in Computer Architecture
36 pages
Pipelining in Parallel Processing
No ratings yet
Pipelining in Parallel Processing
63 pages
Unit 4 Coa
No ratings yet
Unit 4 Coa
25 pages
Pipelined Architecture Explained
No ratings yet
Pipelined Architecture Explained
20 pages
Pipelining Performance in Computer Architecture
No ratings yet
Pipelining Performance in Computer Architecture
14 pages
3.4 Pipelining Performance2
No ratings yet
3.4 Pipelining Performance2
12 pages
Pipelining Concepts and Problems
No ratings yet
Pipelining Concepts and Problems
33 pages
Pipe Lining
No ratings yet
Pipe Lining
14 pages
Pipeline 1
No ratings yet
Pipeline 1
17 pages
Pipeline Processing
No ratings yet
Pipeline Processing
28 pages
Unit 6
No ratings yet
Unit 6
30 pages
Parallel and Pipeline Processing Explained
No ratings yet
Parallel and Pipeline Processing Explained
43 pages
An Introductory Analysis of Pipelines: I I I I I Clock Cycles
No ratings yet
An Introductory Analysis of Pipelines: I I I I I Clock Cycles
2 pages
Linear Pipeline Processors
No ratings yet
Linear Pipeline Processors
4 pages
Comparison Between Pipelining
No ratings yet
Comparison Between Pipelining
9 pages
Pipeline Processing Explained
No ratings yet
Pipeline Processing Explained
47 pages
Understanding Processor Pipelining
No ratings yet
Understanding Processor Pipelining
28 pages
Unit 6 Updated
No ratings yet
Unit 6 Updated
40 pages
اسمبلي ٩
No ratings yet
اسمبلي ٩
3 pages
Lecture 06 - (New) Pipelining and Parallelism
No ratings yet
Lecture 06 - (New) Pipelining and Parallelism
36 pages
PCC-CS402
No ratings yet
PCC-CS402
7 pages
Chap-10: Speed and Efficiency
No ratings yet
Chap-10: Speed and Efficiency
29 pages
Understanding Pipelining in CPUs
No ratings yet
Understanding Pipelining in CPUs
8 pages
Unit 3 - Advanced Computer Architecture - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Advanced Computer Architecture - WWW - Rgpvnotes.in
15 pages
Unit 6 Updated
No ratings yet
Unit 6 Updated
40 pages
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
No ratings yet
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
8 pages
07 Pipeline Notes
No ratings yet
07 Pipeline Notes
145 pages
An Introductory Analysis of Pipelines: I I I I I Clock Cycles Æ
No ratings yet
An Introductory Analysis of Pipelines: I I I I I Clock Cycles Æ
2 pages
Bản Sao Của Lecture 9 - Pipelined Processor Design
No ratings yet
Bản Sao Của Lecture 9 - Pipelined Processor Design
11 pages
Pipelining in Instruction Processing
No ratings yet
Pipelining in Instruction Processing
76 pages
Pipelining: 5-Stage Pipeline: Mahdi Nazm Bojnordi
No ratings yet
Pipelining: 5-Stage Pipeline: Mahdi Nazm Bojnordi
35 pages
CAO-II Module 2 Complete
100% (1)
CAO-II Module 2 Complete
32 pages
Lecture 06 - (New) Pipelining and Parallelism
No ratings yet
Lecture 06 - (New) Pipelining and Parallelism
37 pages
Module 4
No ratings yet
Module 4
12 pages
Pipe Lining
No ratings yet
Pipe Lining
32 pages
CA Slides#3 Pipeline Introduction
No ratings yet
CA Slides#3 Pipeline Introduction
26 pages
05 Pipelining
No ratings yet
05 Pipelining
34 pages
BCS302 Unit-3 (Part-III)
No ratings yet
BCS302 Unit-3 (Part-III)
4 pages
Pipelineing
No ratings yet
Pipelineing
82 pages
Computer Systems A Programmers Perspective, Section 4.4, "General Principles of Pipelining"
No ratings yet
Computer Systems A Programmers Perspective, Section 4.4, "General Principles of Pipelining"
7 pages
Computer Architecture: Pipelining
No ratings yet
Computer Architecture: Pipelining
27 pages
Pipelining in Computer Architecture
No ratings yet
Pipelining in Computer Architecture
36 pages
Computer Systems Pipelining Guide
No ratings yet
Computer Systems Pipelining Guide
39 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
Lecture On Embedded System (Part - 2)
No ratings yet
Lecture On Embedded System (Part - 2)
35 pages
05 Types of Pipelining
No ratings yet
05 Types of Pipelining
56 pages
Basics and Hazards of Pipeline Controller
No ratings yet
Basics and Hazards of Pipeline Controller
23 pages
Lecture 3.1.1 (Parallelism in Uniprocessor System, Flynns Classification)
No ratings yet
Lecture 3.1.1 (Parallelism in Uniprocessor System, Flynns Classification)
21 pages
Notes Chapter 1.2 Lecture 1.2.4 (Mapping Constraints)
No ratings yet
Notes Chapter 1.2 Lecture 1.2.4 (Mapping Constraints)
7 pages
Notes Chapter 1.2 Lecture 1.2.6 (Comparison of Models)
No ratings yet
Notes Chapter 1.2 Lecture 1.2.6 (Comparison of Models)
10 pages
Lecture-14 (Test For Population Variances)
No ratings yet
Lecture-14 (Test For Population Variances)
6 pages
Dynamic Scheduling in Computer Architecture
No ratings yet
Dynamic Scheduling in Computer Architecture
39 pages
MIC U-III (Instruction Set of 8086) PDF
No ratings yet
MIC U-III (Instruction Set of 8086) PDF
109 pages
16-Instruction Set-19-01-2023
No ratings yet
16-Instruction Set-19-01-2023
13 pages
Optimizing CPU Performance with Pipelining
No ratings yet
Optimizing CPU Performance with Pipelining
82 pages
Very Large Instruction Word (VLIW) : - VLIW - Architectures and Scheduling Techniques (Ch. 3.5)
No ratings yet
Very Large Instruction Word (VLIW) : - VLIW - Architectures and Scheduling Techniques (Ch. 3.5)
35 pages
8085 Microprocessor Instruction Set
No ratings yet
8085 Microprocessor Instruction Set
122 pages
8086 Instruction Formats Guide
No ratings yet
8086 Instruction Formats Guide
18 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
99 pages
Tuning The Pentium Pro Microarchitecture
No ratings yet
Tuning The Pentium Pro Microarchitecture
8 pages
Chapter 6 Pipelining Summary Computer Organization
No ratings yet
Chapter 6 Pipelining Summary Computer Organization
8 pages
Pipelining: Basic and Intermediate Concepts
No ratings yet
Pipelining: Basic and Intermediate Concepts
69 pages
IT3030E CA Chap5 CPU
No ratings yet
IT3030E CA Chap5 CPU
98 pages
Computer Architecture
No ratings yet
Computer Architecture
29 pages
컴구 2021 1 중간고사답안 김성태
No ratings yet
컴구 2021 1 중간고사답안 김성태
23 pages
Jump
No ratings yet
Jump
4 pages
2.3 - Difference Between RISC and CISC
No ratings yet
2.3 - Difference Between RISC and CISC
4 pages
Chapter 03 Solution
No ratings yet
Chapter 03 Solution
19 pages
Appendix C
63% (8)
Appendix C
7 pages
Advanced Computer Architecture Exam
No ratings yet
Advanced Computer Architecture Exam
2 pages
Unit Ii 8085 New Syllabus
No ratings yet
Unit Ii 8085 New Syllabus
11 pages
CPU Comparison Chart
No ratings yet
CPU Comparison Chart
2 pages
Addressing Modes of 8086
0% (1)
Addressing Modes of 8086
4 pages
Simultaneous Multithreading Design Challenges
No ratings yet
Simultaneous Multithreading Design Challenges
15 pages
Address Sequencing in Microprogram Control
No ratings yet
Address Sequencing in Microprogram Control
18 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
17 pages
Superscalar Processor Pipeline Analysis
0% (1)
Superscalar Processor Pipeline Analysis
3 pages
Pipelining: Pros and Cons Explained
No ratings yet
Pipelining: Pros and Cons Explained
1 page
Computer Architecture
No ratings yet
Computer Architecture
12 pages
SSDF Chess Rating List Update 2001
No ratings yet
SSDF Chess Rating List Update 2001
7 pages

Lecture 3.1.3 (Instruction Pipeline)

Uploaded by

Lecture 3.1.3 (Instruction Pipeline)

Uploaded by

University Institute of Engineering

Department of Computer Science & Engineering

COMPUTER ORGANIZATION & ARCHITECTURE

ER. SHIKHA ATWAL

 Instruction pipelining is a technique that implements a

In a four-stage pipelined architecture, the execution of each instruction is

1.Instruction Fetch (IF)

To implement a four-stage pipeline,

●The hardware of the CPU is divided into four functional units.

●The second functional unit performs instruction decode.

●Instructions in the program execute in parallel.

Clearly, pipelined execution of instructions is far more efficient than non-

Performance of Pipelined Execution

The following parameters serve as criteria to estimate the performance of

It gives an idea of "how much faster" the pipelined execution is as

The efficiency of pipelined execution is calculated as-

Throughput is defined as number of instructions executed per unit time. It is

Point-01: Calculating Cycle Time-

Case-01: All the stages offer same delay-

If all the stages offer same delay, then-

Case-02: All the stages do not offer same delay-

If all the stages do not offer same delay, then-

Frequency of the clock (f) = 1 / Cycle time

Point-03: Calculating Non-Pipelined Execution Time-

Thus, Non-pipelined execution time

●Multiple instructions execute parallelly.

Thus, Pipelined execution time

●For very large number of instructions, n→∞. Thus, speed up = k.

●The aim of pipelined architecture is to execute one complete

●In other words, the aim of pipelining is to maintain CPI ≅ 1.

●Practically, it is not possible to achieve CPI ≅ 1 due to delays that get

Under ideal conditions,

●One complete instruction is executed per clock cycle i.e. CPI = 1.

In case only one instruction has to be executed, then-

●Non-pipelined execution gives better performance than pipelined

High efficiency of pipelined processor is achieved when-

●All the stages are of equal duration.

●Stallings, W., “Computer Organization and Architecture”, Eighth

You might also like