0% found this document useful (0 votes)

449 views14 pages

Test-3 Solutions Subject: Advanced Computer Architecture: 1 2 3 4 5 6 7 8 S1 S2 S3 X X X X X X X X

The document contains solutions to test questions on advanced computer architecture. For question 1, the summary is: - The forbidden latencies for the pipeline reservation table are 2, 4, 5 and 7. - The optimal constant latency cycle is 3, and the minimal average latency is between 3 and 5. - Given a pipeline clock period of 20 ns, the throughput is calculated to be 18.75 MIPS. For question 2, the summary describes the use of prefetch buffers like sequential and target buffers, and multiple functional units with reservation stations to resolve dependencies in pipelined scalar processors.

Uploaded by

sukantakundu11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

449 views14 pages

Test-3 Solutions Subject: Advanced Computer Architecture: 1 2 3 4 5 6 7 8 S1 S2 S3 X X X X X X X X

Uploaded by

sukantakundu11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

TEST-3 SOLUTIONS

Subject: Advanced Computer Architecture

1) Consider the following pipeline reservation table.

1 2 3 4 5 6 7 8
S1
X X X
S2 X X
X X X
S3

(a) What are the forbidden latencies?

(b) Draw the state transition diagram.
(c) List all the simple cycles and greedy cycles.
(d) Determine the optimal constant latency cycle and the minimal average latency.
(e) Let the pipeline clock period be τ = 20 ns. Determine the throughput of the pipeline.
(10
Marks)
Sol:

Forbidden latencies: 2, 4, 5 and 7

Permissible latencies: 1, 3, 6 and 8
Collision vector: C7C6C5C4C3C2C1 = 1011010

CASE 1: latency 3
Present state 1011010
Collision vector 1011010
PS with 3 shifts + 0001101
Next state 1011011

Present state 1011011 Present state 1011011

Collision vector 1011010 Collision vector 1011010

PS with 3 shifts + 0001011 PS with 8 shifts + 0000000
Next state 1011011 Next state 1011010

CASE 2: latency 6
Present state 1011010
Collision vector 1011010
PS with 6 shifts + 0000001
Next state 1011011
Present state 1011011 Present state 1011011
Collision vector 1011010 Collision vector 1011010
PS with 6 shifts + 0000001 PS with 8 shifts + 0000000
Next state 1011011 Next state 1011010

CASE 3: latency 1
Present state 1011010
Collision vector 1011010
PS with 1 shifts + 0101101
Next state 1111111

CASE 4: latency 8
Present state 1111111 Present state 1011010
Collision vector 1011010 Collision vector 1011010
PS with 8 shifts + 0000000 PS with 8 shifts + 0000000
Next state 1011010 Next state 1011010

1011010

3 6 8+

1* 8+
1011011 1111111

3* 6

Latency cycles: (1, 8) (1, 8, 8) (1, 8, 3, 8) (1, 8, 8, 3, 8) (1, 8, 6, 8) (1, 8, 8, 6, 8) (8) (3)
(6) (1, 8, 8, 6, 6, 8)
Simple cycles: (3) (6) (8) (1, 8) (3, 8) (6, 8)
Greedy cycles: (3) (1, 8)
Optimal latency cycle: (3)
MAL: Lower bound = 3
Upper bound = 4+1 = 5
Average greedy cycle latency = (1+8) / 2 = 4.5
MAL ≤ 4.5
MAL = (3)

Given:
τ = 20 ns
Throughput of the pipeline = N/n x τ = 3/8 x 20 x 10-9 = 18.75 MIPS.

2) Describe the mechanisms for instruction pipelining interms of prefetch buffers,

multiple functional units.
(10 Marks)
Sol: Prefetch buffers:

Seq buffer 1
Seq buffer 2

Memor Fetch Seq buffer 2

y cach
e
Target buffer 1 Instruction pipeline
Target buffer 2

Seq buffer 2
Instructions from branched locations

There are 3-types of pre-fetch buffers, namely

1. Sequential buffers
2. Target buffers
3. Loop buffers
to match instruction fetch rate to pipeline consumption rate.
Sequential buffers:
 Sequential instructions are loaded into a pair of sequential buffers for in-sequence
pipelining.
Target buffers:
 Instructions from a branch target are loaded into a pair of target buffers for out of
sequential pipelining. Both buffers operate in FIFO fashion. These buffers
become part of the pipeline as additional stages.
 A conditional branch instruction cause both sequential buffers and target buffers
to fill with instructions.
 After the branch condition is checked, appropriate instructions are taken from one
of the two buffers. The instructions in the other buffers are discarded.
 Two buffers alternate to prevent a collision between instruction following into
and out of pipeline.

Multiple functional units: Loop buffers:

 These buffers hold sequential instruction contained in small loop. The loop
buffers are maintained by fetch stage of pipeline. Pre-fetched instructions in the
loop body will be executed repeatedly until all iterations complete execution.
 The loop buffer operates in two steps.
a. It contains instructions sequentially ahead of current instruction. This saves the
instruction fetch time from memory.
b. It recognizes when the target of a branch falls within the target boundary.
 The above architecture is pipelined scalar architecture. In this architecture, in
order to resolve data dependences and resource dependences among successive
instructions entering the pipeline.
 The reservation stations [RS] are used with each functional unit. Operands can
wait in the reservation stations until its data dependences have been resolved.
Each reservation station is uniquely identified by a tag, which is monitored by a
tag unit.
 The tag unit keeps checking the tags from all currently used registers or
reservation stations.
 This register tagging technique allows the hardware to resolve conflicts between
source and destination registers assigned for multiple instructions.
 Besides resolving conflicts, the reservation stations also serve as buffers to
interface the pipelined function units with decode and issue units.
 The multiple functional units are supported to operate in parallel, once the
dependences are resolved.
Instructions from memory

Instruction fetch Register file

unit

Tag Decode and issue

unit unit B T

A S
Reservation

Load
Stations R R R R register
S S S S s
Functional units

F F F F Memor
U U U U y

PART-2

Answer any Two full questions.

3) Consider the five-stage pipelined processor specified by the following reservation

table

1 2 3 4 5 6
S1
X X
S2 X X
X
S3
X
S4
X X
S5

(a) What are the forbidden latencies?

(b) Draw the state transition diagram.
(c) List all the simple cycles and greedy cycles.
(d) Determine the optimal constant latency cycle and the minimal average latency
(MAL).
(10
Marks)
Sol: Forbidden latencies: 3, 4 and 5
Permissible latencies: 1, 2 and 6
Collision vector: C5C4C3C2C1 = 11100

CASE 1: latency 1
Present state 11100
Collision vector 11100
PS with 1 shifts + 01110
Next state 11110

Present state 11110 Present state 11110

Collision vector 11100 Collision vector 11100
PS with 1 shifts + 01111 PS with 6 shifts + 00000
Next state 11111 Next state 11100

Present state 11111

Collision vector 11100
PS with 6 shifts + 00000
Next state 11100

CASE 2: latency 2

Present state 11100

Collision vector 11100
PS with 2 shifts + 00111
Next state 11111

Present state 11111 Present state 11111

Collision vector 11100 Collision vector 11100
PS with 2 shifts + 00111 PS with 6 shifts + 00000
Next state 11111 Next state 11100

CASE 3: latency 6

Present state 11100

Collision vector 11100
PS with 6 shifts + 00000
Next state 11100

11100

1* 6+ 2* 6+

11110

1
1111

Latency cycles: (2),(6),(2,6),(1,6),(1,1,6)

Simple cycles: (2),(6),(2,6),(1,6),(1,1,6)
Greedy cycles: (2) (1, 6)
Optimal latency cycle: (2)
MAL: Lower bound = 2
Upper bound = 3+1 = 4
Average greedy cycle latency = (1+6) / 2 = 3.5
MAL = 2

4) Consider the following pipelined processor with four stages. This pipeline has a total
evaluation time of six clock cycles. All successor stages must be used after each clock
cycle.

Output

Input
S1 S2 S3 S4
(a) Specify the reservation table for this pipeline with six columns and four rows.
(b) List the set of forbidden latencies between task initiations.
(c) Draw the state diagram which shows all possible latency cycles
(d) List all greedy cycles from the state diagram
(e) What is the value of minimal average latency (MAL)?
(10
Marks)
Sol:
Reservation table:

1 2 3 4 5 6
S1
X X
S2 X X X
X X
S3
X
S4

Forbidden latencies: 2 and 4

Permissible latencies: 1, 3 and 5
Collision vector: C4C3C2C1 = 1010

CASE 1: latency 1
Present state 1010
Collision vector 1010
PS with 1 shifts + 0101
Next state 1111

Present state 1111 Present state 1111

Collision vector 1010 Collision vector 1010
PS with 1 shifts + 0111 PS with 5 shifts + 0000
Next state 1111 Next state 1010

CASE 2: latency 3

Present state 1010

Collision vector 1010
PS with 3 shifts + 0001
Next state 1011
Present state 1011 Present state 1011
Collision vector 1010 Collision vector 1010
PS with 3 shifts + 0001 PS with 5 shifts + 0000
Next state 1011 Next state 1010

CASE 3: latency 5

Present state 1010

Collision vector 1010
PS with 5 shifts + 0000
Next state 1010

1010

1* 5+ 3 5+

1111 1011
3*

Simple cycles: (3),(5),(3,5),(1,5)

Greedy cycles: (3) (1,5)
Average greedy cycle latency = (1+5) / 2 = 3
MAL: Lower bound = 3
Upper bound = 2+1 = 3
MAL = 3

5) Design an arithmetic pipeline unit for fixed-point multiplication of 8-bit integer using
CSA and CPA. (10
Marks)
Sol:
An arithmetic pipeline unit for fixed-point multiplication of 8-bit integer using CSA and
CPA:
PART3

Answer any Two full questions.

6) How is the dot product operation

n
S = ∑ ai x bi
i=1
implemented without data forwarding? What are the advantages that accure, with internal
data forwarding? (5+5 = 10 Marks)

Sol: The product operation

n
S = ∑ ai x bi
i=1

For example: A = (1, 2, 3, 4)

B = (4, 5, 6, 7)
A ● B = (1x4+2x5+3x6+4x7) = 60
Implementing the dot-product operation with internal data forwarding between a multiply
unit and an add unit.

Advantages:

 The three instructions must be executed sequentially in a looping structure in

without internal data forwarding.
 With data forwarding, the output of the multiplier is fed directly into the input
register R4 of the adder and the output of the multiplier is also routed to register
R3 as shown in Fig.
 Therefore internal data forwarding between the two functional units reduces the
total execution time through the pipelined processor.

7) Design a binary multiply pipeline unit for two 4-bit operands. Use minimum number
of CSA’s and CPA’s. Show all interconnections and bus width in the schematic
diagram. Calculate the output of each CSA and CPA.
(5+5 = 10 Marks)
Sol:
A binary multiply unit for two 4-bit operands:

For example : Two 4-bit operands

1111
x 1111
1111
11110
111100
1111000
1110001

CSA1:
001111
011110
111100
S = 101101
C = 111100

CSA2:
0101101
0111100
1111000
S = 1101001
C = 1111000

CPA:
1101001
+ 1111000
S= 11100001

8) Describe dynamic instruction scheduling achieved in Tomasulos register-tagging

scheme built in IBM 360/91 processor. (10 Marks)
Sol:
Dynamic instruction scheduling achieved in Tomasulos register-tagging scheme
built in IBM 360/91 processor:
 This hardware dependence resolution scheme was implemented with multiple
floating point units of IBM 91 processors for the model 91 processor, 3 RSs are
used in a floating point adder and two pairs in a floating point multiplier.
 The scheme resolves resource conflicts as well as data dependences using register
tagging to allocate or deallocate the source and destination registers.
 An issue instruction whose operands are not available is forwarded to an RS
associated with the functional unit it will use.
 It waits until its data dependences have been resolved and its operands become
available.
 The dependence is resolved by monitoring the result bus.
 When all operands for an instruction is available, it is dispatched to the functional
unit for execution.
 All working registers are tagged.
 If a source register is busy when an instruction reaches the issue stage, the tag for
the source register is forwarded to an RS.
 When the register becomes available, the tag can signal the availability.
Total execution time is 13 cycles, from cycle 4 to cycle 16

Hwang Sol
No ratings yet
Hwang Sol
29 pages
Reservation Table Analysis in Pipelines
No ratings yet
Reservation Table Analysis in Pipelines
10 pages
Thapar University Operating Systems Exam PYQ
No ratings yet
Thapar University Operating Systems Exam PYQ
35 pages
Object Oriented Analysis & Design QBank
No ratings yet
Object Oriented Analysis & Design QBank
10 pages
7.assignment2 DAA Answers Dsatm PDF
No ratings yet
7.assignment2 DAA Answers Dsatm PDF
19 pages
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
No ratings yet
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
10 pages
Linear Pipeline Processors Overview
No ratings yet
Linear Pipeline Processors Overview
23 pages
CH 5 Answers
No ratings yet
CH 5 Answers
6 pages
Data Structures Unit-5 Notes
100% (1)
Data Structures Unit-5 Notes
20 pages
Optimal Program Partitioning Guide
No ratings yet
Optimal Program Partitioning Guide
39 pages
Understanding Direct Memory Access (DMA)
No ratings yet
Understanding Direct Memory Access (DMA)
15 pages
KTU - CST202: I/O Organization - T M S
No ratings yet
KTU - CST202: I/O Organization - T M S
34 pages
Assignment 4 Detailed Solution
No ratings yet
Assignment 4 Detailed Solution
3 pages
Serial Access Memory
No ratings yet
Serial Access Memory
40 pages
Assignment #7: Model Answer
No ratings yet
Assignment #7: Model Answer
4 pages
STL Cheatsheet
100% (1)
STL Cheatsheet
4 pages
Embedded Systems Unit Vi
No ratings yet
Embedded Systems Unit Vi
18 pages
TCS Coding Questions
No ratings yet
TCS Coding Questions
3 pages
Calculate Alphate Value
No ratings yet
Calculate Alphate Value
5 pages
One and Two Address Instructions Analysis
100% (1)
One and Two Address Instructions Analysis
6 pages
Embedded C Programming Overview
No ratings yet
Embedded C Programming Overview
24 pages
11-Subroutine Call and Return
No ratings yet
11-Subroutine Call and Return
6 pages
Segmentation Advantages and Disadvantages
No ratings yet
Segmentation Advantages and Disadvantages
26 pages
Computer Architecture-2-Marks-Imp-Questions
No ratings yet
Computer Architecture-2-Marks-Imp-Questions
6 pages
Semaphore in OS - Practice Problems - Gate Vidyalay
No ratings yet
Semaphore in OS - Practice Problems - Gate Vidyalay
9 pages
VTU 21CS51 ATC Module 1 Automata Part
No ratings yet
VTU 21CS51 ATC Module 1 Automata Part
35 pages
Double Ended Queue with Linked List
No ratings yet
Double Ended Queue with Linked List
2 pages
Practical No.-2: Write A Program A Implementation of Lexical Analyzer Using 'C' Program
No ratings yet
Practical No.-2: Write A Program A Implementation of Lexical Analyzer Using 'C' Program
10 pages
Pipeline Hazards Detailed Notes
No ratings yet
Pipeline Hazards Detailed Notes
49 pages
Compiler Phases and Concepts Explained
No ratings yet
Compiler Phases and Concepts Explained
85 pages
8085 Lab Assignment
50% (2)
8085 Lab Assignment
4 pages
OOSE Model Question Paper for Computer Engg
0% (1)
OOSE Model Question Paper for Computer Engg
2 pages
Cse III Computer Organization (15cs34) Question Paper
No ratings yet
Cse III Computer Organization (15cs34) Question Paper
4 pages
ECIL CSE Mock Test Full 100
No ratings yet
ECIL CSE Mock Test Full 100
7 pages
BCN Unit - 3
No ratings yet
BCN Unit - 3
42 pages
Computer Peripherals & Interfacing
No ratings yet
Computer Peripherals & Interfacing
128 pages
Week 1: Practice Problems
No ratings yet
Week 1: Practice Problems
2 pages
Non-Terminal and Terminal Symbols in CFG
No ratings yet
Non-Terminal and Terminal Symbols in CFG
80 pages
SE Lab Manual
No ratings yet
SE Lab Manual
36 pages
Task Scheduling in Real-Time Systems
No ratings yet
Task Scheduling in Real-Time Systems
25 pages
Project Based Lab Report ON Voting Information System: K L University
100% (1)
Project Based Lab Report ON Voting Information System: K L University
13 pages
Advanced Computer Architecture: Program Flow Mechanisms
No ratings yet
Advanced Computer Architecture: Program Flow Mechanisms
14 pages
Week-2-July-2023 Solution
No ratings yet
Week-2-July-2023 Solution
3 pages
COA Exam Questions for Anna University
No ratings yet
COA Exam Questions for Anna University
3 pages
C Language System Software Lab Manual
100% (1)
C Language System Software Lab Manual
14 pages
All Theory Questions
No ratings yet
All Theory Questions
2 pages
Lecture 10-Mealy and Moore Machine and Their Conversions
No ratings yet
Lecture 10-Mealy and Moore Machine and Their Conversions
5 pages
Model KTU QP With Answer
No ratings yet
Model KTU QP With Answer
13 pages
Module-1 and 2
No ratings yet
Module-1 and 2
150 pages
CS3361 Data Structures Lab Manual
No ratings yet
CS3361 Data Structures Lab Manual
59 pages
Coa Previous Q Papers
No ratings yet
Coa Previous Q Papers
8 pages
SOLUTION PSUC End Term Exam
No ratings yet
SOLUTION PSUC End Term Exam
9 pages
Java Question Bank (2nd Sem)
No ratings yet
Java Question Bank (2nd Sem)
4 pages
Aim: To Implement First Pass of Two Pass Assembler For IBM 360/370 Objective: Develop A Program To Implement First Pass
No ratings yet
Aim: To Implement First Pass of Two Pass Assembler For IBM 360/370 Objective: Develop A Program To Implement First Pass
4 pages
Latancy Solution-Pipeline Reservation Table
60% (10)
Latancy Solution-Pipeline Reservation Table
14 pages
Forbidden Latencies
No ratings yet
Forbidden Latencies
7 pages
Pipeline Processor Analysis and Optimization
No ratings yet
Pipeline Processor Analysis and Optimization
5 pages
Advanced Pipelining Techniques
No ratings yet
Advanced Pipelining Techniques
75 pages
4.non Linear Pipeline
88% (8)
4.non Linear Pipeline
20 pages
Collision Free Scheduling
No ratings yet
Collision Free Scheduling
18 pages
Computer Organization (PCC CS 302) - 2024
No ratings yet
Computer Organization (PCC CS 302) - 2024
2 pages
Non Dept
No ratings yet
Non Dept
1 page
Syllabus of Odd Sem 2019: Dept:-Ece (2Nd Year) Sem:-3Rd Sem
No ratings yet
Syllabus of Odd Sem 2019: Dept:-Ece (2Nd Year) Sem:-3Rd Sem
4 pages
Non Dept Odd Sem Syllabus 2019
No ratings yet
Non Dept Odd Sem Syllabus 2019
4 pages
Course Structure of IT-601, Database Management System: Department of Information Technology
No ratings yet
Course Structure of IT-601, Database Management System: Department of Information Technology
11 pages
3rd Year IT Course Guide
No ratings yet
3rd Year IT Course Guide
11 pages
AS602 Processor Datasheet - SYNOCHIP
No ratings yet
AS602 Processor Datasheet - SYNOCHIP
31 pages
ESP32 - PinOut - StudioPieters®
No ratings yet
ESP32 - PinOut - StudioPieters®
1 page
Unit I The 8086 Microprocessor Part A
No ratings yet
Unit I The 8086 Microprocessor Part A
14 pages
NMOS and PMOS Signal Strength Analysis
No ratings yet
NMOS and PMOS Signal Strength Analysis
51 pages
RISC vs CISC: A Comparative Study
No ratings yet
RISC vs CISC: A Comparative Study
8 pages
01 Module-1 MC 2022-23
No ratings yet
01 Module-1 MC 2022-23
14 pages
8086 Addressing Modes Explained
No ratings yet
8086 Addressing Modes Explained
24 pages
2018 Winter Model Answer Paper
No ratings yet
2018 Winter Model Answer Paper
26 pages
Depletion Mode NMOS & PMOS Transistors
No ratings yet
Depletion Mode NMOS & PMOS Transistors
7 pages
Tomasulo Algorithm and Dynamic Branch Prediction
No ratings yet
Tomasulo Algorithm and Dynamic Branch Prediction
57 pages
Addressing Modes
No ratings yet
Addressing Modes
21 pages
Microcontroller Question Bank
No ratings yet
Microcontroller Question Bank
5 pages
8051 Full
No ratings yet
8051 Full
69 pages
A12x Bionic Chip
No ratings yet
A12x Bionic Chip
4 pages
Lab - 5 Advanced Analog
No ratings yet
Lab - 5 Advanced Analog
19 pages
Overview of Random Access Memory (RAM)
No ratings yet
Overview of Random Access Memory (RAM)
3 pages
Digital Electronics Course Overview 2024
No ratings yet
Digital Electronics Course Overview 2024
11 pages
Assignment 2 and 3
No ratings yet
Assignment 2 and 3
2 pages
Digital Design Course Overview
No ratings yet
Digital Design Course Overview
26 pages
Parallel Processing Essentials
No ratings yet
Parallel Processing Essentials
32 pages
Analog Layout
No ratings yet
Analog Layout
7 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
Overview of ARM Architecture Basics
No ratings yet
Overview of ARM Architecture Basics
24 pages
Pradhi IITB Placement Experience
100% (1)
Pradhi IITB Placement Experience
10 pages
ATmega328 Timer Configuration Guide
No ratings yet
ATmega328 Timer Configuration Guide
18 pages
Mic Microproject
No ratings yet
Mic Microproject
11 pages
Winsem2024-25 Bece204l TH Vl2024250504063 Model-Question-Paper
No ratings yet
Winsem2024-25 Bece204l TH Vl2024250504063 Model-Question-Paper
3 pages
Semicon (Preparation)
No ratings yet
Semicon (Preparation)
6 pages
Ee3404 Microprocessor and Microcontroller LT P C
0% (1)
Ee3404 Microprocessor and Microcontroller LT P C
2 pages
SC1124DG
No ratings yet
SC1124DG
1 page

Test-3 Solutions Subject: Advanced Computer Architecture: 1 2 3 4 5 6 7 8 S1 S2 S3 X X X X X X X X

Uploaded by

Test-3 Solutions Subject: Advanced Computer Architecture: 1 2 3 4 5 6 7 8 S1 S2 S3 X X X X X X X X

Uploaded by

TEST-3 SOLUTIONS

Subject: Advanced Computer Architecture

1) Consider the following pipeline reservation table.

(a) What are the forbidden latencies?

Forbidden latencies: 2, 4, 5 and 7

Present state 1011011 Present state 1011011

Collision vector 1011010 Collision vector 1011010

2) Describe the mechanisms for instruction pipelining interms of prefetch buffers,

Memor Fetch Seq buffer 2

There are 3-types of pre-fetch buffers, namely

Multiple functional units: Loop buffers:

Instruction fetch Register file

Tag Decode and issue

Answer any Two full questions.

3) Consider the five-stage pipelined processor specified by the following reservation

(a) What are the forbidden latencies?

Present state 11110 Present state 11110

Present state 11111

Present state 11100

Present state 11111 Present state 11111

Present state 11100

Latency cycles: (2),(6),(2,6),(1,6),(1,1,6)

Forbidden latencies: 2 and 4

Present state 1111 Present state 1111

Present state 1010

Present state 1010

Simple cycles: (3),(5),(3,5),(1,5)

Answer any Two full questions.

6) How is the dot product operation

Sol: The product operation

For example: A = (1, 2, 3, 4)

 The three instructions must be executed sequentially in a looping structure in

For example : Two 4-bit operands

8) Describe dynamic instruction scheduling achieved in Tomasulos register-tagging

You might also like