0% found this document useful (0 votes)

46 views16 pages

Implementation of Multi Stage Processor

The document discusses the design and implementation of a 32-bit multi-stage RISC-V processor, highlighting the advantages of multi-stage processing over single-cycle processing. It details the five stages of the pipeline, potential hazards (data, control, and structural), and provides a Verilog implementation of the processor. The document emphasizes the efficiency and speed improvements achieved through pipelining in instruction execution.

Uploaded by

ceralap881

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views16 pages

Implementation of Multi Stage Processor

Uploaded by

ceralap881

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Design and Implementation of 32-bit Multi stage

RISC-V Processor

1. Why Choose a Multi-Stage Processor Over a Single-Cycle Processor?

i. Single Cycle processor

 Before diving into the multi-stage pipeline processor, let's first understand the single-
cycle processor. Then we can see why pipelining is important.

 A Single Cycle RISC-V Processor is a basic CPU design in which every

instruction is executed in exactly one clock cycle.
 This includes all five stages of instruction execution: instruction fetch, decode,
execute, memory access, and write-back.

 To understand better, can we execute a few instructions step-by-step in a Single Cycle

RISC-V Processor and observe how they go through each stage?

Program:

main:

addi x1, x0, 5

addi x2, x0, 10

add x3, x2, x1

https://drive.google.com/file/d/19tvVzC2Peg3M1gEwgkp9zjb7W3Y67ugn/view?usp=drive_
link
Cycle Instruction ID EX
IF MEM WB
1 addi x1, Decode, Add 0 + 5 No Write 5 -
Fetch
x0, 5 read x0 (0) memory x1
from
PC=0
2 Fetch from Decode, Add 0 + 10
addi x2, No Write 10
PC=4 read x0 (0)
x0, 10 memory – x2
3 add x3, x2, Fetch from Decode, Add 10 + 5 No
Write 15 -
x1 PC=8 read x2 memory
x3
(10), x1 (5)

 No need to handle data, control, or structural hazards since there’s no overlap between
instructions.
 The clock cycle has to be long enough to finish the slowest instruction so faster
instructions waste time.
 Only one instruction runs at a time, so it’s slow overall.
2. Pipeline Stages in a Multi-Stage RISC-V Processor
 That's why we use a multi-stage processor, it runs faster and is more efficient than
a single-cycle processor.
 Stages in Multi stage pipelined processor
5-stage pipeline:
IF → ID → EX → MEM → WB

 IF – Instruction Fetch
 Here we fetch an instruction from memory.
 PC register already contains the address of next instruction, so simply whatever is there
in PC from that memory location we read.
 ID – Instruction Decode
 Here we try to decode the opcode and find out the what kind of instruction it is.
 While decoding is going on it also do some fetching.
 Assuming that there will be 16bit immediate data, it will be taking that last 16bit of
instruction and it will be doing a sign extension to 32bits.
 EX-Execute
 Here we execute the instruction or some instructions we have to compute the effective
address.
 It’s actual memory address from which data will be loaded (LW) or to which data will
be stored(SW).
 MEM – Memory Access
 In this stage here it actually d memory access, read & write from memory.
 For branch instruction it decides whether to branch or not.
 WB – Write Back
 The result of an instruction is written back to the register file.
 After an instruction finishes calculating, we store the result into register in the register
file.

 Let us understand the pipeline stages in a multi-stage processor by taking an example.

Program:
.text
main:
addi x1, x0, 5
addi x2, x0, 10
nop
nop
add x3, x2, x1
link: https://drive.google.com/file/d/1kfnzHK05PBeWIAu1yKHBBrIyfgy1o7B-
/view?usp=drive_link

Cycle IF ID WB
EX MEM

1 addi x1, x0,

5
2 addi x2, x0, addi x1, x0,
10 5
3 NOP addi x2, x0, addi x1, x0,
10 5
4 NOP NOP addi x2, x0, addi x1, x0,
10 5
5 add x3, x2, NOP NOP addi x2, x0, addi x1, x0,
x1 10 5
6 add x3, x2, NOP NOP addi x2, x0,
x1 10
7 add x3, x2, NOP NOP
x1
8 add x3, x2, NOP
x1
9 add x3, x2,
x1
3. Micro-Operations in Each Pipeline Stage

IF_ID Stage:
The instruction is fetched from memory using the program counter, and the PC is
incremented by 4 to point to the next instruction.

ID/EX Stage :
The instruction is decoded, source registers are read, and control signals are generated
for the next stage.

EX/MEM Stage:
The ALU performs the required operation such as arithmetic or address calculation, and
the result is passed to the memory stage along with updated control signals.

MEM/WB Stage:
If it’s a load instruction, data is read from memory; otherwise, the ALU result is
prepared to be written back to the register file.
4. Types of Hazards in a Multi-Stage Pipeline

 Data Hazards:
When an instruction depends on the result of a previous instruction that hasn’t yet
completed.

Example:
addi x1, x0, 5 # x1 = 5
addi x2, x0, 10 # x2 = 10
add x3, x1, x2 # x3 = x1 + x2 → data hazard here

link:
https://drive.google.com/file/d/1hl8igFd6qln0DeALkVckzusGewKk0ejF/view?us
p=drive_link

Cycle IF ID EX WB
MEM

1 addi x1,
x0, 5
2 addi x2, addi x1,
x0, 10 x0, 5
3 add x3, addi x2, addi x1,
x1, x2 x0, 10 x0, 5
4 add x3, addi x2, addi x1,
x1, x2 x0, 10 x0, 5
5 add x3, addi x2, addi x1,
x1, x2 x0, 10 x0, 5
6 add x3, addi x2,
x1, x2 x0, 10
7 add x3,
x1, x2

 add x3, x1, x2 is trying to read x1 and x2 in its ID stage.

 But x1 and x2 haven’t reached WB yet, so their correct values aren't available yet.
 This is a Read After Write (RAW) data hazard.
 Control Hazards:
Hazards caused by branch or jump instructions that change the program counter
(PC).

Example:
What's is hazard in this
# Assume x1 = 5, x2 = 5 initially

addi x1, x0, 5 # x1 = 5

addi x2, x0, 5 # x2 = 5
beq x1, x2, target # If equal, jump to target
addi x3, x0, 10 # This should be skipped if branch is taken
addi x4, x0, 20 # This will be target

target:
addi x5, x0, 30 # This is where we land if beq taken

Link:
https://drive.google.com/file/d/1IAJcRL9DWJ0aPErHSCn9yp1pkTqZmGHF/vie
w?usp=drive_link

Cycle IF ID EX MEM WB

1 addi x1, x0,

5
2 addi x2, x0, addi x1, x0,
5 5
3 beq x1, x2, addi x2, x0, addi x1, x0,
target 5 5
4 addi x3, x0, beq x1, x2, addi x2, x0, addi x1, x0,
10 target 5 5
5 addi x4, x0, addi x3, x0, beq x1, x2, addi x2, x0, addi x1,
20 10 target 5 x0, 5

 In a 5-stage pipeline (like in Ripes), branch instructions like beq are only
resolved in the Execute (EX) stage, which is 2 cycles after the fetch.
 The branch decision (beq) is only made in the Execute (EX) stage.
 Meanwhile, the next instructions (addi x3, addi x4) are already fetched and
possibly entered decode or execute stages.
 This creates a Control Hazard — the CPU is unsure whether to continue with
x3/x4 or jump to target.

 Structural Hazard:
A Structural Hazard occurs when hardware resources are not sufficient to support
multiple instructions executing in parallel in the pipeline.

Example:
lw x1, 0(x2) # Instruction 1 — Load word from memory into x1
addi x3, x0, 5 # Instruction 2 — Set x3 = 5 (uses ALU, no memory access)
sw x4, 0(x5) # Instruction 3 — Store word from x4 into memory at address in x5

link:
https://drive.google.com/file/d/1V3A_KQz1b-
CuY_dOkxBQSieeDTePHF8T/view?usp=drive_link

 lw x1, 0(x2) and sw x4, 0(x5) involve memory access.

 If memory is not properly initialized or x2/x5 don't point to valid memory,
these memory-related instructions don't actually read or write correctly.
 But addi x3, x0, 5 is a pure ALU instruction (doesn't depend on memory),
so it always works and updates x3.
5. Implementation of Multi stage RISC-V processor using Verilog.

Design:

module pipe_riscv32(clk1, clk2);

input clk1, clk2;

reg [31:0] pc, IF_ID_IR, IF_ID_NPC;

reg [31:0] ID_EX_IR, ID_EX_NPC, ID_EX_A, ID_EX_B, ID_EX_Imm;
reg [2:0] ID_EX_type, EX_MEM_type, MEM_WB_type;
reg [31:0] EX_MEM_IR, EX_MEM_ALUOut, EX_MEM_B;
reg EX_MEM_cond;
reg [31:0] MEM_WB_IR, MEM_WB_ALUOut, MEM_WB_LMD;
reg [31:0] Reg [0:31]; // Register Bank 32x32
reg [31:0] MEM [0:1023]; // Memory 1024x32

reg HALTED;
reg TAKEN_BRANCH;

parameter ADD = 6'b000000,

SUB = 6'b000001,
AND = 6'b000010,
OR = 6'b000011,
SLT = 6'b000100,
MUL = 6'b000101,
HLT = 6'b111111,
LW = 6'b001000,
SW = 6'b001001,
ADDI= 6'b001010,
SUBI= 6'b001011,
SLTI= 6'b001100,
BNEQZ=6'b001101,
BEQZ= 6'b001110;

parameter RR_ALU = 3'b000,

RM_ALU = 3'b001,
LOAD = 3'b010,
STORE = 3'b011,
BRANCH = 3'b100,
HALT = 3'b101;

// IF Stage
always @(posedge clk1)
if (HALTED == 0) begin
if (((EX_MEM_IR[31:26] == BEQZ) && (EX_MEM_cond == 1)) ||
((EX_MEM_IR[31:26] == BNEQZ) && (EX_MEM_cond == 0))) begin
IF_ID_IR <= #2 MEM[EX_MEM_ALUOut];
TAKEN_BRANCH <= #2 1'b1;
IF_ID_NPC <= #2 EX_MEM_ALUOut + 1;
pc <= #2 EX_MEM_ALUOut + 1;
end else begin
IF_ID_IR <= #2 MEM[pc];
IF_ID_NPC <= #2 pc + 1;
pc <= #2 pc + 1;
end
end

// ID Stage
always @(posedge clk2)
if (HALTED == 0) begin
ID_EX_A <= #2 Reg[IF_ID_IR[25:21]];
if (IF_ID_IR[20:16] == 5'b00000)
ID_EX_B <= #2 0;
else
ID_EX_B <= #2 Reg[IF_ID_IR[20:16]];

ID_EX_NPC <= #2 IF_ID_NPC;

ID_EX_IR <= #2 IF_ID_IR;
ID_EX_Imm <= #2 {{16{IF_ID_IR[15]}}, IF_ID_IR[15:0]}; // sign-extend imm

case (IF_ID_IR[31:26])
ADD, SUB, AND, OR, SLT, MUL: ID_EX_type <= #2 RR_ALU;
ADDI, SUBI, SLTI: ID_EX_type <= #2 RM_ALU;
LW: ID_EX_type <= #2 LOAD;
SW: ID_EX_type <= #2 STORE;
BNEQZ, BEQZ: ID_EX_type <= #2 BRANCH;
HLT: ID_EX_type <= #2 HALT;
default: ID_EX_type <= #2 HALT;
endcase
end

// EX Stage
always @(posedge clk1)
if (HALTED == 0) begin
EX_MEM_type <= #2 ID_EX_type;
EX_MEM_IR <= #2 ID_EX_IR;
TAKEN_BRANCH <= #2 0;

case (ID_EX_type)
RR_ALU: begin
case (ID_EX_IR[31:26])
ADD: EX_MEM_ALUOut <= #2 ID_EX_A + ID_EX_B;
SUB: EX_MEM_ALUOut <= #2 ID_EX_A - ID_EX_B;
AND: EX_MEM_ALUOut <= #2 ID_EX_A & ID_EX_B;
OR: EX_MEM_ALUOut <= #2 ID_EX_A | ID_EX_B;
MUL: EX_MEM_ALUOut <= #2 ID_EX_A * ID_EX_B;
SLT: EX_MEM_ALUOut <= #2 (ID_EX_A < ID_EX_B);
default: EX_MEM_ALUOut <= #2 32'hxxxxxxxx;
endcase
end
RM_ALU: begin
case (ID_EX_IR[31:26])
ADDI: EX_MEM_ALUOut <= #2 ID_EX_A + ID_EX_Imm;
SUBI: EX_MEM_ALUOut <= #2 ID_EX_A - ID_EX_Imm;
SLTI: EX_MEM_ALUOut <= #2 (ID_EX_A < ID_EX_Imm);
default: EX_MEM_ALUOut <= #2 32'hxxxxxxxx;
endcase
end
LOAD, STORE: begin
EX_MEM_ALUOut <= #2 ID_EX_A + ID_EX_Imm;
EX_MEM_B <= #2 ID_EX_B;
end
BRANCH: begin
EX_MEM_ALUOut <= #2 ID_EX_NPC + ID_EX_Imm;
EX_MEM_cond <= #2 (ID_EX_A == 0); // assuming zero check for branch
end
endcase
end

// MEM Stage
always @(posedge clk2)
if (HALTED == 0) begin
MEM_WB_type <= #2 EX_MEM_type;
MEM_WB_IR <= #2 EX_MEM_IR;
case (EX_MEM_type)
RR_ALU, RM_ALU: MEM_WB_ALUOut <= #2 EX_MEM_ALUOut;
LOAD: MEM_WB_LMD <= #2 MEM[EX_MEM_ALUOut];
STORE:
if (TAKEN_BRANCH == 0)
MEM[EX_MEM_ALUOut] <= #2 EX_MEM_B;
endcase
end
// WB Stage
always @(posedge clk1)
if (HALTED == 0) begin
if (TAKEN_BRANCH == 0)
case (MEM_WB_type)
RR_ALU: Reg[MEM_WB_IR[15:11]] <= #2 MEM_WB_ALUOut;
RM_ALU, LOAD: Reg[MEM_WB_IR[20:16]] <= #2 (MEM_WB_type == LOAD
? MEM_WB_LMD : MEM_WB_ALUOut);
HALT: HALTED <= #2 1'b1;
endcase
end

endmodule
Testbench:

module pipe_riscv32_tb;

reg clk1, clk2;

integer i;

// Instantiate the processor

pipe_riscv32 DUT(clk1, clk2);

// Clock generation

initial begin

clk1 = 0; clk2 = 0;

forever begin

#5 clk1 = ~clk1; // Toggle clk1 every 5 time units

#5 clk2 = ~clk2; // Toggle clk2 after clk1

end

// Initialize memory and registers

initial begin

// Clear register file and memory

for (i = 0; i < 32; i = i + 1)

DUT.Reg[i] = 0;

for (i = 0; i < 1024; i = i + 1)

DUT.MEM[i] = 32'h00000000;
// -----------------------------

// Program: Simple instruction flow

// -----------------------------

// ADDI R1, R0, #5 => R1 = 5

// ADDI R2, R0, #10 => R2 = 10

// ADD R3, R1, R2 => R3 = R1 + R2 = 15

// SW R3, 100(R0) => MEM[100] = R3

// LW R4, 100(R0) => R4 = MEM[100]

// HLT

DUT.MEM[0] = {6'b001010, 5'd0, 5'd1, 16'd5}; // ADDI R1, R0, 5

DUT.MEM[1] = {6'b001010, 5'd0, 5'd2, 16'd10}; // ADDI R2, R0, 10

DUT.MEM[2] = {6'b000000, 5'd1, 5'd2, 5'd3, 11'd0}; // ADD R3, R1, R2

DUT.MEM[3] = {6'b111111, 26'd0}; // HLT

// Reset control flags

DUT.HALTED = 0;

DUT.TAKEN_BRANCH = 0;

DUT.pc = 0;

// Simulation time

#200;

// Output Register Contents

$display("\nFinal Register Values:");

for (i = 0; i < 8; i = i + 1)

$display("R[%0d] = %0d", i, DUT.Reg[i]);

$display("\nMemory[100] = %0d", DUT.MEM[100]);

$finish;

end

endmodule
6. Result

Final Register Values:

R[0] = 0
R[1] = 5
R[2] = 10
R[3] = 5
R[4] = 0
R[5] = 0
R[6] = 0
R[7] = 0

Memory[100] = 0
testbench.sv:56: $finish called at 200 (1s)

7. Simulation
8. References:
[1] I. Sen Gupta, Computer Organization and Architecture, National Programme on
Technology Enhanced Learning (NPTEL), IIT Kharagpur

[2] The RISC-V Instruction Set Manual, Volume I: Unprivileged ISA, Document Version
20191213, RISC-V Foundation.

Design of 32bit MIPS Processor
No ratings yet
Design of 32bit MIPS Processor
23 pages
FemtoRV32 Piplined Processor Report
No ratings yet
FemtoRV32 Piplined Processor Report
25 pages
Pipelining 2019
No ratings yet
Pipelining 2019
82 pages
Lec 2
No ratings yet
Lec 2
21 pages
CAO Fall 2024 Lecture 07 RISC V Pipelined Implementation
No ratings yet
CAO Fall 2024 Lecture 07 RISC V Pipelined Implementation
114 pages
Week 11
No ratings yet
Week 11
33 pages
Tiled Chip Multicore Processor Overview
No ratings yet
Tiled Chip Multicore Processor Overview
64 pages
Lecture 4.3 - The Processor - Pipelining
No ratings yet
Lecture 4.3 - The Processor - Pipelining
27 pages
Presentation 1
No ratings yet
Presentation 1
22 pages
Chapter - 04 RISC V
No ratings yet
Chapter - 04 RISC V
132 pages
Chapter 04 Processor 2
No ratings yet
Chapter 04 Processor 2
28 pages
Module 5 - Processor Structure and Function
No ratings yet
Module 5 - Processor Structure and Function
74 pages
Reduced Instruction Set Computer (Risc) Complex Instruction Set Computer (Cisc)
No ratings yet
Reduced Instruction Set Computer (Risc) Complex Instruction Set Computer (Cisc)
7 pages
Processor Structure and Function Overview
No ratings yet
Processor Structure and Function Overview
9 pages
Group 17 - 2151177
No ratings yet
Group 17 - 2151177
15 pages
SRM Pipelining 05
No ratings yet
SRM Pipelining 05
42 pages
DSD Report
No ratings yet
DSD Report
4 pages
RISC V Processor Architecture and 5 Stage Pipeline Implementation On FPGA Using Verilog
No ratings yet
RISC V Processor Architecture and 5 Stage Pipeline Implementation On FPGA Using Verilog
10 pages
DDCO Notes-162-171
No ratings yet
DDCO Notes-162-171
10 pages
CAAL-Micro Architechture
No ratings yet
CAAL-Micro Architechture
21 pages
8 Pipeline DDP Control
No ratings yet
8 Pipeline DDP Control
54 pages
Pipelining for Enhanced Performance
No ratings yet
Pipelining for Enhanced Performance
71 pages
RISC-V Pipelined Datapath Overview
No ratings yet
RISC-V Pipelined Datapath Overview
50 pages
FPGA
No ratings yet
FPGA
10 pages
RISC Pipeline Overview
No ratings yet
RISC Pipeline Overview
39 pages
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
No ratings yet
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
136 pages
Module 2
No ratings yet
Module 2
64 pages
Presentation 35191 Content Document 20250423021246PM
No ratings yet
Presentation 35191 Content Document 20250423021246PM
46 pages
Lec7 Pipelining
No ratings yet
Lec7 Pipelining
22 pages
Cs501 Notes
No ratings yet
Cs501 Notes
33 pages
2.pipeline RISC-V v2
No ratings yet
2.pipeline RISC-V v2
47 pages
CA Lecture 12
No ratings yet
CA Lecture 12
48 pages
Unit 5 Pipeline Hazard
No ratings yet
Unit 5 Pipeline Hazard
31 pages
Instruction Pipelining Explained
No ratings yet
Instruction Pipelining Explained
27 pages
Multi-Core Computer Architecture: Instruction Pipeline Hazards
No ratings yet
Multi-Core Computer Architecture: Instruction Pipeline Hazards
23 pages
Embedded Computer Architecture 5SAI0
No ratings yet
Embedded Computer Architecture 5SAI0
59 pages
Slides Chapter 5 Basic Processing Unit
No ratings yet
Slides Chapter 5 Basic Processing Unit
44 pages
CH10-Processor Structure and Function
No ratings yet
CH10-Processor Structure and Function
14 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
55 pages
ILP - Appendix C PDF
No ratings yet
ILP - Appendix C PDF
52 pages
IT3030E CA Chap5 CPU - Removed
No ratings yet
IT3030E CA Chap5 CPU - Removed
26 pages
Pipelining in Computer Architecture
No ratings yet
Pipelining in Computer Architecture
38 pages
Digital Design & CPU Basics
No ratings yet
Digital Design & CPU Basics
10 pages
CA Unit 3 Answers
No ratings yet
CA Unit 3 Answers
10 pages
Analysis of CPU
No ratings yet
Analysis of CPU
9 pages
L117-19 MIPS Pipeline Implementation
No ratings yet
L117-19 MIPS Pipeline Implementation
37 pages
EE739 InO Report Final
No ratings yet
EE739 InO Report Final
8 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
85 pages
Pipeline Registers in Pipelined Datapath
No ratings yet
Pipeline Registers in Pipelined Datapath
33 pages
L8 PipelineHazards 1
No ratings yet
L8 PipelineHazards 1
28 pages
CA07 2022S3 New
No ratings yet
CA07 2022S3 New
29 pages
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
74 pages
Computer Architecture: Introduction To The Concept of Pipelined Processor
No ratings yet
Computer Architecture: Introduction To The Concept of Pipelined Processor
20 pages
CPU Architecture Essentials
No ratings yet
CPU Architecture Essentials
40 pages
MIPS Pipeline Multi-Cycle Operations Guide
No ratings yet
MIPS Pipeline Multi-Cycle Operations Guide
15 pages
MIPS Pipelining Explained
No ratings yet
MIPS Pipelining Explained
2 pages
12 - Processor Structure and Function
No ratings yet
12 - Processor Structure and Function
73 pages
Chapter 04 RISC V Removed
No ratings yet
Chapter 04 RISC V Removed
99 pages
Verification Interview Questions
50% (2)
Verification Interview Questions
10 pages
System - Verilog - Interview - QA
100% (1)
System - Verilog - Interview - QA
28 pages
AXI Protocol Questions
83% (12)
AXI Protocol Questions
5 pages
System Verilog Constraints Examples
67% (3)
System Verilog Constraints Examples
46 pages
Fifo Verif Plan
50% (2)
Fifo Verif Plan
20 pages
AXI Assertions
67% (3)
AXI Assertions
42 pages
UVM Interview Prep Guide
100% (1)
UVM Interview Prep Guide
12 pages
SystemVerilog Assertion Examples
50% (4)
SystemVerilog Assertion Examples
9 pages
UVM Interview Handbook
100% (1)
UVM Interview Handbook
55 pages
SystemVerilog Randomization Constraints
100% (3)
SystemVerilog Randomization Constraints
24 pages
AXI Protocol Handshaking Explained
100% (3)
AXI Protocol Handshaking Explained
7 pages
System Verilog
No ratings yet
System Verilog
132 pages
Solving Complex Users' Assertions: by Ben Cohen
No ratings yet
Solving Complex Users' Assertions: by Ben Cohen
8 pages
Advanced UVM: Architecting A UVM Testbench
100% (2)
Advanced UVM: Architecting A UVM Testbench
20 pages
UVM Interview Questions
100% (10)
UVM Interview Questions
27 pages
AMD Previous Int Questions
No ratings yet
AMD Previous Int Questions
9 pages
System Verilog Interview Questions
100% (1)
System Verilog Interview Questions
56 pages
Verilog Text Book
100% (1)
Verilog Text Book
431 pages
Digital Logic RTL & Verilog Interview Questions Preview
33% (6)
Digital Logic RTL & Verilog Interview Questions Preview
34 pages
AMD Interview Question 2nd Round
No ratings yet
AMD Interview Question 2nd Round
1 page
Asic Interview Questions
No ratings yet
Asic Interview Questions
13 pages
Conformal Verification Guide 8.1
No ratings yet
Conformal Verification Guide 8.1
98 pages
SystemVerilog Assertions for Signal Integrity
No ratings yet
SystemVerilog Assertions for Signal Integrity
4 pages
Uvm Code Examples
100% (1)
Uvm Code Examples
115 pages
SystemVerilog DPI Interface Guide
No ratings yet
SystemVerilog DPI Interface Guide
6 pages
Interview Questions For System Verilog: 1. What Is Clocking Block?
100% (2)
Interview Questions For System Verilog: 1. What Is Clocking Block?
11 pages
System Verilog Interview Questions
100% (2)
System Verilog Interview Questions
2 pages
Systemverilog Interview Questions
100% (1)
Systemverilog Interview Questions
39 pages
Verilog Interview Questions
No ratings yet
Verilog Interview Questions
67 pages
SystemVerilog Callback & Patterns
83% (6)
SystemVerilog Callback & Patterns
22 pages
3. Submicron CMOS Capacitors and Resistors
No ratings yet
3. Submicron CMOS Capacitors and Resistors
14 pages
Hey Perplexity, - I Want You To Make A List of Topic
No ratings yet
Hey Perplexity, - I Want You To Make A List of Topic
2 pages
Leetcode 7day 105
No ratings yet
Leetcode 7day 105
6 pages
ASIC by Sebastian Smith
No ratings yet
ASIC by Sebastian Smith
506 pages
Uhv Notes Co3 & Co4
No ratings yet
Uhv Notes Co3 & Co4
32 pages
Final Product
No ratings yet
Final Product
2 pages
Booth Multiplier
No ratings yet
Booth Multiplier
10 pages
Chapter-4: Microprocessor: 8086 and Modern Microprocessors
No ratings yet
Chapter-4: Microprocessor: 8086 and Modern Microprocessors
50 pages
Embedded System Design Overview BEC601
No ratings yet
Embedded System Design Overview BEC601
54 pages
Unit 5
No ratings yet
Unit 5
29 pages
Introduction To High Performance Scientific Computing
No ratings yet
Introduction To High Performance Scientific Computing
464 pages
Lecture: Pipelining Basics
No ratings yet
Lecture: Pipelining Basics
28 pages
Computer Architecture Exam Guide
No ratings yet
Computer Architecture Exam Guide
38 pages
Embedded Systems Design Overview
No ratings yet
Embedded Systems Design Overview
88 pages
Arm 7 Architecture
75% (4)
Arm 7 Architecture
22 pages
Development of The ARM Architecture
No ratings yet
Development of The ARM Architecture
44 pages
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
No ratings yet
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
7 pages
Qual A Diferença Entre Multiprocessamento e Multiprogramação?
No ratings yet
Qual A Diferença Entre Multiprocessamento e Multiprogramação?
465 pages
Pipelining Speedup Calculation
No ratings yet
Pipelining Speedup Calculation
11 pages
Modulo15b RiscV DDCArv Ch7
No ratings yet
Modulo15b RiscV DDCArv Ch7
26 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
214 pages
MIPS Processor Quiz: Hazards & Pipelining
No ratings yet
MIPS Processor Quiz: Hazards & Pipelining
30 pages
RISC vs CISC: CPU Architecture Explained
No ratings yet
RISC vs CISC: CPU Architecture Explained
9 pages
Overview of RISC Processor Architecture
No ratings yet
Overview of RISC Processor Architecture
9 pages
Anna University CA
No ratings yet
Anna University CA
9 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
69 pages
ASSIGNMENT1 Acsa
No ratings yet
ASSIGNMENT1 Acsa
3 pages
Optimizing CPU Performance with Pipelining
No ratings yet
Optimizing CPU Performance with Pipelining
82 pages
Modulo15 RiscV DDCArv Ch7
No ratings yet
Modulo15 RiscV DDCArv Ch7
34 pages
Computer Architecture MCQ
No ratings yet
Computer Architecture MCQ
102 pages
HPC Question Bank From SNGCE, Kadayirippu
No ratings yet
HPC Question Bank From SNGCE, Kadayirippu
3 pages
Arm Notes-1
No ratings yet
Arm Notes-1
10 pages
Q: What Is Instruction Level Parallelism (ILP) ? Explain Its Concepts
No ratings yet
Q: What Is Instruction Level Parallelism (ILP) ? Explain Its Concepts
18 pages
07 MIPS Pipelining CH4
No ratings yet
07 MIPS Pipelining CH4
73 pages
Cho - Deca (24cse0103)
No ratings yet
Cho - Deca (24cse0103)
9 pages
Dpco Unit IV Processor
No ratings yet
Dpco Unit IV Processor
26 pages
Tentative Outline CS-353 Computer Architecture CS Dept.
No ratings yet
Tentative Outline CS-353 Computer Architecture CS Dept.
3 pages

Implementation of Multi Stage Processor

Uploaded by

Implementation of Multi Stage Processor

Uploaded by

Design and Implementation of 32-bit Multi stage

1. Why Choose a Multi-Stage Processor Over a Single-Cycle Processor?

i. Single Cycle processor

 A Single Cycle RISC-V Processor is a basic CPU design in which every

 To understand better, can we execute a few instructions step-by-step in a Single Cycle

addi x1, x0, 5

addi x2, x0, 10

add x3, x2, x1

 Let us understand the pipeline stages in a multi-stage processor by taking an example.

1 addi x1, x0,

 add x3, x1, x2 is trying to read x1 and x2 in its ID stage.

addi x1, x0, 5 # x1 = 5

1 addi x1, x0,

 lw x1, 0(x2) and sw x4, 0(x5) involve memory access.

module pipe_riscv32(clk1, clk2);

reg [31:0] pc, IF_ID_IR, IF_ID_NPC;

parameter ADD = 6'b000000,

parameter RR_ALU = 3'b000,

ID_EX_NPC <= #2 IF_ID_NPC;

reg clk1, clk2;

// Instantiate the processor

pipe_riscv32 DUT(clk1, clk2);

#5 clk1 = ~clk1; // Toggle clk1 every 5 time units

#5 clk2 = ~clk2; // Toggle clk2 after clk1

// Initialize memory and registers

// Clear register file and memory

for (i = 0; i < 32; i = i + 1)

for (i = 0; i < 1024; i = i + 1)

// Program: Simple instruction flow

// ADDI R1, R0, #5 => R1 = 5

// ADDI R2, R0, #10 => R2 = 10

// ADD R3, R1, R2 => R3 = R1 + R2 = 15

// SW R3, 100(R0) => MEM[100] = R3

// LW R4, 100(R0) => R4 = MEM[100]

DUT.MEM[0] = {6'b001010, 5'd0, 5'd1, 16'd5}; // ADDI R1, R0, 5

DUT.MEM[1] = {6'b001010, 5'd0, 5'd2, 16'd10}; // ADDI R2, R0, 10

DUT.MEM[2] = {6'b000000, 5'd1, 5'd2, 5'd3, 11'd0}; // ADD R3, R1, R2

DUT.MEM[3] = {6'b111111, 26'd0}; // HLT

// Reset control flags

// Output Register Contents

$display("R[%0d] = %0d", i, DUT.Reg[i]);

$display("\nMemory[100] = %0d", DUT.MEM[100]);

Final Register Values:

You might also like