0% found this document useful (0 votes)

89 views55 pages

Processor Structure and Function Overview

The document discusses the structure and function of processors. It describes the basic components of a processor including the arithmetic logic unit, control unit, and registers. It explains the instruction cycle which involves fetching instructions from memory, interpreting and decoding instructions, fetching operands from memory, processing data, and writing results back to memory. The document also discusses register organization, including user-visible registers and control/status registers. It provides examples of instruction pipelining and interrupt handling.

Uploaded by

David Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views55 pages

Processor Structure and Function Overview

Uploaded by

David Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 55

+

Lec03 - Processor
Structure and Function
+ 2

Contents

Processor organization

Instruction cycle

Instruction pipelining

The x86 processor family

+ 3

Processor Organization
Processor Requirements:
Fetch instruction
The processor reads an instruction from memory (register, cache, main memory)

Interpret instruction
The instruction is decoded to determine what action is required

Fetch data
The execution of an instruction may require reading data from memory or an I/O
module

Process data
The execution of an instruction may require performing some arithmetic or logical
operation on data

Write data
The results of an execution may require writing data to memory or an I/O module

In order to do these things the processor needs to store some data

temporarily and therefore needs a small internal memory
4
CPU With the System Bus
+ 5

CPU With the System Bus

Major components of the processor are

Arithmetic and logic unit (ALU)
does the actual computation or processing of data
Control unit (CU)
controls the movement of data and instructions into and
out of the processor and controls the operation of the
ALU
Registers
a set of storage locations
CPU Internal Structure 6
+ 7

CPU Internal Structure

Internal processor bus

Used to transfer data between various registers and ALU –
because the ALU in fact operates only on data in the
internal processor memory

Note the similarity between the internal structure

of the computer as a whole and the internal
structure of the processor.
In both cases, there is a small collection of major
elements (computer: processor, I/O, memory;
processor: control unit, ALU, registers) connected
by data paths.
+ 8

Register Organization
Within the processor there is a set of registers
that function as a level of memory above main
memory and cache in the hierarchy

The registers in the processor perform two

roles:
Control and Status
User-Visible Registers
Registers
Enable the machine or Used by the control unit to
assembly language control the operation of the
programmer to minimize processor and by
main memory references privileged operating
by optimizing use of system programs to control
registers the execution of programs
9
User-Visible Registers

Categories:

• General purpose
• Can be assigned to a variety of functions by
Referenced by means of the programmer
the machine language • Data
that the processor
executes • May be used only to hold data and cannot be
employed in the calculation of an operand
address
• Address
• May be somewhat general purpose or may be
devoted to a particular addressing mode
• Examples: segment pointers, index registers,
stack pointer
• Condition codes
• Also referred to as flags
• Bits set by the processor hardware as the
result of operations
10

Table 14.1
Condition Codes
+ 11

Control and Status Registers

Four registers are essential to instruction execution:

Program counter (PC)

Contains the address of an instruction to be fetched

Instruction register (IR)

Contains the instruction most recently fetched

Memory address register (MAR)

Contains the address of a location in memory

Memory buffer register (MBR)

Contains a word of data to be written to memory or the
word most recently read
+ 12

Program Status Word (PSW)

Common fields or flags include:

Sign: Contains the sign bit of the result of the last arithmetic
operation.
Zero: Set when the result is 0.
Carry: Set if an operation resulted in a carry (addition) into or
borrow (sub- traction) out of a high-order bit.
Equal: Set if a logical compare result is equality.
Overflow: Used to indicate arithmetic overflow.
Interrupt Enable/Disable: Used to enable or disable interrupts.
Supervisor: Indicates whether the processor is executing in
supervisor or user mode. Certain privileged instructions can be
executed only in supervisor mode, and certain areas of memory
can be accessed only in supervisor mode.
13

Example
Microprocessor
Register
Organizations
+ 14

Example Microprocessor Register

Organizations
MC68000
32-bit registers - 8 data registers and 9 address registers
Data registers are used primarily for data manipulation and addressing
as index registers
32-bit program counter and a 16-bit status register

Intel 8086
16-bit
Every register is special purpose
4 data, 4 pointer and index, and 4 segment registers

Intel 80386
32-bit - extension of the 8086

There is NO universally accepted philosophy concerning the best

way to organize processor registers
15

Includes the
following
Instruction
stages:
Cycle

Fetch Execute Interrupt

If interrupts are
enabled and an
Read the next
Interpret the opcode interrupt has
instruction from
and perform the occurred, save the
memory into the
indicated operation current process state
processor
and service the
interrupt
16
Instruction Cycle
+ 17

Indirect stage

The execution of an instruction may involve one or

more operands in memory, each of which requires
a memory access.

After an instruction is fetched, it is examined to

determine if any indirect addressing is involved.

If so, the required operands are fetched using

indirect addressing.
18

Instruction Cycle State Diagram

19
Data Flow, Fetch Cycle

1
2 3 4
5
6

5 5
+ 20

Data Flow, Fetch Cycle (2)

In general, steps during fetch cycle are :

1. PC contains address of next instruction
2. Address moved to MAR
3. Address placed on address bus and memory
location is identified
4. Control unit sends memory read control signal
5. Result placed on data bus, copied to MBR, then to
IR
6. Meanwhile PC incremented by 1, preparing for
the next fetch.
21
Data Flow, Indirect Cycle

1
2
3

1 3
+ 22

Data Flow, Indirect Cycle (2)

After the fetch cycle, IR is examined

If indirect addressing, indirect cycle is
performed as follows:
1. Right most N bits of MBR transferred to MAR, placed on
the address bus and memory location is identified
2. Control unit sends memory read control signal
3. Result (address of operand) moved to MBR

FETCH & INDIRECT Cycles are predictable.

23
Data Flow, Interrupt Cycle

2 2

1 3
+ 24

Data Flow, Interrupt Cycle

Simple and predictable

Current PC saved to allow resumption after
interrupt
To process interrupt:
1. Contents of PC copied to MBR
2. Special memory location (e.g. stack pointer) loaded to
MAR, places the stack address on the address bus, and
stack location is identified.
3. MBR written to memory
4. PC loaded with address of interrupt handling routine
Now, next instruction (first of interrupt handler routine)
can be fetched
25

Pipelining Strategy
To apply this
concept to
instruction
Similar to the use of execution we must
an assembly line in recognize that an
a manufacturing instruction has a
plant number of stages

New inputs are

accepted at one
end before
previously accepted
inputs appear as
outputs at the other
end
26
Two-Stage Instruction Pipeline
+ 27

Additional Stages – increase

speedup
Fetch operands (FO)
Fetch instruction (FI) Fetch each operand from
Read the next expected memory
instruction into a buffer
Operands in registers
Decode instruction (DI) need not be fetched
Determine the opcode and
the operand specifiers
Execute instruction (EI)
Perform the indicated
Calculate operands (CO) operation and store the
Calculate the effective result, if any, in the
address of each source specified destination
operand operand location
This may involve
displacement, register Write operand (WO)
indirect, indirect, or other
forms of address Store the result in
calculation memory
Timing Diagram for Instruction 28

Pipeline Operation
The Effect of a Conditional Branch 29

on Instruction Pipeline Operation

Assume that
instruction 3 is a
conditional
branch to
instruction 15
+

Six Stage
Instruction Pipeline
+

Alternative Pipeline
Depiction
+ 32

Calculating Performance

Cycle time (ττ) of an instruction pipeline

τ = max [ττi] + d = τm + d 1<=i <=k

τm = delay through stage which experiences the

largest delay

k = number of stages in the instruction pipeline

d = time delay of a latch, needed to advance

signals and data from one stage to the next
+ 33

Calculating Performance (2)

Total time units for the instruction pipeline is:

Tk ,n = [k + (n − 1)]τ
k = stages of the pipeline
n = number of instructions
τ = cycle time
The speedup factor for the instruction pipeline
compared to execution without the pipeline is
defined as:
T1, n nk τ nk
Sk = = =
Tk , n [k + (n − 1)]τ k + (n − 1)
+ 34

Calculating Performance - Example

Given number of stages (k) = 6, number of

instructions (n) = 9, τ = 1 sec.

The total time units required for the pipeline is:

Tk ,n = [6 + (9 − 1)]1 = 14 sec
The speedup factor for the instruction pipeline
compared to execution without the pipeline is
defined as:

Sk =
T1, n
=
(9 )(6 )(1)
=
54
= 3 .86
Tk , n [6 + (9 − 1)]1 14
+
Speedup Factors
with Instruction
Pipelining

• The larger the

number of pipeline
stages, the greater
the potential for
speedup
• However, in practical,
increases in cost and
delays between
stages
36
Pipeline Hazards
Occur when the
pipeline, or some
portion of the There are three
pipeline, must stall types of hazards:
because conditions • Resource
do not permit • Data
continued execution • Control

Also referred to as a
pipeline bubble
+
Resource Hazards

A resource hazard occurs

when two or more
instructions that are
already in the pipeline
need the same resource

The result is that the

instructions must be
executed in serial rather
than parallel for a portion
of the pipeline

A resource hazard is
sometimes referred to as
a structural hazard
38

Data

Data Hazards Hazard

• A data hazard occurs when there is a conflict in
the access of an operand location
+ • Two instructions in a program are to be executed
in sequence and both access a particular
memory or register operand.
+ 39

Types of Data Hazard

Read after write (RAW), or true dependency
An instruction modifies a register or memory location
Succeeding instruction reads data in memory or register location
Hazard occurs if the read takes place before write operation is
complete

Write after read (WAR), or antidependency

An instruction reads a register or memory location
Succeeding instruction writes to the location
Hazard occurs if the write operation completes before the read
operation takes place

Write after write (WAW), or output dependency

Two instructions both write to the same location
Hazard occurs if the write operations take place in the reverse
order of the intended sequence
+ 40

Control Hazard
Also known as a branch hazard

Occurs when the pipeline makes the wrong

decision on a branch prediction

Brings instructions into the pipeline that must

subsequently be discarded

Dealing with Branches:

Multiple streams
Prefetch branch target
Loop buffer
Branch prediction
Delayed branch
41
Multiple Streams
A simple pipeline suffers a penalty for
a branch instruction because it must
choose one of two instructions to fetch
next and may make the wrong choice

A brute-force approach is to replicate

the initial portions of the pipeline and
allow the pipeline to fetch both
instructions, making use of two streams

Drawbacks:
• With multiple pipelines there are contention delays
for access to the registers and to memory
• Additional branch instructions may enter the
pipeline before the original branch decision is
resolved
42

Prefetch Branch Target

When a conditional branch is
recognized, the target of the branch is
prefetched, in addition to the
instruction following the branch

Target is then saved until the branch

instruction is executed

If the branch is taken, the target has

+ already been prefetched

IBM 360/91 uses this approach

+ 43

Loop Buffer
Small, very-high speed memory maintained by the
instruction fetch stage of the pipeline and containing the n
most recently fetched instructions, in sequence

Benefits:
Instructions fetched in sequence will be available without the
usual memory access time
If a branch occurs to a target just a few locations ahead of the
address of the branch instruction, the target will already be in the
buffer
This strategy is particularly well suited to dealing with loops

Similar in principle to a cache dedicated to instructions

Differences:
The loop buffer only retains instructions in sequence
Is much smaller in size and hence lower in cost
+ 44

Branch Prediction

Various techniques can be used to predict whether

a branch will be taken:
These approaches are static
1. Predict never taken
They do not depend on the
2. Predict always taken execution history up to the time
3. Predict by opcode of the conditional branch
instruction

1. Taken/not taken switch These approaches are dynamic

2. Branch history table They depend on the execution history
+ 45

Branch Prediction (2)

Predict never taken (Static)
Assume that jump will not happen
Always fetch instructions in sequence

Predict always taken (Static)

Assume that jump will happen
Always fetch target instruction
Analysis of program behavior-more than 50% of the time,
conditional branch is taken

Predict by Opcode (Static)

Decide by opcode of the branch instruction
CPU assumes that the branch will be taken from certain
branch opcodes, not others
Reported to have 75% success rates
+ 46

Taken/not taken switch

Based on previous history.
Using 1 or more bits associated with each conditional branch
instruction to reflect the recent history
These bits are called as taken/not taken switch
Helps CPU to make a particular decision for the next time the instruction
is encountered
Kept in temporary high-speed storage
Associate the history bits with any conditional branch instruction in a
cache (or)
Maintain a small table for recently executed branch instructions with one
or more history bits in each entry.
Using 1 bit - record whether the last execution of this
instruction resulted in a branch or not – prediction error may
occur twice for each use of the loop, once on entering the
loop and one on exiting
Using 2 bits- record the last 2 instances of the execution of
the associated instruction (or) record state in other ways.
+

Taken/not taken
switch
48
Branch Prediction State Diagram
+ 49

Branch History Table

A small cache memory

Associated with instruction fetch stage of the

pipeline

Each entry in table contains 3 elements:

The address of a branch instruction
Some number of history bits recording the use-state of the
instruction
Information about target instruction
Either target address or target instruction
Storage of target address results in a smaller table but
greater instruction fetch time compared with storing
target instruction
+ 50

Branch History Table (2)

(a) Predict never taken strategy
Instruction fetch stage always fetches the next
sequential address
If branch is taken, flush the pipeline and fetch the next
instruction from the target address

(b) Branch History table strategy

Each prefetch triggers a lookup in the branch history
table
If no match is found, next sequential address is used for
the fetch
If a match is found, prediction is made based on the state
of the instruction
Next sequential address or branch target address is
fed to the select logic
When the branch instruction is executed, the execute
stage sends the result to branch history table logic.
State of the instruction is updated
If prediction is incorrect, select logic is redirected to
correct address for next fetch
If the conditional branch instruction is not in table, it is
added to the table replacing one of the entry.
+ Intel 80486 Pipelining 51

Fetch
Objective is to fill the prefetch buffers with new data as soon as the old
data have been consumed by the instruction decoder
Operates independently of the other stages to keep the prefetch buffers
full

Decode stage 1
All opcode and addressing-mode information is decoded in the D1 stage
3 bytes of instruction are passed to the D1 stage from the prefetch buffers
D1 decoder can then direct the D2 stage to capture the rest of the
instruction

Decode stage 2
Expands each opcode into control signals for the ALU
Also controls the computation of the more complex addressing modes

Execute
Stage includes ALU operations, cache access, and register update

Write back
Updates registers and status flags modified during the preceding execute
stage
+

80486
Instruction
Pipeline
Examples
+ 53

Interrupt Processing
Interrupts and Exceptions
Interrupts
Generated by a signal from hardware and it may occur at random
times during the execution of a program
Maskable
Nonmaskable

Exceptions
Generated from software and is provoked by the execution of an
instruction
Processor detected
Programmed

Interrupt vector table

Every type of interrupt is assigned a number
Number is used to index into the interrupt vector table
Table 14.3
54
x86 Exception and Interrupt Vector Table

Unshaded: exceptions Shaded: interrupts

+ Summary
55

Processor Structure
and Function
Lec03
Instruction pipelining
Processor organization
Pipelining strategy
Register organization Pipeline performance
User-visible registers Pipeline hazards
Control and status registers Dealing with branches
Intel 80486 pipelining
Instruction cycle
The indirect cycle
Data flow

The x86 processor family

Indirect Addressing in CPU Cycles
No ratings yet
Indirect Addressing in CPU Cycles
56 pages
Chapter 4
No ratings yet
Chapter 4
78 pages
8 Week
No ratings yet
8 Week
43 pages
CH6 Iii
No ratings yet
CH6 Iii
56 pages
Processor Structure and Function Overview
No ratings yet
Processor Structure and Function Overview
9 pages
Computer Architecture: Instruction Pipeline
No ratings yet
Computer Architecture: Instruction Pipeline
28 pages
Unit1 1.7 Instr Cycle
No ratings yet
Unit1 1.7 Instr Cycle
35 pages
11 Processor Structure and Function 20 3 18
No ratings yet
11 Processor Structure and Function 20 3 18
27 pages
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
74 pages
CH10-Processor Structure and Function
No ratings yet
CH10-Processor Structure and Function
14 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
55 pages
Understanding Processor Architecture Basics
No ratings yet
Understanding Processor Architecture Basics
24 pages
12 - Processor Structure and Function
No ratings yet
12 - Processor Structure and Function
73 pages
CA Lecture 12
No ratings yet
CA Lecture 12
48 pages
Processor Organization Overview
No ratings yet
Processor Organization Overview
19 pages
Module 5 - Processor Structure and Function
No ratings yet
Module 5 - Processor Structure and Function
74 pages
Unit - 1 Microprocessor Architecture
No ratings yet
Unit - 1 Microprocessor Architecture
52 pages
Computer Architecture Insights
100% (1)
Computer Architecture Insights
55 pages
CH12 CPU Structure and Function
No ratings yet
CH12 CPU Structure and Function
44 pages
CPU Architecture Essentials
No ratings yet
CPU Architecture Essentials
40 pages
CPU Structure & Functions
No ratings yet
CPU Structure & Functions
44 pages
ITEC582 Chapter14
No ratings yet
ITEC582 Chapter14
24 pages
Processor Organization
100% (1)
Processor Organization
55 pages
IIC1082 Chapter8
No ratings yet
IIC1082 Chapter8
24 pages
Lecture 3: CPU Structure and Function
No ratings yet
Lecture 3: CPU Structure and Function
47 pages
Slot24 25 CH14 ProcessorStructureAndFunction 42 Slots
No ratings yet
Slot24 25 CH14 ProcessorStructureAndFunction 42 Slots
42 pages
DDCO Notes-162-171
No ratings yet
DDCO Notes-162-171
10 pages
Processor Organization & Instruction Cycle
No ratings yet
Processor Organization & Instruction Cycle
31 pages
CPU Architecture for Students
100% (1)
CPU Architecture for Students
30 pages
Instruction Pipelining Techniques
No ratings yet
Instruction Pipelining Techniques
43 pages
CPU Architecture Essentials
No ratings yet
CPU Architecture Essentials
39 pages
CH 12.ppt Type I
No ratings yet
CH 12.ppt Type I
54 pages
Computer Architecture Essentials
No ratings yet
Computer Architecture Essentials
42 pages
William Stallings Computer Organization and Architecture: CPU Structure and Function
No ratings yet
William Stallings Computer Organization and Architecture: CPU Structure and Function
40 pages
Processor Organization and Instruction Cycle
No ratings yet
Processor Organization and Instruction Cycle
16 pages
Unit 2
No ratings yet
Unit 2
60 pages
Coa Chapter 2 Final Edited
No ratings yet
Coa Chapter 2 Final Edited
36 pages
CEA201 - Chapter 14 - Processor Structure and Function
No ratings yet
CEA201 - Chapter 14 - Processor Structure and Function
42 pages
Module-5 DDCO
No ratings yet
Module-5 DDCO
35 pages
Presentation 35191 Content Document 20250423021246PM
No ratings yet
Presentation 35191 Content Document 20250423021246PM
46 pages
Instruction Pipelining Explained
No ratings yet
Instruction Pipelining Explained
27 pages
Unit II
No ratings yet
Unit II
46 pages
DDCO Jan25 Unit5
No ratings yet
DDCO Jan25 Unit5
30 pages
Unit 3 Part 1
No ratings yet
Unit 3 Part 1
57 pages
Computer System Architecture Overview
No ratings yet
Computer System Architecture Overview
34 pages
Moduel 5
No ratings yet
Moduel 5
46 pages
CPU Pipelining Explained
No ratings yet
CPU Pipelining Explained
30 pages
CBA Processor
No ratings yet
CBA Processor
21 pages
Unit 7 - Basic Processing
No ratings yet
Unit 7 - Basic Processing
85 pages
CH14 COA10e
No ratings yet
CH14 COA10e
54 pages
Digital Design & CPU Basics
No ratings yet
Digital Design & CPU Basics
10 pages
Unit V
No ratings yet
Unit V
23 pages
Pipelining Updated
No ratings yet
Pipelining Updated
39 pages
CPU With Systems Bus
33% (3)
CPU With Systems Bus
35 pages
CH 3
No ratings yet
CH 3
53 pages
Execution Time and Addressing Modes
No ratings yet
Execution Time and Addressing Modes
11 pages
CH 2
No ratings yet
CH 2
50 pages
Lecture Notes Pipelining Stages 7B
No ratings yet
Lecture Notes Pipelining Stages 7B
7 pages
Unit 3 Students
No ratings yet
Unit 3 Students
37 pages
Lec02 - Computer Function and Interconnection
No ratings yet
Lec02 - Computer Function and Interconnection
40 pages
Lec01 - Introduction
No ratings yet
Lec01 - Introduction
42 pages
Lec04 - Control Unit Operation
No ratings yet
Lec04 - Control Unit Operation
36 pages
Lec05 - Computer Arithmetic - Integer Representation and Arithmetic
No ratings yet
Lec05 - Computer Arithmetic - Integer Representation and Arithmetic
42 pages
MS Access - Built-In Functions
No ratings yet
MS Access - Built-In Functions
15 pages
Hardware Course Outline: Lesson 1: Computer Safety and Work Habits
No ratings yet
Hardware Course Outline: Lesson 1: Computer Safety and Work Habits
4 pages
Pe 1998 11
No ratings yet
Pe 1998 11
92 pages
EJB Annotations Primer: Quick Reference Guide
No ratings yet
EJB Annotations Primer: Quick Reference Guide
81 pages
SELinux and Vold Initialization Logs
No ratings yet
SELinux and Vold Initialization Logs
451 pages
TAFJ DBtools
100% (1)
TAFJ DBtools
125 pages
Excel & Google Sheets Guide
No ratings yet
Excel & Google Sheets Guide
40 pages
Syslib Rm042 (P AIChan)
No ratings yet
Syslib Rm042 (P AIChan)
36 pages
Solutions Series: Mitel
No ratings yet
Solutions Series: Mitel
40 pages
Oracle Wallet Deployment Guide
No ratings yet
Oracle Wallet Deployment Guide
5 pages
Connect More Process Tips MB
No ratings yet
Connect More Process Tips MB
31 pages
MIPS CPU Design on FPGA Report
No ratings yet
MIPS CPU Design on FPGA Report
18 pages
Mosquitto MQTT Setup Guide for Windows
No ratings yet
Mosquitto MQTT Setup Guide for Windows
7 pages
مقسم 20230102 2356
No ratings yet
مقسم 20230102 2356
35 pages
Dive Into Python
No ratings yet
Dive Into Python
327 pages
Veeam Backup 9 5 Evaluators Guide Vsphere en
No ratings yet
Veeam Backup 9 5 Evaluators Guide Vsphere en
130 pages
Hospital Capacity Monitoring Report
No ratings yet
Hospital Capacity Monitoring Report
28 pages
Arecs - IBM Power Systems - Finding Parts, Locations, and Address PDF
No ratings yet
Arecs - IBM Power Systems - Finding Parts, Locations, and Address PDF
250 pages
Introduction to Microprocessors
No ratings yet
Introduction to Microprocessors
9 pages
Abhinav: Java Full Stack Developer Profile
No ratings yet
Abhinav: Java Full Stack Developer Profile
6 pages
Log
No ratings yet
Log
5 pages
CSC 2209 Object Oriented Programming 1
No ratings yet
CSC 2209 Object Oriented Programming 1
2 pages
CSC 204 Session 4
No ratings yet
CSC 204 Session 4
16 pages
Windows11 Core System Guide English
No ratings yet
Windows11 Core System Guide English
6 pages
Test Sistem EtherNetIP AOI RSLogix5000 Integration Manual v1.3
No ratings yet
Test Sistem EtherNetIP AOI RSLogix5000 Integration Manual v1.3
66 pages
4K Video Downloader Crack + License Key 64-Bit
33% (3)
4K Video Downloader Crack + License Key 64-Bit
5 pages
Java Operators: Data Types
No ratings yet
Java Operators: Data Types
15 pages
Me Data Collection For Troubleshooting Rev. 7.2
No ratings yet
Me Data Collection For Troubleshooting Rev. 7.2
37 pages
Iot Unit 3 Decode
No ratings yet
Iot Unit 3 Decode
5 pages
Ecu 4784 G5 - DS (112522) 20221129104513
No ratings yet
Ecu 4784 G5 - DS (112522) 20221129104513
1 page

Processor Structure and Function Overview

Uploaded by

Processor Structure and Function Overview

Uploaded by

+

The x86 processor family

In order to do these things the processor needs to store some data

CPU With the System Bus

Major components of the processor are

CPU Internal Structure

Internal processor bus

Note the similarity between the internal structure

The registers in the processor perform two

Control and Status Registers

Program counter (PC)

Instruction register (IR)

Memory address register (MAR)

Memory buffer register (MBR)

Program Status Word (PSW)

Common fields or flags include:

Example Microprocessor Register

There is NO universally accepted philosophy concerning the best

Fetch Execute Interrupt

The execution of an instruction may involve one or

After an instruction is fetched, it is examined to

If so, the required operands are fetched using

Instruction Cycle State Diagram

Data Flow, Fetch Cycle (2)

In general, steps during fetch cycle are :

Data Flow, Indirect Cycle (2)

After the fetch cycle, IR is examined

FETCH & INDIRECT Cycles are predictable.

Data Flow, Interrupt Cycle

Simple and predictable

New inputs are

Additional Stages – increase

on Instruction Pipeline Operation

Cycle time (ττ) of an instruction pipeline

τ = max [ττi] + d = τm + d 1<=i <=k

τm = delay through stage which experiences the

k = number of stages in the instruction pipeline

d = time delay of a latch, needed to advance

Calculating Performance (2)

Total time units for the instruction pipeline is:

Calculating Performance - Example

Given number of stages (k) = 6, number of

The total time units required for the pipeline is:

• The larger the

A resource hazard occurs

The result is that the

Data Hazards Hazard

Types of Data Hazard

Write after read (WAR), or antidependency

Write after write (WAW), or output dependency

Occurs when the pipeline makes the wrong

Brings instructions into the pipeline that must

Dealing with Branches:

A brute-force approach is to replicate

Prefetch Branch Target

Target is then saved until the branch

If the branch is taken, the target has

IBM 360/91 uses this approach

Similar in principle to a cache dedicated to instructions

Various techniques can be used to predict whether

1. Taken/not taken switch These approaches are dynamic

Branch Prediction (2)

Predict always taken (Static)

Predict by Opcode (Static)

Taken/not taken switch

Branch History Table

A small cache memory

Associated with instruction fetch stage of the

Each entry in table contains 3 elements:

Branch History Table (2)

(b) Branch History table strategy

Interrupt vector table

Unshaded: exceptions Shaded: interrupts

The x86 processor family

You might also like