0% found this document useful (0 votes)

438 views15 pages

Module 1 - Advanced Computer Architecture

This document introduces basic concepts of computer architecture and organization. It discusses instruction sets, hardware components, system organization, and the Von Neumann architecture. It also covers Flynn's classification of computers as SISD, SIMD, MIMD, or MISD based on their instruction and data streams. Finally, it discusses factors that impact computer performance such as instruction count, clock cycle times, and cycles per instruction.

Uploaded by

Dream Catcher

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

438 views15 pages

Module 1 - Advanced Computer Architecture

Uploaded by

Dream Catcher

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 15

INTRODUCTION TO COMPUTER

ARCHITECTURE

BASIC CONCEPTS OF COMPUTER ARCHITECTURE

Computer Architecture is the design

of
computers, including their instruction sets,
hardware components, and system
organization.
It refers to the understanding of the
components
that
Moremake up the computer and
specifically, the way refers
architecture they
to
are
the attributes of the system that are visible to
interconnected.
the
programmer those attributes that have a
direct
- Instruction Sets
impact on the execution of a program.
- Data Representation

- Addressing

- I/O

1
On the other hand, Computer Organization is
the underlying implementation of the architecture
which is transparent to the programmer. An
architecture can have a number of organizational
implementations:

- Control Signals

- Technologies

- Device Implementations

Most computers follow the Von Neumann

Architecture. It is also known as the Stored
Program Architecture or the Fetch-Decode-
Execute Architecture.

A computer follows the Von Neumann

Architecture if it meets the following criteria:

1. It has three basic hardware subsystems:

a CPU, a main memory system, and an
I/O system.
2. It is a storedprogram computer.
Programs (together with data) are stored
in main memory during execution.
3. It carries out instructions sequentially.
4. It has, or at least appears to have, a
single path between the main memory
and the control unit of the CPU.

2
ARCHITECTURAL CLASSIFICATION SCHEMES

Flynns Classification of Computers (in terms

of multiplicity of instruction-data streams) is the
most universally accepted method of classifying
computers.

Definitions of Terms:

1. Instruction Stream (IS) a sequence of

instructions as executed by a machine.

2. Data Stream (DS) a sequence of data

including input, partial, or temporary
results, called for by the instruction
stream.

Both instructions and data are fetched from the

memory units (MU). Instructions are decoded by
the control unit (CU), which sends the decoded
instruction stream to the processor unit (PU) for
execution.

Any computer can be placed in one of four broad

categories:
1. SISD (Single Instruction Stream over a
Single Data stream)
2. SIMD (Single Instruction Stream over a
Multiple Data stream)
3. MIMD (Multiple Instruction Stream over
a Multiple Data stream)
4. MISD (Multiple Instruction Stream over
a Single Data stream)

3
SISD (Single Instruction Stream over a Single
Data stream)

An SISD machine is a conventional sequential

machine (Von Neumann). A program executed by
the processor constitutes the single instruction
stream, and the sequence of data items that it
operates on constitutes the single data stream.

IS DS
CU PU MU
I/O

Instructions are executed sequentially but may be

overlapped in their execution stages (pipelining).

Most SISD uniprocessor systems are pipelined.

4
SIMD (Single Instruction Stream over a Multiple
Data stream)

A single stream of instructions is broadcast to a

number of processors. Each processor operates
on its own data. The multiple data streams are
the sequences of data items accessed by the
individual processors in their own memories.

In other words, an SIMD computer has several

processors running the same program in lockstep
but each operating on different sets of data. This
type of processing is also called array
processing.

DS DS
PU1 LM1

. . data
IS sets
CU IS . . loaded
program
is . . from
hosts
loaded DS DS
from host PUn LMn

5
MIMD (Multiple Instruction Stream over a
Multiple Data stream)

These are the parallel computers (multiprocessor

and multiple computer systems). They involve a
number of independent processors, each
executing a different program and accessing its
own sequence of data items (or the same program
and the same data but not in lockstep as in SIMD
machines).

IS
IS DS
CU1 PU1
I/O
. .
. . Shared
Memory
. .
I/O IS DS
CUn PUn
IS

6
MISD (Multiple Instruction Stream over a Single
Data stream)

A common data structure is manipulated by

separate processors, and each executes a
different program.

This is also known as systolic arrays for

pipelined execution of specific algorithms.

This form of computation does not arise often in

practice.

IS
.. . IS

CU1 CU2 .. . CUn

Memory
(Program IS IS IS
and
Data) DS DS DS DS
PU1 PU2 .. . PUn

I/O

7
SYSTEM ATTRIBUTES TO PERFORMANCE

The ideal performance of a computer system

demands a perfect match between machine
capability and program behavior.

Machine capability can be enhanced with better

hardware technology, innovative architectural
features, and efficient resource management.

Program behavior is affected by algorithm design,

data structures, language efficiency, programmer
skill, and compiler technology.

The simplest measure of program performance is

the turnaround time (the interval from the time
of submission to the time of completion. It is the
sum of the periods spent for disk and memory
accesses, I/O activities, compilation time, OS
overhead, and CPU time). In order to reduce
turnaround time, one must reduce all these time
factors.

In a multiprogrammed computer, the I/O and

system overheads of a given program may overlap
with the CPU times in other programs. Therefore,
it is fair to compare just the total CPU time
needed for program execution.

8
The CPU of todays modern digital computer is
driven by a clock with a constant clock rate or
clock frequency (f in megahertz). The inverse of
the clock rate is the period or cycle time ( = 1/f
in seconds).

The size of the program is determined by its

Instruction Count c(I ), in terms of the number of
machine instructions to be executed in the
program.

Different machine instructions may require

different numbers of clock cycles to execute.

Example:

For the Intel microprocessors, the MOV

instruction (register to register) takes 2 cycles
to execute. The MOV instruction (memory to
register) takes 8 cycles to execute. While the
SHR instruction takes 4 cycles to execute.

Therefore, the cycles per instruction (CPI)

becomes an important parameter for measuring
the time needed to execute each instruction.

For a given instruction set, the average CPI over

all instruction types can be computed.

9
The CPU Time (T in seconds/program) needed to
execute the program is estimated by finding the
product of the three contributing factors:

CPU Time (T) = I CPI

Example 1:

A 40-MHz processor was used to execute a

program with 50,000 instructions. The average
CPI is estimated to be 3.5 cycles/instruction.
Calculate the total execution time.

Solution:

1 1
25 ns
f 40106

CPUTime (T) I CPI

c
9
500003.5
2510

4.375 ms

10
Example 2:

A 40-MHz processor was used to execute a

benchmark program with the following
instruction mix and clock cycle counts:

Instruction Instruction Clock Cycle

Type Count Count
Integer 45,000 1
Arithmetic
Data Transfer 32,000 2
Floating Point 15,000 2
Control 8,000 2
Transfer

Determine the effective CPI and execution time for

this program.

Solution:

1 1
25 ns
f 40106

TotalCycles
450001 3
2000 2 1
5000 2 8
000 2
TotalCycles 155,000
cycles
11
Total Number of
CPI Cycles
Total Number of
Instructions

155000

45000 32000 15000
8000
155000

45000 32000 15000
8000
155000

100000

1.55cycles/instructi
on

CPUTime (T) I CPI

c
9
1000001.55
2510

3.875 ms

12
The execution of an instruction requires going
through a cycle of events involving instruction
fetch, decode, operand(s) fetch, execution, and
store results.

Only the instruction decode and execution phases

are carried out in the CPU. The remaining three
operations may be required to access memory.

Memory cycle is defined as the time needed to

complete one memory reference (read or write).
Usually, a memory cycle is k times the processor
cycle . The value of k depends on the speed of
the memory technology and processor-memory
interconnection scheme used.

The CPI of an instruction can be divided into two

component terms corresponding to the total
processor cycles and memory cycles needed to
complete the execution of the instruction.

CPU Time (T) = I (p + m k)

where:

p is the number of processor cycles

needed for the instruction decode and
execute
m is the number of memory references
needed
k is the ratio between memory cycle
and processor cycle

13
Introduction to Computer Architecture

MIPS Rate

The processor speed is often measured in terms

of million instructions per second.

Let C be the total number of clock pulses or

cycles needed to execute a given program.

C I CPI
c
CPUTime (T) I CPI
c
C

C

f

The equation for the MIPS rate is:

Ic
MIPS
T106

Since T I CPI , then a second equation for

c
the MIPS rate can be derived as:
f
MIPS
CPI106

14
Introduction to Computer Architecture

Since CPI C/I , then a third equation for the

c
MIPS rate can be derived as:
f Ic
MIPS
C106

Example 2:

A 40-MHz processor was used to execute a

benchmark program with the following
instruction mix and clock cycle counts:

Instruction Instruction Clock Cycle

Type Count Count
Integer 45,000 1
Arithmetic
Data Transfer 32,000 2
Floating Point 15,000 2
Control 8,000 2
Transfer

Determine the MIPS rate of the system.

Solution:

From the previous example:

Ic = 100,000 instructions
T = 3.875 ms

I 100000
MIPS c 25.81MIPS
6 3
T10 3.87510
106
15

Csa - 1
No ratings yet
Csa - 1
15 pages
Comparc Cpo203
No ratings yet
Comparc Cpo203
39 pages
StudM1p1Parallel Computer Modelsppt1shared
No ratings yet
StudM1p1Parallel Computer Modelsppt1shared
107 pages
Aca Unit 1
No ratings yet
Aca Unit 1
34 pages
Advanced Computer Architecture - Unit 1 - WWW - Rgpvnotes.in
100% (1)
Advanced Computer Architecture - Unit 1 - WWW - Rgpvnotes.in
20 pages
Chapter 1 Summary
No ratings yet
Chapter 1 Summary
12 pages
Introduction Mod1
No ratings yet
Introduction Mod1
120 pages
Lec 3
No ratings yet
Lec 3
41 pages
CSC232 - Chp1 (Compatibility Mode)
No ratings yet
CSC232 - Chp1 (Compatibility Mode)
50 pages
Computer Architecture & Performance
No ratings yet
Computer Architecture & Performance
31 pages
Computer Architecture Overview
No ratings yet
Computer Architecture Overview
38 pages
Slide 1
No ratings yet
Slide 1
33 pages
Computer Architecture Unit1
No ratings yet
Computer Architecture Unit1
20 pages
DHXD - Chuong 8. Performance
No ratings yet
DHXD - Chuong 8. Performance
27 pages
Computer Organization Overview
No ratings yet
Computer Organization Overview
17 pages
Ilovepdf - Merged (4) 36 274
No ratings yet
Ilovepdf - Merged (4) 36 274
120 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
Performance
No ratings yet
Performance
51 pages
CSA Performance
No ratings yet
CSA Performance
40 pages
Lec 1
No ratings yet
Lec 1
32 pages
4 Perfrmance
No ratings yet
4 Perfrmance
30 pages
CHAPTER 1 and 2
No ratings yet
CHAPTER 1 and 2
25 pages
Lecture 1 Computer Abstraction and Performance
No ratings yet
Lecture 1 Computer Abstraction and Performance
25 pages
23-Performance Parameters-21-02-2023
No ratings yet
23-Performance Parameters-21-02-2023
16 pages
Computer Organisation and Architecture
No ratings yet
Computer Organisation and Architecture
4 pages
Flynn's Computer Architecture Overview
No ratings yet
Flynn's Computer Architecture Overview
17 pages
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
No ratings yet
Cs405-Computer System Architecture: Module - 1 Parallel Computer Models
91 pages
Parallel Computer Models Overview
No ratings yet
Parallel Computer Models Overview
20 pages
Abstraction in Computer Architecture
No ratings yet
Abstraction in Computer Architecture
74 pages
Stored Program Concept
No ratings yet
Stored Program Concept
27 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
64 pages
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
80% (5)
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
118 pages
Performance of Computers: Factors Affecting Computer Performance
No ratings yet
Performance of Computers: Factors Affecting Computer Performance
4 pages
CSE 332: Understanding Computer Performance
No ratings yet
CSE 332: Understanding Computer Performance
41 pages
Understanding Computer Architecture Basics
No ratings yet
Understanding Computer Architecture Basics
54 pages
Computer Architecture & Performance
No ratings yet
Computer Architecture & Performance
56 pages
Parallel Computing for Tech Students
No ratings yet
Parallel Computing for Tech Students
14 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
ICT123 Computer Architecture: Week 13 Review
No ratings yet
ICT123 Computer Architecture: Week 13 Review
21 pages
Chapter 1
No ratings yet
Chapter 1
53 pages
Computer Organization and Architecture: Faculty: Dr. Asif Uddin Khan
No ratings yet
Computer Organization and Architecture: Faculty: Dr. Asif Uddin Khan
23 pages
Co Unit1 Part3
No ratings yet
Co Unit1 Part3
11 pages
CA Week2
No ratings yet
CA Week2
27 pages
Overview of Computer Hardware Components
No ratings yet
Overview of Computer Hardware Components
20 pages
CompArch Studcopy4units
No ratings yet
CompArch Studcopy4units
22 pages
M1-CS405 Computer System Architecture-Ktustudents - in
100% (1)
M1-CS405 Computer System Architecture-Ktustudents - in
97 pages
Flynn's Classification & Performance Metrics
No ratings yet
Flynn's Classification & Performance Metrics
8 pages
Advanced Computer Architecture Course Overview
No ratings yet
Advanced Computer Architecture Course Overview
56 pages
Isa Architecture
No ratings yet
Isa Architecture
108 pages
Computer Architecture Overview
No ratings yet
Computer Architecture Overview
34 pages
Different Types of Computer Architecture
No ratings yet
Different Types of Computer Architecture
8 pages
Computer Performance Analysis
No ratings yet
Computer Performance Analysis
23 pages
Computer Architecture for Students
No ratings yet
Computer Architecture for Students
118 pages
Computer Architecture Unit 1
No ratings yet
Computer Architecture Unit 1
59 pages
Computer Architecture Overview
No ratings yet
Computer Architecture Overview
68 pages
Lecture 16 Technology, Performance, Powerwall
No ratings yet
Lecture 16 Technology, Performance, Powerwall
9 pages
CA Slides#2 Architectural Classification
No ratings yet
CA Slides#2 Architectural Classification
22 pages
Module 2 (26-10-2024)
No ratings yet
Module 2 (26-10-2024)
50 pages
CA Performance
No ratings yet
CA Performance
26 pages
JOANNE Problem 5
No ratings yet
JOANNE Problem 5
3 pages
Fig. 1 Architectures of metal/CO2 Electrochemical Cells As Capture Systems
No ratings yet
Fig. 1 Architectures of metal/CO2 Electrochemical Cells As Capture Systems
1 page
PIC16F161X Press Presentation November 2014
No ratings yet
PIC16F161X Press Presentation November 2014
28 pages
Understanding Network Topologies
No ratings yet
Understanding Network Topologies
49 pages
Human Computer Interaction: Lecture 1. Usability
No ratings yet
Human Computer Interaction: Lecture 1. Usability
3 pages
Theories of Learning Insights
No ratings yet
Theories of Learning Insights
1 page
Header, Footer, Footnotes Guide
No ratings yet
Header, Footer, Footnotes Guide
1 page
Priyanshu Ranjan
No ratings yet
Priyanshu Ranjan
13 pages
Learning OpenStack High Availability - Sample Chapter
100% (1)
Learning OpenStack High Availability - Sample Chapter
15 pages
Peer-to-Peer vs. Client/Server Networks
No ratings yet
Peer-to-Peer vs. Client/Server Networks
18 pages
HNC100 Profibus-DP Interface: Connection To Siemens S7 and S5
No ratings yet
HNC100 Profibus-DP Interface: Connection To Siemens S7 and S5
82 pages
FINALBOOKPYTHON
100% (1)
FINALBOOKPYTHON
225 pages
OST Questions
No ratings yet
OST Questions
6 pages
Microprocessor EE2354 Model Answer Key
No ratings yet
Microprocessor EE2354 Model Answer Key
25 pages
Flynn's Taxonomy in Parallel Computing
No ratings yet
Flynn's Taxonomy in Parallel Computing
29 pages
Sample OPnsense
No ratings yet
Sample OPnsense
17 pages
Recovered
No ratings yet
Recovered
30 pages
SecuriFire Studio 2.4.7 Release Notes
No ratings yet
SecuriFire Studio 2.4.7 Release Notes
7 pages
Orangerx Openlrsng 433mhz
No ratings yet
Orangerx Openlrsng 433mhz
2 pages
LG Arx10 Na9640p
No ratings yet
LG Arx10 Na9640p
77 pages
Object Detection and Counting Using ESP8266 and TF Luna LiDAR
No ratings yet
Object Detection and Counting Using ESP8266 and TF Luna LiDAR
7 pages
Hirschmann SPIDER 1TX 1FX SM EEC Datasheet
No ratings yet
Hirschmann SPIDER 1TX 1FX SM EEC Datasheet
2 pages
5 File Handling 1
No ratings yet
5 File Handling 1
56 pages
The Intel Microprocessors Chapter 1
100% (1)
The Intel Microprocessors Chapter 1
195 pages
ElectronicsLab20240925 Week3
No ratings yet
ElectronicsLab20240925 Week3
132 pages
GNS 430W Drawings PDF
No ratings yet
GNS 430W Drawings PDF
61 pages
Intel Aero Compute Board Guide
No ratings yet
Intel Aero Compute Board Guide
19 pages
Advanced VLSI Course Guide
No ratings yet
Advanced VLSI Course Guide
3 pages
UNV Video Management Server Manual
No ratings yet
UNV Video Management Server Manual
67 pages
9910 An-H48
No ratings yet
9910 An-H48
4 pages
Program - 1 Write A Program To Perform Insertion & Deletion in An Array
No ratings yet
Program - 1 Write A Program To Perform Insertion & Deletion in An Array
16 pages
Pod HD500
No ratings yet
Pod HD500
71 pages
Ahmet Aslan's Web Developer Resume
0% (1)
Ahmet Aslan's Web Developer Resume
1 page
Majid Shaikh: IT Professional Profile
No ratings yet
Majid Shaikh: IT Professional Profile
2 pages
Broadcast and Collision Domain
No ratings yet
Broadcast and Collision Domain
3 pages
Install - Guide CentOS7 Warewulf PBSPro 1.3.7 x86 - 64
No ratings yet
Install - Guide CentOS7 Warewulf PBSPro 1.3.7 x86 - 64
61 pages
Infinibox User Documentation
No ratings yet
Infinibox User Documentation
375 pages

Module 1 - Advanced Computer Architecture

Uploaded by

Module 1 - Advanced Computer Architecture

Uploaded by

INTRODUCTION TO COMPUTER

BASIC CONCEPTS OF COMPUTER ARCHITECTURE

Computer Architecture is the design

Most computers follow the Von Neumann

A computer follows the Von Neumann

1. It has three basic hardware subsystems:

Flynns Classification of Computers (in terms

1. Instruction Stream (IS) a sequence of

2. Data Stream (DS) a sequence of data

Both instructions and data are fetched from the

Any computer can be placed in one of four broad

An SISD machine is a conventional sequential

Instructions are executed sequentially but may be

Most SISD uniprocessor systems are pipelined.

A single stream of instructions is broadcast to a

In other words, an SIMD computer has several

These are the parallel computers (multiprocessor

A common data structure is manipulated by

This is also known as systolic arrays for

This form of computation does not arise often in

CU1 CU2 .. . CUn

The ideal performance of a computer system

Machine capability can be enhanced with better

Program behavior is affected by algorithm design,

The simplest measure of program performance is

In a multiprogrammed computer, the I/O and

The size of the program is determined by its

Different machine instructions may require

For the Intel microprocessors, the MOV

Therefore, the cycles per instruction (CPI)

For a given instruction set, the average CPI over

CPU Time (T) = I CPI

A 40-MHz processor was used to execute a

CPUTime (T) I CPI

A 40-MHz processor was used to execute a

Instruction Instruction Clock Cycle

Determine the effective CPI and execution time for

CPUTime (T) I CPI

Only the instruction decode and execution phases

Memory cycle is defined as the time needed to

The CPI of an instruction can be divided into two

CPU Time (T) = I (p + m k)

p is the number of processor cycles

The processor speed is often measured in terms

Let C be the total number of clock pulses or

The equation for the MIPS rate is:

Since T I CPI , then a second equation for

Since CPI C/I , then a third equation for the

A 40-MHz processor was used to execute a

Instruction Instruction Clock Cycle

Determine the MIPS rate of the system.

From the previous example:

You might also like