0% found this document useful (0 votes)

44 views7 pages

High Performance Computing Notes Unit-1

The document discusses high-performance computing, focusing on modern processor architecture, including stored-program architecture, cache-based microprocessor architecture, and performance metrics. It highlights the evolution of computing from hard-wired programs to flexible stored-program systems, emphasizing the importance of components like the CPU, memory, and I/O systems. Key concepts such as Moore's Law, pipelining, and multicore processors are also introduced, illustrating advancements in computing technology.

Uploaded by

bhuvaneshwari bhuvaneshwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views7 pages

High Performance Computing Notes Unit-1

Uploaded by

bhuvaneshwari bhuvaneshwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

HIGH PERFORMANCE COMPUTING – UNIT-1

Chapter 1: Modern Processor

1.1 Stored-program computer architecture

1.2 General-purpose cache-based microprocessor architecture.

1.2.1 Performance metrics and benchmarks.
1.2.2 Transistors galore: Moore’s Law.
1.2.3 Pipelining
1.2.4 Superscalarity
1.2.5 SIMD

1.3 Memory hierarchies

1.3.1 Cache
1.3.2 Cache mapping
1.3.3 Pre-fetch

1.4 Multicore processors

1.5 Multithreaded processors
1.6 Vector processors
Stored Program Architecture:

Before Stored Program Architecture:

 Early computers like ENIAC used hard-wired programs:

 Programs were not stored in memory.
 Changing a program required manually rewiring the hardware.
 This process was time-consuming and error-prone.

Evolution with EDVAC:

 EDVAC (Electronic Discrete Variable Automatic Computer) was one of the

first computers to use stored program architecture.
 It was proposed in 1945 by John von Neumann, who suggested that both
instructions and data should be stored in the same memory.
 Hence, the architecture is often called the Von Neumann Architecture.

What is Stored Program Architecture?

A computer model where:

 Program instructions (code) and data are both stored in main memory
(RAM).
 The CPU fetches and executes instructions sequentially using a common
bus.
 This model is used by almost all general-purpose computers today.

Based on SISD Model:

 SISD: Single Instruction, Single Data

 A single processor executes one instruction at a time on one data item.
 Describes most traditional serial processors.
 Represents the basic sequential processing model.

Why is Stored Program Architecture Important?

 Flexible programming: Programs can be loaded, modified, and executed

easily.
 Automatic execution: No need to rewire hardware to run different programs.
 Efficient use of hardware: One memory for both code and data reduces
complexity.
 Laid the foundation of modern computing.

How is Stored Program Architecture Structured?

Main Components:

CPU (Central Processing Unit):

 Includes ALU, registers, and control unit

Memory:

 Stores data and instructions

I/O System:

 Handles input/output devices

Instruction Cycle:

 Fetch: Get the instruction from memory

 Decode: Identify the operation
 Execute: Perform the operation (e.g., add, load, store)
Von Neumann Bottleneck

 Instructions and data use the same bus for communication with memory.
 Only one access can happen at a time (either data or instruction).
 This causes a bottleneck and slows down performance, especially in high-
speed computing.

General-Purpose Cache-Based Microprocessor Architecture

Why the name ?

 General purpose because these microprocessors are designed to execute a

wide range of applications, including scientific computing, everyday
software, and operating systems.
 Cache-based indicates that the design includes multiple levels of cache
memory (like L1, L2) to reduce memory latency and increase speed.
 The term microprocessor refers to a CPU implemented on a single chip.

What is a General-Purpose Cache-Based Microprocessor Architecture?

 It is a hardware architecture for CPUs that:

 Implements the stored-program digital computer model
 Includes arithmetic units (for FP and INT operations), registers,
caches, and control logic
 Executes code using a structured pipeline and execution units
 Though extremely complex, only a small portion of the chip actually
performs computations (INT/FP units); the rest supports data movement and
control.
Components:

 Main Memory
 Memory Interface
 L2 unified cache
 L1 Data cache
 L1 instruction Cache
 Memory Queue
 INT/FP Queue
 FP register File
 INT register File
 Shift Mask
 INT Operation
 LD : Load ( data transfer memory to Register )
 ST : Store ( data transfer Register to memory )
 FP mult : Floating Point Multiply
 FP add : Floating Point Add
Example :

LOAD R1, [R2 + 8] : Load value from memory address R2 + 8 into register R1

Components Used:

 L1 Instruction Cache – fetches the LOAD instruction.

 INT Reg. File – provides the address base (value of R2).
 Memory Queue – queues the load request.
 L1 Data Cache – checks if data is cached.
 L2 Unified Cache / Main Memory – accessed if L1 cache misses.
 LD Unit – performs the actual data fetch.
 INT Reg. File – stores the result in R1.

Transistors galore: Moore’s Law

 Even before personal computers, computers were already used in science .

Every ~2 years, the number of transistors in chips doubles.
 More transistors = more complex logic = better performance.
 Even though chip design methods changed (e.g., from 90nm → 5nm),
the doubling trend has stayed on track.
 More transistors allowed: Better CPUs , More cores ,More cache ,Faster
instructions.
 Moore’s Law means numbers of transistors are increased performance also
increased.

Advanced Techniques Enabled by More Transistors:

 Pipelined Functional Units

 Superscalar Architecture
 Data parallelism through SIMD ( Single Instruction , Multiple Data )
 Out of Order Execution
 Larger caches
 Simplified Instruction Set

Pipelined Functional Units :

 The term “pipelined” multiple steps can operate simultaneously on different

inputs.
 "Functional units" refer to hardware blocks like adders, multipliers, etc.,
inside the CPU that perform specific operations.
 So, pipelined functional units are those CPU parts that break down complex
operations into smaller steps, allowing concurrent execution at different
stages.

HPC Notes
No ratings yet
HPC Notes
37 pages
Organization CH 2
No ratings yet
Organization CH 2
102 pages
CO1 Evoluation of Processors and Modern Processor
No ratings yet
CO1 Evoluation of Processors and Modern Processor
29 pages
CPU Architecture Essentials
No ratings yet
CPU Architecture Essentials
9 pages
Hardware Notes
No ratings yet
Hardware Notes
91 pages
Ade 2
No ratings yet
Ade 2
64 pages
Characteristics of Contemporary Processors
No ratings yet
Characteristics of Contemporary Processors
1 page
Fixed Program Computers - Their Function Is Very: Von Neumann Architecture
No ratings yet
Fixed Program Computers - Their Function Is Very: Von Neumann Architecture
12 pages
Microprocessors vs. Microcontrollers
No ratings yet
Microprocessors vs. Microcontrollers
20 pages
Computer Science All in One Paper 1
No ratings yet
Computer Science All in One Paper 1
72 pages
Computer Architecture P1
No ratings yet
Computer Architecture P1
37 pages
CA1, Sem3 (Comp Org)
No ratings yet
CA1, Sem3 (Comp Org)
10 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
74 pages
Advanced Computer Architecture Course Guide
No ratings yet
Advanced Computer Architecture Course Guide
71 pages
1.1 System Architectur
No ratings yet
1.1 System Architectur
8 pages
Computer Architecture
No ratings yet
Computer Architecture
15 pages
Chapter 3 Notes
No ratings yet
Chapter 3 Notes
6 pages
A Level CPU Architecture Guide
No ratings yet
A Level CPU Architecture Guide
7 pages
Advanced Computer Architecture: CSE-401 E
No ratings yet
Advanced Computer Architecture: CSE-401 E
71 pages
Inside The Cpu
No ratings yet
Inside The Cpu
10 pages
1.1 Characteristics of Contemporary Processors
No ratings yet
1.1 Characteristics of Contemporary Processors
5 pages
Lesson 7 The Central Processing Unit (CPU)
No ratings yet
Lesson 7 The Central Processing Unit (CPU)
32 pages
Computer Evolution 2 (Details)
No ratings yet
Computer Evolution 2 (Details)
23 pages
CPU & Architecture Essentials
No ratings yet
CPU & Architecture Essentials
16 pages
COMP 231: Microprocessor Course Overview
No ratings yet
COMP 231: Microprocessor Course Overview
55 pages
HPC Unit 2
No ratings yet
HPC Unit 2
72 pages
Chapter1 - Basic Structure of Computers
0% (1)
Chapter1 - Basic Structure of Computers
119 pages
Hardware VonNeumann
No ratings yet
Hardware VonNeumann
8 pages
Handout On Chapter 3 - (3.1 Computer Architecture)
No ratings yet
Handout On Chapter 3 - (3.1 Computer Architecture)
13 pages
GCSE Computer Science Revision and Workbook: Page 1 of 56
No ratings yet
GCSE Computer Science Revision and Workbook: Page 1 of 56
56 pages
1-Module 1-12-12-2024
No ratings yet
1-Module 1-12-12-2024
43 pages
Chapter1 - Basic Structure of Computers
100% (1)
Chapter1 - Basic Structure of Computers
119 pages
Unit 1 Modern Processors
100% (1)
Unit 1 Modern Processors
52 pages
Detailed Revision Guide
No ratings yet
Detailed Revision Guide
17 pages
CH 3
No ratings yet
CH 3
39 pages
Class Slide Chapter 1
No ratings yet
Class Slide Chapter 1
42 pages
The CPU
No ratings yet
The CPU
36 pages
Computer Organization & Articture No. 4 From APCOMS
No ratings yet
Computer Organization & Articture No. 4 From APCOMS
29 pages
Computer Architecture and Organization Reviewer
No ratings yet
Computer Architecture and Organization Reviewer
14 pages
Chapter 3 (IGCSE) - 1
No ratings yet
Chapter 3 (IGCSE) - 1
66 pages
Computer Architecture...
No ratings yet
Computer Architecture...
8 pages
Chapter 1. Basic Structure of Computers
No ratings yet
Chapter 1. Basic Structure of Computers
119 pages
Chapter 3 (Computer Science)
No ratings yet
Chapter 3 (Computer Science)
15 pages
Computer Architecture Overview
No ratings yet
Computer Architecture Overview
29 pages
Module-4. Structure of Computers, Instruction Set Architecture and Memory Unit
No ratings yet
Module-4. Structure of Computers, Instruction Set Architecture and Memory Unit
59 pages
Introduction - Chapter 1 1
No ratings yet
Introduction - Chapter 1 1
39 pages
COA Notes
No ratings yet
COA Notes
22 pages
Chapter1 - Basic Structure of Computers
No ratings yet
Chapter1 - Basic Structure of Computers
123 pages
Central Processing Unit
No ratings yet
Central Processing Unit
61 pages
3.1 Computer Architecture & Fetch Execute Cycl
No ratings yet
3.1 Computer Architecture & Fetch Execute Cycl
13 pages
Overview of Parallel Computing Models
No ratings yet
Overview of Parallel Computing Models
65 pages
Evolution of Computers
No ratings yet
Evolution of Computers
11 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
40 pages
Understanding Computer Hardware Basics
No ratings yet
Understanding Computer Hardware Basics
15 pages
Computer Organisation
No ratings yet
Computer Organisation
96 pages
CO - Module 1-1-21
No ratings yet
CO - Module 1-1-21
21 pages
Chapter 3
No ratings yet
Chapter 3
20 pages
Module 2
No ratings yet
Module 2
35 pages
Unit 2: Master Data: Tools Initialize Period (MMPI)
No ratings yet
Unit 2: Master Data: Tools Initialize Period (MMPI)
2 pages
Discussion 8
No ratings yet
Discussion 8
2 pages
Ronak Mer
No ratings yet
Ronak Mer
1 page
Cybersecurity Essentials Guide
No ratings yet
Cybersecurity Essentials Guide
27 pages
Slides - Resiliency and Incident Response
No ratings yet
Slides - Resiliency and Incident Response
24 pages
CRM Sales Executive Dashboard Template Excel Sales Executive Dashboard Excel
No ratings yet
CRM Sales Executive Dashboard Template Excel Sales Executive Dashboard Excel
8 pages
2 Aws
No ratings yet
2 Aws
71 pages
PCVL Nle B2
No ratings yet
PCVL Nle B2
5 pages
Kubernetes Autoscaling Troubleshooting Guide
No ratings yet
Kubernetes Autoscaling Troubleshooting Guide
143 pages
9800938-01 8089 Assembler Users Guide Aug79
No ratings yet
9800938-01 8089 Assembler Users Guide Aug79
246 pages
System Base Operations Citrix ADC 13.1
No ratings yet
System Base Operations Citrix ADC 13.1
1 page
מתוקף NSE7 - OTS-6.4 PDF
No ratings yet
מתוקף NSE7 - OTS-6.4 PDF
19 pages
Chapter 1-1
No ratings yet
Chapter 1-1
25 pages
4 Sustainable Tourism & Information Technology
100% (2)
4 Sustainable Tourism & Information Technology
15 pages
CCPA25 M4 Securing Your Environment With Commvault
No ratings yet
CCPA25 M4 Securing Your Environment With Commvault
18 pages
Best Supplier Selection System
No ratings yet
Best Supplier Selection System
32 pages
Blue Team Tools For SOC Analysts
No ratings yet
Blue Team Tools For SOC Analysts
7 pages
Byo Modem Setup - FTTN B v2
No ratings yet
Byo Modem Setup - FTTN B v2
2 pages
Mobility Brochure PDF
No ratings yet
Mobility Brochure PDF
12 pages
HP Data Protector Best Practice Guide Enu
100% (1)
HP Data Protector Best Practice Guide Enu
32 pages
Cyber-Security in AI: Presented by
100% (1)
Cyber-Security in AI: Presented by
21 pages
Big Data Analytics
No ratings yet
Big Data Analytics
5 pages
Project Report On An Efficient and Privacy Preserving Biometric Identification Scheme in Cloud Computing
100% (1)
Project Report On An Efficient and Privacy Preserving Biometric Identification Scheme in Cloud Computing
76 pages
Networking Job Positions and Roles
No ratings yet
Networking Job Positions and Roles
27 pages
Unit-3 Python
No ratings yet
Unit-3 Python
29 pages
Cool Gen Notes
100% (2)
Cool Gen Notes
16 pages
Entry-Level Web Developer Profile
No ratings yet
Entry-Level Web Developer Profile
2 pages
Isc 11 Forensics
No ratings yet
Isc 11 Forensics
19 pages
Elsevier Article Elsarticle Template
No ratings yet
Elsevier Article Elsarticle Template
10 pages

High Performance Computing Notes Unit-1

Uploaded by

High Performance Computing Notes Unit-1

Uploaded by

HIGH PERFORMANCE COMPUTING – UNIT-1

Chapter 1: Modern Processor

1.1 Stored-program computer architecture

1.2 General-purpose cache-based microprocessor architecture.

1.3 Memory hierarchies

1.4 Multicore processors

Before Stored Program Architecture:

 Early computers like ENIAC used hard-wired programs:

Evolution with EDVAC:

 EDVAC (Electronic Discrete Variable Automatic Computer) was one of the

What is Stored Program Architecture?

A computer model where:

Based on SISD Model:

 SISD: Single Instruction, Single Data

Why is Stored Program Architecture Important?

 Flexible programming: Programs can be loaded, modified, and executed

How is Stored Program Architecture Structured?

CPU (Central Processing Unit):

 Includes ALU, registers, and control unit

 Stores data and instructions

 Handles input/output devices

 Fetch: Get the instruction from memory

General-Purpose Cache-Based Microprocessor Architecture

Why the name ?

 General purpose because these microprocessors are designed to execute a

What is a General-Purpose Cache-Based Microprocessor Architecture?

 It is a hardware architecture for CPUs that:

 L1 Instruction Cache – fetches the LOAD instruction.

Transistors galore: Moore’s Law

 Even before personal computers, computers were already used in science .

Advanced Techniques Enabled by More Transistors:

 Pipelined Functional Units

Pipelined Functional Units :

 The term “pipelined” multiple steps can operate simultaneously on different

You might also like