Lec 22

The document summarizes techniques for improving main memory performance. It discusses using wider memory buses, interleaving memory across multiple banks, avoiding bank conflicts through software and hardware methods, and DRAM-specific interleaving using RAS and CAS signals. It also briefly covers virtual memory support, interactions between instruction-level parallelism and caching, and cache consistency issues in multi-processor systems.

Uploaded by

jyothibellary4233

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views14 pages

Lec 22

Uploaded by

jyothibellary4233

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

LECTURE - 22

Topics for Today

Main memory

Scribe for today?
Main Memory

DRAM versus SRAM
DRAM is cheaper, but slower

Reducing the number of pins
At the cost of some performance
Address = RAS + CAS

Performance metrics: latency and bandwidth
#cycles to send address
#cycles to access a word
#cycles to send the data word
Main Memory Performance:
One-Word Wide Memory
CPU Suppose,
Bus (1 word) #cycles to send address = 4
Cache
#cycles to access 1 word = 24
#cycles to send data word = 4
Bus (1 word)
Cache line = 4 words
Main
Memory What is the miss penalty?
4 x (4 + 24 + 4) = 128 cycles
Technique-1: Wider Memory
CPU What is the miss penalty now?
Bus (1 word) 2 x (4 + 24 + 4) = 64 cycles
Mux
Cache Disadvantages?

Bus (2 words)
Larger bus width (cost)

Unit of memory addition
is larger
Main
Read-modify-write for
Memory
single-byte write, if
error-correction present
Technique-2: Interleaved-Memory
CPU What is the miss penalty
Bus (1 word) now?

Cache 4 + 24 + 4x4 =44 cycles

Notion of interleaving
Bus (1 word)
factor
Can the interleaving
factor be anything?

Bank-1 Bank-2 Bank-3 Bank-4

Technique-3: Independent
Memory Banks

Multiple independent accesses
Separate address and data lines

Needed for miss-under-miss scheme

Also, parallel I/O with CPU

Each independent bank may itself be
interleaved
Super-bank number and bank number
Memory-Bank Conflicts

Code can often be such that memory-bank
conflicts occur
No use of independent memory bank
organization under such conflicts

Example:
int x[2][512];
for(j = 0; j < 512; j++) {
for(i = 0; i < 2; i++) {
x[i][j]++;
}
}
Technique-4: Avoiding Memory-
Bank Conflicts

Software solutions:
Loop interchange (works for this example)
Expand array size so that it is not a power of two

Hardware solution:
Use prime number of banks
Bank num Addr % #banks
Addr within bank Addr #banks
Addr within bank Addr #words within bank
if #words within bank, and #banks are co-prime
Technique-5: DRAM-Specific
Interleaving

DRAM has RAS and CAS
Usually RAS and CAS are given one after
another
Same RAS can be used to read multiple
columns
DRAMs come with separate signals to allow
such access

Now, various remarks before finishing up with

memory-hierarchy design
Virtual Memory and Protection

OS requires support in terms of:
Two modes (at least) of execution: user,
supervisor/kernel
Some CPU state which is readable but not
writable in user mode

TLB

User/supervisor mode bit
Mechanisms to switch between the modes

System calls
ILP and Caching

Superscalar execution:
Cache must have enough ports to match the
peak bandwidth
Hit-under-miss, Miss-under-miss required

Speculative execution:
Suppress exception on speculative instructions
Don't stall the cache on a speculative instruction
cache miss
ILP vs. Caching:
Compiler Choices
int x[32][512]; int x[32][512];
for(j = 0; j < 512; j++) { for(i = 0; i < 32; i++) {
for(i = 0; i < 32; i++) { for(j = 0; j < 512; j++) {
x[i][j] = 2*x[i][j-1]; x[i][j] = 2*x[i][j-1];
} }
} }
Caches and Consistency

I/O using caches?
Interferes with CPU, may throw useful blocks

I/O using main memory
Write-through ==> No problem for CPU output
What about input?

Approach-1: OS marks memory block as non-cacheable

Approach-2: OS flushes the cache block after input

Approach-3: h/w checks if block is present in cache,
invalidate if cached (parallel set of tags for perf.)

Multi-processors –w ant same data in many
caches: cache-coherence problem

Computer Architecture: Main Memory
No ratings yet
Computer Architecture: Main Memory
18 pages
Computer Organization and Architecture Chapter 7 Large and Fast Exploiting
No ratings yet
Computer Organization and Architecture Chapter 7 Large and Fast Exploiting
32 pages
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
No ratings yet
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
73 pages
Computer Science 146 Computer Architecture
No ratings yet
Computer Science 146 Computer Architecture
16 pages
Kien-Truc-May-Tinh - David-Brooks - Cs146-Lecture17-Main-Memory - (Cuuduongthancong - Com)
No ratings yet
Kien-Truc-May-Tinh - David-Brooks - Cs146-Lecture17-Main-Memory - (Cuuduongthancong - Com)
16 pages
Ca Q,,a 4TH Sem
No ratings yet
Ca Q,,a 4TH Sem
18 pages
Address Field Breakdown for Cache System
No ratings yet
Address Field Breakdown for Cache System
55 pages
Intel Optane Memory Business Overview
No ratings yet
Intel Optane Memory Business Overview
27 pages
Memory 2
No ratings yet
Memory 2
31 pages
ACA Unit 2
No ratings yet
ACA Unit 2
45 pages
Memory Hierarchy and Cache Optimization
No ratings yet
Memory Hierarchy and Cache Optimization
36 pages
Memory Types and Cache Overview
No ratings yet
Memory Types and Cache Overview
39 pages
Cse 410 Computer Systems: Hal Perkins Spring 2010 L T 13 C Hwit DPF Lecture 13 - Cache Writes and Performance
No ratings yet
Cse 410 Computer Systems: Hal Perkins Spring 2010 L T 13 C Hwit DPF Lecture 13 - Cache Writes and Performance
20 pages
Computer Architecture 1st Semester Spring Session Unit 3
No ratings yet
Computer Architecture 1st Semester Spring Session Unit 3
33 pages
Unit3 Coa
No ratings yet
Unit3 Coa
30 pages
CA09 2024S2 New
No ratings yet
CA09 2024S2 New
29 pages
Understanding Memory System Architecture
No ratings yet
Understanding Memory System Architecture
30 pages
Binary Multiplication Flowchart
No ratings yet
Binary Multiplication Flowchart
12 pages
DRAM and SRAM Memory Hierarchy
No ratings yet
DRAM and SRAM Memory Hierarchy
3 pages
운영체제 01
No ratings yet
운영체제 01
60 pages
Main Memory DRAM's:: Time Between The Read Is Requested and The Desired
No ratings yet
Main Memory DRAM's:: Time Between The Read Is Requested and The Desired
3 pages
Chapter 2 Part 2
No ratings yet
Chapter 2 Part 2
18 pages
CA Final PDF
No ratings yet
CA Final PDF
13 pages
Exercices Memory-Caches
No ratings yet
Exercices Memory-Caches
31 pages
Chapter 2z
No ratings yet
Chapter 2z
54 pages
Memory Interfacing
No ratings yet
Memory Interfacing
65 pages
Computer System Overview: 1 Spring 2015
No ratings yet
Computer System Overview: 1 Spring 2015
48 pages
Chap 1
No ratings yet
Chap 1
48 pages
Chap 01
No ratings yet
Chap 01
11 pages
Comp Org Exam 3 Cheat Sheet
No ratings yet
Comp Org Exam 3 Cheat Sheet
3 pages
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
No ratings yet
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
37 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
53 pages
Understanding Cache Hierarchies and Misses
No ratings yet
Understanding Cache Hierarchies and Misses
20 pages
Week 12 - Lecture 12 - Memory
No ratings yet
Week 12 - Lecture 12 - Memory
27 pages
Oral Questions 2021 - Architecture
No ratings yet
Oral Questions 2021 - Architecture
14 pages
Memory Interleaving & Hierarchy
No ratings yet
Memory Interleaving & Hierarchy
20 pages
Cache1 2
No ratings yet
Cache1 2
30 pages
Ch04 The Memory System
No ratings yet
Ch04 The Memory System
45 pages
Lecture 12: Cache Innovations
No ratings yet
Lecture 12: Cache Innovations
17 pages
Computer System: Operating Systems: Internals and Design Principles
No ratings yet
Computer System: Operating Systems: Internals and Design Principles
62 pages
Lecture19 New2024
No ratings yet
Lecture19 New2024
25 pages
CA Chap5 Memory
No ratings yet
CA Chap5 Memory
91 pages
Chapter5-The Memory System
No ratings yet
Chapter5-The Memory System
36 pages
Chapter01 OSedition7Final
No ratings yet
Chapter01 OSedition7Final
62 pages
MemoryOrganization - For Class
No ratings yet
MemoryOrganization - For Class
35 pages
Onur 447 Spring15 Lecture17 Memoryhierarchyandcaches Afterlecture
No ratings yet
Onur 447 Spring15 Lecture17 Memoryhierarchyandcaches Afterlecture
51 pages
Computer Memory Organization: Elephants Don't Forget But Do Computers?
No ratings yet
Computer Memory Organization: Elephants Don't Forget But Do Computers?
9 pages
CS-30005 (HPC) - CS End Nov 2024
No ratings yet
CS-30005 (HPC) - CS End Nov 2024
23 pages
EE457Unit7b Interleaving
No ratings yet
EE457Unit7b Interleaving
38 pages
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
No ratings yet
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
55 pages
Computer Organisation and Architecture PYQ
No ratings yet
Computer Organisation and Architecture PYQ
14 pages
Memory Subsystem: Dr. Gayathri Sivakumar Assistant Professor (SG-I) School of Electronics VIT, Chennai
No ratings yet
Memory Subsystem: Dr. Gayathri Sivakumar Assistant Professor (SG-I) School of Electronics VIT, Chennai
16 pages
Memory Hierarchy for Engineers
No ratings yet
Memory Hierarchy for Engineers
32 pages
Lecture 15
No ratings yet
Lecture 15
81 pages
Embedded Systems Notes
No ratings yet
Embedded Systems Notes
15 pages
Data and Instruction Locality in Caches
No ratings yet
Data and Instruction Locality in Caches
78 pages
Digital Electronics & Computer Organisation
No ratings yet
Digital Electronics & Computer Organisation
17 pages
Ganga-Ashtakam-1 Telugu PDF File9839
No ratings yet
Ganga-Ashtakam-1 Telugu PDF File9839
3 pages
Second-Generation Stack Computer Architecture
No ratings yet
Second-Generation Stack Computer Architecture
178 pages
Lec 31
No ratings yet
Lec 31
5 pages
Lec 19
No ratings yet
Lec 19
19 pages
Lec 24
No ratings yet
Lec 24
14 pages
Centre Filled Chocolates
No ratings yet
Centre Filled Chocolates
2 pages
Lec 13
No ratings yet
Lec 13
13 pages
Lec 15
No ratings yet
Lec 15
15 pages
Lec 06
No ratings yet
Lec 06
18 pages
Dynamic Scheduling to Overcome Data Hazards
No ratings yet
Dynamic Scheduling to Overcome Data Hazards
15 pages
Lec 05
No ratings yet
Lec 05
13 pages
Lec 03
No ratings yet
Lec 03
16 pages
Lec 11
No ratings yet
Lec 11
19 pages
Mca Iii Semester Software Lab Ii - Practicals List: Assignments For Design and Analysis of Algorithms (Daa)
No ratings yet
Mca Iii Semester Software Lab Ii - Practicals List: Assignments For Design and Analysis of Algorithms (Daa)
28 pages
Lect02.LecJan12 2006.PipelineProcessor
No ratings yet
Lect02.LecJan12 2006.PipelineProcessor
34 pages
TSP Java
No ratings yet
TSP Java
11 pages
TSP Java
No ratings yet
TSP Java
11 pages
Fundamentals of Circuit Analysis
No ratings yet
Fundamentals of Circuit Analysis
16 pages
Yuken Piston Pump Specifications Guide
100% (1)
Yuken Piston Pump Specifications Guide
12 pages
Instruction For Circle Diagram
No ratings yet
Instruction For Circle Diagram
3 pages
SCP23 PDF
No ratings yet
SCP23 PDF
108 pages
Rizal Technological University Department of Electronics Engineering and Technology
No ratings yet
Rizal Technological University Department of Electronics Engineering and Technology
14 pages
FDF1
No ratings yet
FDF1
15 pages
Installation, Operation & Maintenance Manual: 12kV - 630 ... 2500 A - 16 ... 44 Ka
No ratings yet
Installation, Operation & Maintenance Manual: 12kV - 630 ... 2500 A - 16 ... 44 Ka
42 pages
Microprocessors & Microcontrollers Course
No ratings yet
Microprocessors & Microcontrollers Course
1 page
Notes - AC Circuit Part 1
No ratings yet
Notes - AC Circuit Part 1
1 page
GoodWe Battery Ready Inverter Guide
No ratings yet
GoodWe Battery Ready Inverter Guide
3 pages
NOVOTEST UT-1M Ultrasonic Gauge
No ratings yet
NOVOTEST UT-1M Ultrasonic Gauge
1 page
스미토모 카운터
100% (1)
스미토모 카운터
212 pages
ADF4110/ADF4111/ADF4112/ADF4113 RF PLL Frequency Synthesizers
No ratings yet
ADF4110/ADF4111/ADF4112/ADF4113 RF PLL Frequency Synthesizers
24 pages
Eng Ds 1-1773910-3 Intercontec Products QRG 0421
No ratings yet
Eng Ds 1-1773910-3 Intercontec Products QRG 0421
34 pages
Evolution of Operating Systems Overview
No ratings yet
Evolution of Operating Systems Overview
4 pages
Tech Enthusiasts' Sale: Up to 70% Off
No ratings yet
Tech Enthusiasts' Sale: Up to 70% Off
13 pages
Installing The BTS3012 Sensors: About This Chapter
No ratings yet
Installing The BTS3012 Sensors: About This Chapter
4 pages
Dlc5 Schematic
100% (3)
Dlc5 Schematic
1 page
Convert PC Fans to Mini Wind Generators
100% (1)
Convert PC Fans to Mini Wind Generators
5 pages
ABB Kabeldon Cable Joints Terminations Separable Connectors LV HV
No ratings yet
ABB Kabeldon Cable Joints Terminations Separable Connectors LV HV
132 pages
L440l540dsen PDF
No ratings yet
L440l540dsen PDF
4 pages
HF-430 Series Sensorless Inverter Guide
50% (2)
HF-430 Series Sensorless Inverter Guide
31 pages
Build Your Miniature Bass Guitar
0% (2)
Build Your Miniature Bass Guitar
23 pages
Pure Nintendo Magazine #1 - Oct 11
No ratings yet
Pure Nintendo Magazine #1 - Oct 11
8 pages
Electronic Modular Weighbridge Overview
No ratings yet
Electronic Modular Weighbridge Overview
3 pages
PVC Chapter 1
No ratings yet
PVC Chapter 1
15 pages
Performance of Computer Systems
No ratings yet
Performance of Computer Systems
6 pages
NQ Manual
No ratings yet
NQ Manual
146 pages
Multiple Choice Question (MCQ) of Power Systems Page-19 - PDF
No ratings yet
Multiple Choice Question (MCQ) of Power Systems Page-19 - PDF
5 pages
MLX90614 Changing Emissivity Unlocking Key Application Note Melexis
No ratings yet
MLX90614 Changing Emissivity Unlocking Key Application Note Melexis
5 pages

Lec 22

Uploaded by

Lec 22

Uploaded by

LECTURE - 22

Topics for Today

Cache 4 + 24 + 4x4 =44 cycles

Bank-1 Bank-2 Bank-3 Bank-4

Now, various remarks before finishing up with

You might also like