Lecture 8 Cont. Cache Memory

Uploaded by

syed.12682

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views29 pages

Lecture 8 Cont. Cache Memory

Uploaded by

syed.12682

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

William Stallings

Computer Organization
and Architecture
8th Edition
Chapter 4
[Link] Memory

Book by : Computer, Architecture and Organizations, 8th Edition ,William Stalling

Original Slides by : Adrian J Pullin
Cont. Cache Memory
Lecture Outcomes
Understanding of:
• Replacement Algorithm
• Write Policy
• Cache Performance
• Locality of Reference
• Pentium 4 Cache Organization
• ARM Cache Organization
Replacement Algorithms (1) Direct mapping

• No choice
• Each block only maps to one line
• Replace that line
Replacement Algorithms (2) Associative & Set Associative
• Hardware implemented algorithm (speed)
• Least Recently used (LRU)
• e.g. in 2 way set associative
– Which of the 2 block is lru?
• First in first out (FIFO)
– replace block that has been in cache longest
• Least frequently used
– replace block which has had fewest hits
• Random
Write Policy
• Must not overwrite a cache block unless main memory is
up to date
• Multiple CPUs may have individual caches
• I/O may address main memory directly
Write through
• All writes go to main memory as well as cache
• Multiple CPUs can monitor main memory traffic to keep
local (to CPU) cache up to date
• Lots of traffic
• Slows down writes
• Remember bogus write through caches!
Write back
• Updates initially made in cache only
• Update bit for cache slot is set when update occurs
• If block is to be replaced, write to main memory only if
update bit is set
• Other caches get out of sync
• I/O must access main memory through cache
• N.B. 15% of memory references are writes
Multilevel Caches
• High logic density enables caches on chip
– Faster than bus access
– Frees bus for other transfers
• Common to use both on and off chip cache
– L1 on chip, L2 off chip in static RAM
– L2 access much faster than DRAM or ROM
– L2 often uses separate data path
– L2 may now be on chip
– Resulting in L3 cache
• Bus access or now on chip…
Measuring Cache Performance
• No cache: Often about 10 cycles per memory access
• Simple cache:
– tave = hC + (1-h)M
– C is often 1 clock cycle
– Assume M is 17 cycles (to load an entire cache line)
– Assume h is about 90%
– tave = .9 (1) + (.1)17 = 2.6 cycles/access
– What happens when h is 95%?

10
Multi-level cache performance
• tave = h1C1 + (1-h1) h2C2 + (1-h1) (1-h2) M
– h1 = hit rate in primary cache
– h2 = hit rate in secondary cache
– C1 = time to access primary cache
– C2 = time to access secondary cache
– M = miss penalty (time to load an entire cache line
from main memory)
Processor Performance Without Cache

• 5GHz processor, cycle time = 0.2ns

• Memory access time = 100ns = 500 cycles
• Ignoring memory access, Clocks Per Instruction (CPI) =
1
• Assuming no memory data access:
CPI = 1 + # stall cycles
= 1 + 500 = 501

12
Performance with Level 1 Cache

• Assume hit rate, h1 = 0.95

• 5GHz processor, cycle time = 0.2ns
• Memory access time = 100ns = 500 cycles
• L1 access time = 0.2ns/processor cycle time (0.2ns) = 1 cycle
• CPI = 1 + # stall cycles
= 1 + 0.05 x 500
= 26
• Processor speed increase due to cache
= 501/26 = 19.3%

13
Performance with L1 and L2 Caches

• Assume:
– L1 hit rate, h1 = 0.95
– L2 hit rate, h2 = 0.90 (this is very optimistic!)
– L2 access time = 5ns = 25 cycles
• CPI = 1 + # stall cycles
= 1 + 0.05 (25 + 0.10 x 500)
= 1 + 3.75 = 4.75
• Processor speed increase due to both caches
= 501/4.75 = 105.5
• Speed increase due to L2 cache
= 26/4.75 = 5.47

14
15
16
17
18
19
Example

20
Hit Ratio (L1 & L2)
For 8 kbytes and 16 kbyte L1
Unified v Split Caches
• One cache for data and instructions or two, one for data and one for
instructions
• Advantages of unified cache
– Higher hit rate
• Balances load of instruction and data fetch
• Only one cache to design & implement
• Advantages of split cache
– Eliminates cache contention between instruction fetch/decode
unit and execution unit
• Important in pipelining
Pentium 4 Cache
• 80386 – no on chip cache
• 80486 – 8k using 16 byte lines and four way set associative organization
• Pentium (all versions) – two on chip L1 caches
– Data & instructions
• Pentium III – L3 cache added off chip
• Pentium 4
– L1 caches
• 8k bytes
• 64 byte lines
• four way set associative
– L2 cache
• Feeding both L1 caches
• 256k
• 128 byte lines
• 8 way set associative
– L3 cache on chip
Pentium 4 Design Reasoning
• Decodes instructions into RISC like micro-ops before L1 cache
• Micro-ops fixed length
– Superscalar pipelining and scheduling
• Pentium instructions long & complex
• Performance improved by separating decoding from scheduling & pipelining
– (More later – ch14)
• Data cache is write back
– Can be configured to write through
• L1 cache controlled by 2 bits in register
– CD = cache disable
– NW = not write through
– 2 instructions to invalidate (flush) cache and write back then invalidate
• L2 and L3 8-way set-associative
– Line size 128 bytes
ARM Cache Features

Core Cache Cache Size (kB) Cache Line Size Associativity Location Write Buffer Size
Type (words) (words)

ARM720T Unified 8 4 4-way Logical 8

ARM920T Split 16/16 D/I 8 64-way Logical 16

ARM926EJ-S Split 4-128/4-128 D/I 8 4-way Logical 16

ARM1022E Split 16/16 D/I 8 64-way Logical 16

ARM1026EJ-S Split 4-128/4-128 D/I 8 4-way Logical 8

Intel StrongARM Split 16/16 D/I 4 32-way Logical 32

Intel Xscale Split 32/32 D/I 8 32-way Logical 32

ARM1136-JF-S Split 4-64/4-64 D/I 8 4-way Physical 32
ARM Cache Organization
• Small FIFO write buffer
– Enhances memory write performance
– Between cache and main memory
– Small c.f. cache
– Data put in write buffer at processor clock speed
– Processor continues execution
– External write in parallel until empty
– If buffer full, processor stalls
– Data in write buffer not available until written
• So keep buffer small
ARM Cache and Write Buffer
Organization
Review Questions

❑What are the differences among sequential access, direct access, and random
access?
❑What is the general relationship among access time, memory cost, and capacity?
❑How does the principle of locality relate to the use of multiple memory levels?
❑What is the distinction between spatial locality and temporal locality?
❑In general, what are the strategies for exploiting spatial locality and temporal
locality?
Thank you

R RRRRRRRR Final
No ratings yet
R RRRRRRRR Final
28 pages
Cache Memory Essentials
No ratings yet
Cache Memory Essentials
36 pages
CH04
No ratings yet
CH04
46 pages
Lecture 19
No ratings yet
Lecture 19
15 pages
Unit II
No ratings yet
Unit II
9 pages
Pentium 4 Cache Architecture Overview
No ratings yet
Pentium 4 Cache Architecture Overview
20 pages
Memory Cache
No ratings yet
Memory Cache
18 pages
Computer Architecture: Cache Memory
No ratings yet
Computer Architecture: Cache Memory
51 pages
Computer Architecture - Lecture 06
No ratings yet
Computer Architecture - Lecture 06
18 pages
5.5 Cache Organization
No ratings yet
5.5 Cache Organization
8 pages
Cache Memory
No ratings yet
Cache Memory
60 pages
Intel Optane Memory Business Overview
No ratings yet
Intel Optane Memory Business Overview
27 pages
Unit 5 Dpco
No ratings yet
Unit 5 Dpco
20 pages
Cache Memory Organization Guide
No ratings yet
Cache Memory Organization Guide
18 pages
Understanding Cache Memory Types and Design
No ratings yet
Understanding Cache Memory Types and Design
39 pages
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
55 pages
10 Multi-Level Strategies: Assignments
No ratings yet
10 Multi-Level Strategies: Assignments
20 pages
Cache and Caching: Electrical and Electronic Engineering
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
15 pages
Cache and Caching: Electrical and Electronic Engineering
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
15 pages
UNIT2 Cahe-Opt
No ratings yet
UNIT2 Cahe-Opt
134 pages
09 Caches Tlbs
No ratings yet
09 Caches Tlbs
33 pages
CH 4 e F08
No ratings yet
CH 4 e F08
4 pages
Cache Performance and Write Strategies
No ratings yet
Cache Performance and Write Strategies
30 pages
Memory Hierarchy and Cache Optimization
No ratings yet
Memory Hierarchy and Cache Optimization
36 pages
Cache Optimization Techniques
No ratings yet
Cache Optimization Techniques
23 pages
Memory Hierarchy and Cache Design
No ratings yet
Memory Hierarchy and Cache Design
53 pages
Understanding Cache Hierarchies and Misses
No ratings yet
Understanding Cache Hierarchies and Misses
20 pages
Understanding Cache Memory Architecture
No ratings yet
Understanding Cache Memory Architecture
28 pages
Lec2 PDF
No ratings yet
Lec2 PDF
21 pages
Module4 CAche Performance
No ratings yet
Module4 CAche Performance
40 pages
Computer Organization & Architecture: Cache Memory
No ratings yet
Computer Organization & Architecture: Cache Memory
52 pages
Cache and Virtual Memory Explained
No ratings yet
Cache and Virtual Memory Explained
50 pages
Understanding CPU Cache Memory
No ratings yet
Understanding CPU Cache Memory
20 pages
15IF11 Multicore B
No ratings yet
15IF11 Multicore B
36 pages
Memory 2
No ratings yet
Memory 2
31 pages
CPU Cache
No ratings yet
CPU Cache
19 pages
CS 3853 Computer Architecture - Memory Hierarchy
No ratings yet
CS 3853 Computer Architecture - Memory Hierarchy
37 pages
Module 5
No ratings yet
Module 5
17 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
32 pages
Address Field Breakdown for Cache System
No ratings yet
Address Field Breakdown for Cache System
55 pages
Cache1 2
No ratings yet
Cache1 2
30 pages
CH04 COA9e
No ratings yet
CH04 COA9e
58 pages
Optimize Cache Performance Techniques
No ratings yet
Optimize Cache Performance Techniques
41 pages
11 Cache Memory
No ratings yet
11 Cache Memory
40 pages
10 Caches
No ratings yet
10 Caches
34 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
12 pages
Lectures wk11
No ratings yet
Lectures wk11
21 pages
Elements of Cache Design Pentium IV Cache Organization
No ratings yet
Elements of Cache Design Pentium IV Cache Organization
43 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
Computer Memory Systems Guide
No ratings yet
Computer Memory Systems Guide
26 pages
Cache Memory Overview in Computer Architecture
No ratings yet
Cache Memory Overview in Computer Architecture
43 pages
Sunplus Mmobile Inc.: Cache Introduction
No ratings yet
Sunplus Mmobile Inc.: Cache Introduction
18 pages
1159 51 217 Microcontroller Module 5 Caches
No ratings yet
1159 51 217 Microcontroller Module 5 Caches
18 pages
Cache Presentation
No ratings yet
Cache Presentation
45 pages
Understanding CPU Caching
No ratings yet
Understanding CPU Caching
7 pages
MC Module-5 Notes
No ratings yet
MC Module-5 Notes
8 pages
Lecture 16
No ratings yet
Lecture 16
22 pages
Page Faults & Replacement Algorithms
No ratings yet
Page Faults & Replacement Algorithms
10 pages
Multilevel Queue Scheduling Explained
No ratings yet
Multilevel Queue Scheduling Explained
43 pages
CSO Gaddis Java Chapter10 6e
No ratings yet
CSO Gaddis Java Chapter10 6e
46 pages
Lecture 6 Cont. Computer Arithematic (Booth - S Division)
No ratings yet
Lecture 6 Cont. Computer Arithematic (Booth - S Division)
25 pages
Lecture 1 Introduction To Computer Architecture and Organization
No ratings yet
Lecture 1 Introduction To Computer Architecture and Organization
69 pages
Lecture 2 Top Level View of Computer Function and Interconnection
No ratings yet
Lecture 2 Top Level View of Computer Function and Interconnection
29 pages
Computer Bus Architecture Overview
No ratings yet
Computer Bus Architecture Overview
31 pages
Lecture 4 Computer Arithematic (Sign Magnitude)
No ratings yet
Lecture 4 Computer Arithematic (Sign Magnitude)
30 pages
Kitchen Conversion Chart
No ratings yet
Kitchen Conversion Chart
1 page
Extended Abstract of Petroleum Systems of Iraqi Oilfields
No ratings yet
Extended Abstract of Petroleum Systems of Iraqi Oilfields
6 pages
NEET 2026 AdvancedLevel Full Question Paper
No ratings yet
NEET 2026 AdvancedLevel Full Question Paper
36 pages
Parks Boost Property Values
No ratings yet
Parks Boost Property Values
17 pages
Internship Report Ongc (Anshuman Singh Negi)
100% (1)
Internship Report Ongc (Anshuman Singh Negi)
38 pages
HSE Workshop
No ratings yet
HSE Workshop
8 pages
Dentin Inbde 2020 2021
100% (12)
Dentin Inbde 2020 2021
266 pages
Class 12 Chapter 8 English Poetry Solution Bihar Board
No ratings yet
Class 12 Chapter 8 English Poetry Solution Bihar Board
7 pages
ENGINEERING - DESIGN - GUIDELINES - Control - Valve - Sizing - and - Selection by KLM PDF
33% (3)
ENGINEERING - DESIGN - GUIDELINES - Control - Valve - Sizing - and - Selection by KLM PDF
28 pages
9.10 HominidMigrationMap Worksheet
No ratings yet
9.10 HominidMigrationMap Worksheet
4 pages
Capstone Template
No ratings yet
Capstone Template
6 pages
Ielts Reading Test
No ratings yet
Ielts Reading Test
5 pages
Data Ba Rang
No ratings yet
Data Ba Rang
1,030 pages
Buah Mangrove Menjadi Produk Olahan
No ratings yet
Buah Mangrove Menjadi Produk Olahan
9 pages
DC IR Drop Analysis 1691057965
No ratings yet
DC IR Drop Analysis 1691057965
10 pages
Liquiflo Relief Valve Manual
No ratings yet
Liquiflo Relief Valve Manual
3 pages
STS10 Parts Manual 2005 PDF
No ratings yet
STS10 Parts Manual 2005 PDF
150 pages
Key Takeaways - Basics of Plastic v2
No ratings yet
Key Takeaways - Basics of Plastic v2
2 pages
Singapore Drug Registration Guidance
No ratings yet
Singapore Drug Registration Guidance
21 pages
Guruji
100% (1)
Guruji
15 pages
Class III
No ratings yet
Class III
23 pages
8.ISCA RJEngS 2021 004
No ratings yet
8.ISCA RJEngS 2021 004
5 pages
Bbi 01 RBS v5
No ratings yet
Bbi 01 RBS v5
2 pages
Where is the Patis? Insights by Nakpil
No ratings yet
Where is the Patis? Insights by Nakpil
2 pages
Steam Bending
100% (1)
Steam Bending
6 pages
ICCT Coastdowns-EU 201605 Fs
No ratings yet
ICCT Coastdowns-EU 201605 Fs
3 pages
Etsi en 300 220-1 Etsi en 300 220-1 20102010
No ratings yet
Etsi en 300 220-1 Etsi en 300 220-1 20102010
73 pages
Basic Electricity
No ratings yet
Basic Electricity
11 pages
Photosynthesis and Respiration Guide
No ratings yet
Photosynthesis and Respiration Guide
8 pages
Unit Test - II - STD - Vii - Syllabus
No ratings yet
Unit Test - II - STD - Vii - Syllabus
1 page

Lecture 8 Cont. Cache Memory

Uploaded by

Lecture 8 Cont. Cache Memory

Uploaded by

William Stallings

Book by : Computer, Architecture and Organizations, 8th Edition ,William Stalling

• 5GHz processor, cycle time = 0.2ns

• Assume hit rate, h1 = 0.95

ARM720T Unified 8 4 4-way Logical 8

ARM920T Split 16/16 D/I 8 64-way Logical 16

ARM1022E Split 16/16 D/I 8 64-way Logical 16

Intel StrongARM Split 16/16 D/I 4 32-way Logical 32

Intel Xscale Split 32/32 D/I 8 32-way Logical 32

You might also like