Cache Assignment

The document outlines a project to simulate L1 cache for quad-core processors in C++, emphasizing cache coherence using the MESI protocol. It details simulation parameters, input requirements, expected outputs, and the structure of a report to be submitted. The project includes performance metrics such as cache hit rates and execution cycles, with a focus on varying cache parameters to analyze their impact on execution time.

Uploaded by

Mayank Goel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views2 pages

Cache Assignment

Uploaded by

Mayank Goel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1 Problem statement

Simulate L1 cache in C++ for quad core processors, with cache coherence support.

1.1 Simulation details

1. Memory address is 32-bit. If any address is less than 32 bit, assume remaining MSB to be 0. E.g. the
address 0x817b08 is actually 0x00817b08.
2. Each memory reference accesses 32-bit (4-bytes) of data. That is word size is 4- bytes.
3. We are only interested in the data cache and will not model the instruction cache.
4. Each processor has its own L1 data cache. The L1 data caches are backed up by main memory —
there is no L2 data cache.
5. L1 data cache uses write-back, write-allocate policy and LRU replacement policy.
6. L1 data caches are kept coherent using MESI cache coherence protocol.
7. Initially all the caches are empty.
8. Ties are broken arbitrarily on the bus, when multiple cores attempt bus transactions simultaneously.
9. L1 cache hit is 1 cycle. Fetching a block from memory to cache takes additional 100 cycles. Sending a
word from one cache to another (e.g., BusUpdate) takes only 2 cycles. However, sending a cache block
with N words (each word is 4 bytes) to another cache takes 2N cycles. Assume that evicting a dirty
cache block to memory when it gets replaced is 100 cycles.
10. Assume that the caches are blocking. That is, if there is a cache miss, the cache cannot process further
requests from the processor core and the core is completely halted. However, the snooping transactions
from the bus still need to be processed in the cache.
11. In each cycle, each core can execute at most one memory reference instruction.

You may need to make additional assumptions. Clearly state those assumptions in your report.

1.2 Input details and command line parameters

Tracefiles will be input to your simulator. Each parallel application is run using 4 threads on 4 processor
cores, and 4 separate trace files with memory reference instructions have been generated.
E.g. app1 [Link], app1 [Link], app1 [Link] and app1 [Link] are 4 trace
files for the first application app1 running with 4 threads on 4 processor cores. Each of the four files contain
lines like the following, where the first column denotes Read(R) vs. Write(W), and the second column de-
notes the memory address for the read/write operation. Similar four files will be given for other applications.

W 0x7e1afe78
R 0x7e1ac04c

Traces for two parallel applications are available at [Link]

2025/assignment3_traces.zip. $make should create an executable L1simulate, which should take in the
following input parameters from command line.
$./L1simulate -h
-t <tracefile>: name of parallel application (e.g. app1) whose 4 traces are to be used
in simulation
-s <s>: number of set index bits (number of sets in the cache = S = 2s )

1
-E <E>: associativity (number of cache lines per set)
-b <b>: number of block bits (block size = B = 2b )
-o: <outfilename> logs output in file for plotting etc.
-h: prints this help

1.3 Simulation output

Your program should generate the following output in each run. You can generate additional output as well.
1. Number of read/write instructions per core.
2. Total execution cycles per core to complete its instruction trace.
3. Number of idle cycles per core (these are cycles where the core is waiting for the request to the cache
to be completed i.e. it is not a 1-cycle L1 cache hit).
4. Data cache miss rate for each core (miss rate = #misses/#accesses).
5. Number of cache evictions per core.
6. Number of writebacks per core.
7. Number of invalidations on the bus.
8. Amount of data traffic (in bytes) on the bus.

1.4 Experiments and report

Write a PDF report in latex with three sections on the following:
1. Your implementation (e.g., main classes, data structures used, flow chart of important functions etc.).
2. Experimental section with graphs of the above simulation outputs with default parameters as 4KB
2-way set associative L1 cache per processor, with 32-byte block size. Run your simulator 10 times for
the same application with these same default parameters, and report distributions of the outputs over
the 10 runs. Remark on which outputs remain the same vs. which outputs change with each simulator
run, and why.
3. Add code in the simulator to compute maximum execution time for any core for an application. Vary
the cache parameters (cache size, associativity, block size) from default to 3 other values, in powers of
2, one parameter at a time, keeping the others constant. Plot maximum execution time vs. each cache
parameter, and explain your observations.

1.5 What to submit and grading criteria

Use git and commit regularly. Submit the repo (without the memory traces) on Moodle as
entrynum1 entrynum2 [Link], including (a) source code and Makefile, (b) readme on how to compile
and run your code, and (c) PDF report as described above.
1. Code compiles and runs correctly with given traces [6 marks]
2. Code runs correctly on unseen traces [3 marks]
3. Report [6 marks]
4. Q/A on code and observations [5 marks]
5. Bonus [2 marks] Generate interesting traces (e.g. to demonstrate false sharing). These traces can be
small, and even hand-generated.

Final Project Description - Fall2018
No ratings yet
Final Project Description - Fall2018
3 pages
EE204 - Computer Architecture Course Project
No ratings yet
EE204 - Computer Architecture Course Project
7 pages
Destinationsof Benims Projems
No ratings yet
Destinationsof Benims Projems
6 pages
prj1 Specs2010
No ratings yet
prj1 Specs2010
15 pages
18-742 Advanced Computer Architecture: Test I February 24, 1998
No ratings yet
18-742 Advanced Computer Architecture: Test I February 24, 1998
10 pages
Final Project Description
No ratings yet
Final Project Description
3 pages
Final Project Description
No ratings yet
Final Project Description
3 pages
CS 6290: High-Performance Computer Architecture: Summer 2019
No ratings yet
CS 6290: High-Performance Computer Architecture: Summer 2019
5 pages
Cache Simulator Assignment for CS211
No ratings yet
Cache Simulator Assignment for CS211
4 pages
Final Project TrinhxuanKhe BuiNgocMinh
No ratings yet
Final Project TrinhxuanKhe BuiNgocMinh
39 pages
Cache Replacement Algorithms Project
No ratings yet
Cache Replacement Algorithms Project
2 pages
Cs4/Msc Parallel Architectures Practical 2 - Cache Coherence Protocols
No ratings yet
Cs4/Msc Parallel Architectures Practical 2 - Cache Coherence Protocols
5 pages
PRJ3
No ratings yet
PRJ3
5 pages
7th Question Paper
No ratings yet
7th Question Paper
21 pages
MUIC Syskill t1 2022 Assignment5
No ratings yet
MUIC Syskill t1 2022 Assignment5
4 pages
Problem Project 1
No ratings yet
Problem Project 1
4 pages
Project - Cache Organization and Performance Evaluation
No ratings yet
Project - Cache Organization and Performance Evaluation
9 pages
Solutions: 18-742 Advanced Computer Architecture
No ratings yet
Solutions: 18-742 Advanced Computer Architecture
8 pages
Cache Lab
No ratings yet
Cache Lab
11 pages
Main Sol Midterm
No ratings yet
Main Sol Midterm
21 pages
Week 5 - The Impact of Multi-Core Computing On Computational Optimization
No ratings yet
Week 5 - The Impact of Multi-Core Computing On Computational Optimization
11 pages
Cache Memory Lab Guide
No ratings yet
Cache Memory Lab Guide
10 pages
II Usecase Project OS COA DSA FN Batch
No ratings yet
II Usecase Project OS COA DSA FN Batch
17 pages
Advanced Computer Architecture Exam
No ratings yet
Advanced Computer Architecture Exam
11 pages
Advanced Comp Arch Exam '97
No ratings yet
Advanced Comp Arch Exam '97
21 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
Cache-Assignment Handout 12
No ratings yet
Cache-Assignment Handout 12
9 pages
Homework 5
No ratings yet
Homework 5
6 pages
HW 4
No ratings yet
HW 4
4 pages
Operating Systems Lab Manual JNTU
100% (1)
Operating Systems Lab Manual JNTU
9 pages
Cache Lab
No ratings yet
Cache Lab
10 pages
Multithreading Lab Worksheet
No ratings yet
Multithreading Lab Worksheet
4 pages
Lab 8
No ratings yet
Lab 8
10 pages
Lecture 06
No ratings yet
Lecture 06
26 pages
DigitalLogic ComputerOrganization L22 CachesP3 Handout
No ratings yet
DigitalLogic ComputerOrganization L22 CachesP3 Handout
52 pages
Cache Lab: Optimize C Program Performance
No ratings yet
Cache Lab: Optimize C Program Performance
10 pages
Midtermsolutions
No ratings yet
Midtermsolutions
3 pages
Computer Architecture Cache Project
No ratings yet
Computer Architecture Cache Project
7 pages
HPCA Endsem SPR 2024
No ratings yet
HPCA Endsem SPR 2024
3 pages
Cs Cheat
No ratings yet
Cs Cheat
2 pages
hw2 Solns
No ratings yet
hw2 Solns
15 pages
Computer Architecture Questions
No ratings yet
Computer Architecture Questions
10 pages
Mid 19
No ratings yet
Mid 19
3 pages
Architecture
No ratings yet
Architecture
21 pages
CS683 Exam2 Answer
No ratings yet
CS683 Exam2 Answer
12 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Assignment2and 3-2024
No ratings yet
Assignment2and 3-2024
3 pages
Cache Lab: Simulator & Optimization
No ratings yet
Cache Lab: Simulator & Optimization
7 pages
OS Midterm Exam Guide
No ratings yet
OS Midterm Exam Guide
14 pages
L1 Cache Simulation for 32-bit Processors
No ratings yet
L1 Cache Simulation for 32-bit Processors
30 pages
Assignment 6 - P1
No ratings yet
Assignment 6 - P1
7 pages
CS 61C: Great Ideas in Computer Architecture (Machine Structures)
No ratings yet
CS 61C: Great Ideas in Computer Architecture (Machine Structures)
32 pages
Last Level Cache Design for Processors
No ratings yet
Last Level Cache Design for Processors
20 pages
Cache Project Handout
No ratings yet
Cache Project Handout
4 pages
En m3 Ex Sol
No ratings yet
En m3 Ex Sol
35 pages
Edison Module HG 331189 002
No ratings yet
Edison Module HG 331189 002
28 pages
Unit Wise Question Bank
No ratings yet
Unit Wise Question Bank
2 pages
Biofeedback 2000 x-pert Hardware Manual
No ratings yet
Biofeedback 2000 x-pert Hardware Manual
44 pages
Rugged Sunlight-Readable Display
No ratings yet
Rugged Sunlight-Readable Display
2 pages
C++ - 100% CPU Utilization When Using Vsync (OpenGL) - Stack Overflow
No ratings yet
C++ - 100% CPU Utilization When Using Vsync (OpenGL) - Stack Overflow
3 pages
Database Management System: Submitted by Amrutha K. V. Roll No: 3 MBA (F.T)
No ratings yet
Database Management System: Submitted by Amrutha K. V. Roll No: 3 MBA (F.T)
22 pages
COS3721 TL 202 2 2018 B PDF
No ratings yet
COS3721 TL 202 2 2018 B PDF
10 pages
A Hardware Programming Language
No ratings yet
A Hardware Programming Language
19 pages
Sketchup Layout Quick Reference Card
100% (1)
Sketchup Layout Quick Reference Card
1 page
Most Asked Java Interview Questions
No ratings yet
Most Asked Java Interview Questions
14 pages
2016 Grade 11 Mathematics Third Term Test Paper Northern Province
No ratings yet
2016 Grade 11 Mathematics Third Term Test Paper Northern Province
17 pages
Solucionario Dinámica U1
No ratings yet
Solucionario Dinámica U1
55 pages
Bluetooth ESP32 Device Control Guide
No ratings yet
Bluetooth ESP32 Device Control Guide
11 pages
80010123V03 PDF
0% (1)
80010123V03 PDF
7 pages
Lab 2.8.3: Troubleshooting Static Routes Topology Diagram: (Instructor Version)
No ratings yet
Lab 2.8.3: Troubleshooting Static Routes Topology Diagram: (Instructor Version)
11 pages
Computer Architecture
No ratings yet
Computer Architecture
28 pages
AF-902 and AF-904
No ratings yet
AF-902 and AF-904
99 pages
M.Tech Machine Design Syllabus 2013-14
No ratings yet
M.Tech Machine Design Syllabus 2013-14
7 pages
Eng Project File The Last Lesson
86% (7)
Eng Project File The Last Lesson
21 pages
JLR-7500 7800 Technical Information NA19AL0828B
No ratings yet
JLR-7500 7800 Technical Information NA19AL0828B
12 pages
Module 2 - Lecture 1 (Operating System)
No ratings yet
Module 2 - Lecture 1 (Operating System)
22 pages
EngLish Ford4
No ratings yet
EngLish Ford4
102 pages
Fall-Arrest System Checklist
No ratings yet
Fall-Arrest System Checklist
3 pages
1.1. Hydraulic Jack
No ratings yet
1.1. Hydraulic Jack
25 pages
Arduino Motor Shield
No ratings yet
Arduino Motor Shield
3 pages
COA Notes
No ratings yet
COA Notes
55 pages
Computer Basics for Students
No ratings yet
Computer Basics for Students
9 pages
Swivel Joint NFPA 77
No ratings yet
Swivel Joint NFPA 77
1 page
Wildfire 5 Config.pro Options List
No ratings yet
Wildfire 5 Config.pro Options List
52 pages
SE16K and SE27.6K PDF
No ratings yet
SE16K and SE27.6K PDF
2 pages

Cache Assignment

Uploaded by

Cache Assignment

Uploaded by

1 Problem statement

1.1 Simulation details

1.2 Input details and command line parameters

Traces for two parallel applications are available at [Link]

1.3 Simulation output

1.4 Experiments and report

1.5 What to submit and grading criteria

You might also like