A. Instruction-Level Parallelism: Ntroduction

VLIW and superscalar processors aim to increase instruction-level parallelism (ILP) to improve processor throughput. VLIW exposes more parallelism to software by having the compiler schedule instructions, while superscalar does dynamic scheduling in hardware. VLIW simplifies the processor design by removing complex decode and dispatch logic. It relies on the compiler to schedule instructions for parallel execution on functional units, whereas superscalar identifies parallelism at runtime. Both can achieve ILP but take different approaches through static versus dynamic scheduling.

Uploaded by

Jessica Lovey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

117 views3 pages

A. Instruction-Level Parallelism: Ntroduction

Uploaded by

Jessica Lovey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

INTRODUCTION :

Very Long Instruction Word (VLIW) processors [2, 3] are examples of architectures for hich the program pro!ides explicit information regarding parallelism" #he compiler identifies the parallelism in the program and communicates it to the hard are $y specifying hich operations are independent of one another" #his information is of direct !alue to the hard are, since it %no s ith no further chec%ing hich operations it can start executing in the same cycle" In this report, e introduce the &xplicitly 'arallel Instruction (omputing (&'I() style of architecture, an e!olution of VLIW hich has a$sor$ed many of the $est ideas of superscalar processors, al$eit in a form adapted to the &'I( philosophy" &'I( is not so much an architecture as it is a philosophy of ho to $uild IL' processors along ith a set of architectural features that support this philosophy" In this sense &'I( is li%e )I*(+ it denotes a class of architectures, all of hich su$scri$e to a common architectural philosophy" ,ust as there are many distinct )I*( architectures (-e lett.'ac%ard/s '0)I*(, *ilicon 1raphic/s 2I'* and *un/s *'0)() there can $e more than one instruction set architecture (I*0) ithin the &'I( fold" "

A. Instruction-level parallelism
0 common design goal for general.purpose processors is to maximi3e throughput, hich may $e defined $roadly as the amount of or% performed in a gi!en time" 0!erage processor throughput is a function of t o !aria$les4 the a!erage num$er of cloc% cycles re5uired to execute an instruction, and the fre5uency of cloc% cycles" #o increase throughput, then,

a designer could increase the cloc% rate of the architecture, or increase the a!erage instruction-level parallelism(IL') of the architecture" 2odern processor design has focused on executing more instructions in a gi!en num$er of cloc% cycles, that is, increasing IL'" 0 num$er of techni5ues may $e used" 6ne techni5ue, pipelining, is particularly popular $ecause it is relati!ely simple, and can $e used in con7unction ith superscalar and VLIW techni5ues" 0ll modern ('8 architectures are pipelined""

B. Pipelining
0ll instructions are executed in multiple stages" 9or example, a simple processor may ha!e fi!e stages4 first the instruction must $e fetched from cache, then it must $e decoded, the instruction must $e executed, and any memory referenced $y the instruction must $e loaded or stored" 9inally the result of the instruction is stored in registers" #he output from one stage ser!es as the input to the next stage, forming a pipeline of instruction implementation" #hese stages are fre5uently independent of each other, so, if separate hard are is used to perform each stage, multiple instructions may $e :in flight; at once, ith each instruction at a different stage in the pipeline" Ignoring potential pro$lems, the theoretical increase in speed is

proportional to the length of the pipeline4 longer pipelines means more simultaneous in.flight instructions and therefore fe er a!erage cycles per instruction" #he ma7or potential pro$lem ith pipelining is the potential for hazards" 0 ha3ard occurs hen an instruction in the pipeline cannot $e executed" -ennessey and 'atterson identify three types of ha3ards4 structural hazards, here there simply isn<t sufficient hard are to execute all paralleli3a$le instructions at once+ data hazards, here an instruction depends on the result of a pre!ious instruction+ and control hazards, hich arise from instructions hich change the program counter (ie, $ranch instructions)" Various techni5ues exist for managing ha3ards" #he simplest of these is simply to stall the pipeline until the instruction causing the ha3ard has completed"

VLIW :

VLIW architecture 0ll this additional hard are is complex, and contri$utes to the transistor count of the processor" 0ll other things $eing e5ual, more transistors e5uals more po er consumption, more heat, and less on.die space for cache" #hus it seems $eneficial to expose more of the architecture<s parallelism to the programmer" #his ay, not only is the architecture simplified, $ut programmers ha!e more control o!er the hard are, and can ta%e $etter ad!antage of it" VLIW is an architecture designed to help soft are designers extract more parallelism from their soft are than ould $e possi$le using a traditional )I*( design" It is an alternati!e to $etter.%no n superscalar architectures" VLIW is a lot simpler than superscalar designs, $ut has not so far $een commercially successful" 9igure sho s a typical VLIW architecture" =ote the simplified instruction decode and dispatch"

A. ILP in VLIW
VLIW and superscalar approach the IL' pro$lem differently" #he %ey difference $et een the t o is here instruction scheduling is performed4 in a superscalar architecture, scheduling is performed in hard are (and is called dynamic scheduling, $ecause the schedule of a gi!en piece of code may differ depending on the code path follo ed), hereas in a VLIW scheduling is performed in soft are ( static scheduling, $ecause the schedule is :$uilt in to the $inary; $y the compiler or assem$ly language programmer)"

B. Superscalar
8sually, the execution phase of the pipeline ta%es the longest" 6n modern hard are, the execution of the instruction may $e performed $y one of a num$er of functional units" 9or example, integer instructions may $e

executed $y the 0L8, hereas floating.point operations are performed $y the 9'8" 6n a traditional, scalar pipelined architecture, either one or the other of these units ill al ays $e idle, depending on the instruction $eing executed" 6n a superscalar architecture, instructions may $e executed in parallel on multiple functional units" #he pipeline is essentially split after instruction issue"

C. Interloc ing
0nother architecture feature present in some )I*( and VLIW architectures $ut ne!er in superscalar/s is lac% of interloc%s" In a pipelined processor, it is important to ensure that a stall some here in the pipeline on<t result in the machine performing incorrectly" #his could happen if later stages of the pipeline do not detect the stall, and thus proceed as if the stalled stage had completed" #o pre!ent this, most architectures incorporate interloc s on the pipeline stages" )emo!ing interloc%s from the architecture is $eneficial, $ecause they complicate the design and can ta%e time to set up, lo ering the o!erall cloc% rate" -o e!er, doing so means that the compiler (or assem$ly.language programmer) must %no details a$out the timing of pipeline stages for each instruction in the processor, and insert =6's into the code to ensure correctness" #his ma%es code incredi$ly hard are.specific" >oth the architectures studied in detail $elo are fully interloc%ed, though *un<s ill.fated 20,( architecture as not, and relied on fast, uni!ersal ,I# compilation to sol!e the hard are pro$lems"

Instruction-Level Parallelism Explained
No ratings yet
Instruction-Level Parallelism Explained
3 pages
A Comparative Report Between EPIC and VLIW Architecture
No ratings yet
A Comparative Report Between EPIC and VLIW Architecture
2 pages
EPIC vs. VLIW: Compiler Support Analysis
No ratings yet
EPIC vs. VLIW: Compiler Support Analysis
4 pages
VLIW Architecture in ECC Processors
No ratings yet
VLIW Architecture in ECC Processors
19 pages
Advanced Pipelining Techniques Explained
No ratings yet
Advanced Pipelining Techniques Explained
8 pages
VLIW vs. Superscalar Processors
No ratings yet
VLIW vs. Superscalar Processors
35 pages
Very Long Instruction Word A Higher Performance PR
No ratings yet
Very Long Instruction Word A Higher Performance PR
6 pages
Pipelining - Lec 2-3-4
No ratings yet
Pipelining - Lec 2-3-4
72 pages
Superscalar and Vliw Architectures
No ratings yet
Superscalar and Vliw Architectures
2 pages
VLIW Architecture
No ratings yet
VLIW Architecture
5 pages
Advanced Computer Architecture Prof Thriveni T K
No ratings yet
Advanced Computer Architecture Prof Thriveni T K
59 pages
ACA Mod2
No ratings yet
ACA Mod2
45 pages
Mod5 1
No ratings yet
Mod5 1
18 pages
ACA Module2 2018.PDF Extra
No ratings yet
ACA Module2 2018.PDF Extra
48 pages
VLIW Philips
No ratings yet
VLIW Philips
11 pages
Very Long Instruction Word
No ratings yet
Very Long Instruction Word
1 page
Pipelining - Lec 3-Modified
No ratings yet
Pipelining - Lec 3-Modified
36 pages
Chapter 04 Processors and Memory Hierarchy
75% (8)
Chapter 04 Processors and Memory Hierarchy
50 pages
Chapter 04 Processors and Memory Hierarchy PDF
No ratings yet
Chapter 04 Processors and Memory Hierarchy PDF
50 pages
15CS72 ACA Module2Final
No ratings yet
15CS72 ACA Module2Final
29 pages
Computer Architecture Unit 3
No ratings yet
Computer Architecture Unit 3
8 pages
Crusoe Is The New Microprocessor Which Has Been Designed Specially For The
No ratings yet
Crusoe Is The New Microprocessor Which Has Been Designed Specially For The
31 pages
Zareen 13
No ratings yet
Zareen 13
13 pages
VLIW Processor Architecture Guide
No ratings yet
VLIW Processor Architecture Guide
23 pages
Name: Rafi Dar: Very Large Instruction Word
No ratings yet
Name: Rafi Dar: Very Large Instruction Word
18 pages
OLP Notes
No ratings yet
OLP Notes
11 pages
Computer Architecture 09-Superscalar
No ratings yet
Computer Architecture 09-Superscalar
83 pages
Crusoe: Mobile Microprocessor Revolution
No ratings yet
Crusoe: Mobile Microprocessor Revolution
17 pages
Extracting ILP on MIMD Architectures
No ratings yet
Extracting ILP on MIMD Architectures
10 pages
VLIW Architecture and Processor Performance
No ratings yet
VLIW Architecture and Processor Performance
30 pages
Understanding Implicit Parallelism in Computing
No ratings yet
Understanding Implicit Parallelism in Computing
17 pages
VLIW Processor Architecture Overview
No ratings yet
VLIW Processor Architecture Overview
3 pages
Very Large Scale Instruction Word
No ratings yet
Very Large Scale Instruction Word
22 pages
Generating A Periodic Pattern For VLIW
No ratings yet
Generating A Periodic Pattern For VLIW
18 pages
Advantages of Tomasulo's Scheme in CPUs
No ratings yet
Advantages of Tomasulo's Scheme in CPUs
30 pages
MIPS Superscalar Simulator
No ratings yet
MIPS Superscalar Simulator
5 pages
18 20210619 Computer Architecture Super Pipelined VLIW Processor Architecture
No ratings yet
18 20210619 Computer Architecture Super Pipelined VLIW Processor Architecture
15 pages
Cs2354 Advanced Computer Architecture 2 Marks
No ratings yet
Cs2354 Advanced Computer Architecture 2 Marks
10 pages
Unit 2 - Advanced Computer Architecture - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Advanced Computer Architecture - WWW - Rgpvnotes.in
15 pages
Superpipelining in Computer Architecture
No ratings yet
Superpipelining in Computer Architecture
7 pages
Vliw Architecture
No ratings yet
Vliw Architecture
30 pages
Lecture #2
No ratings yet
Lecture #2
11 pages
EPIC Architecture: A New Paradigm in ILP
No ratings yet
EPIC Architecture: A New Paradigm in ILP
8 pages
VLIW Processor Architecture Overview
No ratings yet
VLIW Processor Architecture Overview
7 pages
RHD'L: Instruction-Level Parallel Processing: History, Overview and Perspective
No ratings yet
RHD'L: Instruction-Level Parallel Processing: History, Overview and Perspective
57 pages
RHD'L: Instruction-Level Parallel Processing: History, Overview and Perspective
No ratings yet
RHD'L: Instruction-Level Parallel Processing: History, Overview and Perspective
42 pages
Aca Notes
No ratings yet
Aca Notes
23 pages
L8 Processor Types Parallelism Other Features Jan24 2024
No ratings yet
L8 Processor Types Parallelism Other Features Jan24 2024
2 pages
Lec9 Multiple Issue Processors
No ratings yet
Lec9 Multiple Issue Processors
33 pages
VLIW Processors Explained
No ratings yet
VLIW Processors Explained
17 pages
Module II
No ratings yet
Module II
60 pages
Module3
No ratings yet
Module3
49 pages
CNC Boot System Guide
No ratings yet
CNC Boot System Guide
8 pages
CPEN 100 Structural Components of Microprocessor and Microcontroller.1
No ratings yet
CPEN 100 Structural Components of Microprocessor and Microcontroller.1
35 pages
Samsung Confidential: 8. Block Diagram and Schematic
No ratings yet
Samsung Confidential: 8. Block Diagram and Schematic
44 pages
Computer Project File
No ratings yet
Computer Project File
15 pages
Chapter 1 Summary
No ratings yet
Chapter 1 Summary
12 pages
Programming Fundamentals Lec 1,2.3
No ratings yet
Programming Fundamentals Lec 1,2.3
34 pages
Banking Services in NCT of Delhi
No ratings yet
Banking Services in NCT of Delhi
15 pages
Stanford Advanced Caches
No ratings yet
Stanford Advanced Caches
46 pages
Ict Syllabus
No ratings yet
Ict Syllabus
99 pages
HowTo - Repair Windows 7 Install After Replacing Motherboard - Dowd and Associates
No ratings yet
HowTo - Repair Windows 7 Install After Replacing Motherboard - Dowd and Associates
4 pages
ATtiny Microcontroller Comparison Chart - Wikipedia
No ratings yet
ATtiny Microcontroller Comparison Chart - Wikipedia
15 pages
X20CP158x and X20CP358x: 1 General Information
No ratings yet
X20CP158x and X20CP358x: 1 General Information
23 pages
Poweredge r510 Specs en
No ratings yet
Poweredge r510 Specs en
2 pages
Hardware Platform Monitoring Guide
No ratings yet
Hardware Platform Monitoring Guide
370 pages
Secure PCI 5.x Encrypting PIN Pad
100% (1)
Secure PCI 5.x Encrypting PIN Pad
2 pages
Platform Technologies
100% (1)
Platform Technologies
2 pages
Productos Excluidos 140319
No ratings yet
Productos Excluidos 140319
4 pages
Student E-Commerce Project
No ratings yet
Student E-Commerce Project
23 pages
Understanding Computer Basics
No ratings yet
Understanding Computer Basics
9 pages
PCIe Interposer Datasheet v1113 PDF
No ratings yet
PCIe Interposer Datasheet v1113 PDF
12 pages
Computer System Structure Overview
No ratings yet
Computer System Structure Overview
58 pages
Tutorial Task
No ratings yet
Tutorial Task
1 page
Microprocessor 8257 DMA Guide
No ratings yet
Microprocessor 8257 DMA Guide
5 pages
Operating Systems MCQs for Job Prep
No ratings yet
Operating Systems MCQs for Job Prep
37 pages
Basic Computer Engineering Quiz
No ratings yet
Basic Computer Engineering Quiz
4 pages
AT89S52
No ratings yet
AT89S52
37 pages
SGI L1 and L2 Controller Software User's Guide
No ratings yet
SGI L1 and L2 Controller Software User's Guide
144 pages
Computer Basics for Students
No ratings yet
Computer Basics for Students
5 pages
Advanced Embedded Systems Syllabus
No ratings yet
Advanced Embedded Systems Syllabus
11 pages
Recovering Windows XP
No ratings yet
Recovering Windows XP
43 pages

A. Instruction-Level Parallelism: Ntroduction

Uploaded by

A. Instruction-Level Parallelism: Ntroduction

Uploaded by

INTRODUCTION :

You might also like