0% found this document useful (0 votes)

42 views14 pages

Compiler Design - Complete Study Notes

The document provides comprehensive study notes on compiler design, detailing the phases of a compiler, the differences between compilers and interpreters, and various concepts such as tokens, grammar types, and optimization techniques. It includes memory tricks for easier recall of complex information and examples to illustrate key points. Additionally, it covers practical aspects like storage allocation strategies and properties of optimizing compilers.

Uploaded by

Kamlesh Porwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views14 pages

Compiler Design - Complete Study Notes

Uploaded by

Kamlesh Porwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Compiler Design - Complete Study Notes 📚

1. Phases of Compiler (WITH MEMORY TRICK! 🧠)

Memory Trick: "Lexy Sarah Sees Incredible Code Optimization"

Lexical Analysis

Syntax Analysis
Semantic Analysis

Intermediate Code Generation

Code Optimization
Object Code Generation

Detailed Explanation:

Phase 1: Lexical Analysis (Scanner)

What it does: Converts source code into tokens

Input: Character stream

Output: Token stream

Example:

int x = 10;

Tokens: <INT>, <IDENTIFIER,"x">, <ASSIGN>, <NUMBER,"10">, <SEMICOLON>

Phase 2: Syntax Analysis (Parser)

What it does: Checks grammatical structure

Input: Token stream
Output: Parse tree/Abstract Syntax Tree

Example: Verifies int x = 10; follows grammar rules

Phase 3: Semantic Analysis

What it does: Type checking, scope resolution

Input: Parse tree

Output: Annotated parse tree
Example: Ensures you can't do int x = "hello";

Phase 4: Intermediate Code Generation

What it does: Creates platform-independent code

Input: Annotated parse tree

Output: Three-address code

Example: t1 = 10; x = t1;

Phase 5: Code Optimization

What it does: Improves code efficiency

Input: Intermediate code

Output: Optimized intermediate code

Example: Removes dead code, constant folding

Phase 6: Code Generation

What it does: Generates target machine code

Input: Optimized intermediate code

Output: Assembly/Machine code

2. Compiler vs Interpreter
Aspect Compiler Interpreter

Translation Entire program at once Line by line

Execution After compilation During translation

Speed Faster execution Slower execution

Memory More memory needed Less memory

Error Detection All errors at once Stops at first error

Examples C, C++, Java Python, JavaScript

 

Memory Trick: Compiler = Complete translation first, Interpreter = Immediate execution

3. Tokens, Patterns, and Lexemes

Easy Definition:
Lexeme: The actual string in source code

Token: Category/type of the lexeme

Pattern: Rule that describes the token

Example:

int count = 42;

Lexeme Token Pattern

int KEYWORD Reserved word

count IDENTIFIER [a-zA-Z][a-zA-Z0-9]*

= ASSIGN =

42 NUMBER [0-9]+
 

Memory Trick: Lexeme = Literal string, Token = Type, Pattern = Production rule

4. LEX - Lexical Analyzer Generator

LEX Structure:

%{
/* C declarations */
%}
/* LEX definitions */
%%
/* LEX rules */
%%
/* User functions */

Example LEX Program:

lex

%{
#include <stdio.h>
%}

%%
[0-9]+ { printf("NUMBER: %s\n", yytext); }
[a-zA-Z]+ { printf("IDENTIFIER: %s\n", yytext); }
"+" { printf("PLUS\n"); }
"=" { printf("ASSIGN\n"); }
[ \t\n] { /* ignore whitespace */ }
%%

int main() {
yylex();
return 0;
}

5. Input Buffering Techniques

Why Buffer?
Reading character by character is expensive
Need lookahead for token recognition

Techniques:

1. Single Buffer

Problem: What if token spans buffer boundary?

2. Double Buffer (Better!)

Buffer 1: |----token----|
Buffer 2: |--continues--|

3. Sentinels

Use special EOF character to avoid boundary checks

Trick: Place EOF at end of each buffer half

Memory Trick: "Double Sentinel Buffers Work Better"

6. Context-Free Grammar (CFG)

Definition:
A CFG is a 4-tuple: G = (V, T, P, S)

V: Variables (Non-terminals)

T: Terminals

P: Productions

S: Start symbol

Example:

E → E + T | T
T → T * F | F
F → (E) | id

This generates expressions like: id + id * id

Memory Trick: "Very Talented Programmers Start here"

7. Top-Down vs Bottom-Up Parsing

Top-Down Parsing
Strategy: Start from start symbol, derive input

Types: Recursive Descent, LL(1)

Think: Root to leaves (like reading a book)

Bottom-Up Parsing
Strategy: Start from input, reduce to start symbol

Types: LR(0), SLR(1), LALR(1), LR(1)

Think: Leaves to root (like building a pyramid)

Memory Trick:

Top-Down = Tree from Top

Bottom-Up = Build from Base

8. Syntax-Directed Translation

S-Attributed Definitions
Rule: Use only Synthesized attributes

Direction: Information flows UP the parse tree

Example: Expression evaluation

E → E1 + T { E.val = E1.val + T.val }

T → F { T.val = F.val }
F → num { F.val = num.lexval }

L-Attributed Definitions
Rule: Use synthesized + Limited inherited attributes

Direction: Information flows UP and Left-to-right

Restriction: Inherited attributes can only depend on left siblings

Memory Trick:

S = Synthesized = Simple (only up)

L = Limited = Left-to-right allowed

9. Types of Grammar (COMPLETE KNOWLEDGE)

Chomsky Hierarchy (Remember: "Uncles Can Count Regularly")

Type 0: Unrestricted Grammar

Production: α → β (any string to any string)

Automaton: Turing Machine

Example: αAβ → αγβ

Type 1: Context-Sensitive Grammar

Production: αAβ → αγβ (|γ| ≥ 1)

Automaton: Linear Bounded Automaton

Rule: Left side ≤ Right side length

Type 2: Context-Free Grammar

Production: A → α (single non-terminal on left)

Automaton: Pushdown Automaton

Most important for compilers!

Type 3: Regular Grammar

Production: A → aB or A → a

Automaton: Finite Automaton

Used in lexical analysis

Grammar Properties:

Ambiguous Grammar

Multiple parse trees for same string

Example: E → E + E | E * E | id
String "id + id * id" has 2 parse trees

Left Recursion

Direct: A → Aα
Indirect: A → Bα, B → Aβ
Problem: Infinite loop in top-down parsing

Left Factoring

Problem: A → αβ | αγ

Solution: A → αA', A' → β | γ

10. Type Checking

Static Type Checking

When: Compile time

Advantage: Catches errors early, faster execution

Languages: C, C++, Java

Example: int x = "hello"; ← Error caught at compile time

Dynamic Type Checking

When: Runtime
Advantage: More flexible

Languages: Python, JavaScript

Example: Variable can change type during execution

Memory Trick:

Static = Strictly checked at Start

Dynamic = Determined During execution

11. Polymorphic Functions

Definition:
Functions that work with multiple types

Types:

1. Parametric Polymorphism (Generics)

java

<T> void swap(T a, T b) {

// Works with any type T
}

2. Ad-hoc Polymorphism (Overloading)

java

int add(int a, int b) { return a + b; }

double add(double a, double b) { return a + b; }

3. Subtype Polymorphism (Inheritance)

java

Animal a = new Dog(); // Dog inherits from Animal

Memory Trick: "Polymorphic = People Adding Similar functions"

12. Storage Allocation Strategies

1. Static Allocation
When: Compile time

Where: Global variables, static variables

Lifetime: Entire program execution

2. Stack Allocation
When: Function calls

Where: Local variables, parameters

Lifetime: Function scope

Structure: LIFO (Last In, First Out)

3. Heap Allocation
When: Runtime (dynamic)
Where: malloc(), new operator

Lifetime: Until explicitly freed

Management: Garbage collection or manual

Memory Trick: "Static Stays, Stack Shrinks, Heap Hangs around"

13. Goals of Code Generation

Primary Goals:
1. Correctness: Generated code must be semantically equivalent

2. Efficiency: Optimize for speed and space

3. Target Independence: Easy to retarget

Specific Goals:
Register Allocation: Minimize memory access

Instruction Selection: Choose best instructions

Instruction Scheduling: Avoid pipeline stalls

Memory Trick: "Correct Efficient Targeted code"

14. DAG (Directed Acyclic Graph) Representation

Purpose:

Represent basic blocks efficiently

Identify common subexpressions

Enable optimizations

Example:

a = b + c
d = b + c
e = d + a

DAG shows b + c computed once, used twice!

Benefits:
Dead Code Elimination: Unused computations

Common Subexpression: Avoid recomputation

Constant Folding: Compute constants at compile time

15. Back Patching in Code Generation

Problem:
Jump addresses unknown during code generation

Forward references need to be "patched"

Solution - Back Patching:

1. Generate jump with placeholder address

2. Keep list of locations to patch

3. When target known, patch all locations

Example:

if (condition) goto L1
stmt1
goto L2
L1: stmt2
L2: next_stmt
Memory Trick: "BackPatch = Blank first, Patch later"

16. Code Optimization

Types:

Machine Independent Optimizations:

Constant Folding: 3 + 4 → 7

Constant Propagation: Replace variables with constants

Dead Code Elimination: Remove unreachable code

Common Subexpression Elimination

Machine Dependent Optimizations:

Instruction Scheduling

Peephole Optimization

Levels:
Local: Within basic block

Global: Within function

Interprocedural: Across functions

17. Peephole Optimization

Definition:
Optimize small "window" of instructions (usually 3-5)

Types:

1. Redundant Instruction Elimination

Before: MOV R1, R2

MOV R2, R1
After: MOV R1, R2

2. Constant Folding
Before: MOV R1, #3
ADD R1, #4
After: MOV R1, #7

3. Strength Reduction

Before: MUL R1, #2

After: SHL R1, #1 (shift left = multiply by 2)

4. Algebraic Simplification

Before: ADD R1, #0

After: (remove instruction)

Memory Trick: "Peep through small hole, optimize locally, efficiently"

18. Data Flow Analysis

Purpose:
Determine how data flows through program
Enable optimizations
Safety analysis

Key Concepts:

Reaching Definitions

Which definitions reach which uses?

Use: Dead code elimination

Live Variable Analysis

Which variables are "live" (will be used later)?

Use: Register allocation

Available Expressions

Which expressions are available (already computed)?

Use: Common subexpression elimination

Direction:
Forward: Information flows with program execution

Backward: Information flows against program execution

19. Cross Compiler

Definition:
Compiler that runs on one machine but generates code for another

Example:
Host: x86 PC

Target: ARM processor (mobile phones)

Use Case: Embedded systems development

Why Needed?
Target machine too small for compiler

Development convenience
Different architectures

Memory Trick: "Cross Compiler Compiles Crosswise"

20. Properties of Optimizing Compilers

Essential Properties:

1. Correctness Preservation

Optimized code must behave identically to original

Most Important Property!

2. Efficiency Improvement

Time: Faster execution

Space: Smaller code size

Energy: Lower power consumption

3. Compile-Time Efficiency
Optimization shouldn't take too long
Trade-off between compile time and runtime benefit

4. Debugging Support

Maintain correlation between source and optimized code

Support for debugging optimized code

5. Predictability

Similar code patterns should be optimized similarly

Developers can reason about performance

Memory Trick: "Correct Efficient Compilation Debugs Predictably"

Quick Review Mnemonics 🎯

1. Compiler Phases: "Lexy Sarah Sees Incredible Code Optimization"
2. Grammar Types: "Uncles Can Count Regularly"

3. Storage Types: "Static Stays, Stack Shrinks, Heap Hangs around"

4. Parsing Types: "Top-Down from Top, Bottom-Up from Base"
5. Attributes: "S = Simple UP, L = Limited left-right"

Exam Tips 💡
1. Draw diagrams for phases, parse trees, DAGs

2. Give examples for every concept

3. Compare and contrast (compiler vs interpreter, static vs dynamic)

4. Remember the "why" behind each technique

5. Practice code examples for LEX, grammar, optimizations

Good Luck! You've got this! 🚀

Compiler Design: Assignment
No ratings yet
Compiler Design: Assignment
4 pages
CD
No ratings yet
CD
40 pages
Untitled 3
No ratings yet
Untitled 3
12 pages
Assembly Language and Parsing Techniques
No ratings yet
Assembly Language and Parsing Techniques
8 pages
Demonstrate The Phases of A Compiler With Example
No ratings yet
Demonstrate The Phases of A Compiler With Example
16 pages
CD Overview
No ratings yet
CD Overview
9 pages
Svelte Transitions in Compiler Design
No ratings yet
Svelte Transitions in Compiler Design
5 pages
Document From Aditya Tripathi
No ratings yet
Document From Aditya Tripathi
5 pages
Compiler Construction Final
No ratings yet
Compiler Construction Final
6 pages
Compiler Design - 2-Mark and 16-Mark Answers
No ratings yet
Compiler Design - 2-Mark and 16-Mark Answers
19 pages
1 QP
No ratings yet
1 QP
31 pages
Compiler Designassignment
No ratings yet
Compiler Designassignment
15 pages
Compiler Design Solutions Guide
No ratings yet
Compiler Design Solutions Guide
10 pages
Compiler Design Question Bank
No ratings yet
Compiler Design Question Bank
3 pages
CS 403 Compiler Design - Easy Revision Notes For RGPV: Unit I: Introduction To Compilers & Lexical Analysis
No ratings yet
CS 403 Compiler Design - Easy Revision Notes For RGPV: Unit I: Introduction To Compilers & Lexical Analysis
11 pages
Welcome To CS143: Compilers
No ratings yet
Welcome To CS143: Compilers
60 pages
CD Micro
No ratings yet
CD Micro
16 pages
CD Module1
No ratings yet
CD Module1
37 pages
Compiler Types and Parsing Techniques
No ratings yet
Compiler Types and Parsing Techniques
14 pages
CD Final Nnotes For Semester Exam
No ratings yet
CD Final Nnotes For Semester Exam
13 pages
Compiler Overview and Phases Explained
No ratings yet
Compiler Overview and Phases Explained
56 pages
Cambridge Compiler Construction Guide
No ratings yet
Cambridge Compiler Construction Guide
82 pages
Additional Note CSC 409
No ratings yet
Additional Note CSC 409
11 pages
CD Question Bank Solution
No ratings yet
CD Question Bank Solution
18 pages
Compiler Design by Natan Asrat
No ratings yet
Compiler Design by Natan Asrat
25 pages
Introduction to Compiler Basics
No ratings yet
Introduction to Compiler Basics
33 pages
Compiler Design Questio and Answer Key - 1
No ratings yet
Compiler Design Questio and Answer Key - 1
14 pages
Compiler Design 1
No ratings yet
Compiler Design 1
206 pages
2-Introduction To Compilation and Lexical Analysis-19!07!2024
No ratings yet
2-Introduction To Compilation and Lexical Analysis-19!07!2024
135 pages
COMPILER DESIGN ASSIGNMENT TWO 17 12 2022 Submit
No ratings yet
COMPILER DESIGN ASSIGNMENT TWO 17 12 2022 Submit
18 pages
Compiler Design 2 PDF Free
No ratings yet
Compiler Design 2 PDF Free
262 pages
Compiler Design-1-2
No ratings yet
Compiler Design-1-2
20 pages
Core Concepts: Compiler Construction (CSC 409)
No ratings yet
Core Concepts: Compiler Construction (CSC 409)
13 pages
1 - Introduction To Compiler
No ratings yet
1 - Introduction To Compiler
26 pages
ECS 142: Compilers Administrative Matters: Course Objectives Instructor
No ratings yet
ECS 142: Compilers Administrative Matters: Course Objectives Instructor
4 pages
UNIT 1 Notes CD
No ratings yet
UNIT 1 Notes CD
10 pages
Compiler Design Note
No ratings yet
Compiler Design Note
313 pages
PYQs Unit 3 CD
No ratings yet
PYQs Unit 3 CD
34 pages
Comprehensive Guide to Compiler Design
No ratings yet
Comprehensive Guide to Compiler Design
29 pages
Compiler Design Overview for CSE Students
No ratings yet
Compiler Design Overview for CSE Students
53 pages
Compiler
No ratings yet
Compiler
15 pages
Language Processors:: Compiler
No ratings yet
Language Processors:: Compiler
14 pages
CD Unit-1 (Complete)
No ratings yet
CD Unit-1 (Complete)
90 pages
Compiler Design Study Material Unit 1
No ratings yet
Compiler Design Study Material Unit 1
26 pages
Compiler Design
75% (8)
Compiler Design
262 pages
Lecture21-22 Compiler Construction
No ratings yet
Lecture21-22 Compiler Construction
42 pages
CD Uint1
No ratings yet
CD Uint1
29 pages
Compiler Theory Course Overview
No ratings yet
Compiler Theory Course Overview
33 pages
All Units
No ratings yet
All Units
19 pages
Compiler Design
No ratings yet
Compiler Design
53 pages
Imp CS1352 APR08
No ratings yet
Imp CS1352 APR08
15 pages
15IR and SymTab
No ratings yet
15IR and SymTab
30 pages
280425
No ratings yet
280425
11 pages
PCD LN 4
No ratings yet
PCD LN 4
20 pages
SHORTS
No ratings yet
SHORTS
11 pages
CD 2 Marks
No ratings yet
CD 2 Marks
15 pages
Compiler Design TCS601 All Answers Complete UTF8
No ratings yet
Compiler Design TCS601 All Answers Complete UTF8
12 pages
Goods Movement Transaction
No ratings yet
Goods Movement Transaction
4 pages
Patran 2023.4 Release Guide
No ratings yet
Patran 2023.4 Release Guide
170 pages
NoSQL Data Management Overview
No ratings yet
NoSQL Data Management Overview
36 pages
OceanStor Dorado 6.x & OceanStor 6.x DM-Multipath Configuration Guide For Citrix XenServer
No ratings yet
OceanStor Dorado 6.x & OceanStor 6.x DM-Multipath Configuration Guide For Citrix XenServer
12 pages
FRONTEX Best Practice Technical Guidelines For ABC
No ratings yet
FRONTEX Best Practice Technical Guidelines For ABC
63 pages
Quiz (IBM Storage Expert Care L2) - Attempt Review
100% (2)
Quiz (IBM Storage Expert Care L2) - Attempt Review
11 pages
Lab Manual Format Cyber Security Workshop - BCS453. - DS
No ratings yet
Lab Manual Format Cyber Security Workshop - BCS453. - DS
65 pages
SAP IDoc Data Transfer Guide
No ratings yet
SAP IDoc Data Transfer Guide
12 pages
Ch-2. Migration Into A Cloud
No ratings yet
Ch-2. Migration Into A Cloud
16 pages
Project Report 2 - Guideline and Rubric
No ratings yet
Project Report 2 - Guideline and Rubric
6 pages
@ (Gabriel Volpe) Practical FP in Scala - A Hands-On Approach (2020)
100% (1)
@ (Gabriel Volpe) Practical FP in Scala - A Hands-On Approach (2020)
300 pages
Re-Sales and CRM Procurement Process Business Blueprint
No ratings yet
Re-Sales and CRM Procurement Process Business Blueprint
93 pages
Vedant Rathore Resume N.
No ratings yet
Vedant Rathore Resume N.
1 page
Astrophysics Baggage Scanner Manual
No ratings yet
Astrophysics Baggage Scanner Manual
9 pages
Sheet Metal Design Course Overview
No ratings yet
Sheet Metal Design Course Overview
2 pages
Arduino Bluetooth Home Automation System
No ratings yet
Arduino Bluetooth Home Automation System
16 pages
Encryption Project
No ratings yet
Encryption Project
65 pages
Java Exception Handling and JDBC Guide
No ratings yet
Java Exception Handling and JDBC Guide
17 pages
Ford Type 1 IMMO Emulator Guide
No ratings yet
Ford Type 1 IMMO Emulator Guide
1 page
Internet Marketing-PPT Final
100% (1)
Internet Marketing-PPT Final
19 pages
Oracle Customer Data Hub Implementation Concepts and Strategies
No ratings yet
Oracle Customer Data Hub Implementation Concepts and Strategies
125 pages
Safecom Go HP Administrators Manual 60701-33 PDF
No ratings yet
Safecom Go HP Administrators Manual 60701-33 PDF
198 pages
SAP PM Calibration Process Manual
100% (2)
SAP PM Calibration Process Manual
41 pages
A Project Report On: "Wireless Data Acquisition System of Single Phase Induction Motor Using MATLAB"
No ratings yet
A Project Report On: "Wireless Data Acquisition System of Single Phase Induction Motor Using MATLAB"
73 pages
Xpert Calibration Package Insert (Ingles)
No ratings yet
Xpert Calibration Package Insert (Ingles)
48 pages
Java OSGi Framework Guide
No ratings yet
Java OSGi Framework Guide
3 pages
Passive Income for Creatives
No ratings yet
Passive Income for Creatives
11 pages
RHEL 8.5 - Integrating RHEL Systems Directly With Windows Active Directory
No ratings yet
RHEL 8.5 - Integrating RHEL Systems Directly With Windows Active Directory
41 pages
Java vs JavaScript: Key Differences
No ratings yet
Java vs JavaScript: Key Differences
19 pages
Advantages and Dis of Presentation Tools
No ratings yet
Advantages and Dis of Presentation Tools
6 pages