0% found this document useful (0 votes)

55 views6 pages

Document 4 Compiler Design - Lexical Analysis Notes

Uploaded by

alok20007k

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views6 pages

Document 4 Compiler Design - Lexical Analysis Notes

Uploaded by

alok20007k

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Document 4: Compiler Design – Lexical Analysis

Notes

1. Introduction to Lexical Analysis

First phase of compiler.

Converts source code tokens.

Removes whitespace, comments.

Passes token stream to syntax analysis.

2. Role of Lexical Analyzer

Reads input characters.

Groups them into lexemes.

Produces tokens (token name + attribute value).

Reports lexical errors.

3. Tokens, Lexemes, and Patterns

Token: Category of lexemes (e.g., keyword, identifier).

Lexeme: Actual string in source (e.g., int, main).

Pattern: Rule that defines a token (e.g., [a-zA-Z][a-zA-Z0-9]* for identifiers).

4. Example

Source Code:

int sum = a + b;

Tokens:
1.
Keyword: int

2.
Identifier: sum

3.
Operator: =

4.
Identifier: a

5.
Operator: +

6.
Identifier: b

7.
Delimiter: ;

5. Finite Automata in Lexical Analysis

Regular Expressions used to describe tokens.

NFA (Nondeterministic Finite Automata) converted to DFA for efficient scanning.

DFA is used to implement lexical analyzers.

6. Regular Expressions Examples

Identifier: [a-zA-Z][a-zA-Z0-9]*

Integer Constant: [0-9]+

Whitespace: ( \t | \n | " " )

7. Lexical Errors

Unrecognized symbols.

Unterminated strings/comments.

Illegal identifiers.

Error Handling:
Panic mode recovery.

Skipping characters until a valid token is found.

8. Symbol Table

Stores identifiers with attributes (type, scope, memory location).

Used by later phases of compiler.

9. Lex Tool

Lex (or Flex) is a tool to automatically generate lexical analyzers.

Input: set of token definitions (regex).

Output: C program for scanning tokens.

10. Conclusion

Lexical analysis is the foundation phase of compiler design. It simplifies input by converting
raw code into meaningful tokens, which allows further phases like syntax and semantic
analysis to work effectively.

Compiler
No ratings yet
Compiler
14 pages
Compiler Design Principles Overview
No ratings yet
Compiler Design Principles Overview
24 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
46 pages
Week 5-6
No ratings yet
Week 5-6
33 pages
Lexical Analysis and Parsing CD
No ratings yet
Lexical Analysis and Parsing CD
107 pages
5.tokens, Patterns, and Lexemes
No ratings yet
5.tokens, Patterns, and Lexemes
7 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
12 pages
Lexical Analysis and Token Recognition
No ratings yet
Lexical Analysis and Token Recognition
67 pages
CD - CH2 - Lexical Analysis
No ratings yet
CD - CH2 - Lexical Analysis
59 pages
Unit 03 Scanner
No ratings yet
Unit 03 Scanner
51 pages
Chapter 2 Lexical Analysis (Scanning)
No ratings yet
Chapter 2 Lexical Analysis (Scanning)
56 pages
L4 - Lexical Analysis
No ratings yet
L4 - Lexical Analysis
44 pages
Chapter 2-Lexical Analysis
No ratings yet
Chapter 2-Lexical Analysis
48 pages
Lecture 3 - Lexical Analysis
No ratings yet
Lecture 3 - Lexical Analysis
42 pages
Role of A Lexical AN
No ratings yet
Role of A Lexical AN
26 pages
L2 Lexical Analysis
No ratings yet
L2 Lexical Analysis
59 pages
Unit 2
No ratings yet
Unit 2
14 pages
Define Lexeme, Pattern, and Token Lexical
No ratings yet
Define Lexeme, Pattern, and Token Lexical
10 pages
Lexical Analysis
No ratings yet
Lexical Analysis
128 pages
CD - Ch.1
No ratings yet
CD - Ch.1
28 pages
Upload 1
No ratings yet
Upload 1
3 pages
1 - Scanning Slides Sanyal Part1
No ratings yet
1 - Scanning Slides Sanyal Part1
22 pages
Ca1cd 2327
No ratings yet
Ca1cd 2327
9 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
18 pages
Lexical Analyzer (Compiler Contruction)
100% (1)
Lexical Analyzer (Compiler Contruction)
6 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
64 pages
Compiler Construction Final Notes For End Sem Exam
No ratings yet
Compiler Construction Final Notes For End Sem Exam
37 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
Lexical Analysis
No ratings yet
Lexical Analysis
12 pages
002chapter 2 - Lexical Analysis
No ratings yet
002chapter 2 - Lexical Analysis
114 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
ATCD Mod 3
No ratings yet
ATCD Mod 3
46 pages
Lexical Analysis in Compiler Design
100% (1)
Lexical Analysis in Compiler Design
52 pages
Comp Final
No ratings yet
Comp Final
16 pages
Lexical Analysis
No ratings yet
Lexical Analysis
15 pages
Lexical Analysis Overview by Atif Ishaq
100% (1)
Lexical Analysis Overview by Atif Ishaq
37 pages
CD Chapter 1
No ratings yet
CD Chapter 1
28 pages
Compiler Construction II Handout
100% (1)
Compiler Construction II Handout
27 pages
MOD 04 - Language Description & Lexical Analysis
No ratings yet
MOD 04 - Language Description & Lexical Analysis
107 pages
CD UNIT-1
No ratings yet
CD UNIT-1
60 pages
ACD Unit-2 Part-2
No ratings yet
ACD Unit-2 Part-2
20 pages
Practical File: Computer Network and Security
No ratings yet
Practical File: Computer Network and Security
28 pages
ATCD
No ratings yet
ATCD
9 pages
Compiler Design
No ratings yet
Compiler Design
14 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
74 pages
Unit 2-LEXICAL ANALYSIS
No ratings yet
Unit 2-LEXICAL ANALYSIS
46 pages
Lexical Analysis in Compilers
No ratings yet
Lexical Analysis in Compilers
52 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
88 pages
Compiler Design Notes
No ratings yet
Compiler Design Notes
35 pages
Lexical Analysis
No ratings yet
Lexical Analysis
35 pages
Lexical Analysis
No ratings yet
Lexical Analysis
9 pages
Lecture 2 10022025 035804pm
No ratings yet
Lecture 2 10022025 035804pm
27 pages
Learning Materials, CD, Unit-2 (Lexical Analysis)
No ratings yet
Learning Materials, CD, Unit-2 (Lexical Analysis)
13 pages
Unit I
No ratings yet
Unit I
89 pages
Lexical Analyzer Design with LEX Tool
No ratings yet
Lexical Analyzer Design with LEX Tool
13 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
45 pages
CD - Ch.1
No ratings yet
CD - Ch.1
28 pages
Lexical Analyzer: Tokenization Process
No ratings yet
Lexical Analyzer: Tokenization Process
37 pages
Assignment
No ratings yet
Assignment
13 pages
Document 2B Operating System - Process Management Notes
No ratings yet
Document 2B Operating System - Process Management Notes
3 pages
Document 2A GATE CSE - Important Formulas & Short Notes (2025)
No ratings yet
Document 2A GATE CSE - Important Formulas & Short Notes (2025)
5 pages
Assignment On Technical Writing Review and Editing
No ratings yet
Assignment On Technical Writing Review and Editing
9 pages
Operating Systems Process Management Notes
No ratings yet
Operating Systems Process Management Notes
1 page

Document 4 Compiler Design - Lexical Analysis Notes

Uploaded by

Document 4 Compiler Design - Lexical Analysis Notes

Uploaded by

Document 4: Compiler Design – Lexical Analysis

1. Introduction to Lexical Analysis

First phase of compiler.

Converts source code tokens.

Removes whitespace, comments.

Passes token stream to syntax analysis.

2. Role of Lexical Analyzer

Reads input characters.

Groups them into lexemes.

Produces tokens (token name + attribute value).

3. Tokens, Lexemes, and Patterns

Token: Category of lexemes (e.g., keyword, identifier).

Lexeme: Actual string in source (e.g., int, main).

Pattern: Rule that defines a token (e.g., [a-zA-Z][a-zA-Z0-9]* for identifiers).

5. Finite Automata in Lexical Analysis

Regular Expressions used to describe tokens.

NFA (Nondeterministic Finite Automata) converted to DFA for efficient scanning.

DFA is used to implement lexical analyzers.

Integer Constant: [0-9]+

Whitespace: ( \t | \n | " " )

Skipping characters until a valid token is found.

Stores identifiers with attributes (type, scope, memory location).

Used by later phases of compiler.

Lex (or Flex) is a tool to automatically generate lexical analyzers.

Input: set of token definitions (regex).

Output: C program for scanning tokens.

You might also like