0% found this document useful (0 votes)

13 views20 pages

Lec 5

Uploaded by

Mohammad Humayun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views20 pages

Lec 5

Uploaded by

Mohammad Humayun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

 Recognition of Tokens
 Transition Diagrams
 Recognition of Reserved Words and Identifiers
 Recognizing Whitespace
 Recognizing Numbers

 Finite Automata
 NFA
 Transition Tables

1
Recognition of Tokens
 Now we see how to build a piece of code that examines the input
string and finds a prefix that is a lexeme matching one of the
patterns.
 Our current goal is to perform the lexical analysis needed for the
following grammar.

 Recall that the terminals are the tokens & the non-terminals
produce terminals.

2
Recognition of Tokens..
 A regular definition for the terminals is

3
Recognition of Tokens…
 We also want the lexer to remove whitespace so we define a new
token
ws → ( blank | tab | newline ) +

 where blank, tab, and newline are symbols used to represent the
corresponding ascii characters.

 If the lexer recognizes the token ws, it does not return it to the
parser but instead goes on, to recognize the next token, which is
then returned.

4
Recognition of Tokens..
 Our goal for the lexical analyzer is summarized below:

5
Transition Diagram
 As an intermediate step in the construction of a lexical analyzer, we
first convert patterns into stylized flowcharts, called "transition
diagrams”.

 Conversion of RE patterns to Transition Diagram.

 Transition diagrams have a collection of nodes or circles, called

states
 Each state represents a condition that could occur during the process
of scanning the input looking for a lexeme that matches one of
several patterns.

6
Transition Diagram..
 Edges are directed from one state of the transition diagram to
another.
 Each edge is labeled by a symbol or set of symbols.
 Some important conventions:
 The double circles represent accepting or final states at which point a
lexeme has been found. There is often an action to be done (e.g.,
returning the token), which is written to the right of the double circle.

 If we have moved one (or more) characters too far in finding the
token, one (or more) stars are drawn.

 An imaginary start state exists and has an arrow coming from it to

indicate where to begin the process.
7
Transition Diagram…
 A transition diagram that recognizes the lexemes matching the
token relop.

8
Recognition of Reserved Words and Identifiers
 Recognizing keywords and identifiers presents a problem.
 The transition diagram below corresponds to the regular definition
given previously.

9
Recognition of Reserved Words and Identifiers
 Two questions arises:

 How do we distinguish between identifiers and keywords such

as then, which also match the pattern in the transition diagram?

 What is (gettoken(), installID())?

 We will use the method, i.e having the keywords installed into the
identifier table prior to any invocation of the lexer.
 The table entry will indicate that the entry is a keyword.

10
Recognition of Reserved Words and Identifiers..
 installID() checks if the lexeme is already in the table. If it is not
present, the lexeme is installed as an id token. In either case a
pointer to the entry is returned.

 gettoken() examines the lexeme and returns the token name,

either id or a name corresponding to a reserved keyword.

 So far we have transition diagrams for identifiers (this diagram also

handles keywords) and the relational operators.

 What remains are whitespace, and numbers.

11
Recognizing Whitespace
 Recognizing Whitespace

 The delim in the diagram represents any of the whitespace

characters, say space, tab, and newline.
 The final star is there because we needed to find a non-whitespace
character in order to know when the whitespace ends and this
character begins the next token.
 There is no action performed at the accepting state.

12
Recognizing Numbers
 The transition diagram for token number

13
Finite Automata
 Finite automata are like the graphs in transition diagrams but they
simply decide if an input string is in the language (generated by our
regular expression).

 Finite automata are recognizers, they simply say "yes" or "no" about
each possible input string.

 There are two types of Finite automata:

 Nondeterministic finite automata (NFA) have no restrictions on the

labels of their edges. A symbol can label several edges out of the
same state, and ε, the empty string, is a possible label.

14
Finite Automata..

 Deterministic finite automata (DFA) have exactly one edge, for

each state, and for each symbol of its input alphabet with that
symbol leaving that state.

 So if you know the next symbol and the current state, the next state is
determined. That is, the execution is deterministic, hence the name.

 Both deterministic and nondeterministic finite automata are

capable of recognizing the same languages.

15
N - Finite Automata
 A nondeterministic finite automaton (NFA) consists of:

1. A finite set of states S.

2. A set of input symbols Σ, the input alphabet. We assume that ε, which
stands for the empty string, is never a member of Σ.
3. A transition function that gives, for each state, and for each symbol in
Σ U {ε} a set of next states.
4. A state S0 from S that is distinguished as the start state (or initial state)
5. A set of states F, a subset of S, that is distinguished as the accepting
states (or final states).

16
N - Finite Automata..
 An NFA is basically a flow chart like the transition diagrams we
have already seen.

 Indeed an NFA can be represented by a transition graph whose

nodes are states and whose edges are labeled with elements of Σ
∪ ε.
 The differences between a transition graph and previous transition
diagrams are:
 Possibly multiple edges with the same label leaving a single state.
 An edge may be labeled with ε.

17
N - Finite Automata...
 Ex: The transition graph for an NFA recognizing the language of
regular expression (a | b) * abb
 This ex, describes all strings of a's and b's ending in the particular
string abb.

18
Transition Tables
 Transition Table is an equivalent way to represent an NFA, in which,
for each state s and input symbol x (and ε), the set of successor
states x leads to from s.

 The empty set φ is used when there is no edge labeled x

emanating from s.

Transition Table for (a | b) * abb

19
Thank You

Compiler Design: Lexical Analysis & Automata
No ratings yet
Compiler Design: Lexical Analysis & Automata
35 pages
Week 06 TD, Short Dfa Nfa
No ratings yet
Week 06 TD, Short Dfa Nfa
22 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
31 pages
Recognition of Tokens
No ratings yet
Recognition of Tokens
34 pages
Unit II - Lexical Analysis-20-1-2021
No ratings yet
Unit II - Lexical Analysis-20-1-2021
49 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
11 pages
2 Lexical
100% (1)
2 Lexical
7 pages
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
100% (1)
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
17 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
51 pages
Token Recognition in Compiler Design
No ratings yet
Token Recognition in Compiler Design
51 pages
Chapter 3 - Lexical Analysis
100% (1)
Chapter 3 - Lexical Analysis
51 pages
CH 2
No ratings yet
CH 2
36 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
55 pages
Lect 07
No ratings yet
Lect 07
46 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
34 pages
Lect 03
No ratings yet
Lect 03
19 pages
File 1675742677 110405 LexicalAnalysis-Continue1
No ratings yet
File 1675742677 110405 LexicalAnalysis-Continue1
39 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
51 pages
ch-2.pdf 2
No ratings yet
ch-2.pdf 2
27 pages
Lexical Analysis for Programmers
No ratings yet
Lexical Analysis for Programmers
67 pages
Lexical Analysis and Token Recognition
100% (3)
Lexical Analysis and Token Recognition
51 pages
Lec 4
No ratings yet
Lec 4
17 pages
Transition Diagrams in Compiler Design
No ratings yet
Transition Diagrams in Compiler Design
4 pages
6-Lexical Analysis Part5
No ratings yet
6-Lexical Analysis Part5
20 pages
2 - Compilers (Lexical Analysis)
No ratings yet
2 - Compilers (Lexical Analysis)
60 pages
Week 02
No ratings yet
Week 02
28 pages
Lexical Analysis All Token List and Diffence
No ratings yet
Lexical Analysis All Token List and Diffence
4 pages
Compilers CH 3
No ratings yet
Compilers CH 3
58 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
48 pages
Recognition of Tokens: Expr STMT Expr STMT STMT STMT Expr Term Term Term Term If Then Else Relop Id Num
No ratings yet
Recognition of Tokens: Expr STMT Expr STMT STMT STMT Expr Term Term Term Term If Then Else Relop Id Num
15 pages
1st Phase Lexical Analyzer
No ratings yet
1st Phase Lexical Analyzer
33 pages
Compiler Construction Lecture 3-4
No ratings yet
Compiler Construction Lecture 3-4
78 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
10 pages
Lecture 2b
No ratings yet
Lecture 2b
37 pages
2 - 3recognition of Tokens
No ratings yet
2 - 3recognition of Tokens
17 pages
CD - Unit1 - Lecture4 5 6 7
No ratings yet
CD - Unit1 - Lecture4 5 6 7
50 pages
Lecture 3
No ratings yet
Lecture 3
31 pages
Token Specification and Language Operations
No ratings yet
Token Specification and Language Operations
23 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
No ratings yet
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
52 pages
Lec 2
No ratings yet
Lec 2
30 pages
Compiler Construction: Lexical Analysis
No ratings yet
Compiler Construction: Lexical Analysis
37 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
Compiler 2
No ratings yet
Compiler 2
38 pages
CP 324 Lexical Analysis l2
No ratings yet
CP 324 Lexical Analysis l2
26 pages
Regular Expression
No ratings yet
Regular Expression
6 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
64 pages
Compiler Course: Lexical Analysis
No ratings yet
Compiler Course: Lexical Analysis
50 pages
CompilerD L3
No ratings yet
CompilerD L3
36 pages
CD PPTS 2
No ratings yet
CD PPTS 2
27 pages
Chapter 2
No ratings yet
Chapter 2
91 pages
Lecture 04
No ratings yet
Lecture 04
37 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
88 pages
Applications of FA
No ratings yet
Applications of FA
29 pages
Lec 2
No ratings yet
Lec 2
21 pages
Lec 3
No ratings yet
Lec 3
10 pages
Parser Lec5
No ratings yet
Parser Lec5
13 pages
Lec 1
No ratings yet
Lec 1
15 pages
Assignment 2 G1
No ratings yet
Assignment 2 G1
2 pages
Interpolation vs Polynomial Approximation
No ratings yet
Interpolation vs Polynomial Approximation
3 pages
C++ Operator Overloading Guide
No ratings yet
C++ Operator Overloading Guide
34 pages
Computer Organization & Assembly Course
No ratings yet
Computer Organization & Assembly Course
4 pages
UCP Mid Term Exam Schedule Fall 2022
No ratings yet
UCP Mid Term Exam Schedule Fall 2022
17 pages
GED 405 Presentation
No ratings yet
GED 405 Presentation
14 pages
LANGUAGE w2 Day 1
No ratings yet
LANGUAGE w2 Day 1
33 pages
Plaidoirie en Francais-1.Fr - en
No ratings yet
Plaidoirie en Francais-1.Fr - en
67 pages
Developments in Hydroforming
No ratings yet
Developments in Hydroforming
9 pages
Overall Electrical Wiring Diagrams SB079W
No ratings yet
Overall Electrical Wiring Diagrams SB079W
3 pages
Sample Size Determination
No ratings yet
Sample Size Determination
4 pages
English Worksheets For Playgroup-By Activity Wallet
No ratings yet
English Worksheets For Playgroup-By Activity Wallet
20 pages
l5 - Data Analysis (c20)
No ratings yet
l5 - Data Analysis (c20)
48 pages
Hive Database Setup Guide
No ratings yet
Hive Database Setup Guide
2 pages
Evacuated Tube Collector
No ratings yet
Evacuated Tube Collector
5 pages
Internal Combustion Engines A
No ratings yet
Internal Combustion Engines A
3 pages
Beam Design Principles and Analysis
No ratings yet
Beam Design Principles and Analysis
49 pages
U 4
No ratings yet
U 4
19 pages
Smart Garbage System
No ratings yet
Smart Garbage System
4 pages
Being Transgender What You Should Know
No ratings yet
Being Transgender What You Should Know
256 pages
ALS Project ENGLISH CORE CLASS XII 2024-25-1
No ratings yet
ALS Project ENGLISH CORE CLASS XII 2024-25-1
3 pages
Week 4 Welding
No ratings yet
Week 4 Welding
57 pages
From RET To MK Party - The Mobilisation of Existing Communities To Drive Political Messaging - Final
100% (2)
From RET To MK Party - The Mobilisation of Existing Communities To Drive Political Messaging - Final
17 pages
Timeline
No ratings yet
Timeline
1 page
GE8077 Total Quality Management-By WWW - LearnEngineering.in
No ratings yet
GE8077 Total Quality Management-By WWW - LearnEngineering.in
120 pages
Topic 9
No ratings yet
Topic 9
45 pages
Ibm FW Bios Bce148b-1.21 Linux I386
No ratings yet
Ibm FW Bios Bce148b-1.21 Linux I386
3 pages
Dissolved Gas Flotation (DGF) Unit
No ratings yet
Dissolved Gas Flotation (DGF) Unit
17 pages
S4 Hana 6 Important Question
100% (1)
S4 Hana 6 Important Question
15 pages
BRTF14: Tetra Optical Macro Slave Repeater
No ratings yet
BRTF14: Tetra Optical Macro Slave Repeater
2 pages
Master American Accent: 9 Tips
No ratings yet
Master American Accent: 9 Tips
20 pages
Grade 6 Math Quiz Bee
100% (2)
Grade 6 Math Quiz Bee
58 pages
Group 4
No ratings yet
Group 4
20 pages
Geo English
No ratings yet
Geo English
5 pages
Essential AI Tools for Journalists
No ratings yet
Essential AI Tools for Journalists
20 pages

Lec 5

Uploaded by

Lec 5

Uploaded by

Contents

 Conversion of RE patterns to Transition Diagram.

 Transition diagrams have a collection of nodes or circles, called

 An imaginary start state exists and has an arrow coming from it to

 How do we distinguish between identifiers and keywords such

 What is (gettoken(), installID())?

 gettoken() examines the lexeme and returns the token name,

 So far we have transition diagrams for identifiers (this diagram also

 What remains are whitespace, and numbers.

 The delim in the diagram represents any of the whitespace

 There are two types of Finite automata:

 Nondeterministic finite automata (NFA) have no restrictions on the

 Deterministic finite automata (DFA) have exactly one edge, for

 Both deterministic and nondeterministic finite automata are

1. A finite set of states S.

 Indeed an NFA can be represented by a transition graph whose

 The empty set φ is used when there is no edge labeled x

Transition Table for (a | b) * abb

You might also like