0% found this document useful (0 votes)

48 views6 pages

C2ex Java

Uploaded by

ashrafmuzammil.26csa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views6 pages

C2ex Java

Uploaded by

ashrafmuzammil.26csa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Date: Ex – 1(b)Lexical analyser

Aim:
To develop a Lexical Analyzer that processes C code to identify and classify keywords,
identifiers, operators, punctuation, constants, and lexemes from a source file.

Algorithm:

1. Initialize Data Structures:

● Use LinkedHashSet to store identifiers, preserving insertion order.

● Define lists for keywords, operators, punctuation, constants, and lexemes.
● Initialize predefined sets of keywords, operators, and punctuation symbols.

2. Read File Line by Line:

● Open and read the file using BufferedReader.

● Process each line by splitting it into tokens based on whitespace and non-word characters.

3. Handle Special Tokens:

● Skip preprocessor directives and headers (e.g., #include <stdio.h>).

● Process string literals ("..."), character literals ('A'), and function calls (func()).

4. Classify Tokens:

● Add tokens to the respective lists (keywords, operators, punctuation, constants) based on
their type.
● Add single alphabetical characters as identifiers.

5. Store Lexemes:

● Store any token that doesn't fit into keywords, operators, punctuation, constants, or
identifiers into the lexemes list.

6. Display Symbol Table:

● After processing all lines, print the contents of the symbol table including keywords,
identifiers, operators, punctuation, constants, and lexemes.

Code:

import java.io.BufferedReader;
import java.io.FileReader;
import java.io.IOException;
import java.util.ArrayList;
import java.util.Arrays;
import java.util.HashSet;
import java.util.LinkedHashSet;
import java.util.Set;

public class LexicalAnalyzer2 {

static Set<String> identifiers = new LinkedHashSet<>(); // Use LinkedHashSet to maintain
insertion order
static ArrayList<String> keywordsList = new ArrayList<>();
static ArrayList<String> operatorsList = new ArrayList<>();
static ArrayList<Character> punctuationList = new ArrayList<>();
static ArrayList<String> constantsList = new ArrayList<>();
static ArrayList<String> lexemes = new ArrayList<>(); // New array for function names and
others

// Define initial keywords and operators

static Set<String> keywords = new HashSet<>(Arrays.asList(
"int", "float", "char", "void", "if", "else", "while", "return",
"for", "do", "switch", "case", "include", "stdio", "main"
));
static Set<String> operators = new HashSet<>(Arrays.asList(
"+", "-", "*", "/", "=", "++", "--", "==", "!=", ">", "<", ">=", "<=", "&&", "||"
));
static Set<Character> punctuations = new HashSet<>(Arrays.asList(
';', ',', '(', ')', '{', '}', '[', ']'
));

static void processLine(String line) {

// Handle multi-character tokens like strings and function calls
String[] tokens = line.split("(?=\\W)|(?<=\\W)");

for (String token : tokens) {

token = token.trim();

if (token.isEmpty()) {
continue; // Skip empty tokens
}

// Skip preprocessor directives

if (token.startsWith("#")) {
continue;
}

// Skip header files or anything in angle brackets (e.g., <stdio.h>)

if (token.startsWith("<") && token.endsWith(">")) {
continue;
}

// Handle string literals (e.g., "Hello, World!\n")

if (token.startsWith("\"") && token.endsWith("\"")) {
lexemes.add(token);
continue;
}

// Handle character literals (e.g., 'A')

if (token.startsWith("'") && token.endsWith("'") && token.length() == 3) {
lexemes.add(token);
continue;
}

// Handle function calls

if (token.contains("(") && token.contains(")")) {
lexemes.add(token);
continue;
}

// Process other tokens

if (keywords.contains(token)) {
if (!keywordsList.contains(token)) {
keywordsList.add(token);
}
} else if (operators.contains(token)) {
operatorsList.add(token);
} else if (punctuations.contains(token.charAt(0))) {
punctuationList.add(token.charAt(0));
} else if (Character.isDigit(token.charAt(0))) {
constantsList.add(token);
} else if (isSingleAlphabetic(token)) {
// Ensure only single alphabetical tokens are added as identifiers
identifiers.add(token);
} else {
// Tokens that are not identifiers or constants might be part of lexemes
lexemes.add(token);
}
}
}

// Helper method to check if a token is a single alphabetic character

private static boolean isSingleAlphabetic(String token) {
return token.length() == 1 && Character.isLetter(token.charAt(0));
}

public static void main(String[] args) {

// Hardcoded file path
String filePath = "C:\\4025 CSA\\dio2.c";

try (BufferedReader br = new BufferedReader(new FileReader(filePath))) {

String line;
while ((line = br.readLine()) != null) {
processLine(line);
}
} catch (IOException e) {
System.out.println("An error occurred while reading the file.");
e.printStackTrace();
}

// Display the symbol table after processing the entire file

System.out.println("Symbol Table:");
System.out.println("Keywords: " + String.join(", ", keywordsList));
System.out.println("Identifiers: " + String.join(", ", identifiers));
System.out.println("Operators: " + String.join(", ", operatorsList));
System.out.println("Punctuations: " + punctuationList.toString());
System.out.println("Constants: " + String.join(", ", constantsList));
System.out.println("Lexemes: " + String.join(", ", lexemes)); // New output for lexemes
}
}

Dio2.c
#include <stdio.h>
int main() {
int a = 10;
float b = 20.5;
char c = 'A';

a = a + 1;
b = b * 2;
printf("Hello, World!\n");

return 0;
}

Output

Symbol Table:
Keywords: include, stdio, int, main, float, char, return
Identifiers: a, b, c
Operators: <, >, =, =, =, =, +, =, *
Punctuations: [(, ), {, ;, ;, ;, ;, ;, (, ,, ), ;, ;, }]
Constants: 10, 20, 5, 1, 2, 0
Lexemes: ., ., ', ', printf, ", Hello, World, !, \, "

Result:
Hence a Lexical Analyzer that processes C code to identify and classify keywords has been
successfully written, executed and its output verified successfully.

من المفترض ان ده حل الكويز بس بيقع في كذا تيست
No ratings yet
من المفترض ان ده حل الكويز بس بيقع في كذا تيست
4 pages
Concepts - Assignment (Technical Report Template)
No ratings yet
Concepts - Assignment (Technical Report Template)
14 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
4 pages
CC Assignment # 1
No ratings yet
CC Assignment # 1
6 pages
Compiler Construction Practical Guide
No ratings yet
Compiler Construction Practical Guide
114 pages
Mid Term Project
No ratings yet
Mid Term Project
4 pages
Ornek Scanner Parser
No ratings yet
Ornek Scanner Parser
44 pages
Cdjavacodes
No ratings yet
Cdjavacodes
23 pages
Compiler Design Lab Guide
No ratings yet
Compiler Design Lab Guide
43 pages
Lexical Analyzer Guide
No ratings yet
Lexical Analyzer Guide
2 pages
CD Lab Manual File
No ratings yet
CD Lab Manual File
27 pages
Program No. - 3: Write A Program To Find Different Tokens in A Program
No ratings yet
Program No. - 3: Write A Program To Find Different Tokens in A Program
3 pages
Week 2a &2B
No ratings yet
Week 2a &2B
6 pages
21bai1724 Lab-01
No ratings yet
21bai1724 Lab-01
11 pages
Compiler Token Separation Guide
No ratings yet
Compiler Token Separation Guide
5 pages
22bce2509 VL2024250102410 Ast01
No ratings yet
22bce2509 VL2024250102410 Ast01
12 pages
Sslab 2
No ratings yet
Sslab 2
6 pages
21BCE3008
No ratings yet
21BCE3008
7 pages
Cs-603 Activity: Abca-1 (Coding/Debugging) Compiler: Name - Divyansh Sharma Roll No. - 0905cs211055
No ratings yet
Cs-603 Activity: Abca-1 (Coding/Debugging) Compiler: Name - Divyansh Sharma Roll No. - 0905cs211055
6 pages
Experiment No 3 PDF
No ratings yet
Experiment No 3 PDF
4 pages
Compiler Record
No ratings yet
Compiler Record
42 pages
Lexer
No ratings yet
Lexer
6 pages
Compiler Practical File
No ratings yet
Compiler Practical File
33 pages
Compiler Lab2
No ratings yet
Compiler Lab2
17 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
68 pages
Assignment No - 01
No ratings yet
Assignment No - 01
4 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
33 pages
Name:atif Ali Enrollment: (01-134191-008)
No ratings yet
Name:atif Ali Enrollment: (01-134191-008)
15 pages
CD Lab Manual
No ratings yet
CD Lab Manual
43 pages
03 Lexical Analysis
No ratings yet
03 Lexical Analysis
77 pages
01 134201 011 9556776808 12042022 111907pm
No ratings yet
01 134201 011 9556776808 12042022 111907pm
14 pages
This Program Implements A Lexical Analyzer
No ratings yet
This Program Implements A Lexical Analyzer
32 pages
EX - NO:1 Implementation of Symbol Table Date
No ratings yet
EX - NO:1 Implementation of Symbol Table Date
65 pages
Lexer Implementation Guide
No ratings yet
Lexer Implementation Guide
6 pages
CD Lab Manual
No ratings yet
CD Lab Manual
48 pages
Compiler Design Record (21072)
No ratings yet
Compiler Design Record (21072)
48 pages
1 PR CD
No ratings yet
1 PR CD
6 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
Lab 5 (Latest-ByAman) (For Students)
No ratings yet
Lab 5 (Latest-ByAman) (For Students)
5 pages
Token Separation & Parsing Guide
82% (11)
Token Separation & Parsing Guide
47 pages
Cse420 Lab 1
No ratings yet
Cse420 Lab 1
4 pages
21je0390 CD Lab3
No ratings yet
21je0390 CD Lab3
11 pages
Practical3 PCD
No ratings yet
Practical3 PCD
8 pages
CD Lab Manual
No ratings yet
CD Lab Manual
48 pages
Develop a Lexical Analyzer in C
No ratings yet
Develop a Lexical Analyzer in C
17 pages
Compiler .CPPP
No ratings yet
Compiler .CPPP
4 pages
A
No ratings yet
A
4 pages
Lab2 CD 22BLC1161
No ratings yet
Lab2 CD 22BLC1161
9 pages
Program Scanner Untuk Melakukan Analisis Leksikal Dengan C
No ratings yet
Program Scanner Untuk Melakukan Analisis Leksikal Dengan C
4 pages
Lexical Analyzer Implementation
No ratings yet
Lexical Analyzer Implementation
11 pages
Compiler Design Lab Work
No ratings yet
Compiler Design Lab Work
43 pages
Lab 3
No ratings yet
Lab 3
8 pages
Compiler Design Lab: Lexical Analyzer
No ratings yet
Compiler Design Lab: Lexical Analyzer
52 pages
Rajalakshmi Institute of Technology Chennai: Department of Computer Science and Engineering
No ratings yet
Rajalakshmi Institute of Technology Chennai: Department of Computer Science and Engineering
20 pages
CS3501-Compiler Lab-2021R-Updated-19-7-2023
No ratings yet
CS3501-Compiler Lab-2021R-Updated-19-7-2023
44 pages
Lecture 2.76
No ratings yet
Lecture 2.76
31 pages
HCL Campus Placement Technical Paper
No ratings yet
HCL Campus Placement Technical Paper
5 pages
Business & IT Graduate Resume
No ratings yet
Business & IT Graduate Resume
1 page
Comparison Chart of APA 6 and APA 7 FINAL
No ratings yet
Comparison Chart of APA 6 and APA 7 FINAL
10 pages
Boundary Value Analysis
No ratings yet
Boundary Value Analysis
15 pages
2023S CPE593 MidtermA
No ratings yet
2023S CPE593 MidtermA
7 pages
NCR Selfserv 27 Atm (6627) : Parts Identification Manual
100% (1)
NCR Selfserv 27 Atm (6627) : Parts Identification Manual
204 pages
S 1 0 Cod24
No ratings yet
S 1 0 Cod24
9 pages
Alcatel-Lucent Multicast Student Guide v3.1 Download
No ratings yet
Alcatel-Lucent Multicast Student Guide v3.1 Download
393 pages
Digital Radiographic Quality Control Guide
No ratings yet
Digital Radiographic Quality Control Guide
150 pages
TalentServe Graduate Engineer Trainee Role
No ratings yet
TalentServe Graduate Engineer Trainee Role
3 pages
Full Monte SRA For P6 Installation Guide
No ratings yet
Full Monte SRA For P6 Installation Guide
24 pages
Beginner's Guide: Teaching Kids Coding
No ratings yet
Beginner's Guide: Teaching Kids Coding
15 pages
Unit 1.5 Processor and Memory
No ratings yet
Unit 1.5 Processor and Memory
51 pages
DELL Inspiron 13 5379 Service Manual
No ratings yet
DELL Inspiron 13 5379 Service Manual
90 pages
Acc 142-Lesson 1
No ratings yet
Acc 142-Lesson 1
31 pages
Lab Introduction To STATA
100% (1)
Lab Introduction To STATA
27 pages
CSS Simplilearn Questions
No ratings yet
CSS Simplilearn Questions
22 pages
Understanding Mobility Models in MANETs
No ratings yet
Understanding Mobility Models in MANETs
15 pages
Tutorial 3 Answers Part 1
No ratings yet
Tutorial 3 Answers Part 1
16 pages
Cosy 131 For Remote Access
No ratings yet
Cosy 131 For Remote Access
3 pages
Ishida Series IGB - IGX
No ratings yet
Ishida Series IGB - IGX
47 pages
2020 Malware Trends & COVID-19 Impact
No ratings yet
2020 Malware Trends & COVID-19 Impact
24 pages
Guerrilla Hacking Course Overview
No ratings yet
Guerrilla Hacking Course Overview
7 pages
Process Shipper Install and Update Instructions
No ratings yet
Process Shipper Install and Update Instructions
41 pages
Geoscience Internship Insights
No ratings yet
Geoscience Internship Insights
46 pages
Construction of Fire Alarm System
No ratings yet
Construction of Fire Alarm System
15 pages
Bca 6 Sem Information Security 75014 Dec 2018
No ratings yet
Bca 6 Sem Information Security 75014 Dec 2018
2 pages
Unpack - Enigma 4.XX - 5.XX
No ratings yet
Unpack - Enigma 4.XX - 5.XX
38 pages
Comprehensive Study For Localization Techniques in MANET and VANET
No ratings yet
Comprehensive Study For Localization Techniques in MANET and VANET
4 pages
Excel Budget Template: Project Start Date Scroll To Week #
No ratings yet
Excel Budget Template: Project Start Date Scroll To Week #
5 pages

C2ex Java

Uploaded by

C2ex Java

Uploaded by

Date: Ex – 1(b)Lexical analyser

1. Initialize Data Structures:

● Use LinkedHashSet to store identifiers, preserving insertion order.

2. Read File Line by Line:

● Open and read the file using BufferedReader.

3. Handle Special Tokens:

● Skip preprocessor directives and headers (e.g., #include <stdio.h>).

6. Display Symbol Table:

public class LexicalAnalyzer2 {

// Define initial keywords and operators

static void processLine(String line) {

for (String token : tokens) {

// Skip preprocessor directives

// Skip header files or anything in angle brackets (e.g., <stdio.h>)

// Handle string literals (e.g., "Hello, World!\n")

// Handle character literals (e.g., 'A')

// Handle function calls

// Process other tokens

// Helper method to check if a token is a single alphabetic character

public static void main(String[] args) {

try (BufferedReader br = new BufferedReader(new FileReader(filePath))) {

// Display the symbol table after processing the entire file

You might also like