20IT811 PROJECT WORK
ANALYZING PERFORMANCE OF STUDENTS USING
DATA ENGINEERING AND MULTI-MODEL MACHINE
LEARNING SYSTEM
A PROJECT REPORT
Submitted by
ASHOK KUMAR A (111719205008)
DINESH KUMAR A (111719205013)
MATHAN KUMAR R (111719205024)
in partial fulfilment for the award of the degree
of
BACHELOR OF TECHNOLOGY
IN
INFORMATION TECHNOLOGY
R.M.K. ENGINEERING COLLEGE
(An Autonomous Institution)
R.S.M. Nagar, Kavaraipettai-601 206
ANNA UNIVERSITY: CHENNAI 600 025
MARCH 2024
R.M.K. ENGINEERING COLLEGE
(An Autonomous Institution)
R.S.M. Nagar, Kavaraipettai-601 206
BONAFIDE CERTIFICATE
Certified that this project report “ANALYZING PERFORMANCE OF
STUDENTS USING DATA ENGINEERING AND MULTI-MODEL
MACHINE LEARNING SYSTEM” is the bonafide work of
“ASHOK KUMAR.A (111719205008), DINESH KUMAR.A
(111719205013), MATHAN KUMAR.R (111719205024)” who carried out
the
project work under my supervision.
SIGNATURE SIGNATURE
Dr. M. SHEERIN BANU M.Tech., Ph.D., Ms. M. REKHA M.Tech., Ph.D.,
HEAD OF THE DEPARTMENT SUPERVISOR
PROFESSOR ASSISTANT PROFESSOR
Dept. of Information Technology Dept. of Information Technology
R.M.K. Engineering College, R.M.K. Engineering College,
R.S.M. Nagar, Kavaraipettai, R.S.M. Nagar, Kavaraipettai,
Tiruvallur District – 601 206. Tiruvallur District – 601 206.
CERTIFICATE OF EVALUATION
College Name : R.M.K. ENGINEERING COLLEGE
Department : INFORMATION TECHNOLOGY
Semester : 08
Name of the
Title of Project Name of the Students Supervisor with
designation
ANALYZING THE ASHOK KUMAR.A Ms. REKHA M
PERFORMANCE OF DINESH KUMAR.A (ASSISTANT
STUDENTS USING MATHAN KUMAR.R PROFESSOR)
DATA ENGINEERING
AND MULTI-MODEL
MACHINE LEARNING
SYSTEM
The report of the project work submitted by the above
students in partial fulfilment for the award of Bachelor of
Technology Degree in INFORMATION TECHNOLOGY of Anna
University was evaluated and confirmed to be the report of the
work done by the above students and then evaluated.
Submitted the project during the viva voce held on
…………………………………………
INTERNAL EXAMINER EXTERNAL EXAMINER
ACKNOWLEDGEMENT
We earnestly portray our sincere gratitude and regards to our beloved
Founder Chairman, Thiru. R. S. MUNIRATHINAM for giving us the
infrastructure for conducting the project work and our Chairperson Tmt.
MANJULA MUNIRATHINAM for her blessings. We also thank our Vice
Chairman, Thiru. R. M. KISHORE and our Director, Thiru. R. JOTHI
NAIDU for their constant support and affection shown towards us throughout
the course.
We are extremely thankful to our Principal,
Dr. K. A. MOHAMED JUNAID, for being the source of inspiration in this
college.
We reveal our sincere thanks to our Professor and Head of the
Department, Information Technology, Dr. M. SHEERIN BANU, for her
commendable support and encouragement for the completion of our project.
We would like to express our sincere gratitude to our project coordinator
Dr. K. SARAVANAN, and class in-charge Ms. S. SHOBANA
for their valuable suggestions towards the successful completion for this
project.
We wish to record our thanks to our project supervisor Ms. M. REKHA
for her valuable guidance and support during each stage of our project.
We take this opportunity to extend our thanks to all the faculty members
of Department of Information Technology, parents and friends for their care and
support towards the completion of our project.
ABSTRACT
This project will utilize machine learning algorithms, including
regression analysis, decision trees, and neural networks, to develop predictive
models that identify students who may struggle or excel in their academic
performance. The project will focus on applying data engineering techniques
and machine learning algorithms to large datasets that contain academic and
non-academic factors. The goal is to analyze student performance by
identifying key factors that impact academic success, such as attendance,
study habits, and socioeconomic status. The project's findings can be used by
educators to identify at-risk students and develop targeted interventions to
improve their academic outcomes. This project is a contribution to the
emerging field of educational data mining, which has the potential to
revolutionize education by providing personalized learning, early
identification of struggling students, and tailoring teaching to individual needs.
The successful application of machine learning and data engineering
techniques in this project can provide valuable insights for educators in
making informed decisions and interventions to promote student academic
success.
i
TABLE OF THE CONTENTS
CHAPTER TITLE PAGE NO.
NO
- ABSTRACT i
LIST OF TABLES v
LIST OF FIGURES vi
LIST OF ABBREVATIONS vii
1 INTRODUCTION
1.1 Purpose 1
1.2 Scope 2
1.3 Product Features 3
1.4 Existing System and its Limitations 3
1.5 Proposed System 5
2 SYSTEM STUDY AND ANALYSIS
2.1 Literature Review 6
2.2 Problem Description 11
2.3 Feasibility Study 12
2.4 Hardware and Software Requirement 14
ii
2.5 Functional Requirement 15
3 SYSTEM DESIGN
3.1 System Overview 16
3.2 System Architecture 17
3.3 Block Diagram 20
3.4 Database Design
3.4.1 ER Diagram 21
3.4.2 Schema 22
3.5 State Transition Diagram 23
3.6 Class Diagram 24
3.7 Activity Diagram 25
3.8 Sequence Diagram 26
3.9 Collaboration Diagram 27
4 SYSTEM IMPLEMENTATION
4.1 Module Overview 28
4.2 Module Descriptions 29
4.3 Implementation Details 35
5 TESTING
5.1 Test Overview 41
iii
5.2 Test Cases 47
5.3 Test Results 49
6 CONCLUSION 51
6.1 Future Enhancements 51
REFERENCES
APPENDICES
Sample Coding 56
Screenshots 74
iv
LIST OF TABLES
Table No Title Page No
1 Test Case Validation 47
2 Accuracy of the different Algorithms 48
Used
v
LIST OF FIGURES
Figure No Title Page No
1 Block Diagram of Proposed work 16
2 System Architecture 17
3 Use Case Diagram 20
4 ER Diagram 21
5 Database Schema 22
6 State Transition Diagram 23
7 Class Diagram 24
8 Activity Diagram 25
9 Sequence Diagram 26
10 Collaboration Diagram 27
vi
LIST OF ABBREVIATIONS
Abbreviation Expansion
API Application Programming Language
ML Machine Learning
SVM Support Vector Machine
NB Naïve Bayes
RFC Random Forest Classifier
DB Database
SQL Structured Query Language
UI User Interface Design
vii
CHAPTER 1
INTRODUCTION
Quality education plays a critical role in the UN-approved
Sustainable Development Goals (17-SDGs). A crucial aspect to keep in mind
while working on the Sustainable Development Goals, providing equal
opportunities, and sharing them equitably. The thesis of students in access to
tertiary education is a serious matter, which must be evaluated globally.
Education should be a priority for the improvement of our organizations.
Administrative and teaching staff should improve their productivity by
recognizing the development of programs and competencies, which will provide
better learning opportunities for students. In this context, secondary and
intermediate schools should also concentrate on the development and
improvement of educational models through the integration of information and
communication technologies which can be used as a tool for promoting social
responsibility and equal opportunities. From this perspective, ICT performance
in education systems is important because they can make a significant
REFER CONTENTS FOR SUBDIVISION AND CHAPTERS
Follow the same format for content, figure and table
8
CHAPTER 2
SYSTEM STUDY AND ANALYSIS
2.1 LITERATURE REVIEW
Authors: Chitra Jalota and Rashmi Agrawal. Title: Analysis of
Educational Data Mining using Classification. Publisher: International
Conference on Machine Learning, Big Data, Cloud and Parallel
Computing (COMITCon). Date: 2020.
Educational Data Mining is a new area of research that employs data mining
techniques. It employs machine learning algorithms and statistical
techniques to assist the user in interpreting students' learning habits,
academic performance, and future improvement if necessary. In this paper,
Chitra Jalota will discuss various data mining techniques that can be used to
predict student performance levels. Kalboard 360 dataset was used and
applied to Weka to analyse data mining techniques.
10 PAPERS
9
3.1 SYSTEM ARCHITECTURE
This graphic provides a concise and understandable description of all the
entities currently integrated into the system. The diagram shows how the many
actions and choices are linked together. You might say that the whole process
and how it was carried out is a picture. The figure below shows the functional
connections between various entities.
Fig 3.2 – Architecture Diagram
10
5.1 TEST CASES:
Table 1. Tests performed on UI
Test Case Module Test Test Expected Actual Statu
ID Scenario Data Result Result s
Login_Page_ Home Verify http://19 Login page User could Pass
TC_001 Page student user 2.168.43. needs to open view the
able to login 32:8501/ without error. dashboard.
to dashboard
11
5.2 TEST RESULTS:
Fig. 5.3. Accuracy of Random Forest
12
REFERENCES
[1] Juan L. Rastrollo-Guerrero, Juan A. Gómez-Pulido and Arturo Durán-
Domínguez. Analyzing and Predicting Students Performance by Means of Machine
Learning, MDPI (2020).
[2] Abdallah Namoun and Abdullah Alshanqiti. Predicting Student Performance
Using Data Mining and Learning Analytics Techniques, MDPI (2020).
30PAPERS
13
APPENDICES
SAMPLE CODING:
14
SCREENSHOTS:
Fig. 8.2.1. Entry of Details (1)
15