0% found this document useful (0 votes)

31 views16 pages

Machine Learning Glob (22241a1237)

The document presents a project on predicting student performance using machine learning algorithms, specifically Support Vector Machine (SVM), Decision Tree, and K-Nearest Neighbors (KNN). The study utilizes the Student Performance Dataset to identify factors influencing academic success, achieving an accuracy of 88% with the Decision Tree model. The findings emphasize the potential of machine learning to enhance educational outcomes by enabling proactive support for at-risk students.

Uploaded by

margammanisha7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views16 pages

Machine Learning Glob (22241a1237)

Uploaded by

margammanisha7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

A GLOB REPORT on “MACHINE LEARNING LAB”

AI-Driven Student Performance Prediction Using Machine Learning

Algorithms
Submitted in partial fulfilment of the requirements for the award of the
Bachelor of Technology
in
INFORMATION TECHNOLOGY
(2022-2026)

By
Manisha Margam 22241A1237
Under the Esteemed guidance of

Assistant Professor

Department of INFORMATION TECHNOLOGY

GOKARAJU RANGARAJU INSTITUTE OF ENGINEERING AND
TECHNOLOGY
(Approved by AICTE, Autonomous under JNTUH, Hyderabad)
Bachupally, Kukatpally, Hyderabad-500090
2024-2025
GOKARAJU RANGARAJU INSTITUTE OF ENGINEERING AND
TECHNOLOGY
(Autonomous)

Hyderabad-500090

CERTIFICATE

This is to certify that the GLOB entitled “AI-Driven Student Performance

Prediction Using Machine Learning Algorithms” is submitted by Margam
Margam(22241A1237) in partial fulfilment of the award of degree in BACHELOR OF
TECHNOLOGY in INFORMATION TECHNOLOGY during Academic year 2024-2025.

Internal Guide Head of Department

Dr. Y. J. Nagendra Kumar
ABSTRACT

Student academic performance plays a pivotal role in evaluating both individual

potential and the effectiveness of educational systems. Traditionally, assessing
student performance has been a subjective and reactive process, often relying on
manual evaluations and limited data. However, with the increasing availability
of student data and the advent of machine learning (ML), there is an opportunity
to develop predictive models that can identify students at risk of
underperforming, providing a proactive approach to improve academic
outcomes.

This project explores the use of machine learning to predict student

performance based on a variety of factors, including study time, previous
grades, failures, absences, family support, and other demographic and
behavioral attributes. The dataset used in this study is the Student
Performance Dataset from the UCI Machine Learning Repository, which
includes various features influencing a student's final grade (G3). The objective
of this research is to apply three distinct machine learning algorithms—Support
Vector Machine (SVM), Decision Tree, and K-Nearest Neighbors (KNN)—
to predict the final grade of students and evaluate the performance of each
model.

The findings demonstrate that all three algorithms provide valuable insights into
predicting academic performance, with the Decision Tree model outperforming
the others with an accuracy of 88%. The SVM model showed robustness in
handling complex data, achieving 84% accuracy, while KNN performed
decently with 80% accuracy but was sensitive to feature scaling. These models
not only predict performance but also offer insights into the most influential
factors contributing to student success, enabling educators to make data-driven
decisions.
I. INTRODUCTION

1.1 Problem Statement

Educational institutions strive to ensure student success, but early detection of

students at academic risk remains a challenge. Manual methods are inefficient
and often inaccurate. Machine learning can provide predictive models that
enable proactive academic support based on student attributes and past
performance.

1.2 Objective

The objective of this project is to build a machine learning model that can
predict student academic performance based on various factors such as study
time, previous failures, absences, and family support. The project will
implement the following objectives:

1. Preprocess the Data:

o Clean and prepare the dataset by handling any missing values,
encoding categorical variables, and performing feature scaling.
2. Apply Machine Learning Algorithms:
o Implement three distinct machine learning algorithms—Support
Vector Machine (SVM), Decision Tree, and K-Nearest
Neighbors (KNN)—to predict student performance based on the
input features.
3. Evaluate Model Performance:
o Evaluate the models' performance using accuracy for classification
tasks and Root Mean Squared Error (RMSE) for regression
tasks.
4. Compare the Models:
o Compare the performance of the models to identify the best-
performing algorithm for predicting student final grades.
5. Provide Insights for Educational Institutions:
o Use the results of the models to generate insights that can help
educators in identifying at-risk students, optimizing resource
allocation, and planning personalized interventions to improve
student outcomes.

1
1.3 Scope of the Project

This project focuses on predicting the final grade (G3) of students using a
variety of available features such as study time, failures, absences, family
support, and previous academic performance. The aim is to apply supervised
learning algorithms to model the relationship between these features and the
target variable (final grade). Both classification (predicting grade categories)
and regression (predicting exact grade scores) approaches will be utilized to
assess the different ways machine learning can predict student performance.

The project will evaluate and compare the effectiveness of three widely used
machine learning algorithms: Support Vector Machine (SVM), Decision
Tree, and K-Nearest Neighbors (KNN). The models will be compared based
on metrics such as accuracy and Root Mean Squared Error (RMSE) to
determine their performance in predicting student grades.

Finally, the findings will be discussed in the context of their implications in

educational settings, with a focus on how machine learning can help educators
identify at-risk students and implement data-driven strategies to improve
academic outcomes and retention rates.

2
II. Literature Review

Numerous studies have explored the use of machine learning (ML) techniques
to predict academic outcomes, leveraging the wealth of data available in
educational systems. Algorithms like Logistic Regression, Naive Bayes,
Random Forests, and Neural Networks have been extensively applied, each
showing promising results in various academic prediction tasks. These methods
are particularly beneficial in analyzing large datasets with multiple features such
as student demographics, behavior patterns, and academic performance.

Among these, Support Vector Machines (SVM) have excelled in classification

tasks, especially when distinguishing between different levels of academic
performance. SVM’s ability to maximize the margin between classes makes it
effective for complex, high-dimensional datasets. It is particularly well-suited to
predicting binary outcomes, such as identifying students who are at risk of
failing versus those who are likely to succeed.

Decision Trees, on the other hand, are favored for their interpretability and
transparency. Educators and researchers appreciate how Decision Trees split
data based on feature values, creating a clear path for understanding how
different attributes contribute to a student's final grade. This interpretability
allows for more actionable insights, which can be used to design targeted
interventions.

K-Nearest Neighbors (KNN) is often applied in educational data mining

because of its simplicity and effectiveness in handling small to moderate-sized
datasets. KNN is a non-parametric algorithm that makes predictions based on
the proximity of data points, which works well when the decision boundaries
are non-linear and there are clear similarities between data points. While KNN
is highly sensitive to feature scaling and the choice of k (number of neighbors),
it remains a valuable tool due to its simplicity and ease of implementation.

Furthermore, ensemble methods like Random Forests have shown robust

performance in predicting academic outcomes by combining multiple decision
trees to improve accuracy and reduce overfitting. Neural Networks,
particularly deep learning models, are also gaining traction in the field of
educational data mining due to their capacity to model complex relationships

3
within the data. However, these models tend to require larger datasets and
computational resources.

In conclusion, while there is no one-size-fits-all algorithm, each of these models

offers unique advantages and trade-offs, and the choice of model depends
largely on the dataset, the problem at hand, and the desired level of
interpretability. These machine learning algorithms provide an opportunity to
not only predict academic outcomes but also to gain deeper insights into the
factors influencing student success and failure. By incorporating these
predictive models, educational institutions can move towards more personalized
learning experiences and targeted interventions.

4
III.Methodology

3.1 Data Collection

The dataset used for this project is the Student Performance Dataset from the
UCI Machine Learning Repository. This dataset consists of 1,000 instances with
various input features related to student demographic, behavioral, and academic
factors. The target variable is the final grade (G3), which represents the
student's final grade on a scale from 0 to 20. The features in the dataset include:

 Study time
 Failures
 Absences
 Family support
 Health
 Monthly alcohol consumption
 Extracurricular activities
 Parental education
 Previous grades (G1 and G2)
 Gender

3.2 Data Preprocessing

Data preprocessing is a vital step in machine learning that helps ensure the
quality and suitability of the data for training. The following preprocessing steps
were carried out:

 Handling Missing Values: The dataset contains no missing values, so no

imputation or removal of missing data was necessary.
 Feature Encoding: Categorical variables, such as gender and family
support, were encoded using one-hot encoding or label encoding as
needed to prepare them for the machine learning models.
 Feature Scaling: Numerical features like study time, absences, and
previous grades were standardized using techniques like StandardScaler
(zero mean, unit variance) to ensure that each feature contributes equally
to the model's performance, especially for models like KNN and SVM.

5
 Data Splitting: The data was divided into training and testing sets, with
80% of the data used for training the models and 20% used for testing
their performance.

3.3 Algorithms Implemented

3.3.1 Support Vector Machine (SVM) (Supervised Learning)

Support Vector Machines (SVM) are supervised learning models that are highly
effective for both classification and regression tasks. SVM works by finding a
hyperplane that best separates the classes in the feature space. In this project, we
use SVM to predict whether a student will perform well (high grade) or poorly
(low grade) based on their attributes. The SVM model can handle non-linear
relationships through kernel tricks, making it well-suited for this kind of

educational data.

3.3.2 Decision Tree (Supervised Learning)

A Decision Tree is a model that splits the data into subsets based on the most
significant features. It works by recursively dividing the dataset into smaller
segments, making decisions at each node. For this project, we use the Decision
Tree model to predict students' final grades by classifying them into categories
like low, medium, and high performance. The model is chosen for its
interpretability, allowing educators to easily understand the factors influencing a
student's performance.

3.3.3 K-Nearest Neighbors (KNN) (Supervised Learning)

K-Nearest Neighbors (KNN) is a non-parametric, lazy learning algorithm that

predicts a student’s performance by considering the grades of the nearest
neighbors. The model finds the k most similar instances and assigns the grade
based on their majority. KNN is known for being simple yet effective,
especially in datasets where data points with similar attributes tend to belong to
the same class. It is sensitive to the scale of the data, which is why feature
scaling was applied prior to training.

6
IV. Implementation

4.1 Preprocessing the Data

[fig-1]

4.2 Support Vector Machine (SVM)

[fig-2]

7
4.3 Decision Tree

[fig-3]

8
4.4 K-Nearest Neighbors (KNN)

[fig-4]

9
V. Results

5.1 Support Vector Machine (SVM)

[fig-5]

5.2 Decision Tree

[fig-6]

10
[fig-7]

[fig-8]

11
5.3 K-Nearest Neighbors (KNN)

[fig-9]

VI. Conclusion

This project successfully demonstrated the application of machine learning

techniques to predict student academic performance using the Student
Performance Dataset from the UCI Machine Learning Repository. By
focusing on relevant features such as study time, past failures, absences, and
family support, three machine learning models—Support Vector Machine
(SVM), Decision Tree, and K-Nearest Neighbors (KNN)—were
implemented and evaluated.
Among the models, the Decision Tree algorithm emerged as the most
effective, achieving an accuracy of 88%, followed by SVM at 84% and
KNN at 80%. These results highlight the capability of machine learning to
model complex educational data and provide reliable predictions of student
outcomes. The models also offered insights into the most influential factors

12
affecting performance, which can support data-driven decision-making in
educational settings.
By identifying students at risk of underperforming early, educational
institutions can proactively implement personalized interventions and
allocate resources more strategically. Ultimately, the integration of
predictive analytics into academic environments can lead to improved
student retention, enhanced learning outcomes, and a more efficient and
equitable education system.

VII. References
1. Yadav, S., & Pal, S. (2012)
Used classification techniques to improve student performance
predictions.
2. Scikit-learn: Machine Learning in Python
3. Pandas Documentation
4. Peña-Ayala, A. (2014)
Analyzed recent work in educational data mining and how it's used
to help students.
5. K-Nearest Neighbors Explained – GeeksforGeeks

Leveraging Machine Learning Approaches For Predicting Students' Academic Success An Analytical Perspective
No ratings yet
Leveraging Machine Learning Approaches For Predicting Students' Academic Success An Analytical Perspective
16 pages
Lucky Mini Project
No ratings yet
Lucky Mini Project
32 pages
First Project
No ratings yet
First Project
34 pages
MiniProject XLSX Merged1
No ratings yet
MiniProject XLSX Merged1
37 pages
83 CD
No ratings yet
83 CD
6 pages
Computer Science Students Academic Performance Prediction Using Ai
No ratings yet
Computer Science Students Academic Performance Prediction Using Ai
68 pages
ML in Student Performance Analysis
No ratings yet
ML in Student Performance Analysis
15 pages
Student Score Prediction with ML
No ratings yet
Student Score Prediction with ML
24 pages
PredictingStudentSuccess-AutoML PrePrint
No ratings yet
PredictingStudentSuccess-AutoML PrePrint
23 pages
Jeml 0102005
No ratings yet
Jeml 0102005
7 pages
1822 B.E Cse Batchno 7
No ratings yet
1822 B.E Cse Batchno 7
60 pages
Academic Analytics Using Machine Learning
No ratings yet
Academic Analytics Using Machine Learning
26 pages
Project Interim
No ratings yet
Project Interim
13 pages
Predicting Student Academic Performanceusing Support Vector Machineand Random Forest
No ratings yet
Predicting Student Academic Performanceusing Support Vector Machineand Random Forest
9 pages
Machine Learning Approaches For Student Performance Prediction
No ratings yet
Machine Learning Approaches For Student Performance Prediction
6 pages
Predicting Student Performance
No ratings yet
Predicting Student Performance
38 pages
Final PPT Gruop 143k
No ratings yet
Final PPT Gruop 143k
26 pages
Học viện ngân hàng Banking Academy of Vietnam International School of Business
No ratings yet
Học viện ngân hàng Banking Academy of Vietnam International School of Business
9 pages
Academic Performance Prediction Using Machine Learning Approaches A Survey
No ratings yet
Academic Performance Prediction Using Machine Learning Approaches A Survey
18 pages
Predicting The Academic Performance of Industrial
No ratings yet
Predicting The Academic Performance of Industrial
12 pages
Paper Predicting Student Scores
No ratings yet
Paper Predicting Student Scores
10 pages
Advanced Machine Learning Models For Academic Performance Forecasting
No ratings yet
Advanced Machine Learning Models For Academic Performance Forecasting
38 pages
Ai-Based Early Prediction and Intervention For Student Academic Performance in Higher Education
No ratings yet
Ai-Based Early Prediction and Intervention For Student Academic Performance in Higher Education
19 pages
Journal Publications
No ratings yet
Journal Publications
13 pages
A Machine Learning Approach For Tracking and Predicting Student Performance in Degree Programs
No ratings yet
A Machine Learning Approach For Tracking and Predicting Student Performance in Degree Programs
2 pages
Proj Report 4
No ratings yet
Proj Report 4
12 pages
Evaluation of Literature Review
No ratings yet
Evaluation of Literature Review
2 pages
Irjet V7i2688 PDF
No ratings yet
Irjet V7i2688 PDF
4 pages
Applsci 11 10007 v3
No ratings yet
Applsci 11 10007 v3
22 pages
Predicting Student Success with ML
No ratings yet
Predicting Student Success with ML
30 pages
A Systematic Literature Review
No ratings yet
A Systematic Literature Review
28 pages
Predicting Student Success with ML
No ratings yet
Predicting Student Success with ML
5 pages
12 IV April 2024
No ratings yet
12 IV April 2024
8 pages
Seminal Review Paper
No ratings yet
Seminal Review Paper
23 pages
Prediction of Students Performance With Learning Coefficients Using Regression Based Machine Learning Models
No ratings yet
Prediction of Students Performance With Learning Coefficients Using Regression Based Machine Learning Models
11 pages
Sat - 7.Pdf - Predicting Student's Performance Based On Machine Learning
No ratings yet
Sat - 7.Pdf - Predicting Student's Performance Based On Machine Learning
11 pages
Prediction Model For Students PDF
No ratings yet
Prediction Model For Students PDF
4 pages
Article 4
No ratings yet
Article 4
9 pages
Ffirst Review
No ratings yet
Ffirst Review
18 pages
Machine Learning Based Student AcademicPerformance Prediction
No ratings yet
Machine Learning Based Student AcademicPerformance Prediction
6 pages
12058-Article Text-21417-1-10-20220201
No ratings yet
12058-Article Text-21417-1-10-20220201
7 pages
Bee Jay1
No ratings yet
Bee Jay1
11 pages
Major Project Report Sem 7
No ratings yet
Major Project Report Sem 7
23 pages
Evaluating Machine Learning Algorithms For Enhanced Prediction of Student Academic Performance
100% (1)
Evaluating Machine Learning Algorithms For Enhanced Prediction of Student Academic Performance
4 pages
AI-Powered Student Grade Prediction
No ratings yet
AI-Powered Student Grade Prediction
70 pages
Huang2021 Article AFeatureWeightedSupportVectorM
No ratings yet
Huang2021 Article AFeatureWeightedSupportVectorM
13 pages
SFA Paper 7
No ratings yet
SFA Paper 7
2 pages
1 Report
No ratings yet
1 Report
45 pages
Prediction of Student Exam Performance Using Data
No ratings yet
Prediction of Student Exam Performance Using Data
26 pages
Arasetv44 N1 PP105 119
No ratings yet
Arasetv44 N1 PP105 119
15 pages
Abstract Student Outcomes
No ratings yet
Abstract Student Outcomes
2 pages
Ijesrt: International Journal of Engineering Sciences & Research Technology
No ratings yet
Ijesrt: International Journal of Engineering Sciences & Research Technology
11 pages
Final22 INT254 Report
No ratings yet
Final22 INT254 Report
10 pages
Yash 21BSDS12 Perdictive Analysis Report
No ratings yet
Yash 21BSDS12 Perdictive Analysis Report
20 pages
AI in Education (Juan Cruz-Benito)
No ratings yet
AI in Education (Juan Cruz-Benito)
116 pages
CCS369 - TSS-Unit 2
No ratings yet
CCS369 - TSS-Unit 2
56 pages
DipanshuKhurana NERTask
No ratings yet
DipanshuKhurana NERTask
8 pages
Lora: Low-Rank Adaptation of Large Language Models
No ratings yet
Lora: Low-Rank Adaptation of Large Language Models
20 pages
Chapter 3 Artificial Intelligence (AI)
No ratings yet
Chapter 3 Artificial Intelligence (AI)
47 pages
AIML Unit 5
No ratings yet
AIML Unit 5
195 pages
A Water Behavior Dataset For An Image-Based Drowning Solution
100% (1)
A Water Behavior Dataset For An Image-Based Drowning Solution
5 pages
Curriculum Doc - BITSOM BA 2507 - Program Calendar
No ratings yet
Curriculum Doc - BITSOM BA 2507 - Program Calendar
3 pages
Nlp-Natural Language Processing
No ratings yet
Nlp-Natural Language Processing
11 pages
Yolo1 11
No ratings yet
Yolo1 11
38 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
Review PAMI
No ratings yet
Review PAMI
20 pages
CNN for Vehicle Plate Detection
No ratings yet
CNN for Vehicle Plate Detection
5 pages
Brochure Special Issue
100% (1)
Brochure Special Issue
1 page
Generative AI For Beginners1
100% (3)
Generative AI For Beginners1
85 pages
AI Project Report
No ratings yet
AI Project Report
30 pages
Supervised and Unsupervised Machine Learning
No ratings yet
Supervised and Unsupervised Machine Learning
3 pages
Neural Network Concepts Quiz
No ratings yet
Neural Network Concepts Quiz
152 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
104 pages
Real Time Human Action Recognition Using Pose Estimation To Enhance Hospital Security
No ratings yet
Real Time Human Action Recognition Using Pose Estimation To Enhance Hospital Security
19 pages
Banglabert Language Model Pret
No ratings yet
Banglabert Language Model Pret
11 pages
Zero Shot Prompting Unlocking AIs Untapped Potential
No ratings yet
Zero Shot Prompting Unlocking AIs Untapped Potential
7 pages
Elective Course 5 Ai 2024 Solution 666666666666
No ratings yet
Elective Course 5 Ai 2024 Solution 666666666666
7 pages
JMLDL CFP
No ratings yet
JMLDL CFP
4 pages
CH2.3 - Resnet Backpropagation Well Explained
No ratings yet
CH2.3 - Resnet Backpropagation Well Explained
5 pages
Soft-Computing Notes
No ratings yet
Soft-Computing Notes
3 pages
Computer Project of AI
No ratings yet
Computer Project of AI
15 pages
Course Basic Level of Generative AI
No ratings yet
Course Basic Level of Generative AI
4 pages
15) EXPLAIN Fitted Q and Deep Q-Learning
No ratings yet
15) EXPLAIN Fitted Q and Deep Q-Learning
17 pages
Supply Chain
No ratings yet
Supply Chain
14 pages
AI for Vegetable Image Classification
No ratings yet
AI for Vegetable Image Classification
8 pages

Machine Learning Glob (22241a1237)

Uploaded by

Machine Learning Glob (22241a1237)

Uploaded by

A GLOB REPORT on “MACHINE LEARNING LAB”

AI-Driven Student Performance Prediction Using Machine Learning

Department of INFORMATION TECHNOLOGY

This is to certify that the GLOB entitled “AI-Driven Student Performance

Internal Guide Head of Department

Student academic performance plays a pivotal role in evaluating both individual

This project explores the use of machine learning to predict student

1.1 Problem Statement

Educational institutions strive to ensure student success, but early detection of

1. Preprocess the Data:

Finally, the findings will be discussed in the context of their implications in

Among these, Support Vector Machines (SVM) have excelled in classification

K-Nearest Neighbors (KNN) is often applied in educational data mining

Furthermore, ensemble methods like Random Forests have shown robust

In conclusion, while there is no one-size-fits-all algorithm, each of these models

3.1 Data Collection

3.2 Data Preprocessing

 Handling Missing Values: The dataset contains no missing values, so no

3.3 Algorithms Implemented

3.3.1 Support Vector Machine (SVM) (Supervised Learning)

3.3.2 Decision Tree (Supervised Learning)

3.3.3 K-Nearest Neighbors (KNN) (Supervised Learning)

K-Nearest Neighbors (KNN) is a non-parametric, lazy learning algorithm that

4.1 Preprocessing the Data

4.2 Support Vector Machine (SVM)

5.1 Support Vector Machine (SVM)

5.2 Decision Tree

This project successfully demonstrated the application of machine learning

You might also like