Capstone Project Topic
Submit by 09/03/2025
Select any one of the following.
Title Task Dataset
1. Linear • Exploratory data analysis for the car_price
Regression given data
dataset
Model • Data Visualization
•
• Identify Features & labels
• Making a car Price prediction ML
model
• Training the model
• Evaluation of the model: Confusion
matric, Classification Report
• Saving Model
• Plot regression line
• Testing on real-time data
2. Logistic • Exploratory data analysis for the Road
Regression given data
Accident
• Data Visualization
• Identify Features & labels Survival
• Develop a survival status prediction Dataset
model
• Supervised Logistic Regression
model
• Binary Class model (Yes/No)
• Confusion Matrix
• Classification Report
• Evaluation of the model
3. KNN • Exploratory data analysis for the Stroke
given data
Risk
• Identify Features & labels
• Data Visualization Predictio
• Building a KNN model for predicting n Dataset
stroke risk based on the symptoms Based on
• Training the model
Symptom
• Evaluation of the model Confusion
matric, Classification Report s
• Plot the clustering curve
• Saving Model
• Testing on real-time data
• Plot the clustering curve
4. SVM • Exploratory data analysis for the Human
given data
Activity
• Data Visualization
• Identify Features & labels Recogniti
• Building an SVM model for on Using
identifying the human activity Smartpho
• Training the model
nes - UCI
• Evaluation of the model Confusion
matric, Classification Report Machine
• Saving Model Learning
• Testing on real-time data Repositor
y
5. Decision • Exploratory data analysis for the Fraud
given data
Tree Detection
• Data Visualization
Transactio
• Building a decision tree model for ns
predicting the IP address flag Dataset
• Training the model
• Evaluation of the model Confusion
matric, Classification Report
• Plot the tree
• Saving Model
• Testing on real-time data
Online
• Exploratory data analysis for the
Retail -
given data
6. K-means • Data Visualization UCI
• Explain the feature set here Machine
• Prepare K-means clustering Learning
unsupervised model for mall sales
data
Repositor
• Data cleaning and processing y
• Removing the outlier from the
dataset using the concept box plot
• Finding the “K” value for cluster
formation using the elbow method
• Implementing the K-means
• Plot the cluster
• Evaluating model
7. PCA • Exploratory data analysis for the Hierarchi
given data
cal Sales
• Data Visualization
Data -
1. Explain the concept of UCI
features and clustering here. Machine
• Do Principal Component Analysis
(PCA) for mall data to reduce the
Learning
dimension of the dataset. Repositor
• Compute PC1, PC2… y
• Variance ratio has been computed
8. CNN • Building an image multi-class MNIST
classification model for MNIST data
Database
• Data Visualization
• Reading the image data of
• Processing of image data Handwrit
• Preparing dataset with labels
ten Digits
• Label Encoding
• Preparing CNN layers - UCI
• Compiling the CNN Machine
• Training the CNN model Learning
• Analyzing Accuracy & Loss Curve Repositor
• Testing the Model
• Evaluating the model
y
• Saving & Loading the Model
Hint Sheet link
https://colab.research.google.com/drive/1ymwpji6K6st0_fyHYG9cZG77YHtv54V8?usp=sharing