0% found this document useful (0 votes)

21 views2 pages

Script

The document discusses data preprocessing, feature selection, and model evaluation in a dataset of 41 medical conditions. It highlights the use of Recursive Feature Elimination and Mutual Information for feature selection, along with visualizations like correlation heatmaps and feature importance rankings. The comparison of algorithms shows that Random Forest achieved the highest accuracy among the tested models.

Uploaded by

sua43668

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views2 pages

Script

Uploaded by

sua43668

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Slide 15: Data Preprocessing

This bar graph illustrates the distribution of diseases in our dataset. Each bar represents a
distinct disease, with equal frequencies across 41 medical conditions. The balanced dataset
minimizes bias, ensuring fair and accurate predictions across all classes.

Slide 16: Feature Selection

We employed two methods for feature selection:

● Recursive Feature Elimination (RFE): Identifies the most impactful features by

iteratively removing less significant ones.
● Mutual Information: Measures the dependency between features and the target
variable, selecting the most predictive attributes.

Slide 16 (continued): Correlation Heatmap

The heatmap visualizes feature relationships:

● Diagonal (Red): Perfect self-correlation (value = 1).

● Color Scale:
○ Red: Strong positive correlation.
○ Blue: Strong negative correlation.
○ White: Little to no correlation.
Highly correlated features aid feature selection and improve model
performance.

Slide 17: Feature Importance Visualization

This bar chart ranks features by their importance scores:

● Top Feature: Fatigue has the highest influence on predictions.

● Other impactful features include joint_pain, headache, and high_fever.
● Importance scores decrease down the list, highlighting diminishing contributions.

Slide 19: Top 50 Features

This bar chart displays the top 50 important features based on importance scores:

● Top Feature: Muscle_pain is the most influential predictor.

● Other key features: Itching, altered_sensorium, dark_urine, and high_fever.
● Features were ranked using importance metrics derived from tree-based models,
ensuring a balance between performance and complexity.

Slide 20: Confusion Matrices

Confusion matrices compare Random Forest (left) and Gradient Boosting (right) models:

● Axes: Predicted labels (x-axis) vs. actual labels (y-axis).

● Diagonal Values: Correct predictions dominate, indicating high accuracy.

Slide 22: Algorithm Comparison

We tested four algorithms:

● Random Forest, Gradient Boosting, Support Vector Classifier, and K-Nearest

Neighbors.
● Result: Random Forest delivered the highest accuracy, which will be detailed in the
results section.

Python and Machine Learning
No ratings yet
Python and Machine Learning
14 pages
Predicting Disease With Machine Learning
No ratings yet
Predicting Disease With Machine Learning
20 pages
Sushree Sangita Jena Shaswati Priyadarshini Swatisoumya Priyadarshini
No ratings yet
Sushree Sangita Jena Shaswati Priyadarshini Swatisoumya Priyadarshini
14 pages
Disease Prediction Using ML
No ratings yet
Disease Prediction Using ML
20 pages
AI Based: Disease Prediction System: A Practical, Responsible, and Deployable Approach
No ratings yet
AI Based: Disease Prediction System: A Practical, Responsible, and Deployable Approach
7 pages
Cse437 4
No ratings yet
Cse437 4
14 pages
Disease Prediction Based On Symptoms
No ratings yet
Disease Prediction Based On Symptoms
16 pages
Meds Can
No ratings yet
Meds Can
34 pages
Thesis Presentation
No ratings yet
Thesis Presentation
22 pages
Boo PH 3
No ratings yet
Boo PH 3
11 pages
Disease Prediction Using Machine Learning Algorithms2020 PDF
No ratings yet
Disease Prediction Using Machine Learning Algorithms2020 PDF
7 pages
HussainBadshah SafwanSheikh
No ratings yet
HussainBadshah SafwanSheikh
12 pages
BDA Miniproject
No ratings yet
BDA Miniproject
5 pages
Mini Project Final Disease Prediction and Classification
No ratings yet
Mini Project Final Disease Prediction and Classification
29 pages
DSC540 Final Project Report Write-Up
No ratings yet
DSC540 Final Project Report Write-Up
6 pages
Report
No ratings yet
Report
11 pages
Review
No ratings yet
Review
5 pages
Disease Prediction Based On Symptoms Using Machine Learning
No ratings yet
Disease Prediction Based On Symptoms Using Machine Learning
16 pages
No 11
No ratings yet
No 11
8 pages
Slide 1
No ratings yet
Slide 1
7 pages
Disease Detection Machine Learning Model
No ratings yet
Disease Detection Machine Learning Model
10 pages
ML FDP Over All Summary
No ratings yet
ML FDP Over All Summary
44 pages
Research Paper
No ratings yet
Research Paper
7 pages
The Prediction and Analysis of Heart Disease Using 240511 181237
No ratings yet
The Prediction and Analysis of Heart Disease Using 240511 181237
8 pages
Bibliography
No ratings yet
Bibliography
6 pages
Final Conference 1
No ratings yet
Final Conference 1
8 pages
Heart Disease Prediction With ML
No ratings yet
Heart Disease Prediction With ML
18 pages
Kalker Jnno ML
No ratings yet
Kalker Jnno ML
13 pages
HEART
No ratings yet
HEART
15 pages
Chat Assignment: Imputation Strategy Overview
No ratings yet
Chat Assignment: Imputation Strategy Overview
10 pages
Cardiovascular Disease Predictive Modeling
No ratings yet
Cardiovascular Disease Predictive Modeling
3 pages
Disease Pred Report
No ratings yet
Disease Pred Report
42 pages
Efficient Medical Diagnosis of Human Heart Diseases
No ratings yet
Efficient Medical Diagnosis of Human Heart Diseases
27 pages
Coursera DL and RL Project
No ratings yet
Coursera DL and RL Project
3 pages
ML Model Report
No ratings yet
ML Model Report
8 pages
Final Thesis Project 4
No ratings yet
Final Thesis Project 4
13 pages
Disease Prediction Using Patient Data
No ratings yet
Disease Prediction Using Patient Data
7 pages
Heart Disease Prediction Using Machine Learning Publication - Ijsart
No ratings yet
Heart Disease Prediction Using Machine Learning Publication - Ijsart
5 pages
Epidemics vs. Pandemics
No ratings yet
Epidemics vs. Pandemics
15 pages
Predictive Disease Detection App Using Machine Learning Model
No ratings yet
Predictive Disease Detection App Using Machine Learning Model
15 pages
INFX 499 Milestone 1
No ratings yet
INFX 499 Milestone 1
8 pages
HUST PPT Template 2022 RED 16x9 567042-2
No ratings yet
HUST PPT Template 2022 RED 16x9 567042-2
25 pages
Predictive Analytics and Personalized Health Monitoring Powered by Machine Learning
No ratings yet
Predictive Analytics and Personalized Health Monitoring Powered by Machine Learning
6 pages
Major
No ratings yet
Major
15 pages
Ibm 9
No ratings yet
Ibm 9
20 pages
Disease Prediction via Symptoms Dataset
No ratings yet
Disease Prediction via Symptoms Dataset
7 pages
Liver Disease Prediction Using Machine Learning
No ratings yet
Liver Disease Prediction Using Machine Learning
28 pages
Breast Cancer Prediction with ML Models
No ratings yet
Breast Cancer Prediction with ML Models
14 pages
Multi Disease Prediction Using Machine Learning Algorithms
No ratings yet
Multi Disease Prediction Using Machine Learning Algorithms
10 pages
Final
No ratings yet
Final
13 pages
Latest Seminar Report Yash Ingole
No ratings yet
Latest Seminar Report Yash Ingole
35 pages
Classification Workflow Updated
No ratings yet
Classification Workflow Updated
3 pages
x23 Group 1 - Final Project cst383
No ratings yet
x23 Group 1 - Final Project cst383
25 pages
Data Analysis and Machine Learning On The Wisconsin Breast Cancer Dataset
No ratings yet
Data Analysis and Machine Learning On The Wisconsin Breast Cancer Dataset
11 pages
AI Heart Disease Prediction Tool
No ratings yet
AI Heart Disease Prediction Tool
16 pages

Script

Uploaded by

Script

Uploaded by

Slide 15: Data Preprocessing

Slide 16: Feature Selection

We employed two methods for feature selection:

●​ Recursive Feature Elimination (RFE): Identifies the most impactful features by

Slide 16 (continued): Correlation Heatmap

The heatmap visualizes feature relationships:

●​ Diagonal (Red): Perfect self-correlation (value = 1).

Slide 17: Feature Importance Visualization

This bar chart ranks features by their importance scores:

●​ Top Feature: Fatigue has the highest influence on predictions.

Slide 19: Top 50 Features

●​ Top Feature: Muscle_pain is the most influential predictor.

Slide 20: Confusion Matrices

●​ Axes: Predicted labels (x-axis) vs. actual labels (y-axis).

Slide 22: Algorithm Comparison

We tested four algorithms:

●​ Random Forest, Gradient Boosting, Support Vector Classifier, and K-Nearest

You might also like

● Recursive Feature Elimination (RFE): Identifies the most impactful features by

● Diagonal (Red): Perfect self-correlation (value = 1).

● Top Feature: Fatigue has the highest influence on predictions.

● Top Feature: Muscle_pain is the most influential predictor.

● Axes: Predicted labels (x-axis) vs. actual labels (y-axis).

● Random Forest, Gradient Boosting, Support Vector Classifier, and K-Nearest