0% found this document useful (0 votes)

15 views46 pages

Classification Algorithms

Uploaded by

231612601008

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views46 pages

Classification Algorithms

Uploaded by

231612601008

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 46

CLASSIFICATION

ALGORITHMS
PRESENTED BY,

DVEIN INNOVATIONS
LOGISTIC REGRESSION
INTRODUCTION

• Despite its name, Logistic Regression is a classification algorithm, not a

regression algorithm.

• It is used when the target variable is categorical (e.g., Yes/No, 0/1,

Churn/Not Churn)

• Logistic Regression predicts the probability that an input belongs to a

particular class using a sigmoid function (also called logistic function).
WHEN TO USE LOGISTIC REGRESSION?

Use Case Example Target Outcome

Predicting customer churn Yes or No

Email spam detection Spam or Not Spam

Medical diagnosis Disease or No Disease

WORKING

Step 1: Linear Combination of Inputs

• It starts like Linear Regression:

• xi: input features

• wi: learned weights
• b: bias term
Step 2: Apply the Sigmoid Function
• This linear output z is passed into the sigmoid function to squash the value between 0
and 1:

This gives a probability value:

• Closer to 0 → class 0
• Closer to 1 → class 1
Step 3: Make Prediction
• Using a threshold (typically 0.5):

So:
• If probability ≥ 0.5 → predict class 1
• Else → predict class 0
Step 4: Model Training (How it Learns)
• Logistic Regression uses a loss function to measure how good its predictions are.
• The most common is Binary Cross-Entropy Loss:
Where:
• y: actual value
• y^: predicted probability
• This loss is minimized using Gradient Descent, adjusting weights to reduce error over
time.
PRO’S & CON’S

ADVANTAGES LIMITATIONS
• Simple and interpretable • Assumes linear relationship between
• Fast to train features and the log-odds of the outcome
• Doesn’t work well
• Good for baseline models
with nonlinear patterns unless features
• Works well with linearly separable data are transformed
• Sensitive to outliers and correlated inputs
K - NEAREST NEIGHBOR
INTRODUCTION

• KNN (K-Nearest Neighbors) is a supervised learning algorithm used

for classification and regression (mostly classification).

• It classifies a new data point based on the majority class of its K closest
neighbors in the training data.
CORE IDEA

“Tell me who your neighbors are, and I’ll tell you who you are.”

• For a new data point, KNN:

• Looks at the K nearest data points (neighbors)

• Finds out their labels

• Predicts the most frequent label among them

WORKING

Step 1: Choose the value of K

• K = number of neighbors to consider (typically odd like 3, 5, 7)

• Small K = sensitive to noise

• Large K = smooth but may ignore local structure

Step 2: Measure the Distance
• KNN computes distance between the test point and all training points.
Common distance measures:
• Euclidean Distance:

• Others: Manhattan, Minkowski, Cosine

Step 3: Find the K Nearest Neighbors

• Sort all training points by distance to the test point

• Select the top K closest ones

Step 4: Vote for the Majority Class

• For classification: The class most represented among the K neighbors is selected.

• For regression: The average value of the K neighbors is taken.

EXAMPLE

Imagine this situation:

• You're classifying a flower based on petal length and width

• K=3

• Among 3 closest flowers:

• 2 are “Setosa”

• 1 is “Versicolor”

• Prediction: Setosa (majority vote)

PRO’S & CON’S

ADVANTAGES LIMITATIONS
• Simple and Intuitive • Slow prediction
• No Training Phase (Lazy Learner) • Curse of dimensionality
• Naturally Handles Multiclass • Sensitive to scale
Classification
• Sensitive to noisy data
• Non-Parametric Algorithm
• Adapts to New Data Easily
SUPPORT VECTOR
MACHINE
INTRODUCTION

• Support Vector Machine (SVM) is a supervised machine learning

algorithm used for both classification and regression (mostly
classification).

• SVM finds the best boundary (hyperplane) that separates classes with the
maximum margin.
WHEN TO USE SVM

Use Case Example Target Outcome

Face Recognition Person A or B
Document Classification Spam or Not Spam
Cancer Detection Malignant or Benign
CORE IDEA

• SVM tries to find a hyperplane (a line in 2D, a plane in 3D, or a surface in

higher dimensions) that best separates the classes in the dataset.

• Not just any separation — it wants the one with the widest margin between
the two classes.
WORKING

Step 1: Find a Decision Boundary (Hyperplane)

• A hyperplane is a line that divides the dataset into different classes.

There are many possible hyperplanes, but SVM picks the one with the
maximum margin.
Step 2: Maximize the Margin
• Margin = Distance between the hyperplane and the nearest data points
from each class.
• The points closest to the hyperplane are called Support Vectors.
• SVM chooses the hyperplane that maximizes this margin.
• Larger margin = better generalization to new data
Step 3: Handle Nonlinear Separation (Using Kernels)
• Not all data is linearly separable (i.e., can't be split by a straight line).
SVM solves this using a Kernel Trick:
• Maps data to a higher dimension where it is linearly separable.
Common kernels:
• Linear
• Polynomial
• Radial Basis Function (RBF) or Gaussian
Step 4: Regularization Parameter (C)

• SVM uses a parameter C to control the trade-off:

• High C → tries to classify everything correctly → low bias, high variance

(overfit)

• Low C → allows some misclassifications → high bias, low variance

(better generalization)
CHARACTERISTICS

Feature Description
Type Supervised, binary or multiclass
Nature Margin-based classifier
Strength Based on support vectors only
Decision Works well even in high-dimensional spaces
Limitations Slower on large datasets, sensitive to noise and
parameter tuning
DECISION TREE
INTRODUCTION

• A Decision Tree is a supervised learning algorithm used for classification

and regression.

• It models decisions as a tree-like structure where each internal

node represents a test on a feature, branches represent outcomes of the test,
and leaf nodes represent class labels (for classification) or values (for
regression).
WHEN TO USE DECISION TREES

Use Case Example Target Outcome

Loan approval Approve / Reject
Disease diagnosis Illness categories
Customer segmentation Group A / B / C
CORE IDEA

• Decision Trees split the dataset into subsets based on feature values that
best differentiate the target variable. This process continues recursively, forming
a tree where:

• Each node → tests a condition

• Each branch → outcome of a decision

• Each leaf → final prediction

WORKING

Step 1: Start at the Root Node

• Begin with the entire dataset.

• Pick the best feature to split the data.

Step 2: Choose the Best Feature to Split
• To decide which feature is "best", we use impurity measures like:
1. Gini Impurity:

(Where pipiis the probability of class i)

2. Entropy (used in Information Gain):

• The goal is to choose the split that maximizes information gain (i.e., gives
the purest children).
Step 3: Recursively Split the Dataset

For each child node, repeat the process:

• Recalculate Gini or Entropy

• Choose best split

• Continue until stopping conditions are met

Step 4: Define Stopping Criteria
• All records belong to one class
• No more features to split
• Max depth is reached
• Minimum number of samples in a node
Step 5: Make Predictions
• For new data: start at root → follow decision rules → reach a leaf → return the value or
class at the leaf.
PRO’S & CON’S

ADVANTAGES LIMITATIONS
• Simple to Understand and Interpret • Overfitting
• No Need for Feature Scaling • Unstable
• Handles Both Numeric and Categorical • Biased toward features with more levels
Features • Greedy algorithm
• Can Model Nonlinear Relationships
RANDOM FOREST
INTRODUCTION

• Random Forest is a supervised ensemble learning algorithm used for

both classification and regression.

• It builds multiple decision trees and combines their predictions to improve

accuracy and avoid overfitting.
WHEN TO USE RANDOM FOREST

Use Case Example Target Outcome

Fraud detection Fraud / Not Fraud
Customer churn prediction Churn / Not Churn
Medical diagnosis Disease class
CORE IDEA

“Don’t trust one decision tree — take a vote from many.”

• A Random Forest creates many decision trees, each trained on a random

subset of data and features, and then takes the majority vote (for
classification) or average (for regression).
WORKING

Step 1: Create Bootstrapped Datasets

• From the original dataset, randomly sample (with replacement) to create
many subsets (called bootstrap samples).
• Each tree will be trained on a different bootstrapped dataset.
Step 2: Grow a Decision Tree on Each Sample

For each bootstrapped dataset:

• A decision tree is built.

• At each split in the tree, only a random subset of features is considered

— not all features.

• This adds diversity and reduces correlation between trees.

Step 3: Make Predictions
• For classification:
• Each tree gives a class prediction.
• The forest chooses the class with the majority vote.
• For regression:
• Each tree gives a numerical prediction.
• The forest returns the average of all predictions.
Step 4: Aggregate the Output

• This combination of trees:

• Reduces overfitting

• Increases robustness

• Improves overall accuracy

CHARACTERISTICS

Feature Description
Type Ensemble, supervised learning
Works For Classification & regression
Base Model Decision Tree
Reduces Overfitting, variance
Increases Accuracy, robustness
Preprocessing Needed Very little
PRO’S & CON’S

LIMITATIONS ADVANTAGES
• Slower Predictions • High Accuracy and Performance
• Less Interpretable • Reduces Overfitting from
• Large Memory Usage Decision Trees
• Works Well with Missing and
• Overfitting (if too many trees
Imbalanced Data
without pruning)
• Robust to Noise and Outliers

Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
14 pages
Unit3 ML
No ratings yet
Unit3 ML
7 pages
Machine Learning Techniques - Overview of Decision Trees, Logistic Regression, SVM, and K-NN
No ratings yet
Machine Learning Techniques - Overview of Decision Trees, Logistic Regression, SVM, and K-NN
1 page
Amlt Bca Unit-1
No ratings yet
Amlt Bca Unit-1
24 pages
Classification Algorithms 3rd
No ratings yet
Classification Algorithms 3rd
15 pages
Machine Learning
No ratings yet
Machine Learning
15 pages
AI For Eng Supervised-Learning
No ratings yet
AI For Eng Supervised-Learning
25 pages
Algorithms 1
No ratings yet
Algorithms 1
23 pages
ML Models
No ratings yet
ML Models
21 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
Data Science Unit 3
No ratings yet
Data Science Unit 3
33 pages
Machine Learning Basics for Beginners
No ratings yet
Machine Learning Basics for Beginners
28 pages
Machine Learning Lec 1
No ratings yet
Machine Learning Lec 1
68 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
U21amg05 Aif and ML Unit 04 Notes
No ratings yet
U21amg05 Aif and ML Unit 04 Notes
42 pages
Classification vs. Prediction in ML
No ratings yet
Classification vs. Prediction in ML
25 pages
Session 5
No ratings yet
Session 5
36 pages
Dsbdunitiii T1729232981820-1
No ratings yet
Dsbdunitiii T1729232981820-1
26 pages
5 Markd
No ratings yet
5 Markd
24 pages
Types of Kernels in Support Vector Machines
No ratings yet
Types of Kernels in Support Vector Machines
14 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
Machine Learning for Heart Failure Prediction
No ratings yet
Machine Learning for Heart Failure Prediction
15 pages
ML Notes
No ratings yet
ML Notes
12 pages
Detailed ML Questions
No ratings yet
Detailed ML Questions
4 pages
Machine Learning Chapter 2
No ratings yet
Machine Learning Chapter 2
53 pages
DL
No ratings yet
DL
10 pages
Partha Pratim Das New1
No ratings yet
Partha Pratim Das New1
13 pages
Classification
No ratings yet
Classification
10 pages
Classification and Regression Models
No ratings yet
Classification and Regression Models
20 pages
Machine Learning for Breast Cancer Classification
100% (2)
Machine Learning for Breast Cancer Classification
16 pages
Unit 3 Ds
No ratings yet
Unit 3 Ds
10 pages
Raghav Soni (20IOT6014) Algo - Assignment
No ratings yet
Raghav Soni (20IOT6014) Algo - Assignment
14 pages
ML & DL Notes
No ratings yet
ML & DL Notes
30 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
Supervised Classification Notes
No ratings yet
Supervised Classification Notes
31 pages
Data Analysis Chap 3
No ratings yet
Data Analysis Chap 3
21 pages
Lecture2 MCQ Guide
No ratings yet
Lecture2 MCQ Guide
8 pages
Unit 3 PDF
No ratings yet
Unit 3 PDF
7 pages
Overview of Machine Learning Algorithms
No ratings yet
Overview of Machine Learning Algorithms
123 pages
Data Mining Algorithms Comparison
No ratings yet
Data Mining Algorithms Comparison
32 pages
Unit 6 DWDM
No ratings yet
Unit 6 DWDM
74 pages
MLunit 2 Mynotes
No ratings yet
MLunit 2 Mynotes
15 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
Ds Notes Mca
No ratings yet
Ds Notes Mca
30 pages
Big Data Notes
No ratings yet
Big Data Notes
33 pages
Decision Trees
67% (3)
Decision Trees
14 pages
Comparative Study
No ratings yet
Comparative Study
17 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Classification
No ratings yet
Classification
7 pages
Unit 3 - ML (NEW)
No ratings yet
Unit 3 - ML (NEW)
68 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Unit III
No ratings yet
Unit III
5 pages
ML4 - Decision Trees & Random Forest
No ratings yet
ML4 - Decision Trees & Random Forest
44 pages
Supervised Learning - SVM - DT
No ratings yet
Supervised Learning - SVM - DT
43 pages
Machine Learning Classification Guide
No ratings yet
Machine Learning Classification Guide
84 pages
DS Unit - 4
No ratings yet
DS Unit - 4
76 pages
Module Iii
No ratings yet
Module Iii
15 pages
Unit 1
No ratings yet
Unit 1
15 pages
Assessing A Single Classification Algorithm and Two Classification Algorithms
No ratings yet
Assessing A Single Classification Algorithm and Two Classification Algorithms
12 pages
Advanced Artificial Intelligence Internship
No ratings yet
Advanced Artificial Intelligence Internship
2 pages
Proposal Malolan College
No ratings yet
Proposal Malolan College
3 pages
Natural Language Processing: Presented By
No ratings yet
Natural Language Processing: Presented By
22 pages
Android Course Syllabus
No ratings yet
Android Course Syllabus
5 pages
Time
No ratings yet
Time
1 page
Deep Learning Insights and Applications
No ratings yet
Deep Learning Insights and Applications
3 pages
AI Learns More with Fewer Samples
No ratings yet
AI Learns More with Fewer Samples
8 pages
R18 Prediction of Water Quality With Ensemble Learning Algorithms
No ratings yet
R18 Prediction of Water Quality With Ensemble Learning Algorithms
9 pages
Vipul Kumar Gupta Resume
No ratings yet
Vipul Kumar Gupta Resume
1 page
Prasang Biyani
No ratings yet
Prasang Biyani
1 page
AI in HR
No ratings yet
AI in HR
22 pages
Unit-4 Ai
No ratings yet
Unit-4 Ai
32 pages
Project Report Doc Team 1 Food Classification
No ratings yet
Project Report Doc Team 1 Food Classification
16 pages
Text Classification: Slides Adapted From Lyle Ungar and Dan Jurafsky
No ratings yet
Text Classification: Slides Adapted From Lyle Ungar and Dan Jurafsky
29 pages
Sample
No ratings yet
Sample
7 pages
Smart Health Monitoring System Using IoT and Machi
No ratings yet
Smart Health Monitoring System Using IoT and Machi
6 pages
ChatGPT and Higher Education
No ratings yet
ChatGPT and Higher Education
51 pages
Big Data Analytics Overview and Tools
No ratings yet
Big Data Analytics Overview and Tools
15 pages
Trust Issues in Artificial Intelligence
No ratings yet
Trust Issues in Artificial Intelligence
4 pages
Image Annotation For Computer Vision Guide
No ratings yet
Image Annotation For Computer Vision Guide
27 pages
Rethinking AI for Health Equity
No ratings yet
Rethinking AI for Health Equity
33 pages
Gen AI Syllabus
No ratings yet
Gen AI Syllabus
2 pages
Decision Trees in AI: Overview and Methods
No ratings yet
Decision Trees in AI: Overview and Methods
54 pages
Neural Computation: Exercise Sheet 5
No ratings yet
Neural Computation: Exercise Sheet 5
2 pages
Deep Learning Techniques For Behavior Analysis Methods, Applications, and Insights Using CNN Model
No ratings yet
Deep Learning Techniques For Behavior Analysis Methods, Applications, and Insights Using CNN Model
7 pages
Abhinav Burnapelly Resume
No ratings yet
Abhinav Burnapelly Resume
1 page
Journal of Natural Gas Science and Engineering: Chiranth Hegde, K.E. Gray
No ratings yet
Journal of Natural Gas Science and Engineering: Chiranth Hegde, K.E. Gray
9 pages
If4071 Deep Learning
No ratings yet
If4071 Deep Learning
1 page
Class Note For Machine Learning at University
No ratings yet
Class Note For Machine Learning at University
58 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
21 pages
Intelligent Traffic Monitoring and Control Using Artificial Intelligence in Nairobi's Network Infrastructure: Enhancing Urban Mobility Through Smart Technologies (WWW - Kiu.ac - Ug)
No ratings yet
Intelligent Traffic Monitoring and Control Using Artificial Intelligence in Nairobi's Network Infrastructure: Enhancing Urban Mobility Through Smart Technologies (WWW - Kiu.ac - Ug)
5 pages
Koushik Katakam Resume
No ratings yet
Koushik Katakam Resume
2 pages
Data Mining & Warehousing Course
No ratings yet
Data Mining & Warehousing Course
1 page
Generative AI With Perplexity
No ratings yet
Generative AI With Perplexity
17 pages
Final R20 M.Tech AI - ML Syllabus
No ratings yet
Final R20 M.Tech AI - ML Syllabus
50 pages

Classification Algorithms

Uploaded by

Classification Algorithms

Uploaded by

CLASSIFICATION

• Despite its name, Logistic Regression is a classification algorithm, not a

• It is used when the target variable is categorical (e.g., Yes/No, 0/1,

• Logistic Regression predicts the probability that an input belongs to a

Use Case Example Target Outcome

Predicting customer churn Yes or No

Email spam detection Spam or Not Spam

Medical diagnosis Disease or No Disease

Step 1: Linear Combination of Inputs

• xi: input features

This gives a probability value:

• KNN (K-Nearest Neighbors) is a supervised learning algorithm used

• For a new data point, KNN:

• Looks at the K nearest data points (neighbors)

• Finds out their labels

• Predicts the most frequent label among them

Step 1: Choose the value of K

• K = number of neighbors to consider (typically odd like 3, 5, 7)

• Small K = sensitive to noise

• Large K = smooth but may ignore local structure

• Others: Manhattan, Minkowski, Cosine

• Sort all training points by distance to the test point

• Select the top K closest ones

Step 4: Vote for the Majority Class

• For regression: The average value of the K neighbors is taken.

Imagine this situation:

• You're classifying a flower based on petal length and width

• Among 3 closest flowers:

• Prediction: Setosa (majority vote)

• Support Vector Machine (SVM) is a supervised machine learning

Use Case Example Target Outcome

• SVM tries to find a hyperplane (a line in 2D, a plane in 3D, or a surface in

Step 1: Find a Decision Boundary (Hyperplane)

• A hyperplane is a line that divides the dataset into different classes.

• SVM uses a parameter C to control the trade-off:

• High C → tries to classify everything correctly → low bias, high variance

• Low C → allows some misclassifications → high bias, low variance

• A Decision Tree is a supervised learning algorithm used for classification

• It models decisions as a tree-like structure where each internal

Use Case Example Target Outcome

• Each node → tests a condition

• Each branch → outcome of a decision

• Each leaf → final prediction

Step 1: Start at the Root Node

• Begin with the entire dataset.

• Pick the best feature to split the data.

​(Where pipi​is the probability of class i)

For each child node, repeat the process:

• Choose best split

• Continue until stopping conditions are met

• Random Forest is a supervised ensemble learning algorithm used for

• It builds multiple decision trees and combines their predictions to improve

Use Case Example Target Outcome

“Don’t trust one decision tree — take a vote from many.”

• A Random Forest creates many decision trees, each trained on a random

Step 1: Create Bootstrapped Datasets

For each bootstrapped dataset:

• At each split in the tree, only a random subset of features is considered

• This adds diversity and reduces correlation between trees.

• This combination of trees:

• Improves overall accuracy

You might also like

(Where pipiis the probability of class i)