0% found this document useful (0 votes)

9 views53 pages

Machine Learning - Iii

Supervised machine learning algorithms are widely used for tasks where a model learns from labeled training data, with common types including classification and regression. Decision trees, Naive Bayes, and Support Vector Machines (SVM) are notable examples, each with unique structures and methodologies for making predictions. Random Forest enhances decision trees by combining multiple trees to improve accuracy and reduce overfitting.

Uploaded by

as8219003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views53 pages

Machine Learning - Iii

Uploaded by

as8219003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 53

Supervised Machine Learning Algorithms

UNIT-III
Types of Machine Learning (ML)
Machine Learning Algorithms helps computer system learn without being
explicitly programmed.
These algorithms are categorized into supervised or unsupervised.
Supervised machine learning algorithms
This is the most commonly used machine learning algorithm.
It is called supervised because the process of algorithm learning from the
training dataset can be thought of as a teacher supervising the learning
process.
It can be understood as follows −

Suppose we have input variables x and an output variable y and we applied

an algorithm to learn the mapping function from the input to output such as −
Y = f(x)
Now, the main goal is to approximate the mapping function so well that when we have
new input data (x), we can predict the output variable (Y) for that data.

Mainly supervised leaning problems can be divided into the following two kinds of
problems −
Classification − A problem is called classification problem when we have the
categorized output such as “black”, “teaching”, “non-teaching”, etc.

 Regression − A problem is called regression problem when we have the real value
output such as “distance”, “kilogram”, etc.

Decision tree, random forest, knn, logistic regression are the examples of supervised
machine learning algorithms.
1. Decision Tree

Decision Trees are a class of very powerful Machine Learning model cable of

achieving high accuracy in many tasks while being highly interpretable.

What makes decision trees special in the realm of ML models is really their clarity

of information representation.

The “knowledge” learned by a decision tree through training is directly formulated

into a hierarchical structure.

This structure holds and displays the knowledge in such a way that it can easily be

understood, even by non-experts.

Decision tree algorithm falls under the category of supervised learning.

They can be used to solve both regression and classification problems.

Decision tree uses the tree representation to solve the problem in which each leaf
node corresponds to a class label and attributes are represented on the internal node of
the tree.
We can represent any boolean function on discrete attributes using the decision tree.

The most notable types of decision tree algorithms are:-

What is a Decision Tree?
A Decision Tree is a supervised learning algorithm used for both classification and
regression tasks.
It mimics human decision-making by breaking down a dataset into smaller subsets
based on feature values, forming a tree-like structure.
Structure of a Decision Tree-
A Decision Tree consists of:
• Root Node: The starting point that represents the entire dataset.
• Branches: Connections between nodes that show decision paths.
• Internal Nodes: Points where decisions are made based on feature values.
• Leaf Nodes: The final output or classification.
How Does a Decision Tree Work?
1. Splitting: The dataset is split based on feature values to
create pure subsets.
2. Attribute Selection: The best feature for splitting is chosen
using metrics like:
1. Information Gain (based on Entropy)
2. Gini Index (used in CART algorithm)
3. Recursive Partitioning: The process continues until a
stopping criterion is met.
4. Pruning: To prevent overfitting, unnecessary branches are
removed.
Example of a Decision Tree
Imagine predicting whether a customer will buy a product based on income,
age, and previous purchases:
1. Root Node (Income): "Is the person’s income greater than $50,000?"
1. If Yes, proceed to the next question.
2. If No, predict "No Purchase" (leaf node).
2. Internal Node (Age): "Is the person’s age above 30?"
1. If Yes, proceed to the next question.
2. If No, predict "No Purchase" (leaf node).
3. Internal Node (Previous Purchases): "Has the person made previous
purchases?"
1. If Yes, predict "Purchase" (leaf node).
2. If No, predict "No Purchase" (leaf node).
Advantages of Decision Trees
• Easy to interpret and visualize
Handles both numerical and categorical data
Requires minimal data preprocessing
Works well with missing values
Disadvantages of Decision Trees
• Prone to overfitting if not pruned properly
Can be biased toward dominant features
Sensitive to noisy data
Example: Predicting Whether a Customer Will Buy a Product
Imagine we want to predict whether a customer will buy a product based on age,
income, and previous purchases.
Step 1: Dataset

Previous
Age Income Buy Product?
Purchases

25 Low No No
45 High Yes Yes
35 Medium No No
50 High Yes Yes
30 Low Yes No
Step 2: Building the Decision Tree
1. Root Node (Income): "Is the person’s income High?"
• If Yes, proceed to the next question.
• If No, predict "No Purchase" (leaf node).
2. Internal Node (Age): "Is the person’s age above 40?"
• If Yes, proceed to the next question.
• If No, predict "No Purchase" (leaf node).
3. Internal Node (Previous Purchases): "Has the person made
previous purchases?"
• If Yes, predict "Purchase" (leaf node).
• If No, predict "No Purchase" (leaf node).
Step 3: Making Predictions
• If a new customer is 45 years old, has High income, and has made previous
purchases, the model predicts "Purchase".
• If a customer is 30 years old, has Low income, and has not made previous
purchases, the model predicts "No Purchase".
Naive Bayes is a classification algorithm based on Bayes' theorem,
which calculates the probability of a sample belonging to a particular class based
on the probabilities of its features.
It is called "naive" because it assumes that all features are independent of each
other, which is often not the case in real-world scenarios.
Naive Bayes is a probabilistic classification algorithm based on Bayes'
theorem, which calculates the probability of a sample belonging to a particular
class given the probabilities of its features.
It is widely used in machine learning due to its simplicity and efficiency.
Understanding Bayes' Theorem
Bayes' theorem states:
[ P(A|B) = \frac{P(B|A) P(A)}{P(B)} ]
Where:
• ( P(AB) ) is the posterior probability (probability of class ( A ) given feature ( B )).
• ( P(BA) ) is the likelihood (probability of feature ( B ) given class ( A )).
• ( P(A) ) is the prior probability (initial probability of class ( A )).
• ( P(B) ) is the **probability of feature ( B )).
Why is it "Naive"?
• The algorithm assumes that all features are independent, meaning the presence
of one feature does not affect another.
• While this assumption is often unrealistic, Naive Bayes still performs well in
many applications.
Types of Naive Bayes Classifiers
1. Gaussian Naive Bayes: Used when features follow a normal distribution.
2. Multinomial Naive Bayes: Commonly used in text classification (e.g., spam
filtering).
3. Bernoulli Naive Bayes: Suitable for binary feature data (e.g., sentiment
analysis).
Applications
• Spam Filtering: Classifies emails as spam or not.
• Sentiment Analysis: Determines whether a review is positive or negative.
• Medical Diagnosis: Predicts diseases based on symptoms.
• Text Classification: Categorizes documents into predefined classes.
Advantages
 Fast & Efficient: Works well with large datasets.
Requires Minimal Training Data: Performs well even with small datasets.
Handles High-Dimensional Data: Useful for text classification.
Limitations
• Feature Independence Assumption: Not always realistic.
Zero Probability Problem: If a feature never appears in training data, it gets
assigned zero probability.
Sensitive to Data Quality: Requires good feature selection for optimal
performance.
Example: Spam Detection
We'll classify messages as spam or not spam using Naive Bayes.
Step 1: Install Required Libraries
pip install scikit-learn pandas
Step 2: Import Libraries

import pandas as pdfrom sklearn.feature_extraction.text import

CountVectorizerfrom sklearn.naive_bayes import
MultinomialNBfrom sklearn.model_selection import
train_test_split
Step 3: Prepare Data

Step 4: Convert Text to Numerical Data

Step 5: Split Data into Training & Testing Sets

Step 6: Train Naive Bayes Model

Step 7: Make Predictions

This will classify the sample messages as spam or not spam based
on the trained model.
SVM (Support Vector Machine) is a supervised algorithm, effective for
both regression and classification, though it excels in classification
tasks.
Popular since the 1990s, it performs well on smaller or complex
datasets with minimal tuning.
What is a Support Vector Machine(SVM)?
A Support Vector Machine (SVM) is a machine learning
algorithm used for classification and regression.
This finds the best line (or hyperplane) to separate data into groups,
maximizing the distance between the closest points (support vectors)
of each group.
Types of Support Vector Machine (SVM) Algorithms
• Linear SVM: When the data is perfectly linearly separable only then we
can use Linear SVM.

• Perfectly linearly separable means that the data points can be classified
into 2 classes by using a single straight line(if 2D).

• Non-Linear SVM: When the data is not linearly separable, we can use
Non-Linear SVM.

• This happens when the data points cannot be separated into two classes
using a straight line (if 2D).

• In such cases, we use advanced techniques like kernel tricks to classify

them.
How Does Support Vector Machine Algorithm Work?

SVM is defined such that it is defined in terms of the support

vectors only, we don’t have to worry about other

observations since the margin is made using the points

which are closest to the hyperplane (support vectors),

whereas in logistic regression the classifier is defined over all

the points.

Let’s understand the working of SVM using an example.

To classify these points, we can have many decision
boundaries, but the question is which is the best and
how do we find it?
NOTE: Since we are plotting the data points in a 2-dimensional graph
we call this decision boundary a straight line but if we have more
dimensions, we call this decision boundary a “hyperplane”
This diagram illustrates:

• Hyperplane: The decision boundary that separates different classes.

• Support Vectors: The closest data points to the hyperplane, which

influence its position.

• Margin: The distance between the hyperplane and the nearest

support vectors.

• A larger margin improves classification accuracy.

Advantages of Support Vector Machine
1.Works well with complex data: SVM is great for datasets
where the separation between categories is not clear. It can
handle both linear and non-linear data effectively.

2.Effective in high-dimensional spaces: SVM performs

well even when there are more features (dimensions) than
samples, making it useful for tasks like text classification or
image recognition.
1.Avoids overfitting: SVM focuses on finding the best decision
boundary (margin) between classes, which helps in reducing the risk
of overfitting, especially in high-dimensional data.

2.Versatile with kernels: By using different kernel functions (like

linear, polynomial, or radial basis function), SVM can adapt to various
types of data and solve complex problems.

3.Robust to outliers: SVM is less affected by outliers because it

focuses on the support vectors (data points closest to the margin),
which helps in creating a more generalized model.
Disadvantages of Support Vector Machine
1.Slow with large datasets: SVM can be computationally
expensive and slow to train, especially when the dataset is very
large.
2.Difficult to tune: Choosing the right kernel and parameters
(like C and gamma) can be tricky and often requires a lot of trial
and error.
3.Not suitable for noisy data: If the dataset has too many
overlapping classes or noise, SVM may struggle to perform well
because it tries to find a perfect separation.
4.Hard to interpret: Unlike some other algorithms, SVM models
are not easy to interpret or explain, especially when using non-
linear kernels.
How SVMs Solve Classification Problems
1. Finding the Best Hyperplane: SVM identifies a hyperplane that maximizes
the margin between different classes.
The larger the margin, the better the generalization.
2. Support Vectors: These are the data points closest to the hyperplane and
play a crucial role in defining its position.
3. Kernel Trick: If data is not linearly separable, SVM uses kernel functions to
map it into a higher-dimensional space where separation becomes possible.
Common kernels include:
• Linear Kernel (for simple separable data)
• Polynomial Kernel (for complex boundaries)
• Radial Basis Function (RBF) Kernel (for intricate patterns)
Real-World Applications

• Spam Detection: Classifying emails as spam or not spam.

• Handwritten Digit Recognition: Identifying numbers from

images.

• Medical Diagnosis: Predicting whether a tumor is benign or

malignant.

• Stock Market Prediction: Classifying stocks based on trends.

Support Vector Machines-
Hyperplane

A hyperplane is a decision boundary which separates between given set of data points
having different class labels.
The SVM classifier separates data points using a hyperplane with the maximum amount
of margin.

This hyperplane is known as the maximum margin hyperplane and the linear classifier it
defines is known as the maximum margin classifier.

Support Vectors

Support vectors are the sample data points, which are closest to the hyperplane. These
data points will define the separating line or hyperplane better by calculating margins
Margin
A margin is a separation gap between the two lines on the closest data points.
It is calculated as the perpendicular distance from the line to support vectors or
closest data points.
In SVMs, we try to maximize this separation gap so that we get maximum margin.
The following diagram illustrates these concepts visually.
Margin in SVM
Linear Support Vector Machine (SVM) is a type of SVM used when data is linearly
separable, meaning it can be divided using a straight line (or hyperplane in higher
dimensions).

How Linear SVM Works

1. Finds the Best Hyperplane: It identifies the optimal boundary that separates
different classes while maximizing the margin between them.

2. Uses Support Vectors: The closest data points to the hyperplane influence its
position and help define the separation.

3. Maximizes Margin: A larger margin improves classification accuracy and

generalization.
Example of Linear SVM

Imagine classifying emails as spam or not spam based on word frequency.

If spam emails contain words like "free," "win," or "offer" more frequently, Linear
SVM can draw a straight boundary separating spam from non-spam emails.

A Non-Linear Support Vector Machine (SVM) is used when data cannot be separated
by a straight line.

Instead of finding a simple boundary, it uses a technique called the kernel trick to
transform data into a higher-dimensional space where separation becomes
possible.
How Non-Linear SVM Works

1. Data Transformation: If the data is scattered in a way that a straight line cannot
separate it, SVM applies a mathematical function (kernel) to map it into a higher
dimension.

2. Kernel Trick: Instead of manually transforming the data, SVM uses kernel functions
to make separation easier. Common kernels include:

o Radial Basis Function (RBF): Helps separate circular or complex patterns.

o Polynomial Kernel: Useful for curved boundaries.

o Sigmoid Kernel: Mimics neural networks for specific cases.

3. Finding the Best Boundary: Once transformed, SVM finds the optimal hyperplane
that separates different classes.
Example

Imagine classifying red and blue dots arranged in a circular pattern.

A straight line cannot separate them, but using an RBF kernel, SVM

transforms the data into a higher dimension where a clear boundary can

be drawn.
The Random Forest algorithm is a powerful machine learning technique that
combines multiple decision trees to improve accuracy and reduce overfitting.
It works by creating many decision trees, each trained on a random subset of
the data, and then aggregating their predictions through majority voting (for
classification) or averaging (for regression)
Key Features:
• Handles Missing Data: Works even if some data is missing.
• Feature Importance: Identifies the most useful features for predictions.
• Versatile: Used for both classification (predicting categories) and regression
(predicting numerical values).
• Robust to Overfitting: Since it averages multiple trees, it avoids overfitting
better than individual decision trees.
How Random Forest Works-
1. Bootstrapping (Random Sampling): The algorithm selects random
subsets of the training data to build multiple decision trees.
2. Feature Selection: Each tree is trained on a random subset of features,
ensuring diversity among trees.
3. Decision Trees Construction: Each tree learns patterns from its
subset of data and makes predictions.
4. Aggregation of Predictions:
• Classification: The final prediction is determined by majority voting
among all trees.
• Regression: The final prediction is the average of all tree predictions.
Advantages of Random Forest
• Handles Missing Data: Works well even if some data is missing.
• Reduces Overfitting: Since multiple trees are used, it avoids overfitting better
than a single decision tree.
• Feature Importance: Identifies the most influential features in the dataset.
• Works Well with Large Datasets: Can handle high-dimensional data efficiently.
Applications of Random Forest
• Medical Diagnosis: Used to predict diseases based on patient data.
• Fraud Detection: Helps banks and financial institutions detect fraudulent
transactions.
• Stock Market Prediction: Used to analyze trends and predict stock prices.
• Customer Churn Prediction: Businesses use it to identify customers likely to
A typical Random Forest diagram shows:

 Multiple decision trees trained on different parts of the dataset.

 Each tree making its own prediction.

 The final result being determined by majority voting (for

classification) or averaging (for regression).

Random Forest is a machine learning algorithm that builds multiple
decision trees and combines their predictions to improve accuracy. 1.
1. Data Splitting: The algorithm randomly selects different parts of the
dataset to create multiple decision trees.

2. Tree Building: Each tree learns patterns from its subset of data and
makes predictions.
3. Voting/Averaging:

o For classification, the final result is based on majority voting

(the most common prediction among trees).

o For regression, the final result is the average of all tree

predictions.

4. Final Prediction: The combined result from all trees gives a more
accurate and stable prediction.
Step-by-Step Implementation
1. Import Libraries

2. Load Dataset

3. Split Data into Training and Testing Sets

4. Train the Random Forest Model

5. Make Predictions

6. Evaluate Model Performance

Linear Regression:
o Linear regression is a statistical regression method which is used for
predictive analysis.
o It is one of the very simple and easy algorithms which works on
regression and shows the relationship between the continuous
variables.
o It is used for solving the regression problem in machine learning.
o Linear regression shows the linear relationship between the
independent variable (X-axis) and the dependent variable (Y-axis),
hence called linear regression.
Types of Linear Regression
Linear regression is of the following two types −
• Simple linear regression − A linear regression algorithm is called
simple linear regression if it is having only one independent
variable.

• Multiple linear regression − A linear regression algorithm is

called multiple linear regression if it is having more than one
independent variable.

Linear regression is mainly used to estimate the real values based on

continuous variable(s). For example, the total sale of a shop in a day, based on
real values, can be estimated by linear regression.
Advantages of Linear Regression
Interpretable & Simple: Easy to understand and explain
Efficient: Works well for datasets with linear relationships
Fast Training: Computationally inexpensive
Python Implementation Example
Logistic Regression
• It is a classification algorithm and also known as logit regression.
• Mainly logistic regression is a classification algorithm that is used to estimate the discrete
values like 0 or 1, true or false, yes or no based on a given set of independent variable.
Basically, it predicts the probability hence its output lies in between 0 and 1.
• It helps predict whether something belongs to one group or another, like spam vs. not
spam, sick vs. healthy, or pass vs. fail.
• Instead of predicting a number, it calculates the probability that something belongs to a
certain category.
• It uses a special mathematical function called the sigmoid function to keep values
between 0 and 1, representing probability.
Example: Predicting if a Student Passes an Exam
Imagine you are a teacher who wants to predict whether a student will pass or
fail based on their study hours.
•Independent Variable (X): Hours spent studying
•Dependent Variable (Y): Pass (1) or Fail (0)
How Logistic Regression Works Here
•If a student studies for many hours, their probability of passing is high.
•If a student studies little, their probability of passing is low.
•The logistic regression model calculates this probability and assigns a
result:
•If probability > 0.5, the student is predicted to pass (1)
•If probability < 0.5, the student is predicted to fail (0)
Example Prediction
Let’s assume the model gives the following results:
So, a student studying for 8 hours has a 90% chance of passing
according to the logistic regression model.
This is a simple example of how logistic regression helps make binary
classifications based on probabilities.
Probability of Predicted
Study Hours
Passing Outcome

2 hours 0.30 Fail (0)

5 hours 0.70 Pass (1)

8 hours 0.90 Pass (1)

Overview of Machine Learning Algorithms
No ratings yet
Overview of Machine Learning Algorithms
123 pages
AI Unit 4
No ratings yet
AI Unit 4
15 pages
AI For Eng Supervised-Learning
No ratings yet
AI For Eng Supervised-Learning
25 pages
Amlt Bca Unit-1
No ratings yet
Amlt Bca Unit-1
24 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Bike Buyer Prediction Using Classification Algorithm
No ratings yet
Bike Buyer Prediction Using Classification Algorithm
19 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
ML & DL Notes
No ratings yet
ML & DL Notes
30 pages
Introduction To AI
No ratings yet
Introduction To AI
51 pages
8 Classification
No ratings yet
8 Classification
45 pages
ML Unit 2
No ratings yet
ML Unit 2
6 pages
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
No ratings yet
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
16 pages
Assessing A Single Classification Algorithm and Two Classification Algorithms
No ratings yet
Assessing A Single Classification Algorithm and Two Classification Algorithms
12 pages
Classification Report Research Lab
No ratings yet
Classification Report Research Lab
6 pages
Models For Machine Learning: M. Tim Jones
No ratings yet
Models For Machine Learning: M. Tim Jones
10 pages
Unit 3 Big Data
No ratings yet
Unit 3 Big Data
50 pages
Machine Learning Algorithms - A Review - ART20203995
No ratings yet
Machine Learning Algorithms - A Review - ART20203995
6 pages
Machine Learning Classifiers Guide
No ratings yet
Machine Learning Classifiers Guide
111 pages
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
No ratings yet
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
60 pages
ML-Unit - 3 & 4
No ratings yet
ML-Unit - 3 & 4
33 pages
Asign-3 DWDM
No ratings yet
Asign-3 DWDM
27 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
U21amg05 Aif and ML Unit 04 Notes
No ratings yet
U21amg05 Aif and ML Unit 04 Notes
42 pages
Machine Learning: Classification & Naive Bayes
No ratings yet
Machine Learning: Classification & Naive Bayes
20 pages
Machine Learning Chapter 2
No ratings yet
Machine Learning Chapter 2
53 pages
InSem Question Paper Answer
No ratings yet
InSem Question Paper Answer
15 pages
Module 3 Supervised ML Algo
No ratings yet
Module 3 Supervised ML Algo
48 pages
Ai 14
No ratings yet
Ai 14
11 pages
ML Unit-Ii Notes
No ratings yet
ML Unit-Ii Notes
17 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
Machine Learning Section4 Ebook v03
No ratings yet
Machine Learning Section4 Ebook v03
20 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
Lec 3
No ratings yet
Lec 3
31 pages
ML Unit II - Final
No ratings yet
ML Unit II - Final
138 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
ML Answers
No ratings yet
ML Answers
7 pages
ML Notes
No ratings yet
ML Notes
50 pages
Machine Learning Classification Guide
No ratings yet
Machine Learning Classification Guide
7 pages
Financial Machine Learning-Unit-1: Dr. J.Dhanalakshmi
No ratings yet
Financial Machine Learning-Unit-1: Dr. J.Dhanalakshmi
70 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
Classification
No ratings yet
Classification
7 pages
INT354 - Unit 2
No ratings yet
INT354 - Unit 2
26 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
Exploring Machine Learning Algorithms - A Beginner's Guide
No ratings yet
Exploring Machine Learning Algorithms - A Beginner's Guide
10 pages
Category AI Model
No ratings yet
Category AI Model
7 pages
Unit 6
No ratings yet
Unit 6
55 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
Intro to Supervised Machine Learning
No ratings yet
Intro to Supervised Machine Learning
42 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Chapter Four - Part One
No ratings yet
Chapter Four - Part One
44 pages
Supervised Learning Classification Techniques
No ratings yet
Supervised Learning Classification Techniques
224 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
CH 5
No ratings yet
CH 5
21 pages
Types of Kernels in Support Vector Machines
No ratings yet
Types of Kernels in Support Vector Machines
14 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
17 pages
CSC 325 AI Lecture07 Supervised Learning
No ratings yet
CSC 325 AI Lecture07 Supervised Learning
59 pages
Image Deblurring Techniques Review
No ratings yet
Image Deblurring Techniques Review
15 pages
Structural Breaks in Path Dependent Models
No ratings yet
Structural Breaks in Path Dependent Models
45 pages
Screenshot 2024-10-22 at 3.40.40 PM
No ratings yet
Screenshot 2024-10-22 at 3.40.40 PM
2 pages
Scilab Companion for Numerical Analysis
No ratings yet
Scilab Companion for Numerical Analysis
141 pages
KDTree and BallTree
No ratings yet
KDTree and BallTree
14 pages
Fourier Transform Assignment
No ratings yet
Fourier Transform Assignment
3 pages
2021-Hsu Dacre Senyo-Applied Algorithmic Machine Learning For Intelligent Project Prediction Osf
No ratings yet
2021-Hsu Dacre Senyo-Applied Algorithmic Machine Learning For Intelligent Project Prediction Osf
6 pages
Jntuk r20 Unit V Deep Learning Techniqueswwwjntumaterials
No ratings yet
Jntuk r20 Unit V Deep Learning Techniqueswwwjntumaterials
32 pages
VSB Modulation: Advantages & Examples
No ratings yet
VSB Modulation: Advantages & Examples
5 pages
It340 Machine Learning (End - sp23)
No ratings yet
It340 Machine Learning (End - sp23)
1 page
EM Algorithm Explained: Coin Toss
No ratings yet
EM Algorithm Explained: Coin Toss
7 pages
Maths Class Xii Chapter 12 Linear Programming Practice Paper 15 2024 Answers
No ratings yet
Maths Class Xii Chapter 12 Linear Programming Practice Paper 15 2024 Answers
11 pages
Algorithm Homework Solutions
No ratings yet
Algorithm Homework Solutions
6 pages
Polynomials Class - 6 (Notes)
No ratings yet
Polynomials Class - 6 (Notes)
5 pages
Newton's Interpolation Techniques
No ratings yet
Newton's Interpolation Techniques
19 pages
Random Decimation in Anti-Aliasing
No ratings yet
Random Decimation in Anti-Aliasing
6 pages
Digital Signal Processing and Communication With Programming (Dspc-19)
No ratings yet
Digital Signal Processing and Communication With Programming (Dspc-19)
5 pages
MTH601 Final Term Solved Papers
No ratings yet
MTH601 Final Term Solved Papers
40 pages
Fourier Analysis of Audio Signal Report
No ratings yet
Fourier Analysis of Audio Signal Report
5 pages
K-Nearest Neighbour (KNN) Algorithm
No ratings yet
K-Nearest Neighbour (KNN) Algorithm
5 pages
ML Exam Answers Final
No ratings yet
ML Exam Answers Final
3 pages
Adaptive Filter for Impulsive Noise
No ratings yet
Adaptive Filter for Impulsive Noise
6 pages
90 Days Roadmap: Dsa Sheet
No ratings yet
90 Days Roadmap: Dsa Sheet
30 pages
CS QTest2
No ratings yet
CS QTest2
10 pages
Eet 401 Draft Scheme 24
No ratings yet
Eet 401 Draft Scheme 24
3 pages
Kasami and Gold Sequences
50% (2)
Kasami and Gold Sequences
21 pages
Polynomials (Olympiad MCQ)
No ratings yet
Polynomials (Olympiad MCQ)
4 pages
Fuzzy Sem Question Paper
No ratings yet
Fuzzy Sem Question Paper
4 pages
A Comprehensive Analysis of Image Edge Detection Techniques
No ratings yet
A Comprehensive Analysis of Image Edge Detection Techniques
13 pages
Finite Word Length Effects in DSP: Prepared BY Guided BY
No ratings yet
Finite Word Length Effects in DSP: Prepared BY Guided BY
28 pages

Machine Learning - Iii

Uploaded by

Machine Learning - Iii

Uploaded by

Supervised Machine Learning Algorithms

Suppose we have input variables x and an output variable y and we applied

achieving high accuracy in many tasks while being highly interpretable.

The “knowledge” learned by a decision tree through training is directly formulated

into a hierarchical structure.

understood, even by non-experts.

They can be used to solve both regression and classification problems.

The most notable types of decision tree algorithms are:-

import pandas as pdfrom sklearn.feature_extraction.text import

Step 4: Convert Text to Numerical Data

Step 5: Split Data into Training & Testing Sets

Step 7: Make Predictions

• In such cases, we use advanced techniques like kernel tricks to classify

SVM is defined such that it is defined in terms of the support

vectors only, we don’t have to worry about other

observations since the margin is made using the points

which are closest to the hyperplane (support vectors),

whereas in logistic regression the classifier is defined over all

Let’s understand the working of SVM using an example.

• Hyperplane: The decision boundary that separates different classes.

• Support Vectors: The closest data points to the hyperplane, which

influence its position.

• Margin: The distance between the hyperplane and the nearest

• A larger margin improves classification accuracy.

2.Effective in high-dimensional spaces: SVM performs

2.Versatile with kernels: By using different kernel functions (like

3.Robust to outliers: SVM is less affected by outliers because it

• Spam Detection: Classifying emails as spam or not spam.

• Handwritten Digit Recognition: Identifying numbers from

• Medical Diagnosis: Predicting whether a tumor is benign or

• Stock Market Prediction: Classifying stocks based on trends.

How Linear SVM Works

3. Maximizes Margin: A larger margin improves classification accuracy and

Imagine classifying emails as spam or not spam based on word frequency.

o Radial Basis Function (RBF): Helps separate circular or complex patterns.

o Polynomial Kernel: Useful for curved boundaries.

o Sigmoid Kernel: Mimics neural networks for specific cases.

Imagine classifying red and blue dots arranged in a circular pattern.

 Multiple decision trees trained on different parts of the dataset.

 Each tree making its own prediction.

 The final result being determined by majority voting (for

classification) or averaging (for regression).

o For classification, the final result is based on majority voting

o For regression, the final result is the average of all tree

3. Split Data into Training and Testing Sets

6. Evaluate Model Performance

• Multiple linear regression − A linear regression algorithm is

Linear regression is mainly used to estimate the real values based on

2 hours 0.30 Fail (0)

5 hours 0.70 Pass (1)

8 hours 0.90 Pass (1)

You might also like