0% found this document useful (0 votes)

36 views16 pages

Machine Learning Basics and Classification

The document provides an overview of machine learning, including its definition, types (supervised, unsupervised, and reinforcement learning), and how it works through examples. It also discusses binary classification, its algorithms, evaluation metrics like accuracy, precision, and recall, and their importance in assessing model performance. Additionally, it highlights the differences between precision and recall in the context of machine learning models.

Uploaded by

mandala.saiteja2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views16 pages

Machine Learning Basics and Classification

Uploaded by

mandala.saiteja2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

UNIT-I

1 The Ingredients of Machine Learning:

1.1 Introduction to Machine Learning
1.2 Types of Machine Learning Models
1.3 The output of Machine Learning
2 Binary Classification and related tasks:
2.1 Classification
2.2 Calculating accuracy in classification.

1.1 Introduction to Machine Learning

What is Machine Learning?
• Machine learning is an application of artificial intelligence that involves algorithms and data that
automatically analyse and make decision by itself without human intervention.

• It describes how computer perform tasks on their own by previous experiences.

• Therefore we can say in machine language artificial intelligence is generated on the basis of
experience.

How Machine Learning Works

Consider a system with input data that contains photos of various kinds of fruits. You want the system to
group the data according to the different types of fruits.
First, the system will analyze the input data. Next, it tries to find patterns, like shapes, size, and color.
Based on these patterns, the system will try to predict the different types of fruit and segregate them.
Finally, it keeps track of all the decisions it made during the process to ensure it is learning. The next
time you ask the same system to predict and segregate the different types of fruits, it won't have to go
through the entire process again. That’s how machine learning works

1.2 Types of Machine Learning

Supervised machine learning: You supervise the machine while training it to

work on its own. This requires labeled training data
Unsupervised learning: There is training data, but it won’t be labeled
Reinforcement learning: The system learns on its own

Supervised Learning :

To understand how supervised learning works, look at the example below, where you
have to train a model or system to recognize an apple.
First, you have to provide a data set that contains pictures of a kind of fruit, e.g.,
apples.

Then, provide another data set that lets the model know that these are pictures of
apples. This completes the training phase.

Next, provide a new set of data that only contains pictures of apples. At this point, the
system can recognize what the fruit it is and will remember it.

That's how supervised learning works. You are training the model to perform a specific
operation on its own. This kind of model is often used in filtering spam mail from your
email accounts.

Here are some different supervised learning algorithms:

Decision trees: Uses labeled data to train and predict on unlabeled data

Random forest:Combines multiple decision trees to create a more accurate prediction

Support vector machines (SVM):A well-known supervised learning algorithm with

many applications and variations
Logistic regression:A classification algorithm that uses a model to determine the
probability of an event

Linear regression:A data scientist trains the algorithm using a set of training data with
correct outputs

Naive Bayes:A supervised learning technique based on Bayes' Theorem

Unsupervised Learning

Consider a cluttered dataset: a collection of pictures of different fruit. You feed this data
to the model, and the model analyzes it to recognize any patterns. In the end, the
machine categorizes the photos into three types, as shown in the image, based on their
similarities. Flipkart uses this model to find and recommend products that are well suited
for you.
Types of Unsupervised Learning:

2.1 Clustering

2.2 Dimensionality Reduction

Reinforcement Learning
You provide a machine with a data set and ask it to identify a particular kind of fruit (in
this case, an apple). The machine tells you that it's mango, but that’s the wrong answer.
As feedback, you tell the system that it's wrong; it's not a mango, it's an apple. The
machine then learns from the feedback and keeps that in mind. The next time you ask
the same question, the system gives you the right answer; it is able to tell you that it’s
an apple. That is a reinforced response.

That's how reinforcement learning works; the system learns from its mistakes and
experiences. This model is used in games like Prince of Persia, Assassin’s Creed, and
FIFA, wherein the level of difficulty increases as you get better with the games.

Output of machine learning:

The output of machine learning is a machine learning model, which is a computer
program that contains data and guidelines for making predictions. The model is created
by a machine learning algorithm that analyzes data to find patterns and make
predictions.

The output of a machine learning model depends on the type of learning:

Supervised learning: The model output is a predicted target value for a given input.

Unsupervised learning: The model output may include cluster assignments or other
learned patterns in the data.

Machine learning models can be used to perform classification and prediction tasks on
various types of data, including documents, images, and numbers. For example, a
financial institution might use a machine learning model to classify transactions as
fraudulent or genuine.

Machine learning models can be very accurate, but they are only as accurate as the
data used to train them. The data should be clean, unbiased, and representative of
different scenarios.

CLASSIFICATION:
Definition of Classification
In machine learning, Classification, as the name suggests, classifies data into different parts/classes/groups. It is
used to predict from which dataset the input data belongs to .

Classification is the process of assigning new input variables (X) to the class they most likely belong to, based on a

classification model, as constructed from previously labeled training data.

Data with labels is used to train a classifier such that it can perform well on data without labels (not yet labeled). This

process of continuous classification, of previously known classes, trains a machine. If the classes are discrete, it can

be difficult to perform classification tasks.

Types of Classification

There are two types of classifications

 Binary classification

 Multi-class classification

Binary Classification

It is a process or task of classification, in which a given data is being classified into two classes. It’s basically a kind
of prediction about which of two groups the thing belongs to.
Examples include:

 Email spam detection (spam or not).

 Churn prediction (churn or not).[ a measurement of the percentage of accounts that cancel or choose
 not to renew their subscriptions ]
 Conversion prediction (buy or not).

Typically, binary classification tasks involve one class that is the normal state and another class that is the abnormal

state.

For example “not spam” is the normal state and “spam” is the abnormal state. Another example is “cancer not

detected” is the normal state of a task that involves a medical test and “cancer detected” is the abnormal state.

The class for the normal state is assigned the class label 0 and the class with the abnormal state is assigned the

class label 1.

Let us suppose, two emails are sent to you, one is sent by an insurance company that keeps sending their ads, and

the other is from your bank regarding your credit card bill. The email service provider will classify the two emails, the

first one will be sent to the spam folder and the second one will be kept in the primary one.
This process is known as binary classification, as there are two discrete classes, one is spam and the other is

primary. So, this is a problem of binary classification.

Binary classification uses some algorithms to do the task, some of the most common
algorithms used by binary classification are
 Logistic Regression
 k-Nearest Neighbors
 Decision Trees
 Support Vector Machine
 Naive Bayes

Paramet
Binary classification Multi-class classification
ers

There can be any number of

It is a classification of two
No. of classes in it, i.e., classifies the
groups, i.e. classifies objects in
classes object into more than two
at most two classes.
classes.

Algorith The most popular algorithms Popular algorithms that can be

ms used used by the binary used for multi-class
classification are- classification include:
Logistic Regression
k-Nearest Neighbors
k-Nearest Neighbors Decision Trees
Decision Trees
Naive Bayes
Support Vector Machine
Random Forest
Naive Bayes
Gradient Boosting

Examples of binary
classification include- Examples of multi-class
E mail spam detection (spam or classification include:
Example
not). Face classification.
s
Churn prediction (churn or not). Plant species classification.
Conversion prediction (buy or Optical character recognition.
not).

Evaluation of binary classifiers

If the model successfully predicts the patients as positive, this case is called True Positive (TP).
If the model successfully predicts patients as negative, this is called True Negative (TN).

The binary classifier may misdiagnose some patients as well. If a diseased patient is classified as healthy by a
negative test result, this error is called False Negative (FN).

Similarly, If a healthy patient is classified as diseased by a positive test result, this error is called False Positive(FP).

We can evaluate a binary classifier based on the following parameters:

 True Positive (TP): The patient is diseased and the model predicts "diseased"
 False Positive (FP): The patient is healthy but the model predicts "diseased"
 True Negative (TN): The patient is healthy and the model predicts "healthy"
 False Negative (FN): The patient is diseased and the model predicts "healthy"

 The following is a confusion matrix, which represents the above parameters:

CALCULATING ACCURACY IN CLASSIFICATION
Accuracy is perhaps the best-known Machine Learning model validation method used in classification
problems. One reason for its popularity is its relative simplicity. It is easy to understand and easy to
implement. Accuracy is a good metric to assess model performance for simple cases.

What is Accuracy?

Accuracy is a metric used in classification problems used to tell the percentage of accurate predictions.
We calculate it by dividing the number of correct predictions by the total number of predictions.

In the binary classification case, we can express accuracy in True Positive/False Positive/True Negative
False Negative values.

Where

 TP : True Positives

 FP : False Positives

 TN : True Negatives

 FN : False Negatives

What is Precision?

Precision is defined as the ratio of correctly classified positive samples (True Positive) to a total
number of classified positive samples (either correctly or incorrectly).

The percentage of correct predictions for the positive class

Precision = True Positive/True Positive + False Positive

Precision = TP/TP+FP (or)

Precision=TP/Predicted(positive)

Hence, precision helps us to visualize the reliability of the machine learning model in
classifying the model as positive.
What is Recall?

The recall is calculated as the ratio between the numbers of Positive samples correctly classified as
Positive to the total number of Positive samples. The recall measures the model's ability to
detect positive samples. The higher the recall, the more positive samples detected.

The percentage of actual positive class samples that were identified by the model

Recall = True Positive/True Positive + False Negative

Recall = TP/TP+FN (or)

Recall=TP/Actual(positive)

EXAMPLE:1

165 Predicted Predicted

Negative Positive
Actual 50 10 60
Negative
TN FP
Actual 5 100 105
Positive
FN TP
55 110

ACCURACY=TN+TP/TOTAL =50+100/165 =0.91

ERROR RATE=1-ACCURACY OR FP+FN/TOTAL =0.09

PRECISION=TP/PREDICTED(POSITIVE) =100/110 =0.91

RECALL=TP/ACTUAL(POSITIVE) =100/105 =0.95 OR RECALL=TPR

TPR=TP/TP+FN = 100/105 =0.95(SENSITIVITY)

FPR=FP/FP+TN OR 1-TNR =10/60 =0.16

FNR=FN/FN+TP OR 1-TPR =0.05

TNR=TN/TN+ FP = 50/60 =0.83(SPECIFICITY)

The F-Measure or F1-score is a way of combining the precision and recall of the model, and it is
defined as the harmonic mean of the model’s precision and recall.

F-MEASURE or F-SCORE =2PR/P+R =1.729/1.86 =0.92

EXAMPLE:2
Difference between Precision and Recall in Machine Learning

Precision Recall

It helps us to measure the ability to classify It helps us to measure how many positive
positive samples in the model. samples were correctly classified by the ML
model.

While calculating the Precision of a model, While calculating the Recall of a model, we
we should consider both Positive as well as only need all positive samples while all
Negative samples that are classified. negative samples will be neglected.

When a model classifies most of the When a model classifies a sample as Positive,
positive samples correctly as well as many but it can only classify a few positive
false-positive samples, then the model is samples, then the model is said to be high
said to be a high recall and low precision accuracy, high precision, and low recall
model. model.

The precision of a machine learning model Recall of a machine learning model is

is dependent on both the negative and dependent on positive samples and
positive samples. independent of negative samples.

In Precision, we should consider all positive The recall cares about correctly classifying all
samples that are classified as positive positive samples. It does not consider if any
either correctly or incorrectly. negative sample is classified as positive.

Why use Precision and Recall in Machine Learning models?

This question is very common among all machine learning engineers and data researchers. The use of
Precision and Recall varies according to the type of problem being solved.

o If there is a requirement of classifying all positive as well as Negative samples as Positive,

whether they are classified correctly or incorrectly, then use Precision.
o Further, on the other end, if our goal is to detect only all positive samples, then use Recall.
Here, we should not care how negative samples are correctly or incorrectly classified the
samples.

https://www.kaggle.com/datasets/chepkoyallan/datapreprocessing

ML 3RD Unit
No ratings yet
ML 3RD Unit
67 pages
Classification
No ratings yet
Classification
21 pages
Inductive Learning and Machine Learning
100% (1)
Inductive Learning and Machine Learning
321 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Types of Machine Learning Explained
No ratings yet
Types of Machine Learning Explained
30 pages
Unit 3
No ratings yet
Unit 3
27 pages
Machine Learning Models
No ratings yet
Machine Learning Models
11 pages
Unit I
No ratings yet
Unit I
8 pages
Classification
No ratings yet
Classification
22 pages
Unit 1
100% (1)
Unit 1
13 pages
INTRODUCTION
No ratings yet
INTRODUCTION
51 pages
Machine Learning Basics and Classifications
No ratings yet
Machine Learning Basics and Classifications
18 pages
Unit 3
No ratings yet
Unit 3
123 pages
Session 3 Types of Machine Learning
No ratings yet
Session 3 Types of Machine Learning
22 pages
4.1 Machine Learning Basics
No ratings yet
4.1 Machine Learning Basics
26 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
14 pages
Machine Learning Classification Guide
No ratings yet
Machine Learning Classification Guide
28 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
Lect 4 - Linear Classification
No ratings yet
Lect 4 - Linear Classification
14 pages
ML Notes - 2025
No ratings yet
ML Notes - 2025
145 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
37 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
UNIT II Deep Learning
No ratings yet
UNIT II Deep Learning
42 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
225 pages
Machine Learning Basics and Types
No ratings yet
Machine Learning Basics and Types
20 pages
ML Unit-1 (CEC)
No ratings yet
ML Unit-1 (CEC)
108 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
61 pages
Category AI Model
No ratings yet
Category AI Model
7 pages
Basic Notes
No ratings yet
Basic Notes
26 pages
ML 2
No ratings yet
ML 2
166 pages
Machine Learning
No ratings yet
Machine Learning
35 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Unit 1
No ratings yet
Unit 1
21 pages
Lecture 01 02
No ratings yet
Lecture 01 02
30 pages
Unit V
No ratings yet
Unit V
67 pages
Machine Learning BE Merged Modules
No ratings yet
Machine Learning BE Merged Modules
561 pages
Machine-Learning-MBA-Unit 4 Machine - Learning-MBA-unit-3
No ratings yet
Machine-Learning-MBA-Unit 4 Machine - Learning-MBA-unit-3
5 pages
Machine Learning Classification Guide
No ratings yet
Machine Learning Classification Guide
83 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
29 pages
01 Ml-Overview Notes
No ratings yet
01 Ml-Overview Notes
19 pages
Unit 1
No ratings yet
Unit 1
24 pages
Chapter 5 Machine Learning
No ratings yet
Chapter 5 Machine Learning
96 pages
ML - Unit 1 - SPR - New July 212025
No ratings yet
ML - Unit 1 - SPR - New July 212025
60 pages
01 - ML - Introduction
No ratings yet
01 - ML - Introduction
65 pages
Task The Problems That Can Be Solved With Machine Learning
No ratings yet
Task The Problems That Can Be Solved With Machine Learning
9 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
6 pages
Machine Learning Lecture Notes
No ratings yet
Machine Learning Lecture Notes
19 pages
ML - Unit - 1
No ratings yet
ML - Unit - 1
47 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
Intro to Machine Learning Types
No ratings yet
Intro to Machine Learning Types
5 pages
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
Module 2 - ML
No ratings yet
Module 2 - ML
53 pages
Classification (Part II)
No ratings yet
Classification (Part II)
162 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
Machine Learning Classification, Regression and Clustering
No ratings yet
Machine Learning Classification, Regression and Clustering
77 pages
EPM Mid1 (Answers)
No ratings yet
EPM Mid1 (Answers)
43 pages
It Epm Ass-1
No ratings yet
It Epm Ass-1
2 pages
UHV Question Bank
No ratings yet
UHV Question Bank
1 page
ML Unit-4
No ratings yet
ML Unit-4
35 pages
Multiclass Classification Overview
No ratings yet
Multiclass Classification Overview
57 pages
Scie 8 Activity 1
No ratings yet
Scie 8 Activity 1
1 page
MAPLE Basics: Quick Start Guide
No ratings yet
MAPLE Basics: Quick Start Guide
6 pages
SOP for Master's in Accounting & Finance
No ratings yet
SOP for Master's in Accounting & Finance
2 pages
014.mech Eng Session 009 Kinetic and Dynamic Principles 1
No ratings yet
014.mech Eng Session 009 Kinetic and Dynamic Principles 1
4 pages
Unsupervised Calibration for rBergomi
No ratings yet
Unsupervised Calibration for rBergomi
24 pages
Influence of Alccofine in UHPC
No ratings yet
Influence of Alccofine in UHPC
9 pages
Convolution Integrals in Quantum Physics
No ratings yet
Convolution Integrals in Quantum Physics
13 pages
Definite Integral in Calculus 2
No ratings yet
Definite Integral in Calculus 2
8 pages
DSS Assignment
No ratings yet
DSS Assignment
29 pages
De Broglie Wave-Particle Duality
No ratings yet
De Broglie Wave-Particle Duality
22 pages
Atprulebooktr PDF
0% (1)
Atprulebooktr PDF
1,170 pages
Determination of The Flakiness and The Elongation Index For The Given Aggregate Sample.
No ratings yet
Determination of The Flakiness and The Elongation Index For The Given Aggregate Sample.
4 pages
Likelihood-Ratio Test
No ratings yet
Likelihood-Ratio Test
5 pages
Week 4.3E Continuity
No ratings yet
Week 4.3E Continuity
5 pages
Coordinate Geometry Solutions
No ratings yet
Coordinate Geometry Solutions
2 pages
Exam Prep: Verbal & Analytical Reasoning
No ratings yet
Exam Prep: Verbal & Analytical Reasoning
6 pages
Image Processing in Agriculture
No ratings yet
Image Processing in Agriculture
8 pages
Types and Analysis of Histograms
No ratings yet
Types and Analysis of Histograms
5 pages
Dynamic Memory & Pointers Guide
No ratings yet
Dynamic Memory & Pointers Guide
22 pages
Engineering Mechanics Exam
No ratings yet
Engineering Mechanics Exam
13 pages
Grade 4 Mathematics Schemes of Work Term 2 2025
No ratings yet
Grade 4 Mathematics Schemes of Work Term 2 2025
6 pages
Implement A Class Calculator That Models Handheld Calculator. It Should Have (Atleast) The Following Functionality
No ratings yet
Implement A Class Calculator That Models Handheld Calculator. It Should Have (Atleast) The Following Functionality
7 pages
TiAl Casting Process Optimization
No ratings yet
TiAl Casting Process Optimization
5 pages
Random Variables and Probability Distributions
No ratings yet
Random Variables and Probability Distributions
96 pages
Risk & Return Analysis of Nifty Stock in Indian Capital Market
No ratings yet
Risk & Return Analysis of Nifty Stock in Indian Capital Market
5 pages
Be Glar 2009
No ratings yet
Be Glar 2009
23 pages
66.0.0 Differentiation Q
No ratings yet
66.0.0 Differentiation Q
4 pages
IP Board Ut
No ratings yet
IP Board Ut
5 pages
SEQUENCE Function in Excel - Auto Generate Number Series
No ratings yet
SEQUENCE Function in Excel - Auto Generate Number Series
8 pages

Machine Learning Basics and Classification

Uploaded by

Machine Learning Basics and Classification

Uploaded by

UNIT-I

1 The Ingredients of Machine Learning:

1.1 Introduction to Machine Learning

• It describes how computer perform tasks on their own by previous experiences.

How Machine Learning Works

1.2 Types of Machine Learning

Supervised machine learning: You supervise the machine while training it to

Here are some different supervised learning algorithms:

Random forest:Combines multiple decision trees to create a more accurate prediction

Support vector machines (SVM):A well-known supervised learning algorithm with

Naive Bayes:A supervised learning technique based on Bayes' Theorem

2.2 Dimensionality Reduction

Output of machine learning:

The output of a machine learning model depends on the type of learning:

classification model, as constructed from previously labeled training data.

be difficult to perform classification tasks.

There are two types of classifications

 Email spam detection (spam or not).

primary. So, this is a problem of binary classification.

There can be any number of

Algorith The most popular algorithms Popular algorithms that can be

Evaluation of binary classifiers

We can evaluate a binary classifier based on the following parameters:

 The following is a confusion matrix, which represents the above parameters:

The percentage of correct predictions for the positive class

Precision = True Positive/True Positive + False Positive

Precision = TP/TP+FP (or)

Recall = True Positive/True Positive + False Negative

165 Predicted Predicted

ACCURACY=TN+TP/TOTAL =50+100/165 =0.91

ERROR RATE=1-ACCURACY OR FP+FN/TOTAL =0.09

PRECISION=TP/PREDICTED(POSITIVE) =100/110 =0.91

RECALL=TP/ACTUAL(POSITIVE) =100/105 =0.95 OR RECALL=TPR

TPR=TP/TP+FN = 100/105 =0.95(SENSITIVITY)

FPR=FP/FP+TN OR 1-TNR =10/60 =0.16

FNR=FN/FN+TP OR 1-TPR =0.05

TNR=TN/TN+ FP = 50/60 =0.83(SPECIFICITY)

F-MEASURE or F-SCORE =2PR/P+R =1.729/1.86 =0.92

The precision of a machine learning model Recall of a machine learning model is

Why use Precision and Recall in Machine Learning models?

o If there is a requirement of classifying all positive as well as Negative samples as Positive,

You might also like