0% found this document useful (0 votes)

19 views16 pages

ML Unit 2

This document provides an overview of various machine learning techniques, including regression, Bayesian learning, concept learning, and support vector machines (SVM). It explains key concepts such as linear and logistic regression, Bayes' theorem, and the importance of Bayesian belief networks and the EM algorithm. Additionally, it discusses the Naïve Bayes classifier and SVM, highlighting their applications, advantages, and limitations.

Uploaded by

tejaschandelkar04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views16 pages

ML Unit 2

Uploaded by

tejaschandelkar04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

UNIT 2

SUBJECT: MACHINE LEARNING TECHNIQUES

What is Regression
Regression is a technique in machine learning and statistics used to find relationships between variables
and make predictions. It helps us understand how the value of one thing (called the dependent variable)
changes when other things (called independent variables) change.

For example- If you want to predict the price of a house based on its size, regression can help create a
model that shows how the house price changes as the size (in square feet) increases.

This way, you can estimate the price of a house just by knowing its size.

In simple terms, regression is like drawing a line (or curve) through data points to see how one thing affects
another and make predictions about future outcomes.

Types of Regression

1. Linear Regression
Linear Regression is a simple statistical method used to predict a continuous outcome (like price or
temperature) based on the relationship between two variables, where one is the input (independent
variable) and the other is the output (dependent variable). It models this relationship using a straight line
to make predictions.

The formula for linear regression is:

Y=mX+b

Where:

 Y is the predicted output (dependent variable).

 X is the input (independent variable).

 m is the slope of the line (how much Y changes with X).

 b is the intercept (the value of Y when X is 0).

Key Points:

 Linear regression assumes a linear relationship between the input and output.
 It tries to minimize the difference between the actual data points and the line (using a method
called least squares).
Example:

If you want to predict a house's price based on its size, linear regression can help you find a straight line
that best fits the data points representing different house sizes and their corresponding prices.

2. Logistic Regression
Logistic Regression is a statistical method used in machine learning to predict a categorical outcome (like
yes/no, true/false) based on one or more input variables. Unlike linear regression, it predicts probabilities
and uses a logistic function to model a binary outcome (0 or 1).

In simple terms, it's used when the result is something like "Will it happen or not?" instead of predicting
a continuous value.

Example:

If you want to predict whether an email is spam or not based on features like the presence of certain
words, the sender's address, or the email length, logistic regression can help you estimate the probability
that an email is spam.

The formula for Logistic Regression is:

What is Bayesian Learning
Bayesian Learning is a statistical approach to machine learning that applies Bayes' theorem to update the
probability of a hypothesis as new evidence is acquired. It focuses on incorporating prior knowledge and
evidence to improve predictions and decision-making.

1. Prior Knowledge: Bayesian learning allows the incorporation of prior beliefs or information about
a problem, which can influence the learning process.

2. Updating Beliefs: As new data becomes available, Bayesian learning updates the prior beliefs to
form a new posterior belief, allowing for continuous learning and adaptation.

3. Probabilistic Models: Bayesian learning often employs probabilistic models, such as Bayesian
networks or Gaussian processes, to represent uncertainties and make predictions.

Applications:

 Medical Diagnosis: Updating the probability of diseases based on symptoms and test results.

 Spam Detection: Classifying emails as spam or not spam by updating beliefs based on previous
data.

 Recommender Systems: Personalizing recommendations by incorporating user preferences and

behavior.
In summary, Bayesian Learning is a powerful framework for modeling uncertainty and making informed
predictions by combining prior knowledge with observed data.

What is Bayes Theorem

Bayes' Theorem is a fundamental concept in probability theory that describes how to update the
probability of a hypothesis based on new evidence. It provides a way to calculate the conditional
probability of an event based on prior knowledge and observed data.

Explanation:

Prior Probability reflects your belief about the hypothesis before considering the evidence.

Likelihood measures how well the hypothesis explains the observed data.

Posterior Probability is the updated belief after taking the evidence into account.

Example:

Suppose you want to determine the probability that someone has a disease (hypothesis H) after testing
positive for it (evidence D).

You would use Bayes' theorem to update your belief about the disease's probability based on the test's
accuracy and the general prevalence of the disease in the population.
Importance:

Bayes' theorem is widely used in various fields, including statistics, machine learning, finance, and
medicine, as it provides a coherent method for reasoning about uncertainty and making decisions based
on incomplete information.

Question Example
What is Concept Learning
Concept Learning is a fundamental aspect of machine learning, where a machine identifies and
understands general rules or concepts from specific examples. This process allows the machine to classify
new instances based on learned patterns. Here’s a breakdown of the key elements and processes involved
in concept learning:

In simple terms, concept learning is about teaching a machine to recognize a category based on a set of
examples. For instance, if we want the machine to learn the concept of “fruit,” we provide it with examples
of fruits (like apples and bananas) and non-fruits (like carrots and potatoes).
Components of Concept Learning:

1. Instances: These are the individual examples being classified. For example, instances can be specific
fruits or animals.

2. Target Concept: This is the actual concept we want the machine to learn, such as "fruit" or "bird."

3. Hypothesis Space: The set of all possible rules or hypotheses that the machine can consider based on
the given examples.

4. Hypothesis: The final rule or concept that the machine learns to classify the examples correctly.

5. Positive and Negative Examples:

Positive Examples: Instances that belong to the target concept (e.g., fruits like apple, banana).

Negative Examples: Instances that do not belong to the target concept (e.g., vegetables like carrot,
cucumber).

Example of Concept Learning:

Consider teaching a machine the concept of a “bird.”

Positive examples: Sparrow, Eagle, Parrot (they are birds).

Negative examples: Bat, Dog, Airplane (they are not birds).

The machine learns that birds usually have features such as wings and feathers, and many can fly. Its goal
is to develop a hypothesis (a rule) that can help it predict if a new animal (like a Penguin) is a bird.

Process of Concept Learning:

1. Representation of the Hypothesis Space: The machine begins by defining all potential rules or
hypotheses based on features of the instances (e.g., wings, size).

2. Generalization and Specialization:

Generalization: The machine looks for common features in positive examples.

Specialization: The machine refines its hypothesis to ensure it excludes negative examples.

This process is iterative and continues until the machine finds the best hypothesis.

3. Evaluating the Hypothesis: Once a hypothesis is formed, the machine evaluates it based on its
effectiveness in classifying the training examples and its ability to generalize to new instances.
Key Challenges in Concept Learning:

Overfitting: This occurs when the hypothesis is too specific, making it perform well on training data but
poorly on new, unseen examples.

Underfitting: This happens when the hypothesis is too broad and fails to capture the underlying structure
of the data, leading to inaccurate classifications.

Importance of Concept Learning in Machine Learning:

Concept learning is crucial because it forms the basis for many classification tasks in machine learning. It
enables machines to:

Recognize patterns in data.

Make decisions based on learned concepts.

Generalize knowledge to new data, which is essential for real-world applications such as spam detection,
image recognition, and medical diagnostics.

What is Bayes Optimal Classifier

The Bayes Optimal Classifier is a statistical model used in machine learning for classification tasks. It
utilizes Bayes' theorem to predict the class of an instance by calculating the posterior probabilities of each
class based on observed features.
Example:

In spam email classification, it would compute the probability of an email being spam or not spam based
on the presence of certain keywords, choosing the class with the highest probability.

Limitations:

 Computationally Intensive: Requires significant computation, especially with many features or

classes.

 Independence Assumption: Assumes features are independent given the class, which may not be
true in real scenarios.

Importance:

It serves as a benchmark for evaluating other classification algorithms, like Naive Bayes, which simplifies
some of the assumptions of the Bayes Optimal Classifier.

What is Naïve Bayes Classifier

The Naïve Bayes Classifier is a simple yet powerful classification algorithm based on Bayes' theorem. It is
called "naïve" because it assumes that the features (or attributes) used for classification are independent
of each other, which is often not the case in real-world data.

Key Features:

1. Bayes' Theorem: It calculates the probability of a class C given features X:

2. Independence Assumption: The classifier assumes that the presence of a feature in a class is
independent of the presence of any other feature. This simplifies calculations and leads to:

3. Classification Rule: It predicts the class that maximizes the posterior probability:

Types of Naïve Bayes Classifiers:

1. Gaussian Naïve Bayes: Assumes that features follow a Gaussian (normal) distribution.

2. Multinomial Naïve Bayes: Suitable for discrete data, often used in text classification.

3. Bernoulli Naïve Bayes: Assumes binary features (e.g., presence/absence of a feature).

Example:

In email spam detection, the Naïve Bayes classifier would evaluate the presence of certain words in an
email to classify it as "spam" or "not spam." Each word is treated as an independent feature, and the
classifier calculates the probabilities accordingly.

Advantages:

Simplicity: Easy to understand and implement.

Efficiency: Works well with large datasets and is computationally efficient.

Performance: Often performs surprisingly well, even with the independence assumption.

Limitations:

Independence Assumption: The assumption of feature independence is rarely true, which can lead to
inaccuracies.

Zero Probability Problem: If a feature is not present in the training data for a particular class, it may lead
to a probability of zero. This can be mitigated using techniques like Laplace smoothing.

Importance:

The Naïve Bayes classifier is widely used in various applications, including text classification (spam
detection, sentiment analysis), document categorization, and recommendation systems, due to its
efficiency and effectiveness in handling large datasets.
What is Bayesian Belief Networks (BBNs)
Bayesian Belief Networks (BBNs), also known as Bayesian Networks, are graphical models that represent
the probabilistic relationships among a set of variables using a directed acyclic graph (DAG).

Key Features:

1. Graph Structure:

Nodes represent random variables.

Directed edges indicate dependencies (causal relationships) between variables.

2. Conditional Probability Tables (CPT):

Each node has a CPT that defines the probability of the variable given its parent nodes.

3. Joint Probability Distribution:

The joint distribution of all variables can be expressed as the product of the conditional probabilities.

Example:

In a BBN involving Rain, Traffic Jam, and Accident:

Rain affects Traffic Jam, and Traffic Jam affects Accident.

Applications:

Used in medical diagnosis, decision-making systems, and predictive analytics.

Advantages:

Handles uncertainty well and shows causal relationships clearly.

What is EM Algorithm (Expectation-Maximization Algorithm)

The Expectation-Maximization (EM) Algorithm is a statistical method used for estimating the parameters
of probabilistic models, particularly when dealing with incomplete or missing data. It is commonly applied
in various fields like machine learning, computer vision, and natural language processing.

Key Concepts:

1. Latent Variables: These are variables that are not directly observed but are inferred from the observed
data. The EM algorithm is particularly useful in scenarios involving latent variables.
2. Steps of the Algorithm:

Initialization: Start with initial guesses for the parameters.

E-Step (Expectation Step): Calculate the expected value of the log-likelihood function based on the
current parameter estimates. This step involves estimating the distribution of the latent variables given
the observed data.

M-Step (Maximization Step): Update the parameter estimates to maximize the expected log-likelihood
calculated in the E-step.

3. Convergence: The algorithm iterates between the E-step and M-step until the parameter estimates
converge, meaning they stabilize and do not change significantly.

Applications:

Clustering: Used in Gaussian Mixture Models to find clusters in data.

Image Processing: Helps in segmenting images with incomplete information.

Natural Language Processing: Useful for training models with missing data points.

Support Vector Machine (SVM)

Support Vector Machine (SVM) is a supervised learning algorithm used for both classification and
regression tasks, though it is mostly used for classification problems. The basic concept of SVM is that it
creates a hyperplane to classify data points that separate two classes.

Key Concepts of SVM:

1. Hyperplane: The main aim of SVM is to find a hyperplane that can efficiently separate the classes.
In a two-dimensional space, a hyperplane is a line, while in higher-dimensional spaces, it can be a
plane or a higher-dimensional surface.

2. Support Vectors: The data points that are closest to the hyperplane are called "Support Vectors."
These points are critical to SVM as they determine the position of the hyperplane and help
differentiate between the classes.

3. Margin: The distance between the hyperplane and the nearest support vectors on both sides is
called the margin. SVM aims to find a hyperplane with the maximum margin, as a larger margin
results in more robust classification.

4. Linear and Non-Linear SVM:

 Linear SVM: When the data can be linearly separated (by a straight line), linear SVM is
used.

 Non-Linear SVM: Sometimes the data is not linearly separable, and SVM uses the “kernel
trick” to map the data into a higher-dimensional space where linear separation is possible.

5. Kernel Trick: When data is not linearly separable, SVM applies kernel functions to project the data
into higher dimensions, where it can be separated easily. Popular kernel functions
include Polynomial kernel, Radial Basis Function (RBF), and Gaussian kernel.

Types of SVM Kernels:

1. Linear Kernel:

Definition: The linear kernel is used when the data can be separated by a straight line (or hyperplane in
higher dimensions). It is the simplest form of kernel.

Usage: It is typically used when the data is linearly separable, meaning the two classes can be separated
by a straight boundary.

Example: Classifying spam and non-spam emails when the features (like words) can be separated by a
linear boundary.

2. Polynomial Kernel:

Definition: The polynomial kernel is useful for datasets that are not linearly separable but can be
separated using polynomial decision boundaries. It adds more complexity by creating curved boundaries.

Usage: It is used when the relationship between the data points is non-linear but can be captured by
polynomial functions.

Example: When the data points form a circular or curved pattern, a polynomial kernel can be used to
classify them.

3. Gaussian Kernel (RBF - Radial Basis Function):

Definition: The Gaussian kernel (or RBF) is one of the most commonly used kernels for SVMs. It maps
data into an infinite-dimensional space, allowing complex, non-linear boundaries to separate the classes.

Usage: It is used when the data is not linearly separable, and we need a flexible decision boundary to
handle complex patterns.
Example: It is ideal for tasks like face recognition or handwriting classification, where the data cannot be
separated by a simple linear boundary.

Polynomial Kernel vs Gaussian Kernel (RBF)

Hyperplane – Decision Surface

The Hyperplane is a crucial concept in SVM (Support Vector Machine). Its function is to classify data points
into different categories. The main goal of SVM is to create a hyperplane that serves as the optimal
boundary between data points belonging to different categories.

Hyperplane: It is a line or plane that separates data points of different classes (categories).

If the data is in a 2D space, the hyperplane is a line.

If the data is in a 3D space, the hyperplane becomes a plane.

In higher dimensions, the hyperplane becomes a higher-dimensional surface.

Example:

Imagine you have red and blue points. The hyperplane creates a straight line or plane that separates these
red and blue points.

Decision Surface:

The hyperplane is also called the decision surface because it helps SVM decide which category the points
belong to.
Best Hyperplane: SVM's task is to find the hyperplane that creates the maximum margin between the
classes, meaning it tries to keep the points of different classes as far away from the hyperplane as possible
for more accurate classification.

Some properties OF SVM:

1. Maximum Margin Classifier:

SVM's objective is to find the maximum margin between classes to separate them as effectively as
possible. The farther the hyperplane is from the class points, the better the classification.

2. Works Well with High-Dimensional Data:

SVM performs well even with high-dimensional data. If you have many features, SVM is efficient in
understanding the relationship between them.

3. Effective with Non-Linear Data:

- SVM can efficiently classify non-linear data by using kernels (like Polynomial or Gaussian). By using
kernels, SVM maps data to higher dimensions where the non-linear data can become linear.

4. Robustness to Overfitting:

If the data is well-separated, SVM reduces the chances of overfitting. Overfitting occurs when a model fits
the training data so well that it doesn't perform well on testing data.

5. Support Vectors:

In SVM, only a few important points (called support vectors) are used to define the hyperplane. These
points lie very close to the hyperplane and help form the decision boundary.
Issues in SVM (Challenges with SVM)

1. Computational Complexity:

If you have a very large dataset, the training process of SVM can be slow, as finding the best hyperplane
requires significant computation.

2. Choice of Kernel:

For non-linear data, selecting the right kernel function is crucial. If the wrong kernel is chosen, SVM’s
performance can degrade. It’s often difficult to determine which kernel (Polynomial, Gaussian, or others)
is the best fit for the data.

3. Overfitting with Noisy Data:

If the data is very noisy (with many outliers), the hyperplane generated by SVM may not be accurate. SVM
is sensitive to noisy data, which can lead to incorrect decision boundaries.

4. Interpretability:

The results of SVM can be harder to interpret. When working in high dimensions, it’s not always easy to
understand how or why the hyperplane is being formed.

5. Not Suitable for Large Datasets:

SVM works best with small to medium-sized datasets. For very large datasets, it becomes computationally
expensive, and other algorithms like Decision Trees or Random Forests may perform better.

MLT by Engineering Express
No ratings yet
MLT by Engineering Express
94 pages
Unit 3 - AIML
No ratings yet
Unit 3 - AIML
21 pages
Aimlf Unit 3
No ratings yet
Aimlf Unit 3
20 pages
Unit 1
100% (1)
Unit 1
13 pages
Machine Learning and Eural Etwork
100% (1)
Machine Learning and Eural Etwork
21 pages
Fundamentals of Machine Learning With QA
No ratings yet
Fundamentals of Machine Learning With QA
41 pages
Chapter 5 Artificial Intelligence Notes
No ratings yet
Chapter 5 Artificial Intelligence Notes
7 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
INTRODUCTION
No ratings yet
INTRODUCTION
51 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
20 pages
ML - Part - A
No ratings yet
ML - Part - A
10 pages
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
No ratings yet
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
13 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Sat - 34.Pdf - A Systematic Approach Towards Description and Classification of Crime Incidents
No ratings yet
Sat - 34.Pdf - A Systematic Approach Towards Description and Classification of Crime Incidents
11 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
ML Chapter 1
No ratings yet
ML Chapter 1
41 pages
Machine Learning UNIT-2: Logistic Regression
No ratings yet
Machine Learning UNIT-2: Logistic Regression
12 pages
Task The Problems That Can Be Solved With Machine Learning
No ratings yet
Task The Problems That Can Be Solved With Machine Learning
9 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
10 pages
Machine Learning With R and Python
No ratings yet
Machine Learning With R and Python
290 pages
482 LectureNotes Chapter 1
No ratings yet
482 LectureNotes Chapter 1
6 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Machine 1
No ratings yet
Machine 1
14 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
9 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
ML Notes Unit-1
No ratings yet
ML Notes Unit-1
11 pages
ML Unit123
No ratings yet
ML Unit123
65 pages
Module 1
No ratings yet
Module 1
50 pages
Machine-Learning-MBA-Unit - 4-09-04 Machine - Learning-MBA-unit-3
No ratings yet
Machine-Learning-MBA-Unit - 4-09-04 Machine - Learning-MBA-unit-3
18 pages
Regression Training Overview
No ratings yet
Regression Training Overview
52 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
20 pages
Internship Report On Machine Learning With Python For Business
No ratings yet
Internship Report On Machine Learning With Python For Business
25 pages
ML Concepts
No ratings yet
ML Concepts
8 pages
Unit 1-1
No ratings yet
Unit 1-1
32 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
5 pages
Introducti0n (MLT)
No ratings yet
Introducti0n (MLT)
39 pages
Ybi Python Final Internship Report
100% (6)
Ybi Python Final Internship Report
29 pages
Unit Iii Supervised Learning
No ratings yet
Unit Iii Supervised Learning
67 pages
MAchine Learning
No ratings yet
MAchine Learning
10 pages
UNIT 1 - Introduction (Types of Machine Learning)
100% (1)
UNIT 1 - Introduction (Types of Machine Learning)
21 pages
Intro To ML PDF
No ratings yet
Intro To ML PDF
66 pages
ML Intro Theory
No ratings yet
ML Intro Theory
10 pages
Truncated Doc 4
No ratings yet
Truncated Doc 4
3 pages
Unit 3 Model Construction 3.1 Machine Learning Concepts - An Overview
No ratings yet
Unit 3 Model Construction 3.1 Machine Learning Concepts - An Overview
36 pages
Unit 3
No ratings yet
Unit 3
62 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
Machine Learning
No ratings yet
Machine Learning
87 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
A Preliminary Idea On Machine Learning
No ratings yet
A Preliminary Idea On Machine Learning
40 pages
5th Sem Report
No ratings yet
5th Sem Report
29 pages
Machine Learning & NLP Overview
No ratings yet
Machine Learning & NLP Overview
41 pages
Unit 4 Machine Learning
No ratings yet
Unit 4 Machine Learning
22 pages
Machine Learning Career Roadmap
No ratings yet
Machine Learning Career Roadmap
17 pages
Chapter1 ML
No ratings yet
Chapter1 ML
101 pages
(Machine Learning) BAYES' THEOREM AND CONCEPT LEARNING
No ratings yet
(Machine Learning) BAYES' THEOREM AND CONCEPT LEARNING
22 pages
Module 3
No ratings yet
Module 3
11 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
12 pages
MLT Study
No ratings yet
MLT Study
22 pages
Research Method Using R
No ratings yet
Research Method Using R
442 pages
Data Science & AI Evaluation Guide
No ratings yet
Data Science & AI Evaluation Guide
46 pages
Teaching Resources Impact on Rwandan Students
100% (1)
Teaching Resources Impact on Rwandan Students
12 pages
EM GaussianMixture Example
No ratings yet
EM GaussianMixture Example
2 pages
CHO - Statistics
No ratings yet
CHO - Statistics
6 pages
Multinomial Logistic Regression Fit Test
No ratings yet
Multinomial Logistic Regression Fit Test
7 pages
Diagnostic Imaging: Gynecology 3rd Edition Akram M. Shaaban Full
No ratings yet
Diagnostic Imaging: Gynecology 3rd Edition Akram M. Shaaban Full
106 pages
Business Data Analysis Using Excel, 2010 (David Whigham) PDF
75% (4)
Business Data Analysis Using Excel, 2010 (David Whigham) PDF
315 pages
Sampling Methods in Quantitative Research
No ratings yet
Sampling Methods in Quantitative Research
101 pages
MATH 03 Lesson 3 Random Variables and Probability Distributions PDF
No ratings yet
MATH 03 Lesson 3 Random Variables and Probability Distributions PDF
18 pages
Spearman Rank-Order Correlation Guide
No ratings yet
Spearman Rank-Order Correlation Guide
2 pages
2021 Meteorology Master's Program Guide
No ratings yet
2021 Meteorology Master's Program Guide
23 pages
Data Analytics Using Spreadsheets I-Reference Notes
No ratings yet
Data Analytics Using Spreadsheets I-Reference Notes
27 pages
3RD Quarter - Module
No ratings yet
3RD Quarter - Module
5 pages
Practical Research 2 Pretest Grade12
No ratings yet
Practical Research 2 Pretest Grade12
5 pages
Understanding Beta in SPSS Output
No ratings yet
Understanding Beta in SPSS Output
1 page
Differencing and Unit Root Tests Explained
No ratings yet
Differencing and Unit Root Tests Explained
8 pages
Overview of Decision Support Systems and WEKA
No ratings yet
Overview of Decision Support Systems and WEKA
22 pages
Understanding Microsoft Teams Administration: Configure, Customize, and Manage The Teams Experience 2nd Edition Balu N Ilag Digital Download
No ratings yet
Understanding Microsoft Teams Administration: Configure, Customize, and Manage The Teams Experience 2nd Edition Balu N Ilag Digital Download
125 pages
Soft Vs Hard Clustering
No ratings yet
Soft Vs Hard Clustering
5 pages
Carson Et Al., 2007
No ratings yet
Carson Et Al., 2007
9 pages
L T P/S FW No. of Psda Total Credit Units: Weightage (%)
No ratings yet
L T P/S FW No. of Psda Total Credit Units: Weightage (%)
6 pages
Time Series Forecast - A Basic Introduction Using Python
No ratings yet
Time Series Forecast - A Basic Introduction Using Python
18 pages
Introduction to Simple Linear Regression
No ratings yet
Introduction to Simple Linear Regression
47 pages
Soft Skill-Chapter
No ratings yet
Soft Skill-Chapter
19 pages
True Experimental Design
No ratings yet
True Experimental Design
3 pages
Advanced Analytics Unlocking The Power of Insight
No ratings yet
Advanced Analytics Unlocking The Power of Insight
15 pages
Internal Auditor Characteristics, Internal Audit Effectiveness and Moderating Effect of Senior Management
No ratings yet
Internal Auditor Characteristics, Internal Audit Effectiveness and Moderating Effect of Senior Management
19 pages
ICASDGAI-08 ThreatModelinginAgileSDLCAModernApproach
No ratings yet
ICASDGAI-08 ThreatModelinginAgileSDLCAModernApproach
21 pages
Job Satisfaction and Satisfaction With Work-Life B
No ratings yet
Job Satisfaction and Satisfaction With Work-Life B
21 pages

ML Unit 2

Uploaded by

ML Unit 2

Uploaded by

UNIT 2

SUBJECT: MACHINE LEARNING TECHNIQUES

The formula for linear regression is:

 Y is the predicted output (dependent variable).

 X is the input (independent variable).

 m is the slope of the line (how much Y changes with X).

 b is the intercept (the value of Y when X is 0).

The formula for Logistic Regression is:

 Recommender Systems: Personalizing recommendations by incorporating user preferences and

What is Bayes Theorem

5. Positive and Negative Examples:

Example of Concept Learning:

Consider teaching a machine the concept of a “bird.”

Positive examples: Sparrow, Eagle, Parrot (they are birds).

Negative examples: Bat, Dog, Airplane (they are not birds).

Process of Concept Learning:

2. Generalization and Specialization:

Generalization: The machine looks for common features in positive examples.

Importance of Concept Learning in Machine Learning:

Recognize patterns in data.

Make decisions based on learned concepts.

What is Bayes Optimal Classifier

 Computationally Intensive: Requires significant computation, especially with many features or

What is Naïve Bayes Classifier

1. Bayes' Theorem: It calculates the probability of a class C given features X:

Types of Naïve Bayes Classifiers:

3. Bernoulli Naïve Bayes: Assumes binary features (e.g., presence/absence of a feature).

Simplicity: Easy to understand and implement.

Efficiency: Works well with large datasets and is computationally efficient.

Nodes represent random variables.

Directed edges indicate dependencies (causal relationships) between variables.

2. Conditional Probability Tables (CPT):

3. Joint Probability Distribution:

In a BBN involving Rain, Traffic Jam, and Accident:

Rain affects Traffic Jam, and Traffic Jam affects Accident.

Used in medical diagnosis, decision-making systems, and predictive analytics.

Handles uncertainty well and shows causal relationships clearly.

What is EM Algorithm (Expectation-Maximization Algorithm)

Initialization: Start with initial guesses for the parameters.

Clustering: Used in Gaussian Mixture Models to find clusters in data.

Image Processing: Helps in segmenting images with incomplete information.

Support Vector Machine (SVM)

Key Concepts of SVM:

4. Linear and Non-Linear SVM:

Types of SVM Kernels:

3. Gaussian Kernel (RBF - Radial Basis Function):

Polynomial Kernel vs Gaussian Kernel (RBF)

Hyperplane – Decision Surface

If the data is in a 2D space, the hyperplane is a line.

If the data is in a 3D space, the hyperplane becomes a plane.

In higher dimensions, the hyperplane becomes a higher-dimensional surface.

Some properties OF SVM:

1. Maximum Margin Classifier:

2. Works Well with High-Dimensional Data:

3. Effective with Non-Linear Data:

3. Overfitting with Noisy Data:

5. Not Suitable for Large Datasets:

You might also like