Bayesian

The document provides an overview of Bayesian Classification, detailing its principles, including Bayes Theorem and the Naïve Bayesian Classifier, which simplifies computations through the assumption of class conditional independence. It also discusses the k-Nearest Neighbor algorithm and Case-Based Reasoning as alternative classification methods, while addressing classifier accuracy and techniques for handling class-imbalanced datasets. The document emphasizes the importance of probabilistic learning and the practical applications of Bayesian methods in data classification.

Uploaded by

researchanalystforapurpose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views23 pages

Bayesian

Uploaded by

researchanalystforapurpose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Classification-Bayesian

Classification
Dr. Manish Kumar
Associate Professor
Chair: Data Analytics Lab & M.Tech (Data Engg.)
Department of Information Technology
Indian Institute of Information Technology-Allahabad, Prayagraj
Bayesian Classification
Bayesian Classification
What are Bayesian Classifiers?
▪ Statistical Classifiers
▪ Predict class membership probabilities
▪ Based on Bayes Theorem
▪ Naïve Bayesian Classifier
▪ Computationally Simple
▪ Comparable performance with DT and NN
classifiers
Bayesian Classification
▪ Probabilistic learning: Calculate explicit
probabilities for hypothesis, among the most
practical approaches to certain types of learning
problems
▪ Incremental: Each training example can
incrementally increase/decrease the probability that
a hypothesis is correct. Prior knowledge can be
combined with observed data.
Bayes Theorem
▪ Let X be a data sample whose class label is
unknown
▪ Let H be some hypothesis that X belongs to a
class C
▪ For classification determine P(H/X)
▪ P(H/X) is the probability that H holds given the
observed data sample X
▪ P(H/X) is posterior probability of H conditioned on
X
Bayes Theorem
Example: Sample space: All Fruits described by their
color and shape
X is “round” and “red”
H= hypothesis that X is an Apple
P(H/X) is our confidence that X is an apple given
that X is “round” and “red”
▪ P(H) is Prior Probability of H, ie, the probability
that any given data sample is an apple regardless
of how it looks
▪ P(H/X) is based on more information
▪ Note that P(H) is independent of X
Bayes Theorem
Example: Sample space: All Fruits
▪ P(X/H) ?
▪ It is the probability that X is round and
red given that we know that it is true
that X is an apple
▪ Here P(X) is prior probability =
P(data sample from our set of fruits is
red and round)
Estimating Probabilities
▪ P(X), P(H), and P(X/H) may be estimated
from given data
▪ Bayes Theorem

▪ Use of Bayes Theorem in Naïve Bayesian

Classifier!!
Naïve Bayesian Classification
▪ Also called Simple BC

▪ Class Conditional Independence

Effect of an attribute values on a given class is
independent of the values of other attributes
▪ This assumption simplifies computations
Naïve Bayesian Classification
Steps Involved
1. Each data sample is of the type
X=(xi) i =1(1)n, where xi is the values of X for
attribute Ai
2. Suppose there are m classes C i, i=1(1)m.
X ∈ Ci iff
P(Ci|X) > P(Cj|X) for 1≤ j ≤ m, j≠i
i.e BC assigns X to class Ci having highest
posterior probability conditioned on X
Naïve Bayesian Classification
The class for which P(Ci|X) is maximized is called the
maximum posteriori hypothesis.maximum posterior hypothesis.
From Bayes Theorem
P ( Ci | X ) = P ( X | Ci ) P ( Ci )
P( X )
3. P(X) is constant. Only need be maximized.
▪ If class prior probabilities not known, then assume all
classes to be equally likely i.e. P(C1)=P(C2)=…=P(Cm)
therefore maximize P(X/Ci )
▪ Otherwise maximize
P(Ci) = Si/S
Problem: computing P(X|Ci) is computationally expensive
(may be infeasible)
Naïve Bayesian Classification
4. Naïve assumption: attribute independence
(class conditional independence)
= P(x1,…,xn|C) = Π P(xk|C) over k=1 to n
5. In order to classify an unknown sample X,
evaluate for each class Ci. Sample X
is assigned to the class Ci iff
P(X|Ci)P(Ci) > P(X|Cj) P(Cj) for 1≤ j ≤ m, j≠i
Naïve Bayesian Classification
Example
Age Income Student Credit_rating Class:Buys_comp
<=30 HIGH N FAIR N
<=30 HIGH N EXCELLENT N
31…..40 HIGH N FAIR Y
>40 MEDIUM N FAIR Y
>40 LOW Y FAIR Y
>40 LOW Y EXCELLENT N
31…..40 LOW Y EXCELLENT Y
<=30 MEDIUM N FAIR N
<=30 LOW Y FAIR Y
>40 MEDIUM Y FAIR Y
<=30 MEDIUM Y EXCELLENT Y
31….40 MEDIUM N EXCELLENT Y
31….40 HIGH Y FAIR Y
>40 MEDIUM N EXCELLENT N
Naïve Bayesian Classification
Example
X= (<=30,MEDIUM, Y,FAIR, ???)
We need to max.
P(X|Ci)P(Ci) for i =1,2.
P(Ci) is computed from training sample
P(buys_comp=Y) = 9/14 = 0.643
P(buys_comp=N) = 5/14 = 0.357
How to calculate P(X|Ci)P(Ci) for i=1,2?
P(X|Ci) = P(x1, x2, x3, x4|C) = ΠP(xk|C)
Naïve Bayesian Classification
Example
P(age<=30 | buys_comp=Y)=2/9=0.222
P(age<=30 | buys_comp=N)=3/5=0.600
P(income=medium | buys_comp=Y)=4/9=0.444
P(income=medium | buys_comp=N)=2/5=0.400
P(student=Y | buys_comp=Y)=6/9=0.667
P(student=Y | buys_comp=N)=1/5=0.200
P(credit_rating=FAIR | buys_comp=Y)=6/9=0.667
P(credit_rating=FAIR | buys_comp=N)=2/5=0.400
Naïve Bayesian Classification
Example
P(X | buys_comp=Y)=0.222*0.444*0.667*0.667=0.044
P(X | buys_comp=N)=0.600*0.400*0.200*0.400=0.019

P(X | buys_comp=Y)P(buys_comp=Y) = 0.044*0.643=0.028

P(X | buys_comp=N)P(buys_comp=N) = 0.019*0.357=0.007

CONCLUSION: Bayesian classifier predicts buys_comp=Y for

sample X. X buys computer
Bayesian Belief Networks
▪ Naïve BC assumes Class Conditional Independence
▪ This assumption simplifies computations
▪ When this assumption holds true, Naïve BC is most
accurate compared to all other classifiers
▪ In real problems, dependencies do exist between variables
▪ 2 methods to overcome this limitation of NBC
▪ Bayesian networks, that combine Bayesian reasoning
with causal relationships between attributes
▪ Decision trees, that reason on one attribute at the
time, considering most important attributes first
Bayesian Belief Networks
Known as
▪ Belief Networks
▪ Bayesian Networks
▪ Probabilistic Networks
has 2 components
▪ Directed Acyclic Graph (DAG)
▪ Conditional Probability Table (CPT)
The k-Nearest Neighbor Algorithm
▪ All instances correspond to points in the n-D space.
▪ The nearest neighbor are defined in terms of Euclidean
distance.
▪ Euclidean distance between two points, X = (x1,x2,…,xn) and
Y = (y1,y2,…,yn) is d(X,Y)= √(Σi=1n (xi-yi)2)
▪ The target function could be discrete- or real- valued.
▪ For discrete-valued, the k-NN returns the most common
value among the k training examples nearest to xq .
▪ The k-NN algorithm for continuous-valued target functions
▪ Calculate the mean values of the k nearest neighbors
The k-Nearest Neighbor Algorithm
▪ Distance-weighted nearest neighbor algorithm
▪ Weight the contribution of each of the k neighbors
according to their distance to the query point xq. (giving
greater weight to closer neighbors)
▪ Nearest neighbor classifiers are lazy learners: they store all
of the training samples and do not build a classifier until a
new sample needs to be classified
▪ Robust to noisy data by averaging k-nearest neighbors.
▪ Curse of dimensionality: distance between neighbors could
be dominated by irrelevant attributes
▪ To overcome it, elimination of the least relevant
attributes.
Case-Based Reasoning
▪ Also uses: lazy evaluation + analyze similar instances
▪ Difference: Instances are not “points in a Euclidean space”
▪ Methodology
▪ Instances represented by rich symbolic descriptions
(e.g., function graphs)
▪ Multiple retrieved cases may be combined
▪ Tight coupling between case retrieval, knowledge-based
reasoning, and problem solving
▪ Research issues
Indexing based on syntactic similarity measure, and when
failure, backtracking, and adapting to additional cases
Classifier Accuracy

▪ How it can be measured?

▪ Holdout Method (Random Sub sampling)
▪ K-fold Cross Validation
▪ Bootstrapping
▪ How we can improve classifier Accuracy?
▪ Bagging
▪ Boosting
▪ Is accuracy enough to judge a classifier?
Classification of Class-Imbalanced Data Sets
• Class-imbalance problem: Rare positive example but numerous negative
ones, e.g., medical diagnosis, fraud, oil-spill, fault, etc.
• Traditional methods assume a balanced distribution of classes and
equal error costs: not suitable for class-imbalanced data
• Typical methods for imbalance data in 2-class classification:
– Oversampling: re-sampling of data from positive class
– Under-sampling: randomly eliminate tuples from negative class
– Threshold-moving: moves the decision threshold, t, so that the rare
class tuples are easier to classify, and hence, less chance of false
negative errors
– Ensemble techniques: Ensemble multiple classifiers introduced
above

Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Understanding Bayesian Classification Techniques
No ratings yet
Understanding Bayesian Classification Techniques
16 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
18 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
UNIT - IV
No ratings yet
UNIT - IV
169 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
16 pages
Bayesian Classification Explained
No ratings yet
Bayesian Classification Explained
7 pages
4 22865 IS465 2019 1 2 1 08ClassBasic
No ratings yet
4 22865 IS465 2019 1 2 1 08ClassBasic
43 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Classification Clustering
No ratings yet
Classification Clustering
44 pages
Module 3 - Bayesian Classifier
No ratings yet
Module 3 - Bayesian Classifier
17 pages
AI Notes
No ratings yet
AI Notes
19 pages
10 Classification New 1
No ratings yet
10 Classification New 1
31 pages
Classification 2
No ratings yet
Classification 2
56 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
Naïve Bayes Classifier in AI Training
No ratings yet
Naïve Bayes Classifier in AI Training
27 pages
Understanding Bayesian Classification Techniques
No ratings yet
Understanding Bayesian Classification Techniques
25 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Understanding Bayesian Classification
No ratings yet
Understanding Bayesian Classification
66 pages
Naive Bayesian Classification Overview
No ratings yet
Naive Bayesian Classification Overview
15 pages
Bayesian Classification, Nearest
No ratings yet
Bayesian Classification, Nearest
46 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
Naïve Bayesian Classifier Overview
No ratings yet
Naïve Bayesian Classifier Overview
48 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
L11 Slides
No ratings yet
L11 Slides
28 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
Classification Bayes
No ratings yet
Classification Bayes
21 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Classification Naive Bayes
No ratings yet
Classification Naive Bayes
17 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
Naïve Bayesian Classification Overview
No ratings yet
Naïve Bayesian Classification Overview
38 pages
Classification Algorithms
No ratings yet
Classification Algorithms
23 pages
DWDM Unit 3 Part 2
No ratings yet
DWDM Unit 3 Part 2
8 pages
DMDW 11 Classification Basic
No ratings yet
DMDW 11 Classification Basic
41 pages
Bayes Classification Method
No ratings yet
Bayes Classification Method
18 pages
CCS - Lec 5
No ratings yet
CCS - Lec 5
33 pages
8 Classification
No ratings yet
8 Classification
45 pages
2.3 Bayes Classification
No ratings yet
2.3 Bayes Classification
15 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Bayesian Learning and Decision Theory
No ratings yet
Bayesian Learning and Decision Theory
64 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
Module - 3 - Last Part
No ratings yet
Module - 3 - Last Part
16 pages
Lecture 3 Basics of Clssification
No ratings yet
Lecture 3 Basics of Clssification
53 pages
Naive Bayes & SVM Overview
No ratings yet
Naive Bayes & SVM Overview
79 pages
O & N Level Math Proportions Guide
No ratings yet
O & N Level Math Proportions Guide
2 pages
Implement A Class Calculator That Models Handheld Calculator. It Should Have (Atleast) The Following Functionality
No ratings yet
Implement A Class Calculator That Models Handheld Calculator. It Should Have (Atleast) The Following Functionality
7 pages
A Practical Method For Load Balancing in The LV Distribution Networks Case Study Tabriz Electrical Network
No ratings yet
A Practical Method For Load Balancing in The LV Distribution Networks Case Study Tabriz Electrical Network
6 pages
Polynomial Functions Guide
No ratings yet
Polynomial Functions Guide
31 pages
Unsupervised Calibration for rBergomi
No ratings yet
Unsupervised Calibration for rBergomi
24 pages
Numerical Modeling of Antenna Radiation Patterns
No ratings yet
Numerical Modeling of Antenna Radiation Patterns
2 pages
Arima
No ratings yet
Arima
12 pages
X Practical 25-26
No ratings yet
X Practical 25-26
6 pages
Term Paper On Inventory Control Techniques: Nikhil Ratnakaran 1PT11MBA35 Section B
No ratings yet
Term Paper On Inventory Control Techniques: Nikhil Ratnakaran 1PT11MBA35 Section B
12 pages
Survey Methodology and Best Practices
No ratings yet
Survey Methodology and Best Practices
23 pages
Shear Design in Two-Way Slabs
100% (2)
Shear Design in Two-Way Slabs
26 pages
Problemset 1 - Propositional Logic Basics
No ratings yet
Problemset 1 - Propositional Logic Basics
9 pages
SOP for Master's in Accounting & Finance
No ratings yet
SOP for Master's in Accounting & Finance
2 pages
New Keynesian Model Overview
No ratings yet
New Keynesian Model Overview
38 pages
Mathematical Biology Textbook
No ratings yet
Mathematical Biology Textbook
119 pages
IOQM Algebra Race
No ratings yet
IOQM Algebra Race
6 pages
Gantt Charts for Project Managers
No ratings yet
Gantt Charts for Project Managers
5 pages
RSHS TOS Grade 7
No ratings yet
RSHS TOS Grade 7
1 page
Decision Making Under Uncertainty
No ratings yet
Decision Making Under Uncertainty
12 pages
DC Module3 - Error Detection
No ratings yet
DC Module3 - Error Detection
98 pages
C Notes
100% (1)
C Notes
96 pages
Algebra 1 Ch. 5 Practice Test
100% (1)
Algebra 1 Ch. 5 Practice Test
7 pages
Nia Aviation Services English
100% (2)
Nia Aviation Services English
7 pages
Rotational Mechanical Systems Extra Notes Nise
No ratings yet
Rotational Mechanical Systems Extra Notes Nise
6 pages
Lecture 1
No ratings yet
Lecture 1
7 pages
Grade Viii Physics Revision Worksheets For Mid Term Session 2024 - 2025
No ratings yet
Grade Viii Physics Revision Worksheets For Mid Term Session 2024 - 2025
18 pages
Linear Programming MHT CET - 2025
No ratings yet
Linear Programming MHT CET - 2025
6 pages
Ebk - TPC 29 - F
No ratings yet
Ebk - TPC 29 - F
1 page
CPM Precalculus Chapter 05 Solutions
No ratings yet
CPM Precalculus Chapter 05 Solutions
22 pages

Bayesian

Uploaded by

Bayesian

Uploaded by

Classification-Bayesian

▪ Use of Bayes Theorem in Naïve Bayesian

▪ Class Conditional Independence

P(X | buys_comp=Y)P(buys_comp=Y) = 0.044*0.643=0.028

CONCLUSION: Bayesian classifier predicts buys_comp=Y for

▪ How it can be measured?

You might also like