0% found this document useful (0 votes)

142 views16 pages

Understanding Bayesian Classification Techniques

Bayesian classification is a statistical classification method that uses Bayes' theorem. It can be used to predict class membership probabilities. The naive Bayesian classifier assumes conditional independence between attributes. It performs comparably to decision trees and neural networks. Bayesian belief networks are graphical models that can represent dependencies between attributes using a directed acyclic graph and conditional probability tables. They allow modeling of conditional independence relationships.

Uploaded by

Ahsan Asim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

142 views16 pages

Understanding Bayesian Classification Techniques

Uploaded by

Ahsan Asim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Bayesian Classification

1
Bayesian Classification
 A statistical classifier
 Probabilistic prediction
 Predict class membership probabilities
 Based on Bayes’ Theorem
 Naive Bayesian classifier
 comparable performance with decision tree and selected neural
network classifiers
 Accuracy and Speed is good when applied to large databases
 Incremental

2
Bayesian Classification

 Naïve Bayesian Classifier

 Class Conditional Independence

 Effect of an attribute value on a given class is

independent of the values of other attributes

 Simplifies Computations

 Bayesian Belief Networks

 Graphical models

 Represent dependencies among subsets of

attributes

3
Bayesian Theorem: Basics
 Let X be a data sample class label is unknown
 Let H be a hypothesis that X belongs to class C
 Classification is to determine P(H|X), the probability that the
hypothesis holds given the observed data sample X
 Posterior Probability

 P(H) (prior probability), the initial probability

 P(X): probability that sample data is observed
 P(X|H) (posteriori probability), the probability of observing the
sample X, given that the hypothesis holds
 X – Round and Red Fruit H - Apple

4
Bayesian Theorem

 Given training data X, posteriori probability of a hypothesis H,

P(H|X), follows the Bayes theorem

P(H | X) = P(X | H )P(H )

P(X)
 Predicts X belongs to Ci iff the probability P(Ci|X) is the highest
among all the P(Ck|X) for all the k classes
 Practical difficulty: require initial knowledge of many probabilities,
significant computational cost

5
Naïve Bayesian Classifier
 Let D be a training set of tuples and their associated class
labels, and each tuple is represented by an n-D attribute
vector X = (x1, x2, …, xn)
 Suppose there are m classes C1, C2, …, Cm.
 Classification is to derive the maximum posteriori, i.e., the
maximal P(Ci|X)
 This can be derived from Bayes’ theorem

P(X | C ) P(C )
P(C | X) = i i
i P(X)

6
Naïve Bayesian Classifier
 Since P(X) is constant for all classes, only
P(C | X) = P(X | C )P(C )
i i i

7
Derivation of Naïve Bayes Classifier

 This greatly reduces the computation cost: Only counts the

class distribution
 If Ak is categorical, P(xk|Ci) = sik /si where sik is the # of tuples in Ci
having value xk for Ak and si is the number of training samples
belonging to Ci
 If Ak is continuous-valued, P(xk|Ci) is usually computed based
on Gaussian distribution with a mean μ and standard deviation
σ
P(xk|Ci) is g(xk, µCi, σCi)
( x −µ ) 2
1 −
g ( x, µ, σ ) = e 2σ 2

2πσ

8
Example age income studentcredit_rating
buys_compu
<=30 high no fair no
<=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes
Class:
C1:buys_computer = ‘yes’
>40 low yes fair yes
C2:buys_computer = ‘no’ >40 low yes excellent no
31…40 low yes excellent yes
Data sample <=30 medium no fair no
X = (age <=30, <=30 low yes fair yes
Income = medium, >40 medium yes fair yes
Student = yes <=30 medium yes excellent yes
Credit_rating = Fair) 31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

9
Example
 P(Ci): P(buys_computer = “yes”) = 9/14 = 0.643
P(buys_computer = “no”) = 5/14= 0.357
 Compute P(X|Ci) for each class
P(age = “<=30” | buys_computer = “yes”) = 2/9 = 0.222
P(age = “<= 30” | buys_computer = “no”) = 3/5 = 0.6
P(income = “medium” | buys_computer = “yes”) = 4/9 = 0.444
P(income = “medium” | buys_computer = “no”) = 2/5 = 0.4
P(student = “yes” | buys_computer = “yes) = 6/9 = 0.667
P(student = “yes” | buys_computer = “no”) = 1/5 = 0.2
P(credit_rating = “fair” | buys_computer = “yes”) = 6/9 = 0.667
P(credit_rating = “fair” | buys_computer = “no”) = 2/5 = 0.4
 X = (age <= 30 , income = medium, student = yes, credit_rating = fair)
P(X|Ci) : P(X|buys_computer = “yes”) = 0.222 x 0.444 x 0.667 x 0.667 = 0.044
P(X|buys_computer = “no”) = 0.6 x 0.4 x 0.2 x 0.4 = 0.019
P(X|Ci)*P(Ci) : P(X|buys_computer = “yes”) * P(buys_computer = “yes”) = 0.028
P(X|buys_computer = “no”) * P(buys_computer = “no”) = 0.007
Therefore, X belongs to class (“buys_computer = yes”)

10
Avoiding the 0-Probability Problem
 Naïve Bayesian prediction requires each conditional prob. be
non-zero. Otherwise, the predicted prob. will be zero
n
P( X | C i) = ∏P ( x k | C i )
k =1
 Ex. Suppose a dataset with 1000 tuples, income=low (0),
income= medium (990), and income = high (10),
 Use Laplacian correction (or Laplacian estimator)
 Adding 1 to each case

Prob(income = low) = 1/1003

Prob(income = medium) = 991/1003
Prob(income = high) = 11/1003
 The “corrected” prob. estimates are close to their
“uncorrected” counterparts

11
Naïve Bayesian Classifier
 Advantages
 Easy to implement
 Good results obtained in most of the cases
 Disadvantages
 Assumption: class conditional independence, therefore loss of
accuracy
 Practically, dependencies exist among variables
 E.g., hospitals: patients: Profile: age, family history, etc.
Symptoms: fever, cough etc., Disease: lung cancer, diabetes, etc.
 Dependencies among these cannot be modeled by Naïve

Bayesian Classifier

12
Bayesian Belief Networks
 Models dependencies between variables
 Defined by Two components
 Directed Acyclic Graph
 Conditional Probability Table (CPT) for each variable
 Bayesian belief network allows a subset of the
variables to be conditionally independent

13
Bayesian Belief Networks
 A graphical model of causal relationships
 Represents dependency among the variables
 Gives a specification of joint probability distribution

 Nodes: random variables

 Links: dependency
X Y
 X and Y are the parents of Z, and Y is
the parent of P
Z
P  No dependency between Z and P
 Has no loops or cycles

14
Bayesian Belief Network: An Example
Family The conditional probability table
History
Smoker (CPT) for variable LungCancer:
(FH, S) (FH, ~S) (~FH, S) (~FH, ~S)

LC 0.8 0.5 0.7 0.1

LungCancer Emphysema ~LC 0.2 0.5 0.3 0.9

CPT shows the conditional probability for

each possible combination of its parents

Derivation of the probability of a

PositiveXRay Dyspnea
particular combination of values of X,
from CPT:
n
Bayesian Belief Networks P ( x1 ,..., xn ) = ∏ P ( x i | Parents (Y i ))
i =1

15
Training Bayesian Networks
 Several scenarios:
 Given both the network structure and all variables observable:
learn only the CPTs
 Network structure known, some hidden variables: gradient
descent (greedy hill-climbing) method, analogous to neural
network learning
 Network structure unknown, all variables observable: search
through the model space to reconstruct network topology
 Unknown structure, all hidden variables: No good algorithms
known for this purpose

Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
AI Notes
No ratings yet
AI Notes
19 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Naive Bayesian Classification Overview
No ratings yet
Naive Bayesian Classification Overview
15 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
Classification Bayes
No ratings yet
Classification Bayes
21 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
16 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
16 pages
Bayesian
No ratings yet
Bayesian
23 pages
Understanding Bayesian Classification Techniques
No ratings yet
Understanding Bayesian Classification Techniques
25 pages
Module 3 - Bayesian Classifier
No ratings yet
Module 3 - Bayesian Classifier
17 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
UNIT - IV
No ratings yet
UNIT - IV
169 pages
Naïve Bayes Classifier in AI Training
No ratings yet
Naïve Bayes Classifier in AI Training
27 pages
2.3 Bayes Classification
No ratings yet
2.3 Bayes Classification
15 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
18 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Bayesian Classification Explained
No ratings yet
Bayesian Classification Explained
7 pages
Bayesian Classification, Nearest
No ratings yet
Bayesian Classification, Nearest
46 pages
Bays Classifier (Machine Learning)
No ratings yet
Bays Classifier (Machine Learning)
16 pages
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
No ratings yet
CSC 325 AI Lecture08 Supervised Learning Fall2024 DR Raheel 20022025 034558pm
29 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
NB Classifier & Bayesian Network 2
No ratings yet
NB Classifier & Bayesian Network 2
37 pages
CSC 325 AI Lecture08 Supervised Learning
No ratings yet
CSC 325 AI Lecture08 Supervised Learning
32 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Module 3 - Classification
No ratings yet
Module 3 - Classification
111 pages
4 22865 IS465 2019 1 2 1 08ClassBasic
No ratings yet
4 22865 IS465 2019 1 2 1 08ClassBasic
43 pages
Naïve Bayes for Data Scientists
No ratings yet
Naïve Bayes for Data Scientists
31 pages
Naïve Bayes Classifier in Machine Learning
No ratings yet
Naïve Bayes Classifier in Machine Learning
19 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
Naïve Bayesian Classification Overview
No ratings yet
Naïve Bayesian Classification Overview
38 pages
Naïve Bayesian Classifier Overview
No ratings yet
Naïve Bayesian Classifier Overview
48 pages
Classification Clustering
No ratings yet
Classification Clustering
44 pages
L11 Slides
No ratings yet
L11 Slides
28 pages
Understanding Bayesian Classification
No ratings yet
Understanding Bayesian Classification
66 pages
Machine Learning-Lecture 04
No ratings yet
Machine Learning-Lecture 04
31 pages
Classification Naive Bayes
No ratings yet
Classification Naive Bayes
17 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
10 Classification New 1
No ratings yet
10 Classification New 1
31 pages
Abdul Hanan 2018-Uam-1253 Human Computer Interaction (Final)
No ratings yet
Abdul Hanan 2018-Uam-1253 Human Computer Interaction (Final)
13 pages
Human-Computer Interaction Exam Insights
No ratings yet
Human-Computer Interaction Exam Insights
9 pages
3.1.1. Multithreaded Design
No ratings yet
3.1.1. Multithreaded Design
18 pages
Metro Media Player RAD Development
No ratings yet
Metro Media Player RAD Development
7 pages
2
No ratings yet
2
9 pages
Emerging Trends in HCI
No ratings yet
Emerging Trends in HCI
32 pages
Classification Techniques in Data Mining
No ratings yet
Classification Techniques in Data Mining
56 pages
CS 412: Introduction To Data Mining Course Syllabus
No ratings yet
CS 412: Introduction To Data Mining Course Syllabus
7 pages
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
No ratings yet
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
11 pages
Data Mining Course Syllabus Overview
No ratings yet
Data Mining Course Syllabus Overview
7 pages
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
No ratings yet
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
16 pages
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
No ratings yet
Introduction To Information Technology (COMP1111) : Dr. Aisha Mahmood
14 pages
Paired Sample t-Test Analysis
No ratings yet
Paired Sample t-Test Analysis
1 page
Ba Part 3 Sociology Social Research Methods LJ 1111 2021
No ratings yet
Ba Part 3 Sociology Social Research Methods LJ 1111 2021
3 pages
Machine Learning Loss Functions Guide
100% (2)
Machine Learning Loss Functions Guide
37 pages
Intro to Analytical Chemistry: Statistics
No ratings yet
Intro to Analytical Chemistry: Statistics
7 pages
Statistics and Probability Final Exam
100% (2)
Statistics and Probability Final Exam
5 pages
Introduction To Econometrics - Stock & Watson - CH 10 Slides
No ratings yet
Introduction To Econometrics - Stock & Watson - CH 10 Slides
99 pages
Statistical Tools in Research (June 23,2014)
100% (1)
Statistical Tools in Research (June 23,2014)
81 pages
Decision Tree Random Forrest Naive Bayes 02
No ratings yet
Decision Tree Random Forrest Naive Bayes 02
13 pages
Ch6 Multiple Regression
No ratings yet
Ch6 Multiple Regression
29 pages
Random 2 - Google Search
No ratings yet
Random 2 - Google Search
3 pages
RE Quiz-1
100% (1)
RE Quiz-1
7 pages
Briefly Explain The Properties of Good Estimators
100% (1)
Briefly Explain The Properties of Good Estimators
4 pages
ETS Models in Time Series Analysis
No ratings yet
ETS Models in Time Series Analysis
18 pages
Point Estimation Exam Study Guide
No ratings yet
Point Estimation Exam Study Guide
18 pages
STAT342 Chap00 PDF
No ratings yet
STAT342 Chap00 PDF
18 pages
PMP Basic Formulas
No ratings yet
PMP Basic Formulas
9 pages
Research Methodology Cadbury
No ratings yet
Research Methodology Cadbury
3 pages
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
83% (6)
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
83 pages
Bootstrap Explained
No ratings yet
Bootstrap Explained
1 page
Gray Robert J 1983
No ratings yet
Gray Robert J 1983
80 pages
Unit - III Testing of Hypothesis
No ratings yet
Unit - III Testing of Hypothesis
2 pages
Lab Sta680 Week 8
No ratings yet
Lab Sta680 Week 8
8 pages
Fundamentals of Applied Probability and Random Processes: 2 Edition
0% (1)
Fundamentals of Applied Probability and Random Processes: 2 Edition
10 pages
Central Tendency for Grouped Data
No ratings yet
Central Tendency for Grouped Data
13 pages
Student Learning Preferences
No ratings yet
Student Learning Preferences
17 pages
تقدير دالة الطلب على الواردات في السودان خلال الفترة (1998- 2017)
No ratings yet
تقدير دالة الطلب على الواردات في السودان خلال الفترة (1998- 2017)
15 pages
CM 17
No ratings yet
CM 17
1 page
Descriptive Statistics & Data
No ratings yet
Descriptive Statistics & Data
215 pages
Kainz Et Al 2017 Improving Causal Inference Recommendations For Covariate Selection and Balance in Propensity Score
No ratings yet
Kainz Et Al 2017 Improving Causal Inference Recommendations For Covariate Selection and Balance in Propensity Score
25 pages
Final Exam MAT1004 Summer Code 2
No ratings yet
Final Exam MAT1004 Summer Code 2
3 pages

Understanding Bayesian Classification Techniques

Uploaded by

Understanding Bayesian Classification Techniques

Uploaded by

Bayesian Classification

 Naïve Bayesian Classifier

 Effect of an attribute value on a given class is

independent of the values of other attributes

 Bayesian Belief Networks

 Represent dependencies among subsets of

 P(H) (prior probability), the initial probability

 Given training data X, posteriori probability of a hypothesis H,

P(H | X) = P(X | H )P(H )

 This greatly reduces the computation cost: Only counts the

Prob(income = low) = 1/1003

 Nodes: random variables

LC 0.8 0.5 0.7 0.1

LungCancer Emphysema ~LC 0.2 0.5 0.3 0.9

CPT shows the conditional probability for

Derivation of the probability of a

You might also like