0% found this document useful (0 votes)

44 views32 pages

Bayesian Neworks

Uploaded by

SAMARTH CHAUHAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views32 pages

Bayesian Neworks

Uploaded by

SAMARTH CHAUHAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

BAYESIAN NETWORKS

Bayesian Networks
• A Bayesian network specifies a joint distribution in a structured form

• Represent dependence/independence via a directed graph

– Nodes = random variables
– Edges = direct dependence

• Structure of the graph  Conditional independence relations

In general,
p(X1, X2,....XN) =  p(Xi | parents(Xi ) )

The full joint distribution The graph-structured approximation

• Requires that graph is acyclic (no directed cycles)

• 2 components to a Bayesian network

– The graph structure (conditional independence assumptions)
– The numerical probabilities (for each variable given its parents)
Example of a simple Bayesian network
A B

p(A,B,C) = p(C|A,B)p(A)p(B)

• Probability model has simple factored form

• Directed edges => direct dependence

• Absence of an edge => conditional independence

• Also known as belief networks, graphical models, causal networks

• Other formulations, e.g., undirected graphical models

Examples of 3-way Bayesian Networks

A B C Marginal Independence:
p(A,B,C) = p(A) p(B) p(C)
Examples of 3-way Bayesian Networks

Conditionally independent effects:

p(A,B,C) = p(B|A)p(C|A)p(A)

A B and C are conditionally independent

Given A

e.g., A is a disease, and we model

B C B and C as conditionally independent
symptoms given A
Examples of 3-way Bayesian Networks

A B Independent Causes:
p(A,B,C) = p(C|A,B)p(A)p(B)

C
“Explaining away” effect:
Given C, observing A makes B less likely
e.g., earthquake/burglary/alarm example

A and B are (marginally) independent

but become dependent once C is known
Examples of 3-way Bayesian Networks

A B C Markov dependence:
p(A,B,C) = p(C|B) p(B|A)p(A)
Example
• Consider the following 5 binary variables:
– B = a burglary occurs at your house
– E = an earthquake occurs at your house
– A = the alarm goes off
– J = John calls to report the alarm
– M = Mary calls to report the alarm

– What is P(B | M, J) ? (for example)

– We can use the full joint distribution to answer this question

• Requires 25 = 32 probabilities

• Can we use prior domain knowledge to come up with a Bayesian

network that requires fewer probabilities?
Constructing a Bayesian Network: Step 1

• Order the variables in terms of causality (may be a partial order)

e.g., {E, B} -> {A} -> {J, M}

• P(J, M, A, E, B) = P(J, M | A, E, B) P(A| E, B) P(E, B)

~ P(J, M | A) P(A| E, B) P(E) P(B)

~ P(J | A) P(M | A) P(A| E, B) P(E) P(B)

These CI assumptions are reflected in the graph structure of the

Bayesian network
The Resulting Bayesian Network
Constructing this Bayesian Network: Step 2

• P(J, M, A, E, B) =
P(J | A) P(M | A) P(A | E, B) P(E) P(B)

• There are 3 conditional probability tables (CPTs) to be determined:

P(J | A), P(M | A), P(A | E, B)
– Requiring 2 + 2 + 4 = 8 probabilities

• And 2 marginal probabilities P(E), P(B) -> 2 more probabilities

• Where do these probabilities come from?

– Expert knowledge
– From data (relative frequency estimates)
– Or a combination of both - see discussion in Section 20.1 and 20.2 (optional)
The Bayesian network
Number of Probabilities in Bayesian
Networks
• Consider n binary variables

• Unconstrained joint distribution requires O(2n) probabilities

• If we have a Bayesian network, with a maximum of k parents for

any node, then we need O(n 2k) probabilities

• Example
– Full unconstrained joint distribution
• n = 30: need 109 probabilities for full joint distribution
– Bayesian network
• n = 30, k = 4: need 480 probabilities
The Bayesian Network from a different Variable Ordering
The Bayesian Network from a different Variable Ordering
Given a graph, can we “read off” conditional independencies?

A node is conditionally independent

of all other nodes in the network
given its Markov blanket (in gray)
Inference (Reasoning) in Bayesian
Networks
• Consider answering a query in a Bayesian Network
– Q = set of query variables
– e = evidence (set of instantiated variable-value pairs)
– Inference = computation of conditional distribution P(Q | e)

• Examples
– P(burglary | alarm)

– P(earthquake | JCalls, MCalls)

– P(JCalls, MCalls | burglary, earthquake)

• Can we use the structure of the Bayesian Network

to answer such queries efficiently? Answer = yes
– Generally speaking, complexity is inversely proportional to sparsity of graph
Example: Tree-Structured Bayesian Network

B E

A C F G

p(a, b, c, d, e, f, g) is modeled as p(a|b)p(c|b)p(f|e)p(g|e)p(b|d)p(e|d)p(d)

Example
D

B E

A c F g

Say we want to compute p(a | c, g)

Example
D

B E

A c F g

Direct calculation: p(a|c,g) = Sbdef p(a,b,d,e,f | c,g)

Complexity of the sum is O(m4)

Example
D

B E

A c F g

Reordering:
Sd p(a|b) Sd p(b|d,c) Se p(d|e) Sf p(e,f |g)
Example
D

B E

A c F g

B E

A c F g

B E

A c F g

Reordering:
Sb p(a|b) Sd p(b|d,c) p(d|g)
p(b|c,g)
Example
D

B E

A c F g

Reordering:
Sb p(a|b) p(b|c,g)
p(a|c,g) Complexity is O(m), compared to O(m4)
General Strategy for inference
• Want to compute P(q | e)

Step 1:
P(q | e) = P(q,e)/P(e) = a P(q,e), since P(e) is constant wrt Q

Step 2:
P(q,e) = Sa..z P(q, e, a, b, …. z), by the law of total probability

Step 3:
Sa..z P(q, e, a, b, …. z) = Sa..z i P(variable i | parents i)
(using Bayesian network factoring)

Step 4:
Distribute summations across product terms for efficient computation
Complexity of Bayesian Network
inference
• Assume the network is a polytree
– Only a single directed path between any 2 nodes

• Complexity scales as O(n m

K+1)
• n = number of variables
• m = arity of variables
• K = maximum number of parents for any node

– Compare to O(mn-1) for brute-force method

• Network is not a polytree?

– Can cluster variables to render the new graph a tree
– Very similar to tree methods used for
– Complexity is O(n m
W+1), where W = num variables in largest cluster
Real-valued Variables
• Can Bayesian Networks handle Real-valued variables?
– If we can assume variables are Gaussian, then the inference and theory
for Bayesian networks is well-developed,
• E.g., conditionals of a joint Gaussian is still Gaussian, etc
• In inference we replace sums with integrals

– For other density functions it depends…

• Can often include a univariate variable at the “edge” of a graph, e.g., a Poisson
conditioned on day of week

– But for many variables there is little know beyond their univariate
properties, e.g., what would be the joint distribution of a Poisson and a
Gaussian? (its not defined)

– Common approaches in practice

• Put real-valued variables at “leaf nodes” (so nothing is conditioned on them)
• Assume real-valued variables are Gaussian or discrete
• Discretize real-valued variables
Other aspects of Bayesian Network
Inference
• The problem of finding an optimal (for inference) ordering and/or
clustering of variables for an arbitrary graph is NP-hard
– Various heuristics are used in practice
– Efficient algorithms and software now exist for working with large
Bayesian networks
• E.g., work in Professor Rina Dechter’s group

• Other types of queries?

– E.g., finding the most likely values of a variable given evidence
– arg max P(Q | e) = “most probable explanation”
or maximum a posteriori query
- Can also leverage the graph structure in the same manner as for
inference – essentially replaces “sum” operator with “max”
Naïve Bayes Model
Y1 Y2 Y3 Yn

P(C | Y1,…Yn) = a P P(Yi | C) P (C)

Features Y are conditionally independent given the class variable C

Widely used in machine learning

e.g., spam email classification: Y’s = counts of words in emails

Conditional probabilities P(Yi | C) can easily be estimated from labeled data

APPLICATIONS OF BAYESIAN NETWORKS

• Bayesian networks are used for modelling knowledge in computational

biology and bioinformatics (gene regulatory networks, protein structure,
gene expression analysis
• Sports betting, learning epistasis from GWAS data sets
• Medicine
• Bio-monitoring
• Document classification, information retrieval
• Semantic search
• Image processing, data fusion, decision support systems
• Engineering, gaming, law, and risk analysis.
• There are texts applying Bayesian networks to bioinformatics, financial and
marketing informatics.
Summary
• Bayesian networks represent a joint distribution using a
graph

• The graph encodes a set of conditional independence

assumptions

• Answering queries (or inference or reasoning) in a Bayesian

network amounts to efficient computation of appropriate
conditional probabilities

• Probabilistic inference is intractable in the general case

– But can be carried out in linear time for certain classes of
Bayesian networks

Bayesian Belief Network
No ratings yet
Bayesian Belief Network
41 pages
Lecture 5 Bayesian Networks
No ratings yet
Lecture 5 Bayesian Networks
12 pages
Bayesian Networks Analysis
No ratings yet
Bayesian Networks Analysis
51 pages
Bayesian Networks
No ratings yet
Bayesian Networks
45 pages
13 Bayes Nets
No ratings yet
13 Bayes Nets
38 pages
Good BayesianNetworksPrimer
No ratings yet
Good BayesianNetworksPrimer
23 pages
Bayesian Networks
No ratings yet
Bayesian Networks
16 pages
EECS6895 AdvancedBigDataAnalytics Lecture6
No ratings yet
EECS6895 AdvancedBigDataAnalytics Lecture6
81 pages
PPT06-Probabilistic Reasoning
No ratings yet
PPT06-Probabilistic Reasoning
31 pages
Exp1 A09 DS
No ratings yet
Exp1 A09 DS
6 pages
Unit V - Graphical Models
No ratings yet
Unit V - Graphical Models
43 pages
Understanding Bayesian Networks in AI
No ratings yet
Understanding Bayesian Networks in AI
15 pages
Ba Yes Network
No ratings yet
Ba Yes Network
73 pages
2021 Lecture09 BayesianNetworks
No ratings yet
2021 Lecture09 BayesianNetworks
60 pages
21 BN 20
No ratings yet
21 BN 20
59 pages
Bayesian Network
No ratings yet
Bayesian Network
20 pages
Lecture Bayesian Networks
No ratings yet
Lecture Bayesian Networks
50 pages
Bayesian Belief Network in Artificial Intelligence
No ratings yet
Bayesian Belief Network in Artificial Intelligence
10 pages
BNetwork Presentation
No ratings yet
BNetwork Presentation
18 pages
Unit 2
No ratings yet
Unit 2
45 pages
Bayesian Networks for Medical Diagnosis
No ratings yet
Bayesian Networks for Medical Diagnosis
48 pages
AML Unit 2
No ratings yet
AML Unit 2
6 pages
ASHTIKA
No ratings yet
ASHTIKA
9 pages
Understanding Bayesian Networks in AI
No ratings yet
Understanding Bayesian Networks in AI
9 pages
Bayesian Network Construction Overview
No ratings yet
Bayesian Network Construction Overview
58 pages
Bayes Nets 2016
No ratings yet
Bayes Nets 2016
62 pages
Bays Theorem
No ratings yet
Bays Theorem
42 pages
Bayesian
No ratings yet
Bayesian
40 pages
Week Six 16 PDF
No ratings yet
Week Six 16 PDF
28 pages
Bayesian Network Homework
100% (1)
Bayesian Network Homework
5 pages
AAI Module 3 Notes
No ratings yet
AAI Module 3 Notes
7 pages
AI Unit 5 Notes
No ratings yet
AI Unit 5 Notes
35 pages
c14 15bayesian Networks 2020
No ratings yet
c14 15bayesian Networks 2020
115 pages
4.2 Bayes-Nets
No ratings yet
4.2 Bayes-Nets
33 pages
CPD in Bayesian Networks Explained
No ratings yet
CPD in Bayesian Networks Explained
14 pages
Mount Zion College of Engineering and Technology
No ratings yet
Mount Zion College of Engineering and Technology
22 pages
Unit 6
No ratings yet
Unit 6
126 pages
Probabilistic Reasoning
No ratings yet
Probabilistic Reasoning
58 pages
Introduction to Bayesian Networks
No ratings yet
Introduction to Bayesian Networks
53 pages
Bayesian Networks: A Tutorial
No ratings yet
Bayesian Networks: A Tutorial
73 pages
Bayesian Networks
No ratings yet
Bayesian Networks
7 pages
AIFA 25 Bayesian Logic 120324
No ratings yet
AIFA 25 Bayesian Logic 120324
33 pages
PR January20 06 PDF
No ratings yet
PR January20 06 PDF
29 pages
Cse317 - SB - Suchi
No ratings yet
Cse317 - SB - Suchi
59 pages
Bayesian Networks: A Tutorial
No ratings yet
Bayesian Networks: A Tutorial
43 pages
22cse61 Module 4
No ratings yet
22cse61 Module 4
110 pages
Inference Techniques in Bayesian Networks
No ratings yet
Inference Techniques in Bayesian Networks
14 pages
BBN (2024)
No ratings yet
BBN (2024)
35 pages
Bayesian Network
No ratings yet
Bayesian Network
33 pages
Module 5 2
No ratings yet
Module 5 2
41 pages
Learning Bayesian Networks With R: Susanne G. Bøttcher Claus Dethlefsen
No ratings yet
Learning Bayesian Networks With R: Susanne G. Bøttcher Claus Dethlefsen
11 pages
Understanding Bayesian Networks
No ratings yet
Understanding Bayesian Networks
14 pages
Group Activity3
No ratings yet
Group Activity3
4 pages
Bayesian Belief Network
No ratings yet
Bayesian Belief Network
53 pages
AI Bayes Theorem
No ratings yet
AI Bayes Theorem
10 pages
Overview of Probabilistic Graphical Models
No ratings yet
Overview of Probabilistic Graphical Models
43 pages
CM Week 15
No ratings yet
CM Week 15
15 pages
Bayesian Networks: (Aka Bayes Nets, Belief Nets) (One Type of Graphical Model)
No ratings yet
Bayesian Networks: (Aka Bayes Nets, Belief Nets) (One Type of Graphical Model)
18 pages
Aiml 5 Units Notes
No ratings yet
Aiml 5 Units Notes
134 pages
Estimating PPQ Batches in Pharma
No ratings yet
Estimating PPQ Batches in Pharma
9 pages
Bayesian Cognitive Modeling A Practical Course, 1st Edition Entire Book Download
100% (20)
Bayesian Cognitive Modeling A Practical Course, 1st Edition Entire Book Download
15 pages
Stats Topics To Cover
No ratings yet
Stats Topics To Cover
7 pages
Bayesian Learning Lecture Notes
No ratings yet
Bayesian Learning Lecture Notes
15 pages
639544869
No ratings yet
639544869
589 pages
Mlnotes 2 Srija
No ratings yet
Mlnotes 2 Srija
15 pages
Unit 6
No ratings yet
Unit 6
1 page
Venture Capital Financing: A Conceptual Framework: Swee-Sum Lam
No ratings yet
Venture Capital Financing: A Conceptual Framework: Swee-Sum Lam
14 pages
1 s2.0 S1697260023000194 Main
No ratings yet
1 s2.0 S1697260023000194 Main
13 pages
Andrew O. Finley, Edwin J. Green, and William E. Strawderman - Introduction To Bayesian Methods in Ecology and Natural Resources Book-Springer (2020)
No ratings yet
Andrew O. Finley, Edwin J. Green, and William E. Strawderman - Introduction To Bayesian Methods in Ecology and Natural Resources Book-Springer (2020)
188 pages
Ai Unit - 3 Final
100% (1)
Ai Unit - 3 Final
48 pages
Reputation Challenges in Platform Markets
No ratings yet
Reputation Challenges in Platform Markets
45 pages
How To Learn Python For Data Science
100% (1)
How To Learn Python For Data Science
22 pages
Introduction To Markov Chain Monte Carlo (MCMC) and Its Role in Modern Bayesian Analysis
No ratings yet
Introduction To Markov Chain Monte Carlo (MCMC) and Its Role in Modern Bayesian Analysis
41 pages
Group 5 Practical
No ratings yet
Group 5 Practical
6 pages
An Introduction To Bayesian Statistics
100% (9)
An Introduction To Bayesian Statistics
20 pages
Active Learning For Hyperspectral Image Classification A Comparative Review
No ratings yet
Active Learning For Hyperspectral Image Classification A Comparative Review
23 pages
Ozker 2020
No ratings yet
Ozker 2020
6 pages
CS3491 Unit 2
No ratings yet
CS3491 Unit 2
22 pages
Bayesian vs. Frequentist Statistics
No ratings yet
Bayesian vs. Frequentist Statistics
12 pages
Bayesian Approaches to Deep Learning
No ratings yet
Bayesian Approaches to Deep Learning
33 pages
Unit-2: Machine Learning Techniques (KCS-055) Module-2
No ratings yet
Unit-2: Machine Learning Techniques (KCS-055) Module-2
199 pages
Exact Inference in Bayesian Networks
No ratings yet
Exact Inference in Bayesian Networks
3 pages
Composite Fatigue Life Prognostics
No ratings yet
Composite Fatigue Life Prognostics
8 pages
1 s2.0 S0266352X10000947 Main PDF
No ratings yet
1 s2.0 S0266352X10000947 Main PDF
8 pages
What Is The Bayes' Theorem?
100% (2)
What Is The Bayes' Theorem?
12 pages
Introduction To The Theory of Complex Systems Stefan Thurner - Download The Ebook Now To Never Miss Important Information
100% (3)
Introduction To The Theory of Complex Systems Stefan Thurner - Download The Ebook Now To Never Miss Important Information
55 pages
Question Bank For Artificial Intelligence Regulation 2013
No ratings yet
Question Bank For Artificial Intelligence Regulation 2013
7 pages
Comparative Study of Bayesian Optimization Process For The Best Machine Learning Hyperparameters
No ratings yet
Comparative Study of Bayesian Optimization Process For The Best Machine Learning Hyperparameters
11 pages

Bayesian Neworks

Uploaded by

Bayesian Neworks

Uploaded by

BAYESIAN NETWORKS

• Represent dependence/independence via a directed graph

• Structure of the graph  Conditional independence relations

The full joint distribution The graph-structured approximation

• Requires that graph is acyclic (no directed cycles)

• 2 components to a Bayesian network

• Probability model has simple factored form

• Directed edges => direct dependence

• Absence of an edge => conditional independence

• Also known as belief networks, graphical models, causal networks

• Other formulations, e.g., undirected graphical models

Conditionally independent effects:

A B and C are conditionally independent

e.g., A is a disease, and we model

A and B are (marginally) independent

– What is P(B | M, J) ? (for example)

– We can use the full joint distribution to answer this question

• Can we use prior domain knowledge to come up with a Bayesian

• Order the variables in terms of causality (may be a partial order)

e.g., {E, B} -> {A} -> {J, M}

• P(J, M, A, E, B) = P(J, M | A, E, B) P(A| E, B) P(E, B)

~ P(J, M | A) P(A| E, B) P(E) P(B)

~ P(J | A) P(M | A) P(A| E, B) P(E) P(B)

These CI assumptions are reflected in the graph structure of the

• There are 3 conditional probability tables (CPTs) to be determined:

• And 2 marginal probabilities P(E), P(B) -> 2 more probabilities

• Where do these probabilities come from?

• Unconstrained joint distribution requires O(2n) probabilities

• If we have a Bayesian network, with a maximum of k parents for

A node is conditionally independent

– P(earthquake | JCalls, MCalls)

– P(JCalls, MCalls | burglary, earthquake)

• Can we use the structure of the Bayesian Network

p(a, b, c, d, e, f, g) is modeled as p(a|b)p(c|b)p(f|e)p(g|e)p(b|d)p(e|d)p(d)

Say we want to compute p(a | c, g)

Direct calculation: p(a|c,g) = Sbdef p(a,b,d,e,f | c,g)

Complexity of the sum is O(m4)

• Complexity scales as O(n m

– Compare to O(mn-1) for brute-force method

• Network is not a polytree?

– For other density functions it depends…

– Common approaches in practice

• Other types of queries?

P(C | Y1,…Yn) = a P P(Yi | C) P (C)

Features Y are conditionally independent given the class variable C

Widely used in machine learning

Conditional probabilities P(Yi | C) can easily be estimated from labeled data

• Bayesian networks are used for modelling knowledge in computational

• The graph encodes a set of conditional independence

• Answering queries (or inference or reasoning) in a Bayesian

• Probabilistic inference is intractable in the general case

You might also like