Graphical Models Graphical Models (ch 8.1-2)

Graphical Models
(ch 8.1-2)
from “Pattern Recognition and Machine
Learning” by C
C. M
M. Bishop
Contents
ƒ 8.1 Bayesian networks
Š
Š
Š
Š
Example: polynomial regression
Generative models
Discrete variables
Linear Gaussian models
ƒ 8.2
8 2 Conditional independence
Š Three example graphs
Š D-separation
D separation
Bayesian Networks
ƒ A simple way to visualize the structure of a probabilistic model.
ƒ Insights (including conditional independence properties) can be
obtained
ƒ Complex computations can be expressed in terms of graphical
manipulations
ƒ Each
h node
d represents a random
d
variable
i bl and
d lilinks
k represent
probabilistic relationships between these variables
ƒ The graph captures the decomposition of the joint distributions
over the set of random variables into a product of factors
Š Directed graphical models: Bayesian networks
Š Undirected graphical models: Markov random fields
Bayesian Networks
ƒ
ƒ
Example: Polynomial regression
ƒ
ƒ Addition
Addi i
off a new iinput value
l
d the
h
x̂ˆ and
corresponding variable tˆ
Observed variable
Hidden variable
Discrete variables
ƒ Discrete variables with K possible states
ƒ Number of independent parameters:
Š
Š
Š
M independent
p
discrete variables: M(K-1)
M fully connected discrete variables: KM-1
A chain of M discrete nodes: K-1+(M-1)K(K-1)
ƒ Controlling the number of parameters:
Š 2M vs logistic sigmoid function (linear with M)
Linear Gaussian models
ƒ
ƒ Mean and covariance of the joint distribution:
distribution
Linear Gaussian models
ƒ Two extreme cases, intermediate case
Š no links: D isolated nodes, total of D+D parameters (diagonal
covariance)
Š Fully connected: each node has all lower numbered nodes as
parents. D(D-1)/2 + D parameters
Š Intermediate case: partially connected
ƒ Conditional distribution of node i:
Conditional independence
ƒ Conditional independence
( a is conditionally independent of b given c )
ƒ a and b are statistically independent given c :
ƒ D-separation:
D separation: conditional independence properties can be
checked without analytical manipulations
Three example graphs
ƒ Example 1 (tail to tail)
Š Without conditions:
Š With conditions:
Three example graphs
ƒ Example 2 (head to tail)
Š With conditions:
ƒ Example 3 (head to head)
Š Without conditions:
Š With conditions:
Three example graphs
ƒ
Ah
head
d tto h
head
d node
d bl
blocks
k a path
th if it iis u
unobserved,
b
d but once th
the
node or at least one of its descendents is observed, the path becomes
unblocked
ƒ
B state off battery,
B:
b
F:
F state off ffuell tank,
k G
G: state off ffuell gauge
the state of fuel tank
and battery are dependent
after observing fuel gage
‹
D separation
D-separation
ƒ Conditions for D-separation : if all paths are blocked, then A is
d-separated from B by C
Š In (a), a is not d-separated from b by f
Š In (b), a is d-separated from b by f
ƒ Example of i.i.d. samples
Tail to tail path
ƒ
Graphical model as a filter (equivalence between d-separation
property and graph factorization)
ƒ Markov blanket, Markov boundary: set of nodes for parents,
children, coparents
Š Minimal set of nodes to isolate xi from the rest of the g
graph
p