Professional Documents
Culture Documents
Vikas Kedia
Outline
Inferencing Algorithms
Application Example
Graphical Models
Introduction
I
I
Graphical Models
I
Markov Property
I
YV
CQC (YVC )
C
C (YVC )
Example Graph
p(XV ) =
Inferencing Algorithm
p(XV )
p(XV )
Naive Approach
Example
P
p(x1 )
=
=
=
=
=
=
=
(x1 , x2 )
(x1 , x3 )
PZ
(x1 , x2 )
(x1 , x3 )
(x1 , x2 )
Z
P
(x1 , x3 )m5 (x2 , x3 ) x4 (x2 , x4 )
Z
(x1 , x2 )m4 (x2 )m3 (x1 , x2 )
x2
x2
x2
x2
x2
x3
x3
x3
Z
m2 (x1 )
Z
x4
x4
(x2 , x4 )
x5
(x3 , x5 )
x6
(x2 , x5 , x6 )
Z
P
(x2 , x4 ) x5 (x3 , x5 )m6 (x2 , x5 )
P Z
x3 (x1 , x3 )m5 (x2 , x3 )
Some Observations
Complexity is therefore r 3
Treewidth
Limitation of HMM
Linear CRF
p(y |x) =
I
I
P
P
exp( eE ,k k fk (ye , x) + v V ,k k gk (yv , x))
Z (x)
L=
X X
X
k fk (ye , x) +
k gk (yv , x) log Z (x))
(
j
eE ,k
v V ,k
Reconciliation
Example
Record
b1
b2
b3
b4
Title
Record Linkage using CRFs
Record Linkage using CRFs
Learning Boolean Formulas
Learning of Boolean Expressions
Author
Linda Stewart
Linda Stewart
Bill Johnson
William Johnson
Venue
KDD-2003
9th SIGKDD
KDD-2003
9th SIGKDD
Collective Model
Nodes for all possible pairs of values for each attribute called
evidence nodes
References I
M. I. Jordan.
Graphical models.
In Statistical Science (Special Issue on Bayesian Statistics),
pages 140155, 2004.
John Lafferty, Andrew McCallum, and Fernando Pereira.
Conditional random fields: Probabilistic models for segmenting
and labeling sequence data.
In Proc. 18th International Conf. on Machine Learning, pages
282289. Morgan Kaufmann, San Francisco, CA, 2001.
Parag and P. Domingos.
Multi-relational record linkage.
In Proceedings of the KDD-2004 Workshop on
Multi-Relational Data Mining, pages 3148, 2004.