EM algorithm-ppt
EM algorithm-ppt
case A B C D E F
1 a1 b1 ∗ d1 e1 *
2 a2 ∗ c2 d2 e2 ∗
.. .. .. .. .. .. ..
. . . . . . .
n an bn cn ∗ ∗ ∗
• non-reply in surveys;
• non-reply for specific questions: ”missing” ∼ don’t
know, essentially an additional state for the variable
in question
• recording error
• variable out of range
• just not recorded (e.g. too expensive)
f (M | Y, θ) = f (M | θ), i.e. M ⊥
⊥ Y | θ.
6
KL(fθy∗ : fθy ) ≥ 0
∇ly (θ∗ )
ly (θ) − ly (θ∗ )
q(θ | θ∗ ) − q(θ∗ | θ∗ )
- θ
θ∗
ly (θ) − ly (θ∗ ) = q(θ | θ∗ ) + KL(fθy∗ : fθy )
∂ ∂
∇ly (θ∗ ) = ly (θ) = q(θ | θ∗ ) .
∂θ θ=θ ∗ ∂θ θ=θ ∗
Kullback-Leibler divergence
Then
where
and
n3∗ 3 3 2
ij = E(Nij | p, n+j ) = pi | j n+j .
We thus get
pij pij
n2∗
ij = n2 , n3∗
ij = n3 . (3)
pi0 + pi1 i+ p0j + p1j +j
The M-step now maximizes log L(p) = ij n∗ij log pij by
P
letting
pij = (n1ij + n2∗ 3∗
ij + nij )/n (4)
where n is the total number of observations.
The EM algorithm alternates between (3) and (4) until
convergence.