Professional Documents
Culture Documents
∑ λ Xi =∑ λ Yj = 0
i=1 j=1
we can write the abaove model in matrix form as follows:
logm 11 logm 1 1 1 1
μ
logm 12 logm 2 1 1 −1
logm = = = λ Xi = Xβ
logm 21 logm 3 1 −1 1
λ Yj
logm 22 logm 4 1 −1 −1
where
β1 μ
β = β 2 = λ Xi
β3 λ Yj
Thus all loglinear models can bae written in the form logm = Xβ where β is a model
parameter column vector containing t parameters.
ni
n
e =m i m i i
For Poisson sampling, the likelihood of the data is Π ni!
so that the kernel is Π e =m i m ni i .
i i
The log likelihood of the kernel is
Lm =∑ n i logm i −∑ m i =∑ n i ∑ x ij β j −∑ exp∑ x ij β j
i i i j i j
Since m i = exp ∑ x ij β j
j
∂Lm
=∑ n i x ij −∑ m i x ij , j = 1, ..., t
∂β j
i i
The likelihood equations equate these derivataives to zero and have form X ′ n = X ′ m.
The matrix of second partial derivatives of the log likelihood has elements
1
∂ 2 Lm
∂β j ∂β k
=− ∑ x ij ∂m i
∂β k
=− ∑ x ij x ik m i
i i
X′DiagmX
As n → ∞, the maximum likelihood estimator β of β has an asymptotic normal distribution
with mean β and covariance matrix equal to the inverse
of the information matrix. Thus, for
Poisson sampling, the estimated covariance matrix of β is
−1
Covβ = X′DiagmX
Similar results hold when we use maximum likelihood estimation after conditioning on
n.When the n i are independent Poisson random variables (i = 1, ..., N) the conditional
distribution of m given n is multinomial with parameters π i = m i / ∑ m i . It is not necessary to
develop separate inferential theory for multinomial loglinear models,as shown by the
following (see Birch 1963).
= nμ + ∑ n i x i β τ − τ
where
τ= ∑ m i = ∑ expμ + x i β τ
Since
logτ = μ + log∑ expx i β τ
this log likelihood has the form
Lτ, β τ = ∑ n i x i β τ − n log∑ expx i β τ + n logτ − τ
Now
π i = m i / ∑ m i = expμ + x i β r / ∑ expμ + x i β τ
and exp μ cancels in the numerator and the denominator.
2
Thus the first term in Lτ, β τ is
∑ n i x i β τ − n log∑ expx i β τ = ∑ n logπ i
which is the multinomial log likelihood conditional on n.
A similar argument applies when there are several independent multinomial samples rather
than a single one. Each log likelihood term is a sum of components from diferent samples,
but the Poisson log likelihood again decomposes into two parts- one part is a Poisson log
likelihood for the independent sample sizes, and the other part is the sum of the
independent multinomial log likelihoods.