Professional Documents
Culture Documents
∂ℓ 1
= − 2 X T (Xβ − y) = 0,
∂β σ
∂ℓ n 1 T
2
=− 2 + 2 (y − Xβ) (y − Xβ) = 0,
∂σ 2σ 2 (σ 2 )
we obtain
−1 T
β̂ = X T X X y, (1)
1 T 1
σ̂ 2 = y − X β̂ y − X β̂ = y T I − X(X T X)−1 X T y. (2)
n n
1X
n
1 T 2
σ̂ 2 = (y − ŷ) (y − ŷ) = (ŷi − yi ) . (3)
n n i=1
b
n 1 T
2
β̂(0) , σ̂(0) = arg max ℓ β(0) , σ = arg min log(σ 2 ) + 2 y − Xβ(0) (y − Xβ(0) )
2 2σ
implies that
T
β̂(0) = arg min y − Xβ(0) (y − Xβ(0) )
= arg min (y − β0 1) (y − β0 1).
T
Hence
−1 1X
n
β̂0 = 1 1
T
1 T
y= yi = ȳ, (4)
n i=1
1 T 1X
n
2
σ̂(0) = y − X β̂(0) y − X β̂(0) = (yi − ȳ)2 . (5)
n n i=1
Li Huiteng Homework 1 April 25, 2023
Since
n n
ℓ β̂, σ̂ = − log 2πσ̂ 2 − ,
2 2n
n
ℓ β̂(0) , σ̂(0) = − log 2πσ̂(0) − ,
2
2 2
we obtain
!
2
σ̂(0)
W = 2 ℓ β̂, σ̂ − ℓ β̂(0) , σ̂(0) = n log .
σ̂ 2
The F -test for regression is denoted as the quotient of Mean of Squares for
Model(MSM) and Mean of Squares for Error(MSE), i.e.,
∑
(ŷi −ȳ)2 P P 2
!
M SM n−p (yi − ȳ)2 − (ŷi − yi )2 n−p σ̂(0)
F = = ∑
n−1
= · P = −1 .
M SE (ŷi −yi )2 n−1 (ŷi − yi )2 n−1 σ̂ 2
n−p
c
1 T 1 T
f (β | y) ∝ f (y | β)f (β) ∝ exp − 2 (y − Xβ) (y − Xβ) − 2 β β
2σ 2τ
1
= exp − 2 β Aβ − 2β X y + y y
T T T T
2σ
1
−1 T
= exp − 2 ⟨β, β⟩ − 2 β, A X y + y y T
2σ
1
= C exp − 2 ⟨β − µ, β − µ⟩
2σ
1
= C exp − 2 (β − µ) A(β − µ) ,
T
2σ
where
A = X T X + λI,
⟨x1 , x2 ⟩ = xT1 Ax2 , ∀x1 , x2 ∈ Rp ,
µ = A−1 X T y,
and C is independent of β. The last three steps hold since A is a positive definite
matrix and thus induces a well-posed inner product ⟨·, ·⟩.
Li Huiteng Homework 1 April 25, 2023
d
(TBC)
we obtain
λ T 1 T
p(θ | y) ∝ exp − 2 θ Ωθ − 2 (y − Nθ) (y − Nθ)
2σ 2σ
1
= exp − 2 θ Aθ − 2θ N y + y y
T T T T
2σ
1
= C exp − 2 (θ − µ) A(θ − µ) ,
T
2σ
where
A = NT N + λΩ,
µ = A−1 NT y,
E(θ | y) = µ,
cov(θ | y) = A−1 σ 2 .
And f (Nθ | y) ∼ N Nµ, σ 2 NA−1 NT , that is
−1 T
E(Nθ | y) = N NT N + λΩ N y,
−1
cov(Nθ | y) = σ 2 N NT N + λΩ NT .