0% found this document useful (0 votes)

98 views7 pages

MLE Gamma

Uploaded by

aditya panikkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views7 pages

MLE Gamma

Uploaded by

aditya panikkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Maximum Likelihood Estimation: Numerical Solution for Gamma

Distribution1

The Gamma distribution has the probability density function:

1 α−1 y
f (y|α, β) = y exp − (1)
Γ (α) β α β

where: y ≥ 0, β is the scale parameter and α is the shape parameter (known parameter).

Maximum Likelihood Estimation

Let y1 , . . . , yn , denote the data. Assume, ∀i : yi independente random variables, share the same
parameters from a Gamma distribution described in (1).

1 α−1 yi
f (yi |β) = yi exp − (2)
Γ (α) β α β
then, their join probability distribution is:
n
Y
f (y1 , . . . , yn |β) = f (yi |β)
i=1
n
Y 1 α−1 yi
= yi exp − (3)
i=1
Γ (α) β α β

The likelihood function is:

L (β|y1 , . . . , yn ) = f (y1 , . . . , yn |β) (4)

The log-likelihood function is:

` (β|y1 , . . . , yn ) = log (L (β|y1 , . . . , yn ))

n
Y 1 α−1 yi
= log yi exp −
i=1
Γ (α) β α β
n
X 1 α−1 yi
= log y
α i
exp −
i=1
Γ (α) β β
n
X yi
= −log Γ (α) − α log β + (α − 1) log yi − (5)
i=1
β

The maximum likelihood estimator (MLE), denoted by β̂, is such that:

β̂ = argmax {` (β|y1 , . . . , yn )} (6)

1
To refer this document and the implemented code, please cite as: Alvarado, M. (2020, September 8). Maximum
Likelihood Estimation: Numerical Solution for Gamma Distribution (Version v1.0.0). Zenodo. http://doi.org/10.
5281/zenodo.4019900. Also at GitHub: https://github.com/miguel-alvarado-stats/MLE_Gamma.

1
To maximize the log-likelihood function (5), requires the derivative with respect to β. The resulted
function is called the Score function, denoted by U (β|y1 , . . . , yn ).
∂` (β|y1 , . . . , yn )
U (β|y1 , . . . , yn ) =
∂β
n
X 1 yi
= −α + 2
i=1
β β
n
nα 1 X
= − + 2 yi (7)
β β i=1

Then, the MLE β̂, is the solution of:

U β = β̂|y1 , . . . , yn = 0 (8)

Maximum Likelihood Estimation: Newton-Raphson Method

Just for notation, let write equation (8) as:

U (β ∗ ) = 0 (9)
The equation (9), generally, is a nonlinear equation, that can be aproximate by Taylor Series:
U (β ∗ ) ≈ U β (t) + U 0 β (t) β ∗ − β (t)

(10)
Then, using (10) into (9), and solving for β ∗ :
U β (t) + U 0 β (t) β ∗ − β (t) = 0

∗ (t) U β (t)
β = β − 0 (t) (11)
U (β )
where U 0 is the derivative of the Score function (7) respect of β.
∂U (β|y1 , . . . , yn )
U 0 (β|y1 , . . . , yn ) =
∂β
n
nα 2 X
= − 3 yi (12)
β2 β i=1

Then, with the Newton-Raphson method: starting with an initial guess β (1) successive approxima-
tions are obtained using (13), until the iterative process converges.

(t+1) (t) U β (t)
β = β − 0 (t) (13)
U (β )
In order to example the use of Newton-Rapshon method, we simulate a set of data (N = 15) that
comes from a Gamma distribution whose shape parameter α (which is assumed to be known) is equal
to 2 and some positive value for the scale parameter β which, for this example, we will take equal to
22 .

2
The package extraDistr provides a tool kit for the Gamma distribution. In particular, the scale parameter β for
the Gamma distribution is supplied in the form of scale = 1/rate.

2
# load packages required
library(extraDistr)

shape <- 2
rate <- 0.5
N <- 15

set.seed(8257)
Y <- rgamma(N, shape, scale = 1/rate)
Y

## [1] 7.9405506 6.8116147 4.6820307 6.3833320 4.4570320 1.2385520 0.5243205

## [8] 4.2149631 7.7965736 1.0916002 1.7835166 3.1615198 3.2806975 1.2061107
## [15] 4.5700020

We load the code developed into our R function MLE NR Gamma stored in the R object with the
same name.

# load the function to solve by Newton-Raphson

load("MLE_NR_Gamma.RData")

The function MLE NR Gamma takes the minimum value of the sample β = min {y} as a first guess
for the iterative process, besides some other default parameters that can be modified such as the shape
parameter α (i.e. shape = 2), only needs the data vector Y.

# MLE by Newton-Raphson (NR) for Gamma distribution

MLE_NR_Gamma(Y)

## ML Estimator Likelihood Log-Likelihood

## [1,] "0.5243205" "4.54775195701934e-34" "-76.7732601263105"
## [2,] "0.7462714" "4.2497602616121e-24" "-53.8151796595443"
## [3,] "1.0322944" "8.66065680277306e-19" "-41.5903262039121"
## [4,] "1.3653770" "2.31178080438159e-16" "-36.0033433493826"
## [5,] "1.6864185" "1.56223856568787e-15" "-34.0926566242412"
## [6,] "1.8994213" "2.24929259750235e-15" "-33.7281606292388"
## [7,] "1.9663410" "2.29684276062607e-15" "-33.7072409277801"
## [8,] "1.9713878" "2.29707167336356e-15" "-33.7071412686589"
## [9,] "1.9714139" "2.29707167937519e-15" "-33.7071412660419"

Then, the MLE by Newton-Raphson method: β̂ = 1.9714139.

Maximum Likelihood Estimation: Fisher-Scoring Method

A distribution belongs to the exponential family if it can be written in the form:

a (y) b (β) − c (β)
f (y|β) = exp + d (y, φ) (14)
φ

3
Since (1) can be written as a member of exponential family as in (14):

1 α−1 y
f (y|β, λ) = exp log y exp −
Γ (α) β α β

1
= exp y − − α log β + (α − 1) log y − log Γ (α) (15)
β
where, a (y) = y, b (β) = −1/β, c (β) = α log β, d (y, φ) = (α − 1) log y − log Γ (α), and φ = 1.

Then, since the Gamma distribution belongs to the exponential family, it can be show that the
variance of U , denoted by J , is:
J = Var {U } = −E {U 0 } (16)
where:
c0 (β)

0 1 00 00
E {U } = − b (β) 0 − c (β) (17)
φ b (β)
For MLE, it is common to approximate U 0 by its expected value E {U 0 }. In this case:
J = −E {U 0 }
= E {−U 0 }
( n )
X
= E − Ui0
i=1
n
X
= −E {Ui0 }
i=1
n
c0 (β)

X 1 00
= − b (β) 0 − c00 (β) (18)
i=1
φ b (β)
where, using (1), the previous derivaties:
1
b0 (β) =
β2
2
b00 (β) = −
β3
α
c0 (β) =
β
α
c00 (β) = −
β2
1
= 1
φ
Then, replacing them into (18):
n α
! !
X 2 β α
J = − − 3 1 − − 2
i=1
β β2
β
n
X α
=
i=1
β2
nα
= (19)
β2

4
Finally:
nα
J = −E {U 0 } =
β2
nα
−J = E {U 0 } = 2 (20)
β
Then, approximating U 0 by its expected value E {U 0 }, the equation (13) results into:

(t+1) (t) U β (t)
β = β + (21)
J (β (t) )
In order to example the use of Fisher-Scoring method, we use the same data used in the Newton-
Rapshon method. We load the code developed into our R function MLE FS Gamma, stored in the R
object with the same name.

# load the function to solve by Fisher-Scoring

load("MLE_FS_Gamma.RData")

The function MLE FS Gamma takes the minimum value of the sample β = min {y} as a first guess
for the iterative process, besides some other default parameters that can be modified such as the shape
parameter α (i.e. shape = 2), only needs the data vector Y.

# MLE by Fisher-Scoring (FS) for Gamma distribution

MLE_FS_Gamma(Y)

## ML Estimator Likelihood Log-Likelihood

## [1,] "0.5243205" "4.54775195701934e-34" "-76.7732601263105"
## [2,] "1.9714139" "2.29707167937518e-15" "-33.7071412660419"
## [3,] "1.9714139" "2.29707167937519e-15" "-33.7071412660419"

Then, the MLE by Fisher-Scoring method: β̂ = 1.9714139.

In summary, the same estimate for the MLE is achieved by both approaches: the Newton-Raphson
and the Fisher-Scoring method, however the latter does it more efficiently.

Naive Approach

The main idea behind the Maximum Likelihood (ML) method is to choose those estimates for the
unknown parameters that maximize the join probability of our observed data (our sample). Keeping in
mind this idea, if we want to get the MLE and avoiding to implement a numerical solution, a naive
approach is to set a large range of possible values for unknown parameters, evaluate the log-likelihood
function (also, the likelihood function) at each point and the point for which the log-likelihood function
(also, the likelihood function) reaches its maximum value, will be our MLE looking for.

We set a set of values for the scale parameter β, by setting rate = 0.2 to rate = 1, spaced by 0.001;
since scale = 1/rate and evaluating (4) and (5) at each point from the set of values.

5
# package dplyr is required to use the pipe operator
library(dplyr)

# set a large range of values for the parameter

rate <- seq(0.2, 1, 0.001)
tabla <- matrix(c(NA,NA,NA,NA), nrow = length(rate), ncol = 4)
tabla[,1] <- rate
tabla[,2] <- 1/tabla[,1]

# evaluate the likelihood and the log-likelihood at each point

for (i in 1:length(rate)){
tabla[i,3] <- prod(dgamma(Y, shape = 2, scale = 1/rate[i], log = FALSE))
tabla[i,4] <- sum(dgamma(Y, shape = 2, scale = 1/rate[i], log = TRUE))
}
colnames(tabla) <- c("rate", "beta", "Likelihood", "Log-Likelihood")
df_tabla <- as.data.frame(tabla)

Then, the plot of likelihood function evaluated at each point:

Figure 1: Family of Likelihood Function

2.0e−15

1.5e−15
Likelihood

1.0e−15

5.0e−16

0.0e+00

1 2 3 4 5

parameter

The point for which the likelihood function is maximum, is our MLE from a naive approach:

mxl <- max(tabla[,3])

df_tabla %>% filter(`Likelihood` == mxl)

## rate beta Likelihood Log-Likelihood

## 1 0.507 1.972387 2.297063e-15 -33.70714

Also, the plot of log-likelihood function evaluated at each point:

6
Figure 2: Family of Log−Likelihood Function

−34

−36
Log−Likelihood

−38

−40

−42

1 2 3 4 5

parameter

The point for which the log-likelihood function is maximum, which is the same point at the likelihood
function reaches its maximum value, is our MLE from a naive approach:

mxllog <- max(tabla[,4])

df_tabla %>% filter(`Log-Likelihood` == mxllog)

## rate beta Likelihood Log-Likelihood

## 1 0.507 1.972387 2.297063e-15 -33.70714

As we can see, using this naive approach, we reach a value that is close enough to that which is
reached using the numerical solutions: the Newton-Raphson and the Fisher-Scoring method.

All Format
91% (32)
All Format
1 page
Live Tracker - Free PakData Sim Database On-Line 2025
No ratings yet
Live Tracker - Free PakData Sim Database On-Line 2025
9 pages
1500 Vocabulary Words
79% (107)
1500 Vocabulary Words
27 pages
XXXX XXXXXXXX: X X X X X XX
60% (5)
XXXX XXXXXXXX: X X X X X XX
2 pages
It - Stephen King's PDF
80% (10)
It - Stephen King's PDF
588 pages
Sim Owner Details - Pakistan No #1 Number Information System 2025
56% (16)
Sim Owner Details - Pakistan No #1 Number Information System 2025
3 pages
Secret Code Samsung
89% (38)
Secret Code Samsung
3 pages
میری گرم فیملی
81% (47)
میری گرم فیملی
133 pages
XXX Archita Phukan Viral Video Original XXX VIDEOS
9% (11)
XXX Archita Phukan Viral Video Original XXX VIDEOS
4 pages
50 Numerical Questions On Electricity Class 10
88% (74)
50 Numerical Questions On Electricity Class 10
49 pages
Big Book of Sex
41% (123)
Big Book of Sex
386 pages
Earseus Key
53% (15)
Earseus Key
4 pages
NADANPENKODI - Malayalam Kambi Kathakal
60% (10)
NADANPENKODI - Malayalam Kambi Kathakal
8 pages
Microsoft Office 2007 Activation Keys
85% (34)
Microsoft Office 2007 Activation Keys
2 pages
Telugu Family Sex Stories Collection
67% (102)
Telugu Family Sex Stories Collection
157 pages
Corel Draw X7 Serial Number & Activation Code
60% (42)
Corel Draw X7 Serial Number & Activation Code
1 page
All Numbers
67% (18)
All Numbers
59 pages
Telugu Boothu Kathala 5
67% (18)
Telugu Boothu Kathala 5
33 pages
Carbon and Its Compound (Prashant Kirad)
91% (267)
Carbon and Its Compound (Prashant Kirad)
21 pages
Uveit Foster
50% (6)
Uveit Foster
954 pages
Telugu Boothu Kathala 24 PDF
77% (13)
Telugu Boothu Kathala 24 PDF
20 pages
Chemistry (Annual Reports - Vol.59-1962)
100% (8)
Chemistry (Annual Reports - Vol.59-1962)
576 pages
Sample Research Paper PDF
95% (20)
Sample Research Paper PDF
36 pages
Class 12 Mathematics All Formulas
90% (176)
Class 12 Mathematics All Formulas
9 pages
Open Deed of Sale of A Motor Vehicle
81% (594)
Open Deed of Sale of A Motor Vehicle
1 page
Mineral and Energy Resources (Prashant Kirad)
92% (247)
Mineral and Energy Resources (Prashant Kirad)
20 pages
R. D. Sharma Class 9th Book PDF - Unlocked
83% (69)
R. D. Sharma Class 9th Book PDF - Unlocked
464 pages
Agriculture (Prashant Kirad)
90% (216)
Agriculture (Prashant Kirad)
22 pages
Tamil Romantic Stories Collection
40% (20)
Tamil Romantic Stories Collection
2 pages
Telugu Boothu Kathala 24
60% (10)
Telugu Boothu Kathala 24
27 pages

MLE Gamma

Uploaded by

MLE Gamma

Uploaded by

Maximum Likelihood Estimation: Numerical Solution for Gamma

The Gamma distribution has the probability density function:

Maximum Likelihood Estimation

The likelihood function is:

L (β|y1 , . . . , yn ) = f (y1 , . . . , yn |β) (4)

The log-likelihood function is:

` (β|y1 , . . . , yn ) = log (L (β|y1 , . . . , yn ))

The maximum likelihood estimator (MLE), denoted by β̂, is such that:

β̂ = argmax {` (β|y1 , . . . , yn )} (6)

Then, the MLE β̂, is the solution of:

Maximum Likelihood Estimation: Newton-Raphson Method

Just for notation, let write equation (8) as:

## [1] 7.9405506 6.8116147 4.6820307 6.3833320 4.4570320 1.2385520 0.5243205

# load the function to solve by Newton-Raphson

# MLE by Newton-Raphson (NR) for Gamma distribution

## ML Estimator Likelihood Log-Likelihood

Then, the MLE by Newton-Raphson method: β̂ = 1.9714139.

Maximum Likelihood Estimation: Fisher-Scoring Method

A distribution belongs to the exponential family if it can be written in the form:

# load the function to solve by Fisher-Scoring

# MLE by Fisher-Scoring (FS) for Gamma distribution

## ML Estimator Likelihood Log-Likelihood

Then, the MLE by Fisher-Scoring method: β̂ = 1.9714139.

# set a large range of values for the parameter

# evaluate the likelihood and the log-likelihood at each point

Then, the plot of likelihood function evaluated at each point:

Figure 1: Family of Likelihood Function

mxl <- max(tabla[,3])

## rate beta Likelihood Log-Likelihood

Also, the plot of log-likelihood function evaluated at each point:

mxllog <- max(tabla[,4])

## rate beta Likelihood Log-Likelihood

You might also like