Professional Documents
Culture Documents
Chapter 2.
2.1. Introduction
Probability and Statistics are important for:
• input modeling
• generation of random samples
• validation of simulation models
• performance of statistical analyses of outputs
• design of experiments
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:3
2.2. Probability
• Experiment:
Any activity with uncertain outcome.
• Sample Space:
The set of all possible outcomes of an experiment.
• Discrete Sample Space:
It has finitely many or a countable infinity of elements.
• Continuous Sample Space:
Its elements (points) constitute a continuum.
• Event:
Any subset of a sample space is called an event.
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:4
• Impossible Event:
The empty set ∅ is called the impossible event.
• Mutually Exclusive Events:
Two events A1 and A2 are said to be mutually exclusive, if and
only if,
A1 ∩ A2 = ∅.
Events A1 , A2, A3, . . . are mutually exclusive, if and only if,
Ai ∩ Aj = ∅ for i 6= j.
• Probability of an Event:
The probability of an event (or outcome) is the proportion of times
the event occur in a long run of repeated experiments.
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:5
• Bayes’ Theorem:
If B1, B2, . . . , Bn are mutually exclusive events of which one must
occur, then for any event A in S such that P (A) 6= 0,
P (A ∩ Br ) P (Br ) P (A|Br )
P (Br |A) = = n for r = 1, . . . , n.
P (A) X
P (Bi) P (A|Bi)
i=1
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:8
• Properties:
◦ F (−∞) = 0
◦ F (∞) = 1
◦ If a < b, then F (a) ≤ F (b) for any real numbers a and b
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:11
• If f (x) and F (x) are, respectively, values of the PDF and the CDF
of X at x, then
P (a ≤ X ≤ b) = F (b) − F (a)
for any real constants a and b with a ≤ b, and
d F (x)
f (x) =
dx
wherever the derivative exists.
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:13
• The r-th and s-th product moment about the origin of X and Y :
µr,s′ = E(X r Y s),
for r = 0, 1, 2, . . . and s = 0, 1, 2, . . .
• µ′1,0 = E(X) = µX , and µ′0,1 = E(Y ) = µY .
• The r-th and s-th product moment about the respective means of
X and Y :
µr,s = E (X − µX )r (Y − µY )s ,
for r = 0, 1, 2, . . . and s = 0, 1, 2, . . .
2
• If µX and µY exist, then µ2,0 = σX , and µ0,2 = σY2 .
• µ1,1 = E (X − µX )(Y − µY ) is called the covariance between X
and Y , and it is denoted by σX,Y or Cov(X, Y ).
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:22
2.8. Independence
• Let X and Y be continuous/discrete random variables.
• X and Y are independent, if and only if,
fX,Y (x, y) = fX (x) fY (y)
for all (x, y) within their range.
• Independent ⇒ σX,Y = 0
Independent 6 ⇐ σX,Y = 0
• Independent ⇒ E(XY ) = E(X) E(Y )
Independent 6 ⇐ E(XY ) = E(X) E(Y )
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:25
• µX̄ ≡ E(X̄) = µ
2 σ2
• σX̄≡ Var(X̄) =
n
2
• σX̄ (the positive square root of σX̄ ) is called the standard error of
the mean.
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:28
• The statistics
2 (n − 1)S 2
χ =
σ2
has the chi-square distribution with (n − 1) degrees of freedom.
• The statistics
X̄ − µ
T = √
S/ n
has the t distribution with (n − 1) degrees of freedom.
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:31
Standard
Normal
t with ν degrees
of freedom
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:37
Expression in Arena:
f(x)
UNIForm(a, b)
Parameters:
0
• a is the minimum a
x
b
a+b (b − a)2
E(X) = and Var(X) =
2 12
Applications:
• Used when all values over a finite range are considered to be equally
likely
• Used when no information other than the range is available
• Used to transform to other random variates
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:46
Expression in Arena:
TRIAngular(a, m, b)
f(x)
Parameters:
• a is the minimum 0
a m b
x
a+m+b
E(X) =
3
a + m2 + b2 − am − ab − mb
2
Var(X) =
18
Applications:
• Used when the exact form of the distribution is not known, but
estimates for the minimum, maximum, and most likely values are
available
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:49
f(x)
0.8
0.6
Expression in Arena: 0.4
NORMal(µ, σ) 0.2
0
−3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 2.5 3
Parameters: x
Applications:
• Used in situations in which the Central Limit Theorem applies
• Used when the distribution appears symmetric
• Use to modeling equity returns, asset returns, errors in observations
of real phenomena
• Used to model the changes in many financial quantities
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:52
lognormal(µ, σ 2) or LN (µ, σ 2) 1
f(x)
0.8
0.4
LOGNormal(µl , σl )
0.2
0
0 0.5 1 1.5 2 2.5 3
Parameters: x
( 2 )
1 ln(x) − µ
1
√ exp − for x > 0
f (x) = xσ 2π 2 σ
0 otherwise
Applications:
• Used when the quantity is the product of a large number of random
quantities
• Used to represent task times that have a distribution skewed to the
right
• Used as a rough model in the absence of data
• Used to model equity prices
Note:
X ∼ LN (µ, σ 2) ⇐⇒ ln(X) ∼ N (µ, σ 2)
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:55
Expression in Arena:
f(x)
EXPOnential(β) 0.4
0.2
Parameter:
β > 0 is the scale parameter 0
0 1 2 3 4 5 6
x
exponential(β)
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:56
1 exp(−x/β)
for x > 0
f (x) = β
0 otherwise
Applications:
• Used to model inter-event times in random arrival
• Used to model the interval of time between the calls
Notes:
• The exponential distribution is the only continuous distribution
with the memoryless property, i.e.,
P (X > s + t | X ≥ s) = P (X > t) for all s ≥ 0 and t ≥ 0
Common Notations:
0.3
k-Erlang(β) or Erlang-k(β)
f(x)
0.2
Expression in Arena:
ERLAng(β, k) 0.1
0
0 2 4 6 8 10 12 14 16
Parameters: x
Applications:
• Used in simulation in which an activity occurs in successive phases
and each phase has an exponential distribution
• Used to represent the time required to complete a task
Note:
• If X1, . . . , Xk are IID random variables with Xi ∼ exponential(β),
then X = X1 + · · · + Xk ∼ Erlang(β, k).
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:61
gamma(α, β)
f(x)
0.6
GAMMa(β, α) 0.2
0
Parameters: 0 1 2 3
x
4 5 6 7
where Z ∞
Γ(α) = y α−1 e−y dy for α > 0
0
Note that
• Γ(y + 1) = y Γ(y) for y > 0
• Γ(y + 1) = y! for any nonnegative integer y
√
• Γ(y + 21 ) = π · 1 · 3 · 5 · · · (2y − 1)/2y for any positive integer y
√
• Γ( 21 ) = π
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:63
Applications:
• Used to represent the time required to complete some task
• Used to modeling financial losses
Notes:
• For a positive integer α, the gamma(α, β) distribution is called the
α-Erlang(β) distribution.
• If X1, . . . , Xm are independent random variables with Xi ∼
gamma(αi, β), then X = X1 +· · ·+Xm ∼ gamma(α1 +· · ·+αm , β).
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:65
Expression in Arena:
BETA(β, α), where β = α1 and α = α2
Parameters:
α1 > 0 and α2 > 0 are shape parameters.
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:66
where
Z 1
B(α1, α2 ) = tα1−1(1 − t)α2 −1 dt for α1 > 0 and α2 > 0
0
Note that
• B(α1, α2) = B(α2, α1)
Γ(α1) Γ(α2)
• B(α1, α2) =
Γ(α1 + α2)
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:67
α1 α1α2
E(X) = and Var(X) =
α1 + α2 (α1 + α2)2(α1 + α2 + 1)
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:68
Applications:
• Used as a rough model in the absence of data
• Used to represent random proportions
• Used to represent many input quantities that can be assumed to
have a range bounded on both ends
• Use to model random recovery rates when assessing credit risks
Notes:
• If X1 and X2 are independent random variables with Xi ∼
gamma(αi, β), then
X1
∼ beta(α1, α2).
X1 + X2
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:69
3 3
α1=1.5, α2=5.0 α1=1, α2=1
α =1.5, α =3.0 α =2, α =2
1 2 1 2
α =3.0, α =1.5 α =3, α =3
1 2 1 2
α1=5.0, α2=1.5 α1=5, α2=5
2 2
f(x)
f(x)
1 1
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
x x
3 3
α =0.5, α =2.0 α =0.8, α =0.2
1 2 1 2
α =1.0, α =2.0 α =0.5, α =0.5
1 2 1 2
α1=2.0, α2=1.0 α1=0.2, α2=0.8
α =2.0, α =0.5 α =0.8, α =0.8
1 2 1 2
2 2
f(x)
f(x)
1 1
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
x x
beta(α1, α2 )
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:70
Weibull(α, β)
f(x)
0.6
WEIBull(β, α) 0.2
0
0 0.5 1 1.5 2 2.5 3 3.5 4
Parameters: x
Applications:
• Used in reliability models to represent the lifetime of a device
• Used to represent nonnegative task times
• Used as a rough model in the absence of data
• Used to model extreme values from a bounded distribution
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:73
Common Notation:
0.6
χ2(ν)
f(x)
0.4
Expression in Arena:
Nil 0.2
0
0 1 2 3 4 5 6 7
x
Parameter: χ2 (ν)
ν (positive integer) is the degrees of freedom
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:74
1
ν/2−1 −x/2
x e for x > 0
f (x) = 2ν/2 · Γ(ν/2)
0 otherwise
Applications:
• Used in inferences
• The chi-square distribution with one degree of freedom is the
distribution of the hedging error from an option that is hedged
only discretely.
Notes:
• If Z ∼ N (0, 1), then Z 2 ∼ χ2(1).
• If X1, . . . , Xm are independent random variables with Xi ∼ χ2(νi ),
then X = X1 + · · · + Xm ∼ χ2(ν1 + · · · + νm ).
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:76
2.12.11. F Distribution
2
ν =1, ν =1
1 2
ν1=10, ν2=10
1.8
ν1=10, ν2=90
ν1=90, ν2=10
1.6
ν1=90, ν2=90
1.2
F (ν1, ν2)
f(x)
1
0.8
Nil 0.4
0.2
0
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
x
ν + ν ν ν1 /2
1 2 1
xν1 /2−1
Γ
2 ν2
for x > 0
(ν1 +ν2 )/2
f (x) = ν ν
1 2 ν1
Γ Γ x+1
2 2 ν
2
0 otherwise
Applications:
• Used in inferences
• Used to evaluate the extent of potential losses
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:79
f(x)
0.2
Expression in Arena:
Nil 0.1
0
−4 −3 −2 −1 0 1 2 3 4
x
Parameter: t(ν)
ν (positive integer) is the degrees of freedom
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:80
Poisson(λ) or Po(λ)
f(x)
0.2
Parameter:
λ>0
0.1
Expression in Arena:
POISson(λ) 0
0 2 4 6 8 10 12 14 16
x
Poisson(λ)
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:83
Applications:
• Used to model the number of random events occurring in a fixed
interval of time
• Used to model random batch size
• Used to model the number of claims in insurance
• Used to model financial shocks
Note:
• If X1, . . . , Xm are independent random variables with Xi ∼
Poisson(λi), then X = X1 + · · · + Xm ∼ Poisson(λ1 + · · · + λm).
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:85
Bernoulli(p) 0.5
0.4
f(x)
Parameter:
0.3
p ∈ [0, 1] is the probability
0.2
of success
0.1
Expression in Arena: 0
0 1
DISCrete(1 − p, 0, 1, 1) x
Bernoulli(0.7)
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:86
or B(n, p)
0.15
f(x)
Parameters: 0.1
Expression in Arena:
DISCrete F (0), 0, F (1), 1, . . . , F (n − 1), n − 1, 1, n
Applications:
• The probability of getting x successes in n independent Bernoulli
trials with probability p of success on each trial
• Used to model the number of defective items in a batch of size n
• Used in option pricing and option evaluations
Note:
• If X1, . . . , Xm are independent random variables and Xi ∼
binomial(ni, p), then
X = X1 + · · · + Xm ∼ binomial(n1 + · · · + nm , p).
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:90
geometric(p) or geom(p)
f(x)
0.15
Parameter: 0.1
of success
0
0 2 4 6 8 10 12 14 16 18 20
x
Notes:
• The geometric distribution is the only discrete distribution with the
memoryless property, i.e.,
P (X > s + t | X ≥ s) = P (X > t) for all s ≥ 1 and t ≥ 1.
NegativeBinomial(k, p) or 0.04
negbin(k, p) or NB(k, p)
f(x)
0.03
Parameters: 0.02
• p is the probability of 0
0 10 20 30 40 50 60
success x
N B(k, p)
Expression in Arena:
DISCrete F (x1), x1, . . . , F (xn), xn
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:94
x−1 k
p (1 − p)x−k for x ∈ {k, k + 1, k + 2, . . . }
f (x) = k−1
0 otherwise
Applications:
• The probability of getting the k-th success on the x-th trial
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:95
Note:
• Some books refer the probability of getting x failures before the
k-th success in a sequence of independent Bernoulli trials with
probability p of success on each trial as the negative binomial
distribution:
−
k + x 1
pk (1 − p)x for x ∈ {0, 1, 2, . . . }
f (x) = x
0 otherwise
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:96
0.25
0.2
Common Notation:
f(x)
0.15
Nil
0.1
0.05
0
0 2 4 6 8 10 12 14 16 18 20
x
Parameters:
• N (positive integer) is the population size
• n ∈ {1, . . . , N } is the number of draws
• m ∈ {0, 1, . . . , N } is the initial number of items with the desired
characteristics in the population
m m N − m N − n
E(X) = n and Var(X) = n
N N N N −1
Applications:
• The probability of getting x successes in n draws (without
replacement) from a finite population of N in which m of them
are looked upon as successes
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:99
Common Notations:
DiscreteUniform(a, b) or DU (a, b)
f(x)
Parameters:
• a (integer) is the minimum
• b (≥ a, integer) is the maximum 0
7 8 9 10 11 12 13 14 15 16
x
DU (7, 16)
Expression in Arena:
1 2
DISCrete b−a+1 , a, b−a+1 , a + 1, . . . , 1, b
Chi-Kong Ng, SEEM3410, Dept. of SEEM, CUHK 2:100
a+b (b − a + 1)2 − 1
E(X) = and Var(X) =
2 12
Applications:
• Used when all values over a finite range are considered to be equally
likely
• Used when no information other than the range is available