Professional Documents
Culture Documents
Input Modeling
System
Input
Raw Data Performance O t t
Output
D
Data
Simulation
30
20
10
0
4 8 12 16 20
40
35
30
25
7 14 20
an intersection
i t ti b
between
t 7 am 1
2
10
19
and 7:05 am was monitored 3 17
20
15
10
0
0 1 2 3 4 5 6 7 8 9 10 11
• BALI Characteristics
• San Francisco Bay Area
• Trace length: 24 hour
• Number of cells: 90
• Persons per cell: 1100
• Persons at all: 99.000 • Question: How to transform the
• Active persons: 66.550 BALI information
i f ti so that
th t it is
i
• Move events: 243.951 usable with a network simulator,
• Call events: 1.570.807 e.g., ns-2?
• Node number as well as connection
number is too high for ns-2
People
1000
• User group
800
600
400
• Between 2 local minima 200
50
• Communication characteristic
0 40
30 0
Ca
10
lls
10 15 ents
ve m
• A user represents a group 0
20 Mo
• Intra-
I and
d inter
i group
Number of People
15000
communication
• Interesting characteristic 10000
N
• Number of people with odd 5000
number movements is 0
negligible! -1 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
N b off M
Number Movements
t
35
60 35
30
50 30
25
5
25
20 40
20
15 30
15
10 20
10
5 10 5
0 0 0
0 10 20 30 40 0 10 20 30 40 0 10 20 30 40
30 35
30
25
25
20
20
15
15
10
10
5 5
0 0
0 5 10 15 20 25 30 35 0 5 10 15 20 25 30 35
⎛ j − 0 .5 ⎞
y j is approximat
pp y F −1 ⎜
ely ⎟
⎝ n ⎠
F-1()
yj
100,6
100,4
100,2
Straight
g line,, 100
supporting the 99,8
hypothesis of a 99,6
normal distribution 99,4
99,2
99 2
99,2 99,4 99,6 99,8 100 100,2 100,4 100,6 100,8
0,35
0,3
0,25
0,2
Superimposed
density function of 0,15
distribution 0,05
0
99,4 99,6 99,8 100 100,2 100,4 100,6
• Q-Q
Q Q plot
l t can also
l beb used
d to
t check
h k homogeneity
h it
• It can be used to check whether a single distribution can represent
two sample sets
• Given
Gi two random
d variables
i bl
• X and x1, x2, …, xn
• Z and z1, z2, …, zn
• Plotting
Pl tti th
the ordered
d d values
l off X and
d Z against
i t eachh other
th revealsl
approximately a straight line if X and Z are well represented by the
same distribution
⎛ n 2⎞
⎜ ∑ X i ⎟ − nX 2
∑
n
Xi
X= i =1
S 2 = ⎝ i =1 ⎠
n n −1
• If the data are discrete and have been grouped in a frequency
distribution:
X=
∑ j =1 f j X j
n
S 2
=
(∑
n
j =1
f j X)2
j − nX 2
n n −1
X=
∑ j =1 f j m j
c
S 2
=
(∑
n
j =1
f j m)2
j − n X 2
n n −1
• The sample
p mean and variance are 25
20
364
X= = 3.64
100 15
Frequency
2080 − 100 ⋅ (3.64) 2
S =
2 10
99
5
= 7.63
0
0 1 2 3 4 5 6 7 8 9 10 11
Number of Arrivals per Period
Di t ib ti
Distribution P
Parameter
t E ti
Estimator
t
Poisson α α̂ = X
Exponential λ λˆ =
1
X
Gamma β θ
β, θˆ =
1
X
Normal μ, σ2 μˆ = X , σˆ 2 = S 2
Lognormal μ, σ2 μˆ = X , σˆ 2 = S 2
After taking ln
of data.
χ 02 = ∑
i =1
Ei
⎧ χ 02 ≤ χα2 ,k − s −1 Accept H 0
Test result ⎨ 2
χ
⎩ 0 > χ 2
α , k − s −1 Reject H 0
pi = p ( xi ) = P ( X = xi )
• where ai-1 and ai are the endpoints of the i-th class interval
• f(x) is the assumed PDF, F(x) is the assumed CDF
• Recommended number of class intervals (k):
• n = Number
N b off observation
b ti periods
i d
• Δt = Time interval length
• Cij = Number of arrivals during the i-th time interval on the j-th
observation
b ti period
i d
• Example: Divide a 10-hour business day [8am,6pm] into equal
intervals k = 20 whose length Δt = ½, and observe over n=3 days
Number of Arrivals Estimated Arrival
Time Period Day 1 Day 2 Day 3 Rate (arrivals/hr) For instance,
8:00 - 8:30 12 14 10 24 1/3(0.5)*(23+26+32)
= 54 arrivals/hour
8:30 - 9:00 23 26 32 54
9:00 - 9:30 27 18 32 52
9:30 - 10:00 20 13 12 30
0,80
0,00
1000 <= X <= 2000 2000 < X <=2500
2500 2500 < X <= 4500 4500 < X <= 5000
• Co
Covariance
a a ce between
bet ee X1 a and d X2:
cov( X 1 , X 2 ) = E[( X 1 − μ1 )( X 2 − μ 2 )] = E ( X 1 X 2 ) − μ1μ 2
⎧= 0 ⎧= 0
⎪ ⎪
• where corr ( X 1 , X 2 )⎨< 0 ⇒ β ⎨< 0
⎪> 0 ⎪> 0
⎩ ⎩
• The closer ρ is to -1
1 or 1,
1 the stronger the linear relationship is
between X1 and X2.
• Notice
• autocorrelation measures the dependence between random
variables that are separated by h-1 others in the time series
1 n
ôv( X 1,,X 2 ) =
co ∑
n − 1 j =1
( X 1 j − X 1 )( X 2 j − X 2 )
• The sample
p correlation is
côv( X 1 , X 2 )
ρˆ =
σˆ1σˆ 2 Sample deviation
• To estimate φ,
φ μ, σε2 :
ˆ côv( X t , X t +1 )
μˆ = X , σˆ ε = σˆ (1 − φˆ 2 ) ,
2 2
φ=
σˆ 2
where côv( X t , X t +1 ) is the lag-1 autocovariance