Model Calibration: John Hourdakis Center For Transportation Studies, U of MN Hourd001@tc - Umn.edu

Chapter 9
Model Calibration
John Hourdakis
Center for Transportation Studies, U of Mn
Hourd001@tc.umn.edu
Why Calibrate?
• Computers Cannot Magically Replicate Reality!

• Simulation Models Are Designed to be General
• Driver Behavior and Road Characteristics Depend on
Location i.e. Minnesota vs California
Vehicle Characteristics (Horsepower, Size, etc.)
Weather Conditions (Dry, Wet, Ice, etc.)
Day or Night
• Microscopic Simulators Can Adapt and Replicate

Almost Any Condition if the Model Parameters Are
Properly Adjusted
• What is Realistic and What is Not?
2
Example of a Disaster
• Half of an Ordinary Double Cloverleaf

• Freeway Speed 65 mph, Ramps 45 mph
• Turning Warnings More Than 2000 Feet Away
• Lot’s of Traffic: 6500 veh/hour on Mainline
• Three Vehicle Types: Car, Truck, and Semi-Trailer3
Simulation Results From 5 Scenarios
• Same Demands, Speed Limits, Turnings, Most of
Model Parameters.
• Changed Car and Semi-Trailer :
• Acceleration
• Normal Deceleration
• Maximum Deceleration (Emergency Stop)
Average Average Flow Total Travel Time Av. Delay LOS
Speed
+2f/s2 45.5 mph 6831 vph 257 hours 9 sec/veh D
+1f/s2 29.1 mph 6200 vph 484 hours 41 sec/veh F
BASE 46.7 mph 6396 vph 234 hours 9 sec/veh D
-1f/s2 35.3 mph 6152 vph 393 hours 30 sec/veh E

4
-2f/s2 27.4 mph 5500 vph 536 hours 57 sec/veh F
Calibration Procedure
H is t o r ic a l
H is t o r ic a l s y s te m
s y s te m in p u t
in p u t d a ta
d a ta
A c tu a l S im u la tio n
sy ste m M odel
C om pare
S y stem M o d el o u tp u t
o u tp u t d ata d a ta
No
V a lid ?
C a lib r a te
5
REALITY
Current Traffic Measurements, etc.
• Need to KNOW the Network to be Replicated

• Depending on the Scope of the Project, Two
or More of the Following Are Needed :
• Mainline Volumes (Every X Feet)
• Mainline Speeds (Every X Feet)
• Travel Times (Link or Between O/D Pairs)
• Bottleneck Capacity (Measured, Not Theoretical)
• Entrance Ramp Queues
• Intersection Queues and Queue Discharge Rates
6
Modeling Parameters
• Global Parameters
• Vehicle Parameters
• Size and Power
• Driver Behavior
• Car-Following Model Parameters
• Reaction Time
• Speed Distibution Among Lanes (Overtaking
Manuevers)
• Local Section Parameters
• Curvature and Grade
• Speed Limit or Free Flow Speed
• Lane Changing Distances
• Headway/Hesitation Factor
7
Model Parameter Issues
• Driver/Vehicle Characteristics
• Literature Does Not Provide Adequate Information
• Not Common to All Simulators
• No Info on Effects of Weather/Pavement/Ambient
Conditions
• Car-Following Parameters
• Depend on the Simulator’s Car-Following Model
Employed.
• Most Simulators Do Not Adequately Describe the
Modeling Process (if at all).
• User Has No Sense of Model Parameter Effects on Results.
8
Calibration Issues
• Very Important For Model Accuracy and

Robustness
• Accuracy Depends on Measurement Granularity
• Averages Over Several Days is a Bad Choice
• Might Need Additional Information to be Collected in
Turbulent Sections (Bottlenecks, Weaving Areas, etc.)
• Simulation Objective Affects Calibration

• When Adaptive Control Strategies Are Simulated,
Stricter Validation is Needed
• Modeling of an Isolated Interchange in Rural
9
Minnesota Will be Restrictive
Calibration Issues
• VERY TIME CONSUMMING PROCESS

• Currently Simulators Do Not Provide a Methodology or
Tools to Assist in Calibration
• Often Users End Up in Endless Trial-and-Error Cycles
• Sporadic Attempts Made in Literature to Streamline the
Process But:
• Focused on a Particular Simulator
• Too Complex or too Naive to be Effectively Used in
Practice
• No Widely Accepted Methods/Standards Currently
Available
10
Before Calibration!
• Check Geometry For Correctness

• Disjoined Sections
• Stuck Vehicles (Sizes of Accel/Decel Lanes)
• Verify Location of Detectors
• Check Input For Accuracy
• Entrance Volume Comparison (Perfect Match)
• Exit Volume Comparison (Match Sum Over All
Hours)
• Volume Totals on Mainline Stations Should Match
11
Practical Calibration Methodology
(Employed on Twin Cities Freeways)
• Need Simultaneous Boundary and Mainline Station Measurements.
• Technique:
upstream mainline detector stations
• Objective is to Match the Simulated and Actual Mainline Traffic

Measurements
• Traffic Measurements Used: Volume and Speed
• Occupancy Affected by Detector Sensitivity (Unknown)
• Perform Calibration in Stages:
• First 2 Stages Based on Volume and Speed in That Order
• Further Improvements in Optional 3rd Stage: 12
Depending on Objective i.e. For Ramp Control -> Queue Length
Goodness-of-Fit Test Measures
• Recommended Goodness-of-Fit Measures:
2
1 n  xi − yi 
1. RMS Percent Error = ∑ 
n i =1  yi 

(Measures Overall % Error)
1 n ( xi − x)( y i − y )
2. Correlation Coefficient = CORL = ∑ σσ
n − 1 i =1 x y
(Measures Linear Association)
Where xi is the Simulated Traffic Measurement at Time i

x’ is the Mean of the Simulated Traffic Measurements
yi is the Actual Traffic Measurement at Time i
y’ is the Mean of the Actual Traffic Measurements
σx is the Standard Deviation of the Simulated Traffic Measurement
σy is the Standard Deviation of the Actual Traffic Measurements
n is the Number of Traffic Measurement Observations
13
Goodness-of-Fit Measures (Cont.)
3. Theil’s U (Considers the Disproportionate Weight of
Large Errors) 1
n
∑(y − x )
n
i i
2
i =1
• 3 Components of Theil’s U 1
∑y + ∑x
1 n
2
i
n
2
i
n n
¾ Us i =1 i =1
(Measure of Variance Proportion, Close n(σ y − σ x ) 2

Us = n
to 1 Satisfactory) ∑(y i − xi ) 2
i =1
¾ Uc
2(1 − r )nσ yσ x
(Measure of Covariance Proportion Uc = n
or Unsystematic Error, Close to 0 Satisfactory) ∑(y

i =1
i − xi ) 2
¾ Um n( y − x ) 2
Um =
(Measure of Bias Proportion or n
∑(y i − xi ) 2
Systematic Error, Close to 0 Satisfactory) i =1
Where σy and σx are the Standard Deviations of the Actual and Simulated
Series
r is the Correlation Coefficient Between the Two Series
14
Examples
200 200
vo lu m e (v eh ic le s/5- m in)
150 150
ac tual volume actual v olume

simulated volume s imulated volume
100 100
50 50
25
31
55
61
13
19
37
55
1
7
13
19
37
43
49
67
25
31
43
49
61
67
tim e in 5-m in incr em en ts fro m m idnigh t tim e in 5-m in increm e nts fr om midn ig ht
Figure 1(a): Illustration of unsatisfactory Um Figure 1(b): Illustration of unsatisfactory Uc

200 2 00
150 1 50
actual v olume s imulate d volu me

s imulated volume a ctua l v olume
100 1 00
50 50
19
25
49
55
13
19
37
55
13
31
37
43
61
67
1
25
31
43
49
61
67
tim e in 5-m in increm e nts fr om midn ig ht t im e in 5 -m in in cr e m e nt s f rom m idnigh t
15
Figure 1(c): Illustration of unsatisfactory Uc Figure 1(d): Illustration of unsatisfactory Us
Stage 1: Volume-Based Calibration
• Objective is to Match Simulated and Actual Mainline

Station Volumes
• Simulation Model Calibrated Beginning Upstream
and Proceeding Downstream
• Global Parameters Are Calibrated First:
• Usually Accomplished in First Few Stations
• Trial & Error Iterative Process For Each Parameter
• RMSP, r and U are the Metrics Used in Each Iterations
• Local Parameters Calibrated at All Stations
• Um, Uc and Us are the Metrics Used at This Point.
16
Stage 2: Speed-based Calibration
• Objective is to Match Simulated and Actual Mainline
Station Speeds and Bottleneck Locations
• Actual Speeds Derived From Volume, Occupancy,
and Effective Vehicle Length (For Single Loop
Detectors)
• Speed Contour Graphs Used For Comparing
(Visually) Simulated and Actual Speeds
1. If Speed Contours Exhibit Significant Discrepancy, Revise
Global Parameters From Stage 1
2. Beginning Upstream and Proceeding Downstream,
Calibrate Local Parameters Until Mainline Speeds and All
Bottleneck Locations in the 2 Contour Graphs Match
17
Stage 3: Application Dependent
• For Adaptive Ramp control:
• Compare Queue Lengths
• Local Simulation Parameters Affecting Detector Output
Need Further Calibration in This Stage
• Simulated Entrance Ramp Queues Should Match Actual Ones

• Queue Measurements From 30 sec Detector Counts
• Queue = Metered Demand - Actual Demand (Upstream)
• Over-Calibration to be Avoided to Ensure Generality

• Repeat Simulation With Different Random Seeds
• Simulate Additional Days
18
Example of Calibration
• AIMSUN Microsimulator
• Mn/DOT Ramp Metering Evaluation.
• 2 Test Sites (Only One Presented)
• TH 169 Northbound in Minneapolis, MN
12 Miles Long: From I-494 to 63rd Avenue N
24 Entrance Ramps, 25 Exit Ramps
30 Detector Stations.
5-Minute Volume and Occupancy
March 21st to 23rd, 2000
14:00 to 20:00 hrs
19
Test site 1: TH 169NB
A A B B
A A B B
20
Stage 1 Results (Volume)
Goodness-of- Values
Fit Measure
r 0.960
RMSP 7.39%
U 0.002
Um 0.088
Us 0.031
Uc 0.881
• 500 Simulator Iterations Required, 2 Months (!)

• Irregularities in Input Data Observed Due to
Sensor Misplacement
21
Actual Speed Contour
40-50
50-60
Downstream 60-70
767
70-80
80-90
769
90-100
443
100-110
441
439
437
434
432
430
428
426
171
175
181
185
187
191
197
201
203
207
209
213
219
223
225
229
231
235
169
173
177
179
183
189
193
195
199
205
211
215
217
221
227
233
237
239
TIME Upstream
22
60-70
Stage 2 Results
70-80
767
80-90
769 90-100
(Speed) 443 100-110
441
439
437
434
Simulated Speed 432
430
After Stage 1 428
426
169
171
173
175
177
179
181
183
185
187
189
191
193
195
197
199
201
203
205
207
209
211
213
215
217
219
221
223
225
227
229
231
233
235
237
239
Sp e e d (s im u la tio n )
40- 50
50- 60
60- 70
7 67 70- 80
Simulated Speed 7 69 80- 90

4 43
90- 10 0
After Stage 2
100 -1 10
4 41
4 39
(About 200 4 37
4 34
Iterations) 4 32
4 30
4 28
4 26
16 9
17 2
17 5
17 8
18 1
18 4
1 99
2 02
2 05
2 08
2 11
2 14
2 17
22 0
22 3
22 6
22 9
23 2
23 5
23 8
187
190
193
196
40-50
50-60
60-70
767
70-80
80-90
769
90-100
443
100-110
441
Actual Speed
439
437
434
432
430
23 428
426
171
175
181
185
187
191
197
201
203
207
209
213
219
223
225
229
231
235
169
173
177
179
183
189
193
195
199
205
211
215
217
221
227
233
237
239
Stage 3 Results (Queues)
• Simulated and Actual Queues Did Not Match Before This Stage
• 3rd Stage Required About 100 Iterations
Example Ramp
80
70
60
50
real queue
40 simulation queue after

30 stage 3
simulation queue before
20
stage 3
10
-10
t ime ( 5 - min inc re me nt s f ro m mid nig ht )
Simulation vs Real Queues 24

Validation Accuracy
(Volume)
RMS% r U Um Us Uc
March 21st 10.620 0.980 0.004 0.309 0.011 0.681

(validation)
March 22nd 6.420 0.970 0.001 0.124 0.054 0.823
(validation)
March 23rd 7.390 0.960 0.002 0.088 0.031 0.881
(Calibration)
Over All Stations.
25
Results (Calibrated Parameters)
AIMSUN Microscopic Simulator
Parameter Initial After stage 1 After stage 2 After stage 3
Max. desired 105.000 110.000 110.000 110.000
speed (kmph)
Max. acc. rate (m/s2) 4.500 3.000 3.000 3.000

Normal dec. rate (m/s2) -4.500 -5.000 -5.000 -5.000
Max. dec. rate (m/s2) -5.000 -5.500 -5.500 -5.500
Reaction time (sec) 0.700 0.590 0.610 0.610

Percent overtake 0.950 0.950 0.940 0.940
Percent recover 1.000 1.000 0.990 0.990
Max. speed difference 40.000 40.000 60.000 60.000
(kmph)
Max. speed difference 50.000 50.000 70.000 70.000

on-ramp (kmph)
Av. section speed 110.000 100.000 105.000 105.000

(regular section, kmph)
Av. section speed 90.000 75.000 70.000 70.000

(weaving section, kmph)
Av. section speed (ramp 60.000 60.000 55.000 55.000

section, kmph)
26
Conclusion
• Garbage In >>> Garbage Out
• Simulation Useless/Dangerous Without

Calibration
Questions?
27

Model Calibration: John Hourdakis Center For Transportation Studies, U of MN Hourd001@tc - Umn.edu

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Model Calibration: John Hourdakis Center For Transportation Studies, U of MN Hourd001@tc - Umn.edu

Uploaded by

Copyright:

Available Formats

Chapter 9

• Computers Cannot Magically Replicate Reality!

• Microscopic Simulators Can Adapt and Replicate

• Half of an Ordinary Double Cloverleaf

+1f/s2 29.1 mph 6200 vph 484 hours 41 sec/veh F

BASE 46.7 mph 6396 vph 234 hours 9 sec/veh D

-1f/s2 35.3 mph 6152 vph 393 hours 30 sec/veh E

• Need to KNOW the Network to be Replicated

• Very Important For Model Accuracy and

• Simulation Objective Affects Calibration

• VERY TIME CONSUMMING PROCESS

• Check Geometry For Correctness

upstream mainline detector stations

• Objective is to Match the Simulated and Actual Mainline Traffic

Where xi is the Simulated Traffic Measurement at Time i

(Measure of Variance Proportion, Close n(σ y − σ x ) 2

or Unsystematic Error, Close to 0 Satisfactory) ∑(y

ac tual volume actual v olume

Figure 1(a): Illustration of unsatisfactory Um Figure 1(b): Illustration of unsatisfactory Uc

actual v olume s imulate d volu me

tim e in 5-m in increm e nts fr om midn ig ht t im e in 5 -m in in cr e m e nt s f rom m idnigh t

• Objective is to Match Simulated and Actual Mainline

• Simulated Entrance Ramp Queues Should Match Actual Ones

• Over-Calibration to be Avoided to Ensure Generality

• 500 Simulator Iterations Required, 2 Months (!)

(Speed) 443 100-110

Simulated Speed 432

After Stage 1 428

Simulated Speed 7 69 80- 90

40 simulation queue after

Simulation vs Real Queues 24

March 21st 10.620 0.980 0.004 0.309 0.011 0.681

Max. acc. rate (m/s2) 4.500 3.000 3.000 3.000

Max. dec. rate (m/s2) -5.000 -5.500 -5.500 -5.500

Reaction time (sec) 0.700 0.590 0.610 0.610

Max. speed difference 50.000 50.000 70.000 70.000

Av. section speed 110.000 100.000 105.000 105.000

Av. section speed 90.000 75.000 70.000 70.000

Av. section speed (ramp 60.000 60.000 55.000 55.000

• Simulation Useless/Dangerous Without

You might also like