Professional Documents
Culture Documents
I.1. Introduction
DATA Usage: Conclusion, Decision Data requirements : valid, representative DATA condition : - several institutions collect similar data - data are available in some institutions - different data form - reliability ???
Traffic Data
MONITORING The collection of prevailing information at any time and as they change over time. FORECASTING The use of current and previous data as one of the inputs for prediction of future condition CALIBRATION The use of traffic data to estimate the values for one or more parameters in a theoretical or simulation model VALIDATION verification of a theoretical or simulation model against information independent of that used to calibrate the model.
Preliminary planning
Sample design
PRELIMINARY PLANNING
Study objectives Survey objectives Review existing information Formation of hypotheses Definition of parameters Resource determination Survey content Selection of survey methods
PILOT STUDY
Provide assistance: Adequacy of sampling method Variability of parameters in the survey population Suitability of survey & analysis method Adequacy of survey forms Efficiency of training methods Suitability of coding and editing Cost and duration of survey and analysis Efficiency of organization
SURVEY ADMINISTRATION
Survey manager/designer survey administration to decrease field problems Check list: - Supervisor should arrive early at survey site - Adequate wet-weather and other disturbances protection - Surveyor replacement - Enough survey forms - Adequate rest and meal breaks
DATA ANALYSIS
Procedures : Data investigation data scanning, examination of data distribution, tables, diagrams Hypothesis testing to compare the distribution of two or more data sets: chi-squared, nonparametric, ANOVA, distribution fitting Relationship determination multiple linear regression, stepwise regression, time series analysis, etc. Data reduction large set of variables more manageable set
SAMPLE
Reasons of sampling: The inability to record information for the entire population restrictions: time, fund, personnel, etc. The scarcity of data, the rareness of certain events impossible to obtain a total population
Level of measurement - nominal : lowest level, identification/ classification - ordinal : ranks data in some order - interval: the distance between each category is defined in terms of actual units. 0 (zero) has a specific meaning - ratio : have all the properties of the above scales, as well as having definite zero. 0 (zero) does not have any meaning
Characteristics of the Scale A scale that measures in terms of names or designations of discrete units or categories A scale that measures in terms of such values as more or less, larger or smaller, but without specifying the size of the intervals A scale that measures in terms of equal intervals or degrees of difference but whose zero point, or point of beginning, is arbitrarily established A scale that measures in terms of equal intervals and an absolute zero point of origin
Statistical Possibilities of the Scale Can be used for determining the mode, the percentage values, or the chi square Can be used for determining the mode, percentage, chi-square, median, percentile rank, or rank correlations Can be used for determining the mode, the mean, the standard deviation, the t test, the F test, and the product moment correlation Can be used for determining the geometric mean, the harmonic mean, the percentage variation, and all other statisctical determinations
Ratio Scale
undergoing continual change, do not permit repetition of observation, e.g. traffic volume data (consisting four components: TREND, SEASONAL VARIATION, CYCLIC VARIATION, RANDOM COMPONENT)
TREND (Tr): long term change in the average quantities, e.g. traffic growth over time SEASONAL variation (Se): different levels of flow at different times of a year, e.g. level of recreational traffic on a rural highway CYCLIC variation (Cy): result from cycles in activity (such as time of day or day of week influences on working or shopping behaviour) RANDOM component (Ra): short-term variations in behaviour (e.g. influenced by weather), or special events
Logical Errors
the logical relationship between two data items for the same observed unit is not correct. Example: Truck two axles is noted as 3 axles errors identification cross tab. Checking: - as a result of recording -> the error is easily rectified - as a result of observation -> the error is examined and compared to the other item recorded to find out the inconsistent item -> correction
Missing data
There are two types : 1. completely missed - e.g. observers attention is distracted by some other event or a vehicle is obscured - correction on the basis of weighting the observed data in terms of some known characteristics of the population 2. partly missed - ignore the missing values and report - report the proportion of missing values for each variable, so that the results are based on the total number of observation made -estimate a probable value of the missing item based on the information available from the other data items for the observation