Professional Documents
Culture Documents
ScienceDirect
highlights
More flexibility is given to the model by relaxing the assumption of proportional odds and scale, or spread rate.
The results highlight a significant improvement in the model fit, compared with the standard model.
Some of factors contributing to the severity of pedestrian crashes are discussed.
Almost all identified results linked to the lack of attention and caution of the drivers.
Article history: Despite low traffic in Wyoming, pedestrian crash severity accounts for a high number of
Received 2 February 2021 fatalities in the state. Thus this study was conducted to highlights factors contributing to
Received in revised form those crashes. The results highlighted that drivers under influence, type of vehicle, loca-
20 October 2021 tion of crashes, estimated speed of vehicles, driving over the recommended speed are
Accepted 26 October 2021 some of factors contributing to the severity of crashes. In this study, we used proportional
Available online xxx odds model which assumes that the impact of each attribute is consistent or proportional
across various threshold values. However, it has been argued that this assumption might
Keywords: be unrealistic, especially at the presence of extreme values. Thus, the assumption was
Partial proportional odds model relaxed in this study by shifting the thresholds based on some explanatory attributes, or
Pedestrian crashes proportional odds effects. In addition, we accounted for the spread rate, or scale, of the
Scale heterogeneity model's latent distribution of pedestrian crashes. The results highlighted that the partial
Proportional odds factor proportional odds model through proportional odds factor and scale effects result in a
Vulnerable road users significant improvement in model fit compared with the standard proportional odds
Drivers' lack of attention model. Comparisons were also made across standard normal, simple partial ordinal model,
and partial ordinal accounting for scale heterogeneity. In addition, various potential
threshold structures such as symmetric and flexible were considered, but similar goodness
of fits were observed across all those models. Extensive discussion has been made
regarding the formulation of the implemented methodology, and its implications.
© 2022 Periodical Offices of Chang'an University. Publishing services by Elsevier B.V. on
behalf of KeAi Communications Co. Ltd. This is an open access article under the CC BY-NC-
ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
* Corresponding author. Tel: þ1 307 766 6230; fax: þ1 307 766 6784.
E-mail addresses: mrezapou@uwyo.edu (M. Rezapour), khaled@uwyo.edu (K. Ksaibati).
Peer review under responsibility of Periodical Offices of Chang'an University.
https://doi.org/10.1016/j.jtte.2021.10.006
2095-7564/© 2022 Periodical Offices of Chang'an University. Publishing services by Elsevier B.V. on behalf of KeAi Communications Co.
Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Please cite this article as: Rezapour, M., Ksaibati, K., Partial proportional odds model for analyzing pedestrian crashes, threshold
heterogeneity by scale and proportional odds factor, Journal of Traffic and Transportation Engineering (English Edition), https://
doi.org/10.1016/j.jtte.2021.10.006
2 J. Traffic Transp. Eng. (Engl. Ed.) xxxx; xxx (xxx): xxx
Please cite this article as: Rezapour, M., Ksaibati, K., Partial proportional odds model for analyzing pedestrian crashes, threshold
heterogeneity by scale and proportional odds factor, Journal of Traffic and Transportation Engineering (English Edition), https://
doi.org/10.1016/j.jtte.2021.10.006
J. Traffic Transp. Eng. (Engl. Ed.) xxxx; xxx (xxx): xxx 3
Thus, this study was conducted by giving more flexibility to where z accommodates J1 elements of threshold of q, and b
the predefined assumptions of the ordinal model to see if consists of p-vector of regression with dimensionality of n p.
altered model would result in a better fit, and consequently On the other hand, Ak is a representation of y, n J, where the
would provide more reliable parameters' estimates. The k element of y would be filled with a value of 1, and 0 other-
analysis is especially important to study pedestrian crashes wise. ok are offset constants to adjust the threshold values at
due to mountainous topography of Wyoming with very low higher and lower points.
traffic and high traffic fatality rate. Due to the unique char- Eq. (4) could also be written in a more compact version to
acteristics of the area of Wyoming, it is expected that pedes- accommodate the values in a fewer matrices’ forms, in a
trian crashes in the state to be varied compared with other more precise way.
regions in the U.S. This manuscript is structured as follows:
the method section would extensively discuss the theory hk ¼ Bk 4 þ ok (5)
behind the employed method, and also the model's parame- where 4 accommodate the initial values of coefficients' pa-
ters estimations. The data section discusses the important rameters and thresholds to be estimated in ½zT ; bT T , and the Bk
predictors, while the results and discussion section outline accommodate vector of observed as ½Ak ; X, with a dimen-
and discuss the results. sion of n ½ðJ 1Þ þ p.
The log-likelihood of Eq. (5), then could be written as below.
2. Method X
n
LL ¼ wT logðpi Þ (6)
i¼1
Please cite this article as: Rezapour, M., Ksaibati, K., Partial proportional odds model for analyzing pedestrian crashes, threshold
heterogeneity by scale and proportional odds factor, Journal of Traffic and Transportation Engineering (English Edition), https://
doi.org/10.1016/j.jtte.2021.10.006
4 J. Traffic Transp. Eng. (Engl. Ed.) xxxx; xxx (xxx): xxx
Please cite this article as: Rezapour, M., Ksaibati, K., Partial proportional odds model for analyzing pedestrian crashes, threshold
heterogeneity by scale and proportional odds factor, Journal of Traffic and Transportation Engineering (English Edition), https://
doi.org/10.1016/j.jtte.2021.10.006
J. Traffic Transp. Eng. (Engl. Ed.) xxxx; xxx (xxx): xxx 5
Please cite this article as: Rezapour, M., Ksaibati, K., Partial proportional odds model for analyzing pedestrian crashes, threshold
heterogeneity by scale and proportional odds factor, Journal of Traffic and Transportation Engineering (English Edition), https://
doi.org/10.1016/j.jtte.2021.10.006
6 J. Traffic Transp. Eng. (Engl. Ed.) xxxx; xxx (xxx): xxx
Table 2 e Performance of considered models. Table 3 e Finalist model considering the proportional
odds and scale effects.
ID Log-likelihood DF AIC
Attribute Point Standard p-value
1 Proportional odds model 758 19 1554
estimate error
2 Model with only scale 755 20 1549
3 Model with only nominal 756 21 1553 Downhill 0.7200 0.3060 0.030
4 Model with nominal and scale 752 22 1548 Non-level grade 0.4300 0.2470 0.080
Straight-ahead 0.5400 0.1330 <0.050
maneuver
result of equidistant structured threshold in the next sub- Alcohol involvement 0.5900 0.2000 <0.050
Location of 1st harmful 0.4600 0.1900 0.020
section. The results of degree of freedom (DF), log-likelihood
event being on
and Akaike information criteria (AIC) of the considered
shoulder
models are presented in Table 2. Drug was involved in the 1.0800 0.2990 <0.050
The results of a best performed model, based on Table 2, is crash
related to a model considering both scale and proportional Pedestrian age 0.4600 0.2440 0.060
odds factors. Beside the better fit of the model, the Pedestrian gender 0.1800 0.1270 0.100
significance of the parameters of scale and proportional Lighting condition 0.5000 0.1600 <0.050
Estimated speed 0.0009 0.0004 0.030
odds are indications of violations of proportional odds
Pickup truck 0.6500 0.1570 <0.050
assumption, equal slopes assumption (Table 3). It should be Overspeed 0.3500 0.2430 0.149
highlighted that the differences between the models in Distracted driver 0.2200 0.2720 0.430
terms of parameters estimates were minor. For instance, Day of a week 0.0300 0.1670 0.870
while the point estimate of pickup truck is 0.65 for the final Posted speed, greater 1.0700 0.2170 <0.050
model, the point estimate changes to 0.70 for the standard than 60
Pickup 0.8600 0.4000 0.031
model.
truck overspeed
Table 3 presents the results of the equidistant model,
Distracted drivers day 1.0400 0.4920 0.033
including the threshold and the spacing of the adjacent of a week
thresholds. For the equidistant model, by having the Log-scale parameter
location of the first threshold and the distance, the Lighting condition 0.3200 0.1190 0.020
thresholds’ locations would be identified irrespective of the Threshold coefficients
number of response categories. Intercept 0.2900 0.2170 0.180
Intercept spacing 1.2400 0.1870 <0.050
The nominal effect of wTi b b J (Eq. (10)), was considered to
Driver age 0.0200 0.0070 0.020
extend the regression part of the model to vary across j as a
Driver age spacing 0.0010 0.0040 0.800
category. At the same time, the wTi b b J impacts the threshold
parameters by giving more flexibility to the threshold. Here
we also incorporate the scale parameter to let the scale, variable of LCi . In summary, compared with proportional
variance, of the latent variable distribution to be dependent odds model, where there would be only a numerator, for the
on some explanatory variable. The scale parameter is partial model, changes would be made to adjust both parts
significant with a value of 0.32, meaning that during night of Eq. (11).
conditions, the scale of the latent distribution is expðzðdark We also found that the structured thresholds models
_ or 27% lower compared with light condition.
lightÞÞ ¼ 0:73x, through Jacobian matrix are not needed, and the flexible
In other words, the distribution of the latent variable for light model could provide a similar goodness of fit. In simple
condition is wider compared with dark condition. words, after creating the matrices based on Eq. (11), the
The mathematical formulation of the cumulative link parameters were estimated by the Newton-Raphson algo-
model could be written as rithm through step-having or line search. The results are
Modified threshold part
highlighted in Table 3.
zfflfflfflfflfflfflfflfflfflfflffl}|fflfflfflfflfflfflfflfflfflfflffl{ Regression part
zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{
qj þ DAi b b J1 ðHTi b1 þ / þ DWi b8 Þ
logitðPðYi jÞÞ ¼ Scale part
(11) 4.2. Results of the best fit model
zfflfflfflfflfflfflffl}|fflfflfflfflfflfflffl{
expðxLCi Þ
Across all parameters in Table 3, the highest contributory
where DAi means driver's age, HTi means heavy truck, DWi impact is related to the driver being under influence of
means day of a week, LCi means lighting condition. Eq. (11) drugs, with b b Drug ¼ þ1:08. This is expected, as drugs or
highlights various parts of our model. The modified alcohol would slow coordination, judgment, and reaction
threshold part adjusts the threshold of qj by values of DAi b
b J1 , times, which is expected to impact drivers’ reaction times in
so more flexibility would be given to the threshold and it seeing a pedestrian. Although a driver under the influence of
would not be constrained to follow the proportional odds alcohol also has a contributory effect, its impacts is lower
assumption. On the other hand, we accounted for the than the impacts of drugs, b b Alcohol ¼ þ 0:59. The results are
change in the scale of the model based on the binary in line with the literature which highlighted that almost half
Please cite this article as: Rezapour, M., Ksaibati, K., Partial proportional odds model for analyzing pedestrian crashes, threshold
heterogeneity by scale and proportional odds factor, Journal of Traffic and Transportation Engineering (English Edition), https://
doi.org/10.1016/j.jtte.2021.10.006
J. Traffic Transp. Eng. (Engl. Ed.) xxxx; xxx (xxx): xxx 7
(47%) of crashes that resulted in a pedestrian death were due when a distracted driver is driving on weekend, the severity of
to the involvement of alcohol for the driver and/or the pedestrian crashes is higher.
pedestrian (NHTSA, 2017). Finally, for the included results in Table 3, we relax the
Two seemingly similar predictors of downhill related and assumption of the model's parameters to have the same
non-level grade predictors were found to contribute to the impacts across all threshold parameters, or thresholds with
severity of pedestrian crashes. Although these two pre- similar slopes. The assumption of the equidistant highlight
dictors seem similar, non-level grade, compared with that the thresholds of qj are equally distanced. The threshold
downhill, include all grades that are not flat. Both factors being exposed by the transpose of Jacobian matrix could be
having contributory effects with higher impact of crashes written as (0.27, 1.59, 1.86, 2.13). The adjustment of using
occurred on downhill area. The results are intuitive that only the first threshold and the spacing is due to the use of
higher speed is expected on non-level areas, especially less parameters for the model fit.
downhill, which is in line with the previous work (Eriksson
et al., 2019).
Moving to the location of the first harmful events, the 5. Discussion
results highlighted that when the location of hitting a
pedestrian is on the shoulder, the severity of crashes would One of the highest priorities of policy makers in the state of
be increased, b b 1st ¼ 0:46 for the first harmful event. The the Wyoming is to reduce severe crashes. Pedestrian crashes
result might be due to the fact that those vehicles going off account for one of the highest proportions of severe crashes
the road, hitting pedestrian on the shoulder width, have in the state so this group has received special attention in the
already lost the control of the vehicles, and possible factors state. This study is funded to identify factors to pedestrian
of lack of control of vehicles might justify the higher crash crashes so appropriate countermeasures could be employed.
severity. The result is in line with the work in the literature In this study we employed multiple techniques to find a most
highlighting the location of crash and pedestrian are reliable method for identifying factors to crashes. The stan-
important factors for the severity of pedestrian crashes (Sze dard ordinal model has been employed extensively in the
and Wong, 2007). literature review for modeling crash severity due to the
Moving to demographic characteristics of pedestrian ordinal nature of the crash severity response. However, the
including gender and age, a cutting point of 39 years old was proportional odds assumption might be too restrictive for
considered to make a dummy variable of passenger age. That observations as it constrains all observations to use the same
divides passenger age into almost equal category. The results set of thresholds. Thus, the flexibility could be given to that
are somehow in line with the previous study that strength and assumption by letting the observations vary based on some
muscle mass loss with aging process (Keller and Engelhardt, of their attributes. The model flexibility in this study was
2014). As a result, older pedestrians are likelier to sustain given by two means of scale and proportional odds factors. In
severe crashes compared with their younger counterpart. summary, while the proportional odds factor was added up
The results are against the work in the literature review that to the threshold factors in the numerator of the model, the
crashes related to male pedestrian are less likely to be scale was considered in the denominator for impacting the
severe (Tay et al., 2011). The differences are likely to be due whole value in the numerator, and the whole equation.
to variation across observations that the non-mixed models Here we allow the threshold to vary based on attribute of
could not account for. driver age. Beside the proportional odds factor, we gave more
On the other hand, counterintuitively the increased esti- flexibility to the latent variable distribution by incorporating
mated speed of vehicles was found to be slightly negative on the scale variable of lighting condition in the equation's de-
the severity of crashes. Few points should be highlighted nominator. By scale consideration, the scale of the latent vari-
about the impact. First, the vehicle estimated speed was ables varies across the categories of being dark or light
approximated by the highway patrol based on brake mark or conditions. In other words, the distribution has a longer or
other factors. So, there might be some uncertainty regarding shorter tail.
this measure. Second it should be highlighted the very small The majority of identified results are intuitive and expected.
impact of this predictor. For instance, we found that factors such as older pedestrians
Moving to the interaction terms, we considered and are likelier to undergo severe crashes, which might be linked to
checked all pairwise interaction terms across our identified the physical structures of the elderly. Or it was found that when
parameters. Across all considered interaction terms, the the drivers were under the influence of drug or alcohol, it is
terms between pickup truck and posted speed limit and day of likelier for pedestrian crashes to be more severe, which is likely
a week and distracted drivers were found to be important. Due to be due to lack of reaction time of drivers. The same expla-
to the significance of pairwise interaction terms across pre- nation goes for the lighting condition: at night the drivers are
dictors, their main effects would not be interpreted. There having a lack of reaction time, so they are expected to react in a
would be 2 scenarios for every interaction term, where few of less optimal way to the hazard ahead.
them would be discussed. For instance, it was found that Beside the main effect, we considered and checked for all
when the vehicle is pickup truck and the vehicle is over- pairwise interaction terms. We found that interaction be-
speeding, the severity of pedestrian crashes increases. Or tween pickup truck and over speeding, for instance, is sig-
Please cite this article as: Rezapour, M., Ksaibati, K., Partial proportional odds model for analyzing pedestrian crashes, threshold
heterogeneity by scale and proportional odds factor, Journal of Traffic and Transportation Engineering (English Edition), https://
doi.org/10.1016/j.jtte.2021.10.006
8 J. Traffic Transp. Eng. (Engl. Ed.) xxxx; xxx (xxx): xxx
Please cite this article as: Rezapour, M., Ksaibati, K., Partial proportional odds model for analyzing pedestrian crashes, threshold
heterogeneity by scale and proportional odds factor, Journal of Traffic and Transportation Engineering (English Edition), https://
doi.org/10.1016/j.jtte.2021.10.006
J. Traffic Transp. Eng. (Engl. Ed.) xxxx; xxx (xxx): xxx 9
National Highway Traffic Safety Administration (NHTSA), 2018. Mahdi Rezapour obtained his PhD from
Fatal Motor Vehicle Crashes Overview. NHTSA, Washington University of Wyoming. He is passionate
DC. about statistics, machine learning and psy-
Peterson, B., Harrell, F.E., 1990. Partial proportional odds models chology. He has written papers about trans-
for ordinal response variables. Applied Statistics 39 (2), 205. portation, fraud detection, health
Pucher, J., Dijkstra, L., 2003. Promoting safe walking and cycling to psychology, fear of death, and sense of
improve public health: lessons from The Netherlands and loneliness.
Germany. American Journal of Public Health 93 (9), 1509e1516.
Rezapour, M., Ksaibati, K., 2021. Semi and Nonparametric
Conditional Probability Density, a Case Study of Pedestrian
Crashes. The Open Transportation Journal 15 (1), 280e288.
Sasidharan, L., Mene ndez, M., 2014. Partial proportional odds
model-an alternate choice for analyzing pedestrian crash
injury severities. Accident Analysis & Prevention 72, 330e340. Khaled Ksaibati, PhD, P.E., obtained his BS
Sun, M., Sun, X.D., Shan, D.H., 2019. Pedestrian crash analysis degree from Wayne State University and his
with latent class clustering method. Accident Analysis & MS and PhD degrees from Purdue University.
Prevention 124, 50e57. Dr. Ksaibati worked for the Indian Depart-
Sze, N.N., Wong, S.C., 2007. Diagnostic analysis of the logistic ment of Transportation for a couple of years
model for pedestrian injury severity in traffic crashes. prior to coming to the University of Wyom-
Accident Analysis & Prevention 39 (6), 1267e1278. ing in 1990. He was promoted to an associate
Tay, R., Choi, J., Kattan, L., et al., 2011. A multinomial logit model professor in 1997 and full professor in 2002.
of pedestrian-vehicle crash severity. International Journal of Dr. Ksaibati has been the director of the
Sustainable Transportation 5 (4), 233e249. Wyoming Technology Transfer Center since
World Health Organization (WHO), 2007. Association for Safe 2003.
International Road Travel, Faces behind the Figures: Voices
of Road Traffic Victims and Their Families. WHO, Geneva.
Please cite this article as: Rezapour, M., Ksaibati, K., Partial proportional odds model for analyzing pedestrian crashes, threshold
heterogeneity by scale and proportional odds factor, Journal of Traffic and Transportation Engineering (English Edition), https://
doi.org/10.1016/j.jtte.2021.10.006