Professional Documents
Culture Documents
https://doi.org/10.1007/s13177-021-00280-3
Abstract
This study deals with the development of artificial intelligence (AI)-based bicycle level of service (BLOS) models for urban
road segments carrying heterogeneous traffic. To accomplish this, the required data sets on the road geometric, traffic and
built-environmental conditions are collected from 84 road segments located in various parts of four Indian cities. The sat-
isfaction levels of bicyclists at each site are also assessed using a Likert scale of 1–6 (excellent–worst). Subsequently, three
promising AI techniques namely, Multivariate adaptive regression splines (MARS), Genetic programming (GP) and Bayes-
ian regularization neural network (BRNN) are utilized to develop the BLOS models. All models are trained and tested with
eight significant attributes of the road segments. Among all models, the MARS-based one has shown the best prediction
performance in the present context with a coefficient of determination (R2) value of 0.92 with averaged observations. On
the other hand, GP has produced the simplest (regression-like) but reliable model, which is the most favourable for field
applications. The relative importance of input variables has concluded that the outermost lane width, traffic volume, on-street
parking activities and pavement condition index are by far the most important variables in the present context. Hence, these
attributes should be largely prioritized in the planning process to enhance the perceived BLOS effortlessly.
Keywords Street segment · Heterogeneous traffic · Bicycle level of service · Multivariate adaptive regression splines ·
Genetic programming · Bayesian regularization neural network
13
Vol.:(0123456789)
International Journal of Intelligent Transportation Systems Research
of this model. In this regard, the present study primarily Various input parameters of this model include traffic vol-
aims (1) to develop an efficient and simple BLOS model ume, number of lanes, speed limit, width of the outermost
for the assessment of road segments carrying heterogeneous lane, pavement factors and location factors. Epperson [2]
traffic, and (2) to identify the most efficient strategies for incorporated few modifications in the pavement and location
improving the present service quality (i.e., BLOS). factors of this model and renamed the resulting model as
The recent developments in artificial intelligence (AI) modified Roadway Condition Index (RCI) model. Another
have shown remarkable efficiencies in solving prediction improvised version of RSI model is known as the Bicycle
problems in various fields of science and engineering. Suitability Rating (BSR) model [5]. This model signifies
Hence, three novel AI techniques namely, Multivariate the important roles of traffic volume and traffic speed in
Adaptive Regression Splines (MARS) [13], Genetic Pro- the BLOS of road segments. The bicycle Interaction Hazard
gramming (GP) [14, 15] and Bayesian Regularization Neu- Score (IHS) model proposed by Landis [3] has also intro-
ral Network (BRNN) [16] are implemented in this study to duced two other influencing parameters namely, roadside
develop the BLOS models. Of all, MARS is a recently intro- land use pattern and on-street parking activity.
duced technique that has numerous advantages for solving The importance of curb-lane parameters (such as curb-
regression-type problems [13]. It is basically a nonlinear and lane width, curb-lane traffic volume and curb-lane traffic
non-parametric tool that does not assume or impose any par- speed) was first reflected in the Bicycle Stress Level (BSL)
ticular type of relationship among the variables beforehand. model [4]. Harkey et al. [7] proposed an improvised version
This principle has made it extremely suitable for dealing of this model and renamed it as Bicycle Compatibility Index
with high-dimensional problems. Secondly, GP is a branch (BCI) model. The authors exposed the significant roles of
of evolutionary algorithms that automatically evolves the bicycle lanes and right-turning vehicles in the BLOS assess-
computer programs to develop predictive models without ment. Further, the BLOS models proposed by Landis et al.
specifying their structures beforehand [14, 15]. Most impor- [6] and Jensen [8] concluded that the bicycle lane and pave-
tantly, unlike many other AI techniques, GP is capable of ment surface condition play significant roles in the BLOS
producing highly compacted model equations that are easy estimation. HCM-2000 [9] has considered the average traffic
to understand and implement in the field. Thirdly, BRNN is speed, average delay and hindrance to define the BLOS cri-
an advanced version of the artificial neural network (ANN) teria. However, HCM-2010 [10] has considered an extensive
technique. The conventional ANN models often exhibit array of parameters to define the same. These parameters
poor generalization abilities, particularly while evolved for include the outermost lane width, number of traffic lanes,
complex problems. This usually occurs due to the magni- traffic volume, traffic speed, percentage of heavy vehicles
tude of the ‘weights’ associated with neuron connections and road surface conditions. Kang and Lee [11] reported that
[17]. BRNN approach, however, is well-capable for dealing the BLOS is largely influenced by the portion of roadways
with such problems. In this study, the BLOS models are dedicatedly available for bicyclists.
trained and tested with the information on eight important However, none of the aforementioned BLOS models is
road attributes. These data sets are collected from 84 road transferable to the mixed traffic environment (as those are
segments located in various parts of four Indian mid-sized developed by considering homogeneous traffic conditions).
cities. Prediction precisions of the developed models are These models do not address the influences of few factors that
assessed using various statistical parameters, and the most seem to be potential in developing countries (for instance,
efficient one is reported. Subsequently, efficient BLOS roadside stoppages of intermittent public transits and density
improvement strategies are also identified (by estimating the of driveways carrying a high volume of traffic, etc.). Recently,
relative importance of input variables). Various outcomes of Beura et al. [12] have proposed a statistical BLOS model for
this study would be helpful both for augmenting the BLOS the assessment of urban road segments in mid-sized cities
of existing segments and designing new bicycle-friendly (population ranges 0.5–1.0 million). This model returns an
road segments. overall BLOS score for the segment under consideration by
using the probabilities of obtaining individual service scores
1–6 (excellent–worst bicycling environments) on the same
2 Prior BLOS Studies segment. The ordered probit modelling approach is used to
predict these individual probabilities. The overall BLOS scores
Various service prediction models (with different names) are then used to define the bicycle service classes of urban
are proposed in the recent past to estimate the BLOS of road segments varying from ‘A’ (excellent) to ‘F’ (worst).
road segments. The first-ever model was developed by Davis However, the drawback of this model is that it is very much
[1], which is popularly known as Roadway Segment Index complex in its mathematical structure, which makes it difficult
(RSI) model. This model was basically developed for the to understand and implement in the field. The sensitivity of
assessment of road segments carrying homogeneous traffic. this model is also not well-understood by non-modellers and
13
International Journal of Intelligent Transportation Systems Research
public officials. Thus, improved models are developed in this (x − 𝜏)q ;x > 𝜏
{
q]
(2b)
[
study for the better assistance of transportation planners and +(x − 𝜏)+ =
0;otherwise
engineers in mid-sized cities.
13
International Journal of Intelligent Transportation Systems Research
x1, x2, x3, … are the input parameters. in the hidden layer, wik is the weight connected between ith
input variable and kth neuron of the hidden layer, and Xi is
3.3 Modelling with BRNN Technique the normalized value of ith input in the range [0–1].
13
International Journal of Intelligent Transportation Systems Research
R2 Best-fit calculations of the observed and predicted values of the output variable (i.e., overall BLOSPerc and 1−
SSRes
E Comparison of overall BLOSPerc and BLOSPred to evaluate how good the model is in explaining the total ∑n � �2 ∑
2
i=1 OAi −OA − ni=1 (OAi −OPi )
variance in the data set ∑n � �2
i=1 OAi −OA
MAE The maximum absolute difference between the values of overall BLOSPerc and BLOSPred
( )
max ||OAi − OPi || , ∀i
AAE The average of absolute difference between the values of overall BLOSPerc and BLOSPred 1 ∑ n
× i=1 ��OAi − OPi ��
n
RMSE The square root of average of squared difference between the values of overall BLOSPerc and BLOSPred
� �2
1 ∑n �
n i=1 OAi − OPi
OR The ratio of RMSE value obtained in model testing to that in model training RMSETesting
RMSETraining
SSRes = residual sum of squares, SSTot = total sum of squares, OAi = actual outputs, OPi = predicted outputs, OA = mean value of actual outputs,
RMSETesting = RMSE in model testing, and RMSETraining = RMSE in model training
lower values of prediction errors (in terms of MAE, AAE built-environmental conditions. Road segments from inte-
and RMSE) denote better prediction efficiency. In this study, rior regions as well as outskirts of the cities are considered.
numerous models are first developed using the AI techniques The traffic flow on interior roadways represented a signifi-
mentioned earlier. Subsequently, the performances of mod- cant percentage of two-wheelers and three-wheelers traffic.
els obtained in each case are compared with each other. In contrast, the traffic flow on the roadways located at the
Finally, the three optimum models (one in each case) are outskirts represented a significant percentage of heavy vehi-
identified for further investigation and comparison. cles. Selected roadways also represented wide variations in
the widths of carriageways, outermost lanes, paved shoulders,
shared-use paths, medians, curbs and gutter pans. The traffic
4 Study Area and Database Preparation volumes on these segments varied in a range of 350–5800
PCUs/h. The roadside developments were also significantly
This section presents a detailed discussion of the site selec- diversified (residential, commercial, official and industrial
tion principles, location of study areas, data collection pro- areas). Figure 1 exemplifies different kinds of bicycling envi-
cedure and data extraction process. ronments persisting in the study areas.
India has a large and diverse transport sector with its own Discussed below, basically eight different variables have sig-
share of challenges. The policy-making alongside the infra- nificant influences on the BLOS of urban street segments in
structural growth in the country unevenly vary from one part mid-sized cities carrying heterogeneous traffic. Beura et al.
to another. Hence, to develop a generalized BLOS model for [12] have identified these variables with the help of the most
the entire country, it is highly essential to collect required data recent and promising machine learning technique namely,
sets from its various parts. Considering the time and budget ‘random forest’. Hence, these parameters are directly col-
constraints, this study has selected four Indian mid-sized cities lected from the study areas for investigation and model
for the data collection namely, Bhubaneswar (Odisha state), development purposes. The details of these variables along
Kottayam (Kerala state), Rourkela (Odisha state) and Rajah- with their collection procedures are given below.
mundry (Andhra Pradesh state). Figure 1 shows the locations
of these cities on the Indian map. These cities well-represent 1. Effective width of the outermost lane (WOLE): WOLE is
a variety of urban availabilities, on-road settings and travel the sum total width of the outermost lane, paved shoul-
behaviours of the inhabitants (that has resulted from diverse der and the paving between the outermost lane stripe and
political, economic and socio-cultural developments). From the outer edge of the pavement minus the average width
these cities, total 84 road segments (including 33 from Bhu- reduction due to encroachments, if any [6]. Required
baneswar city, 14 from Kottayam city, 23 from Rourkela city geometrical measurements were carried using a measur-
and 14 from Rajahmundry city) are considered for data collec- ing tape for determining the value of WOLE at each site.
tion. While preparing the database, essential cares are taken 2. Peak hour traffic volume per lane (PHV/L): In this study,
to consider widely diversified roadway geometric, traffic and the peak hours of traffic flow were chosen as the analy-
13
International Journal of Intelligent Transportation Systems Research
ses periods to reflect the worst operational conditions (fine or minor cracks), 2–3 (rutting, extensive patching,
encountered by bicyclists. The mid-segment traffic flow few joint fractures, faulting, or cracking), 1–2 (distress
on each segment was videotaped for 2 hours and 30 min- occurs over 50 % or more of the surface), and 0–1 (dis-
utes (either during 8:30–11:00 AM or 4:00–6:30 PM), tress occurs over 75 % or more of the surface).
and the manual traffic volume counts were carried out. 5. Roadside commercial density (RCD): The value of RCD
The videotaping was done over a trap length of 30 m to on each segment was rated on a 3-point scale of ‘1’ =
ease the measurement of a few other parameters along high, ‘0.5’ = moderate, and ‘0’ = low.
with traffic volume. The value of PHV on each segment 6. Roadside stoppages of the intermittent public tran-
was determined with the help of the ‘running average’ sits (IPTStoppage): The IPTStoppage on each segment was
method. To express the volume counts in PCUs, the also rated using a 3-point scale where, ‘1’, ‘0.5’ and ‘0’
conversion factors recommended in the IRC code of indicate high, moderate and low interruptions respec-
practice-106 [20] were used. Subsequently, PHV/L on tively. If the public transits seemed to stop frequently
each segment was estimated as the ratio of PHV and the (say, at intervals of 60 seconds or less) on the roadside
number of lanes (L) on each segment. areas, the interruption was termed to be ‘high’. Con-
3. Average traffic speed (SAvg): The average traffic speed versely, if the stopping activities were occasional (say, at
(km/h) on each segment was estimated as the ratio of the intervals of 180 seconds or more), the interruption was
average crossing time of motorists and the length of the termed to be ‘low’. In-between, the intermediate level
longitudinal trap (i.e., 30 m). of interruption (say, at intervals 60–180 seconds) was
4. Pavement condition index (PCI): The pavement condi- termed as moderate.
tions at each site was assessed through visual inspections 7. Volume of vehicular ingress-egress to the on-street park-
by following the guidelines given in HCM-2010 [10]. As ing area (VParking): Approximate VParking on each segment
per specifications, the value of PCI varies in the follow- (in veh/h/km) was estimated by observing the parking
ing ranges: 4–5 (in case of no cracks and patches), 3–4 actives recorded in the videotapes.
13
International Journal of Intelligent Transportation Systems Research
8. Frequency of driveways carrying high traffic volume Table 2 Variations in participants’ characteristics at studied segments
(DFreq): DFreq at each segment (in driveways/km) was Characteristic Distribution Variation (%)
estimated by inspecting the whole length of each seg-
ment visually. Gender Female 41–46
Male 53–59
4.3 Collection of Overall Perceived BLOS Score Age (years) ≤ 20 12–21
21–40 47–58
In this study, an extensive perception survey was conducted 51–60 9–18
to assess the overall perceived BLOS scores (BLOSPerc) of ≥ 60 3–6
on-street bicyclists at each site. The array of these scores Educational background Matriculation or less 14–22
gives the output variable for the present analysis. Although Intermediate 27–37
time-consuming, the roadside interview survey approach Graduation or above 46–55
produces precise and reliable information from the partici- Bicycling experience (years) <5 19–28
pants. Hence, around 150 bicyclists were randomly recruited 5–10 42–56
from each site for face-to-face interactions. The selection > 10 18–27
was made in such a way as to represent a good mix of gen- Daily average bicycling dis- <5 24–32
tance (km/day) 5–10 46–59
der, age and other characteristics of bicyclists. Depending
upon the bicycle volume and pattern of arrival on a par- 11–20 11–17
ticular segment, the decision was taken on how frequently > 20 1–3
to recruit the participants. For instance, while the bicycle
volume was less at a site (say, around 150 bicycles/h), almost
all bicyclists were included in the survey to fulfil the study collected from each segment, the average of which repre-
purpose. On the other hand, while the volume was high, sented the overall BLOSPerc for the concerned segment. The
every third or fourth bicyclist was recruited for the interview. BLOSPerc of bicyclists was observed to vary in a wide range
As it is known that, non-response leads to a smaller final of 1.54–5.44 with a mean value of 3.49. However, extreme
sample size and, therefore, to a loss of accuracy in the pop- values like 1.54 and 5.44 (representing very good and very
ulation estimates. Therefore, the team members aimed at bad service conditions) were rarely observed, and it was
collecting the targeted number of responses by spending obvious.
sufficient time in the field. Furthermore, it was also aimed
to minimize the non-response rate by, for instance, develop-
ing more appealing survey materials. The bicyclists were 5 Data Analysis and Results
interacted in their local languages (like Odia, Hindi and
Telugu) and were assured that any information collected In the initial stage, a detailed investigation of the database
from them will only be used for research purposes. Small was carried out to visualize its statistical properties and use-
gifts and refreshment facilities were also arranged to attract fulness for further analysis. The statistical properties (mini-
the people for participation. In doing so, the refuse rate was mum, mean and maximum) of all variables were estimated,
minimized significantly. Furthermore, the minimum age of and the results are summarized in Table 3. As observed, each
participants was kept as 14 years in order to obtain rela- parameter varies in a very wide range. For instance, WOLE
tively mature responses. The wide variations observed in the was observed to vary from as low as 2.20 m to as high as
socio-demographic and travel-related characteristics of the 7.10 m. This concluded that the data sets are collected from
participants at study sites are shown in Table 2. For instance, widely diversified roadway environments. This fulfilled the
the gender distribution of participants across all sites was basic requirement of developing a well-generalized BLOS
41–46 % females and 53–59 % males. These all variations model for the present context. The variables were also
helped to estimate the overall BLOS scores perceived by all entered into Spearman’s correlation analysis to ensure their
possible categories of bicyclists. significance in the present context. In this analysis, the road
Bicyclists who recently travelled over the full length of attributes (Sl. No. 1–8 of Table 3) were used as independent
desired segments were asked to participate in the interview. variables and BLOSPerc was used as the dependent variable.
Each participant rated the desired segment based on a simple The correlation coefficients (Spearman’s ρ) alongside
question that, “How would you like to rate the road segment their statistical significance (p-value) obtained between
based on your overall perceived satisfaction?” The BLOSPerc each independent variable and BLOSPerc are summarized in
values were collected by using a 6-point Likert scale, which Table 3. As observed, each input variable is significantly
varies from ‘1’ (highly satisfied) to ‘6’ (highly dissatisfied). correlated with BLOSPerc at p < 0.001 level. This shows
In this process, at least 150 effective perceived ratings were that the perceived BLOS of on-street bicyclists is largely
13
International Journal of Intelligent Transportation Systems Research
Table 3 Summary of the Sl. No. Variable (unit) Variable type Minimum Mean Maximum Spearman’s ρ p-value
database used in the present
investigation 1 WOLE (meters) Input 2.20 3.67 7.10 -0.532 < 0.001
2 PHV/L (PCUs/h/lane) Input 145 1070 2587 0.550 < 0.001
3 S Avg (km/h) Input 22 33.95 47 0.410 < 0.001
4 PCI (scale) Input 2.50 3.85 4.50 -0.465 < 0.001
5 RCD (scale) Input 0 0.49 1 0.520 < 0.001
6 IPT Stoppage (scale) Input 0 0.45 1 0.409 < 0.001
7 V Parking (Veh/h/km) Input 0 745.78 6000 0.530 < 0.001
8 D Freq (Driveways/km) Input 0 0.83 3 0.439 < 0.001
9 BLOS Perc Output 1.54 3.49 5.44 - -
Note: BLOSPerc represents the overall perceived BLOS score (which is now a continuous variable)
determined by these attributes. It can also be noted that, if as they require more computational efforts. Hence, a trade-
two or more independent variables are highly correlated with off was made between the model performance and its com-
each other, they explain the same variation in the output var- plexity. Subsequently, the most effective (R2 = 0.908) but
iable. This situation is called as “multi-collinearity”. In such the simplest model was obtained with eight BFs as follow:
cases, the variable having the highest correlation with model
BLOSPred, Norm = 0.562 + 0.199 × BF1 − 0.491 × BF2
output is only selected for inclusion in the model building
process. In this study, the maximum value of inter-correla- + 3.975 × BF3 − 0.375 × BF4
(8)
tions between the input variables was 0.54. This suggested + 0.273 × BF5 + 0.765 × BF6 − 0.978 × BF7
that the selected parameters are free from multi-collinearity, + 0.228 × BF8
and are able to affect BLOSPerc independently. Hence, these
all variables were entered into the modelling process. Where, BLOSPred, Norm is the normalized value of BLOSPred,
Before carrying out the AI analyses, all observations con- ( )
BF1 = max 0, VParking − 0.067 ,
tained in the database were separated into a training set of
52 observations (70% of the total) and a testing set of 22
observations (30% of the total). Both groups were made to BF2 = max(0, PCI − 0.5),
represent similar statistical properties (especially, the mean
and range) of the contained variables. All variables were ( )
BF3 = BF2 × max 0, 0.239 − WOLE ,
normalized in the range of [0–1] before feeding them into
MARS and BRNN analyses. This was required to overcome ( )
the problems that arise due to the dimensional effects of BF4 = max 0, 0.033 − DFreq ,
originally observed values. However, no normalization was
required for the GP analysis, which was a major advantage ( )
BF5 = max 0, SAvg − 0.437 ,
of using this technique. The ‘mapminmax’ function of MAT-
LAB was used in this study to normalize the variables. The ( )
following equation may also be used to manually normalize BF6 = BF4 × max 0, 0.5 − IPTStoppage ,
a variable v:
PHV
( )
v − vMin BF7 = max 0, 0.519 −
( )
× max 0, 0.5 − IPTStoppage , and
vNorm =
vMax − vMin (7) L
Where, vNorm is the normalized value of v, and vMax and BF8 = max(0, RCD − 0.5).
vMin are the maximum and minimum values of v.
Here, all input variables have their values normalized
within the range of 0–1. Therefore, Eq. 8 returns the normal-
5.1 MARS‑based BLOS Model Development ized value of BLOSPred. The de-normalized value of the same
could be estimated as follow:
With the increase in numbers of ‘Basis Functions’ (BFs), BLOSPred = BLOSPred, Norm × (BLOSMax − BLOSMin ) + BLOSMin (9)
the performance of a MARS-based model gets improvised,
but its mathematical structure becomes more complicated.
Complicated models are less preferred for field applications
13
International Journal of Intelligent Transportation Systems Research
Where, BLOSMax and BLOSMin are the maximum and mini- evolved BLOS models fall, called the ‘Pareto front.’
mum values of BLOSPerc (i.e., 5.44 and 1.54 respectively in The blue coloured rectangles on this plot are the set of
this study). non-dominated BLOS models (i.e., superior perform-
ing models), and the violet-coloured triangles are the
5.2 GP‑based BLOS Model Development set of dominated ones (i.e., inferior performing models).
Using the Pareto front, a trade-off was made between
In order to attain the optimum solutions, GP algorithm the fitness and the expressional complexity of superior
parameters were varied in wide ranges as shown in Table 4. performing models. In this process, one optimal model
A tournament selection strategy with a tournament size of (indicated on the Pareto front with an arrow mark) was
seven was adopted to select the parental genes from the selected, which is both reliable (R 2 = 0.853) and less
pool of available solutions. The instructions or functions complex. Figure 2(b) shows the weight and significance
defined in functional set were as follows: +, -, ×, ÷, (.) 2, of the gene and bias term of this model. As depicted
(.)3, sqrt, sin(.), cos(.), tan(.), tanh(.), exp(.) and ln(.). Con- in Fig. 2(b), the p-value of the gene is significantly lesser
ditions used to terminate the program execution was 1000 than the p-value of the bias term; which indicates that
generations or fitness value less than 0.0001, whichever is the gene is largely contributing to the model in com-
earlier. After making numerous trials, the best solutions parison to the other. Hence, it can be concluded that the
were attained with a population size of 1000 individuals at input variables are contributing more to the model than
400 generations and a tree depth of six. The reproduction, the constant term. This is the sign of a good empirical
crossover and mutation probabilities were 0.02, 0.84 and model.
0.14 respectively. The tree structure of the selected GP-based BLOS model
The final population of the GP run attained for the is shown in Fig. 3. By using the GP tree structure and esti-
present problem is presented in Fig. 2. Figure 2(a) shows mated coefficients, the mathematical expression of this
a curve of non-dominated solutions on which optimally model is derived as follow:
Fig. 2 Optimum GP solutions:
(a) Pareto front, (b) weights and
significance of genes and bias of
the best model
13
International Journal of Intelligent Transportation Systems Research
PHV
( )
BLOSPred =1.510 + 0.382 × ln
L
− 2 × WOLE + VParking − 2 × SAvg × DFreq Table 6 Weights and biases of the selected BRNN model (model-2)
(10) Estimates Model term Hidden Hidden Hidden
( ) ( )
+ 0.382 × cos WOLE − 0.382 × PCI + 0.382 × ln SAvg
( )0.5 4 layer-1 layer-2 layer-3
+ 0.382 × WOLE + DFreq + 0.358 × (RCD) + 0.382 × IPTStoppage
Weight (w)
wik WOLE 0.274 -1.784 -1.873
PHV/L 0.291 0.843 -0.383
5.3 BRNN‑based BLOS Model Development SAvg 0.088 0.900 0.359
PCI 0.759 0.292 0.514
In this study, the feed-forward MLP neural networks com- RCD -1.187 -0.762 -0.829
prising of only one hidden layer were developed in order IPTStoppage 0.567 0.173 -0.300
to attain simple (but efficient) models. Numerous BRNN VParking 0.599 0.349 -0.239
models with different numbers of hidden neurons varying in DFreq -1.417 -0.220 0.020
the range of 1–10 were constructed and were trained using wk BLOSPerc -1.150 1.884 -1.411
the same sets of training data. Prediction performances of Bias (b)
the evolved models were assessed in terms of R2, AAE and bhk Hidden layer 0.073 -0.616 0.562
RMSE. The prediction results of eight superior perform- b0 BLOSPerc 0.961
ing models are shown in Table 5. It can be observed that
model-2 with three hidden neurons and log-sigmoid transfer wik are the weights between input and hidden layers, wk are the
function has produced the best results with a R2-value of weights between hidden and output layers, bhk are the biases at the
hidden layer, and b0 is the bias at the output layer
0.892 in the training stage and 0.872 in the testing stage. On
13
International Journal of Intelligent Transportation Systems Research
the other hand, despite having the best performance in the exp(C) − exp(−C)
training phase, model-5 has shown comparatively poor per-
BLOSPred,Norm = (11h)
exp(C) + exp(−C)
formance in the testing phase. Thus, this model is neglected
from further considerations, and model-2 has been recog- Where, A1, A2, A3, B1, B2, B3, and C are the general terms
nized as the optimum BLOS model. of the BRNN model.
The connection weights and biases obtained for model-2 The value of BLOSPred,Norm obtained from the above
are shown in Table 6. Inputting these estimates in Eq. 6, the model could be denormalized using Eq. 9, presented earlier.
expression of the BRNN-based BLOS model is derived as
follow: 5.4 Statistical Tests and Comparison of Models
A1 = 0.073 + 0.274 × WOLE + 0.291 × PHV∕L + 0.088 × SAvg
The plots of the model predicted and overall perceived
+ 0.759 × PCI − 1.187 × RCD + 0.567 × IPT Stoppage (11a) BLOS scores (BLOS Pred versus BLOS Perc) are shown in
+ 0.599 × VParaking − 1.417 × WFreq Fig. 4, 4a for training data and 4b for testing data. It is evi-
dent that the scatterings of BLOSPred values obtained from
A2 = −0.616 − 1.784 × WOLE + 0.843 × PHV∕L + 0.900 × SAvg all models are fairly within the 80 % prediction limit from
the line of fitness. However, it is difficult to identify the
+ 0.292 × PCI − 0.762 × RCD + 0.173 × IPT Stoppage (11b)
best model by simply analysing these figures. Thus, sev-
eral statistical parameters listed in Table 1 are also applied
+ 0.349 × VParaking − 0.220 × WFreq
13
International Journal of Intelligent Transportation Systems Research
Table 7 Overall performance BLOS Model Data Best-fit calculation Prediction error Gener-
and ranking of BLOS models alization
Ability
R2 E MAE AAE RMSE OR
Table 8 Ranges of service classes (A–F) than 3.5 belongs to one of the service classes D–F. Thus, the
BLOS Class Description Ranges of BLOSPred
ranges of BLOS classes (A–F) were defined accordingly in
Table 8 at an equal interval of ‘one.’
A Excellent service < 1.5 For investigation purpose, the MARS model was applied
B Very good service 1.5 – 2.5 to estimate the BLOS class of all investigated segments. For
C Good service 2.5 – 3.5 instance, the field observations for Madhusudan Marg, Bhu-
D Fair service 3.5 – 4.5 baneswar were as follows: WOLE = 3.5 m, PHV/L = 650.5
E Poor service 4.5 – 5.5 PCUs/h/lane, SAvg = 36 km/h, PCI = 4, LU = 0.5, IPTStoppage
F Very poor service > 5.5 = 0.5, VParking = 10 veh/h/km, and DFreq = 1 driveway/km.
Using these data, Eq. 9 gave rise to a BLOSPred value of 3.38.
This suggested that the segment is offering service class ‘C’
models in terms of the best-fit calculations (R2 and E) of (as per Table 8) at its present condition. In a similar manner,
BLOS Pred and overall BLOS Perc. Although it has shown the BLOS classes of all other segments were also estimated.
comparable performance with the GP and BRNN models It was observed that only 16 % of the investigated sites are
in terms of error measuring parameters and OR, its major offering BLOS ‘A’ and ‘B’. Hence, there is a serious need
disadvantage lies in its complex mathematical structure. for timely augmenting the quality of existing segments to
These all observations concluded that the MARS model meet future demands. This might be accomplished effec-
would be the most preferred one for field applications, tively by adopting the strategies discussed in the following
while prediction accuracy is of the utmost concern. The section.
GP model could also be implemented in the field for quick
calculations keeping in mind that the prediction results 5.6 BLOS Improvement Strategies
would be relatively inferior.
For identifying the utmost important BLOS improvement
5.5 Ranges of Service Classes (A–F) and Field strategies, it was important to know the relative importance
Applications of the BLOS model inputs. Hence, the sensitivity analysis
of MARS model (the most efficient one) was carried out by
As the ultimate goal of this study was to estimate the service using Eq. 12 [21].
levels (A–F) offered to bicyclists at the street segments, the
numeric values of the model predicted BLOS scores (i.e.,
� � � �
fmax xi − fmin xi
BLOSPred) were needed to be converted into letter-graded
S i = ∑m � � � � �� × 100 (12)
f
i=1 max i
x − fmin xi
service classes A–F (where, A = excellent and F = worst).
To accomplish this, a service scale was defined in Table 8 Where, Si = sensitivity (%) of ith input, fmax(xi) = maxi-
based on a simple concept as follow. The mean value of mum value of the output variable over ith input, fmin(xi) =
BLOSPrec scores obtained from the studied segments was minimum values of the output variable over ith input, and m
around 3.5. This corresponded to the boundary between the = total number of input variables.
first and last three service classes (i.e., A–C and D–F). This The fmax(x i) and f min(x i) parameters were estimated by
means any value of BLOSPred less than 3.5 belongs to one of putting the maximum and/or minimum values of the i th
the service classes A–C, while any value of BLOSPred higher input along with mean values of the remaining inputs
13
International Journal of Intelligent Transportation Systems Research
Table 9 Ranking of modelled Parameter WOLE PHV/L SAvg PCI RCD IPTStoppage VParking DFreq
attributes as per sensitivity
analysis Si (%) 20.72 20.22 4.91 11.82 10.45 4.64 17.03 10.21
Rank 1 2 7 4 5 8 3 6
13
International Journal of Intelligent Transportation Systems Research
ing model is that it is comparatively less-complex in its BLOS models are primarily developed for mid-sized cities.
mathematical structure. Hence, these models may not provide the expected level of
• The simplest (but reliable) model has been produced accuracy in big and metropolitan cities, where road and traf-
with the GP technique. This model has shown an efficient fic conditions are perhaps more complex. Hence, this aspect
prediction performance in the present context with high could be investigated in future studies and the study efforts
R2-values of 0.853 and 0.876 in its training and testing could be duplicated at necessity. Future studies may also apply
stages respectively. The major advantage of this model is better feature selection tools (for instance, Recursive Feature
that it is highly compacted in its mathematical structure Elimination) as selected variables have significant impacts on
(regression-like) for which it is supposed to be the most the model performance.
applicable one from the point of view of simplicity and
manageability. It could be implemented in the field for
quick calculations keeping in mind that the prediction Declarations
results would be somewhat inferior to the MARS model.
• On the other hand, both BRNN and the existing models Declarations of Interest None.
are observed to be unfavourable for field applications
due to their complex structures. Transportation planners, References
engineers and non-modellers would hardly understand
the sensitivity of these models. 1. Davis, J.: Bicycle safety evaluation. Auburn University, City of
• The proposed models (MARS and GP) are also observed to Chattanooga, and Chattanooga-Hamilton County Regional Plan-
ning Commission, Chattanooga, TN (1987)
be well-generalized to the vast urban contexts of mid-sized 2. Epperson, B.: Evaluating suitability of roadways for bicycle use:
cities in India and other developing nations. This has been toward a cycling level-of-service standard. Transportation Research
achieved by using a large quantity of diversified data sets Record, No. 1438, pp. 9–16 (1994)
and promising AI techniques. 3. Landis, B.W.: Bicycle interaction hazard score: A theoretical model.
Transportation Research Record, No. 1438, 3–8 (1994)
4. Sorton, A., Walsh, T.: Bicycle stress level as a tool to evaluate
The field applications of the proposed models have shown urban and suburban bicycle compatibility. Transportation Research
that only 16 % of the study corridors are offering excellent and Record, No. 1438, pp. 17-24 (1994)
very good kinds of services (i.e., BLOS ‘A’ and ‘B’) at their 5. Davis, J.: Bicycle test route evaluation for urban road conditions.
Transportation Congress: Civil Engineers-Key to the World Infra-
present conditions. Hence, it is essential to enhance the quality structure. Proceeding of the Transportation Congress, Volumes 1
of existing segments to meet future demands. This could be and 2: Civil engineers—key to the world’s infrastructure, Ameri-
effectively accomplished by adopting the BLOS improvement can Society of Civil Engineers (ASCE), San Diego, pp. 1063–1076
strategies proposed in this study. It has been observed that (1995)
6. Landis, B.W., Vattikuti, V.R., Brannick, M.T.: Real-time human per-
WOLE closely followed by PHV/L has the largest influence ceptions: toward a bicycle level of service. Transportation Research
on user-perceived BLOS. Hence, the best ways to enhance Record, No. 1578, 119–126 (1997). https://doi.org/10.3141/1578-15
the BLOS of urban street segments are to increase the effec- 7. Harkey, D.L., Reinfurt, D.W., Knuiman, M., Stewart, J.R., Sorton,
tive width of the outside lane and minimize bicycle-vehicle A.: Development of the bicycle compatibility index: a level of ser-
vice concept. Transportation Research Record, No. 1636, 13–20
interactions. Both of these requirements could be fulfilled by (1998). https://doi.org/10.3141/1636-03
widening the outermost traffic lane, removing obstructions 8. Jensen, S.U.: Pedestrian and bicycle level of service on roadway
from the outermost lane, if any, and/or constructing facilities segments. Transportation Research Record, No. 2031, pp. 43–51
like separate bicycle lanes, paved shoulders, shared-use paths (2007) https://doi.org/10.3141/2031-06
9. Transportation Research Board (TRB): Highway Capacity Manual,
and bypass roads. VParking and PCI are observed to be the third p. 1134. Transportation Research Board, Washington, D.C. (2000)
and fourth most important parameters in the present context. 10. Transportation Research Board (TRB): Highway Capacity Manual,
Thus, improved BLOS could also be effectively achieved by p. 1650. Transportation Research Board, Washington, D.C. (2010)
minimizing on-street parking activities and enhancing the 11. Kang, K., Lee, K.: Development of a bicycle level of service model
from the user’s perspective. KSCE J. Civ. Eng. 16(6), 1032–1039
pavement surface quality. Other BLOS improvement strategies (2012). https://doi.org/10.1007/s12205-012-1146-z
with relatively lower importance included the minimization of 12. Beura, S.K., Chellapilla, H., Bhuyan, P.K.: Urban road segment level
roadside commercial density, effective management of turn- of service based on bicycle users’ perception under mixed traffic
ing-traffic on the high-volume driveways, etc. The proposed conditions. J. Mod. Transp. 25(2), 90–105 (2017). https://doi.org/
10.1007/s40534-017-0127-9
models and the mentioned BLOS improvement strategies 13. Put, R., Xu, Q., Massart, D., Heyden, Y.: Multivariate adaptive
would largely assist the transportation planners and engineers regression splines (MARS) in chromatographic quantitative struc-
in various on-going transportation planning processes. The ture-retention relationship studies. J. Chromatogr. A 1055, 11–19
study findings will also assist the road authorities in improving (2004). https://doi.org/10.1016/j.chroma.2004.07.112
14. Koza, J.R.: Genetic Programming: On the programming of comput-
the quality of existing facilities and establishing new bicycle- ers by means of natural selection. MIT Press, Cambridge (1992)
friendly facilities. Amongst the limitations of this study, the
13
International Journal of Intelligent Transportation Systems Research
15. Poli, R., Langdon, W., McPhee, N.: A field guide to genetic pro- 19. Goh, A.T.C.: Back-propagation neural networks for modeling com-
gramming. Lulu Enterprises, UK Ltd, London (2008) plex systems. Artif. Intell. Eng. 9(3), 143–151 (1995)
16. Demuth, H., Beale, M.: Neural Network Toolbox. The Math Works 20. IRC: Guidelines for capacity of urban roads in plain areas. Indian
Inc., USA (2000) Road Congress-106, New Delhi (1990)
17. Bartlett, P.L.: The sample complexity of pattern classification with 21. Gandomi, A.H., Yun, G.J., Alavi, A.H.: An evolutionary approach
neural networks; the size of the weights is more important than the for modeling of shear strength of RC deep beams. Mater. Struct. 46,
size of network. IEEE Trans. Inf. Theory 44(2), 525–536 (1998). 2109–2119 (2013). https://doi.org/10.1617/s11527-013-0039-z
https://doi.org/10.1109/18.661502
18. Friedman, J.: Multivariate adaptive regression splines. Ann.Stat. 19, Publisher’s Note Springer Nature remains neutral with regard to
1–141 (1991) jurisdictional claims in published maps and institutional affiliations.
13