Professional Documents
Culture Documents
1, 2022
Biographical notes: Annu Mor is a research scholar pursuing PhD from UIET
Panjab University, Chandigarh. She received her Master’s in Engineering
(2012) and Bachelors of Engineering (2010) in Computer Science and
Engineering. She is GATE qualified and authored many peer reviewed research
papers in international conferences and journals. Her research interests are
artificial intelligence, image processing and data mining.
1 Introduction
addressed to minimise congestion (Wang et al., 2017). The improperly designed toll
plaza causes inconvenience to travellers and reduction in mobility at various locations
(Zhu et al., 2017). Toll plaza still needs to be explored by traffic flow prediction, which
helps to design effective traffic guidance strategy (Kim et al., 2016). Design of toll plaza
including a number of toll-lanes, merging patterns and service provided in toll-lane
(Antônio et al., 2016). These characteristics help to reduce the accident rate, reduce road
construction, operational cost and enhance traffic capacity.
Recent studies have primarily focused on microscopic simulation model by using
simulated data for discrete events based on the traffic flow theory (Bartin, 2018; Abadi
et al., 2015). On the other hand, most traditional prediction models are based on a single
data type method such as volume, speed, travel time and occupancy. The information
provided is not enough for public or traffic administration for decision making especially
in context of Indian scenarios due to heterogeneous vehicle type, limited data resources.
Different machine learning methods use different approaches to learn relationship
from training data set (Christopher et al., 2016). There is no single prediction model that
gives best results on all traffic variables on non-recurrent traffic condition (traffic caused
by holiday, event and weather conditions) at particular location (Kim and Wang, 2016).
The short-term traffic forecast involves the prediction of traffic flow over the time
duration of a few seconds to few hours into the future using current and historical
measurements of traffic variables (Habtemichael and Cetin, 2016; Qi and Ishak, 2014;
Guo and Williams, 2010).The previous studies that compare ANN with grey model
(Badhrudeen et al., 2016) on day time data, but a comparison with different non-
parametric techniques with each other while predicting short-term-traffic, seems to be
missing. Even the use of heterogeneous data sources for traffic prediction, seems to be
missing. In this paper, we focus on the particular problem that is: How to decide traffic
prediction models using nonparametric approaches for multivariate data set collected
from different sources? Thus, multivariate traffic flow prediction is essential to capture
the traffic trends or to design traffic rules over a certain period in future for advanced
traveller information systems. Based on the discussion mentioned above, the
contributions of this study are the following:
A real-time data based approach has been proposed for traffic flow prediction with
the influence of exogenous factors such as weather conditions and holiday.
Traffic similarity patterns are analysed for day and night duration to analyse the
phenomenon known as heteroskedasity (Fosgerau and Fukuda, 2012) including
holiday for Indian scenarios.
Selection of an appropriate time-series interval for short-term traffic prediction
approach with respect to geometry.
Different non-parametric techniques are compared, which helps in making suitable
choice for prediction approach.
An overall accuracy of 90% is achieved for multivariate short-term traffic flow
prediction using random forest regression technique.
The remainder of this paper is organised as follows: Firstly, the related work on short-
term traffic flow prediction, followed by framework proposed and data set used after that
the results and performance assessment metrics. Finally, conclusions are discussed.
Table 1
Variables
Input data Prediction
References Techniques Model type Traffic parameter Data set Duration Context
duration (min) step
Wu et al. (2018) DNN Non-parametric traffic flow PeMS 14 month Inter-state 15 1
Zhao et al. (2017) Deep Learning Non-parametric Traffic volume, lane Cameras, induction coils 6 month city 15 1
(LSTM) occupancy average velocity radars
velocity
Polson and CNN Non-parametric vehicle type Cameradata 300 images Single lane 10 1
Sokolov (2017)
Zhang et al. Deep learning Non-parametric Traffic volume, weather, TaxiBJ BikeNYC 3 year and city 15 multi
(2017) (ST-ResNet) event 5 month
Smith and Grey level Parametric Traffic speed, travel time Loop data set (data 24 days Highway 10 multi
Demetsky (1997) aggregation to 1-min)
Kim and Bayesian Non- Time-of- Loop 608 Highway 5 multi
Wang (2016)
Network Parametric Day, Incident, weather, Detector with 3-mint day
traffic state duration
Badhrudeen et al, ANN grey model Both Traffic volume, traffic Detector 9 days Highway 10 1
(2006 speed
Habtemichael KNN SAIMA, Both Traffic volume Detector/sensors 12 data 12 month and Highway 15 1
Multivariate short-term traffic flow prediction
2 Related work
Traffic flow prediction is a very demanding area in the Intelligent Transportation System
(ITS) for the last few decades. The short-term traffic forecast involves the prediction of
traffic flow over the time duration of a few seconds to a few hours into the future using
current and historical measurements of traffic variables. In general, traffic flow-casting
models fall into two categories: parametric and non-parametric. The summary of the
different prediction models of the approaches as mentioned above is described in Table 1.
Previous studies have concluded that the prediction model in the intelligent
transportation system is switching towards data-intensive machine learning models.
Different machine learning models use different strategies to find relationship from
training data set with different performance in different scenarios. Based on the above
discussion, regression techniques are suitable approach for traffic prediction due to
simple interpretable structure as well as less data demanding. Based on aforementioned
discussions, this study applied regression techniques on toll plaza data with external
factors affecting the traffic volume such as weather conditions and holiday.
3 Proposed framework
The time series are decomposed of n sampling data points per 24 hour duration. Suppose
the sampled traffic flow data in N consecutive days here, Sample time interval is: i=15
min, then n=15*24*14, can be written as a series of 1-D vectors matrix.
Di1 Di11 , Di1 2, ,..., Di1 n (1)
DiN DiN 1 , DiN 2 ,..., DiN n (2)
where N=no of days, i= number of time interval (15-min), Di1 = Traffic volume during
the current time interval during day-one, Di11 = Traffic volume during the next time
interval, DiN = Total number of days.
Multivariate short-term traffic flow prediction 37
Tf t i 1 X L t
m
(3)
Ti n n 1 2 (4)
where Ti ( n ) is total number of attribute and n is the number of data points in the set.
2) Data normalisation: Real-time toll plaza traffic volume combined with exogenous
variables (temperature, humidity, visibility, barometer and holidays) collected from
the Internet. Then, whole Data is normalised in form [0, +1]. The scaling of data is
done to avoid attributes in wider numeric ranges dominating those in smaller
numeric ranges.
Wi i
Pi i (5)
W
n
i i
where n= total \attributes ,wi=weight for one particular attribute, i one particular attribute.
Traffic flow trends are affected with holiday (6 July, Eid-ul-Fitar) have higher traffic
volume as compared to a normal day as shown in Figure 1.
Furthermore, some other traffic trends can be inferred. The traffic volume changes
with similarity pattern due to regular daily traveling; well-known road traffic exhibits
strong cyclic patterns for morning and evening peak-hours. However, traffic conditions
in the peak periods are more dynamic than those in the non-peak periods, a phenomenon
known as heteroskedasity (Fosgerau and Fukuda, 2012). Traffic flow on weekends is
much higher than that of weekdays. Hence, it has been caused heavy traffic bottleneck
due to heavy vehicles during night hours. As there is no separate lane for heavy vehicles
as well as for emergency vehicles such as ambulance, defense vehicles. Different traffic
similarity patterns are analysed for day and night duration for two-week data set as
shown in Figure 2.
38 A. Mor and M. Kumar
Figure 2 Two-week traffic flow pattern similarity for 15-min time intervals for 24 hour
calculating any increment in an update, and alpha is a learning rate that is configured
(Raza and Zhong, 2018).
a n
Pi Pi h
n i 1
(6)
where Pi =Predicted value for particular data samples, n = Total number of data sets,
a =learning rate h = function gradient.
Correlation distance and weighted Euclidean distance, the Euclidean distance used
mostly to measure the distance between data points as shown in equation. A small
number of neighbours are enough to give the good results (Myung et al., 2011). KNN is
not more suitable for the sparse data set which contains mostly feature having 0 values,
even though the numbers of features are much larger (In hundred or more) till
performance is very low. Basic parameters used for KNN are: historical values, number
of neighbours and prediction function (Jiber et al., 2018).
D j t 1
1 K Dist j
D t 1
k j 1 1
(7)
j 1 Dist
K
Random forest: Random Forest was developed by Breiman Leo, based on ensemble
learning methods (Breiman, 2001). Random Forest uses bagging technique for
training models. This technique is also known as bootstrap aggregating (Stearrns
et al., 2017). Random Forest algorithm is a collection of the different decision tree,
which slightly different from each other. In Random Forest overfitting of the training
Multivariate short-term traffic flow prediction 41
data set can be reduced, by selecting variables to branch each decision tree and node
which changes the way of tree construction (Guo et al., 2018; Verikas et al., 2011).
The learning process for random forest can be performed as follows:
while j M
BAG_IN =M’ /M’ Random Features
BAG_OUT =M–X //Remaining
Random Features Samples
BS=Best_Feature (BAG_IN)
return Best_Feature (BAG_IN)
while P M // (which feature gives the best split is used to split the node
iteratively)
Best_Split =Best_F(Mq)
Current_Node=Best_Split
BS= Current_Node
U= mini pruning .ADD (Tk Fm, Fn
//New Training Set of k times
End
End
Return (GENERATE_TREE(BS))
End
3 GENERATE_TREE(BS)
while i C // Total No of Decision_tree generated
Training_Set
Generation (M, N)
Bagging_Method()
[ Go to Step 2]
The study shows that, the feature importance also provided by Random Forest by
aggregating the feature importance among the decision trees is more reliable. In random
forest regression, there is no need for data normalisation or scaling the data set.
Adaboost regression: Adaptive boosting is an Ensemble method that combines
multiple trees (weak learners) for prediction. It is primarily used for classification,
based on binary classification (Leshem et al., 2007). It is meta-estimator and
sensitive to noisy data and outliers, less suitable for over fitting problem. Each
learner may be weak, but the performance of all combined is better than random
guessing, which can have exponential loss (Giot, 2014; Freund and Schapire, 1997),
as shown in equation:
H x Sign n
i 1
i hi x (11)
where H is hidden layers, Wij weight matrix, B is activation functions and adjustment
parameters
Support vector machines (SVM): SVM based on the statistical learning method,
perform well on scaled data, by mapping the input-output relationship for non-linear
regression problems (Wu et al., 2004). SVM work with different kernel functions
such as linear, polynomial and Radial Basis Function (RBF), RBF found more
suitable for traffic forecasting under different scenarios (Cortes and Vapnik, 1995).
Hence, in this study kernel function RBF is used. SVM works well as classification,
not as a repression. But it shows good results to minimise error by setting an epsilon
value for error tolerance, that would find out from problem domain by optimising
parameters (Mingheng et al., 2013).
M a, b a , b (13)
For traffic
where Ai represents input vector and Bi output vector using RBF kernel function.
Multivariate short-term traffic flow prediction 43
1 n Oi Pi
MSE
n i 1 Oi
(15)
i
n Oi Pi
AMSE
n i 1 Oi
(16)
P O
n 2
i i
R2 1 i 1
(17)
O Ó
n 2
i 1 i i
O Ó P P
n
i 1 i i i
r (18)
2
Pi Pi
n n
i 1
(Oi Ói ) 2
i 1
where Oi is observed traffic variable, Pi predicted traffic variable, n is the total sample
data size, Ói average of all observed values, Pi average of all predicted values.
44 A. Mor and M. Kumar
The whole data is again normalised for multi-layer perceptron neural network, as the
variance must be 1 and mean values should be 0.
Performance Metric
multivariate
1
0.8
0.6
0.4
0.2
0
The Random Forest regression shows accuracy up to 90% for short-term traffic flow
prediction among all applied techniques while decision tree regression, Gradient descent
regression, MLP regression shows r2_score up to 89%, 87 % and 85%, respectively as
shown in Figure 5 gave better results as compared to others.
R2 are calculated from equation (17) for more accurate performing methods and values
are mentioned in Table 4. Random Forest estimated the maximum R2-values for short-
term traffic flow prediction.
The results show that Random Forest regression technique performed well for limited
data as well as against overfitting issue. Regression techniques learn exogenous variable
importance without explicitly assigning weight to variables.
These analysis results show that performance can be increased on multivariate data
set as compared to single-variable data. Figures 6 and 7 show weekdays and weekends
analysis for traffic pattern using random forest regression technique.
46 A. Mor and M. Kumar
Table 4 Sample data set of regression techniques for observed vs. predicted
Actual Random forest Decision tree Gradient decent MLP predicted R2-
Vol. predicted R2-error predicted R2-error predicted R2-error error
0.211 0.190 .021 0.187 .024 0.183 .027 0.179 .032
0.422 0.410 .012 0.375 .047 0.367 .055 0.358 .064
0.851 0.765 .086 0.757 .094 0.740 .111 0.723 .128
0.939 0.845 .064 0.835 .104 0.816 .123 0.798 .141
Note: *Bold numbers are highest R -two values in each column
Figure 6 Comparison of observed and predicted traffic volume using random forest during
weekend (one unit=15 mint)
Figure 7 Comparison of observed and predicted traffic volume using random forest during
weekday (one unit=15 mint)
Multivariate short-term traffic flow prediction 47
5 Conclusions
Short-term traffic prediction approaches play a vital role in providing useful information
for Intelligent Transportation Systems and smart cities. Traffic patterns are affected by
non-recurrent factors such as holiday, event and bad weather conditions. These factors
cause uncertainties for multivariate short term traffic prediction. In this study, different
non-parametric techniques are applied to reduce these uncertainties and enhance the
accuracy for prediction. The proposed approach is validated using 14 days traffic volume
data aggregated with 15-min interval from Chandigarh Toll plaza Expressway. The first
10 days data is used as training and remainder was utilised for testing purpose. Five-fold
cross validation is performed to evaluate the accuracy of the applied techniques.
The experimental results demonstrated that there are significant differences in
predictions of different non-parametric techniques. The results revealed that Random
Forest method excels over all the other applied techniques, especially using multivariate
real-time data set. The limitation of the proposed methodology is lack of multi-step
prediction between predicted and observed traffic flow. This feature is highly desirable
for traffic flow prediction problems, as future traffic state commonly dependent to
previous state. The spatial features of traffic parameters were not taken into
consideration.
Future work can be extended by applying semantic knowledge base for inference
mechanism. The predicted results can be used to design the queuing model to provide
probability density function for the queue at toll-lanes. More data should be collected
from multiple toll lanes from inter-urban expressway to obtain better results.
The non-parametric techniques presented in this study have the potential to forecast
the future scenarios using the present data. The Random Forest technique is robust to
handle multivariate attributes and implicit variable selection make the algorithm more
suitable for prediction at the intersection, link-based as well as path-based road network.
It can provide important information to toll plaza authorities and town planners for
decision making, policy designing as well as infrastructure network related problems.
Some or all data, models, or code generated or used during the study are proprietary or
confidential in nature and may only be provided with restrictions. Toll plaza data list
items include (such as vehicle type, mode of payment, exempted vehicles, entry time of
vehicle) and restrictions.
References
Abadi, A., Rajabioun, T. and Ioannou, P.A. (2015) ‘Traffic flow prediction for road transportation
networks with limited traffic data’, IEEE Transactions on Intelligent Transportation Systems,
Vol. 16, pp. 653–662.
Antônio, M., Caldas, D.F. and Sacramento, K.T. (2016) ‘Simulation model of discrete events
applied to the planning and operation of a toll plaza’, Journal of Transport Literature, Vol. 10,
pp.40–44.
Arnold, A. and Abe, N. (2007) ‘Temporal causal modeling with graphical granger methods’,
Knowledge Discovery in Databases, Vol. 10, pp.66–75.
48 A. Mor and M. Kumar
Badhrudeen, M., Raj, J. and Vanajakshi, L.D. (2016) ‘Short-term prediction of traffic parameters –
performance comparison of a data-driven and less-data-required approaches’, Journal of
Advanced Transportation, Vol. 50, pp.647–666.
Bartin, B. (2018) ‘The use of learning classifier systems in microscopic toll plaza simulation
models’, Transportation Research Annual Meeting, Vol. 97, pp.1–6.
Bezuglov, A. and Comert, G. (2016) ‘Short-term freeway traffic parameter prediction: application
of grey system theory models’, Expert Systems with Applications, Vol. 62, pp.284–292.
Bharti, M., Saxena, S. and Kumar, R. (2017) ‘Intelligent resource inquisition framework on
internet-of-things’, Computer Electric Engineering, Vol. 58, pp.265–281.
Breiman, L.E.O. (2001) ‘Random forests’, Machine Learning, Vol. 45, pp.5–32.
Chakroborty, P., Gill, R. and Chakraborty, P. (2016) ‘Analyzing queuing at toll plazas using a
coupled, multiple-queue, queuing system model: application to toll plaza design’,
Transportation Planning and Technology, Vol. 39, pp.675–692.
Chang, K. and Bong, S. (1996) ‘An intelligent approach to time series identification by a neural
network driven decision tree classifier’, Decision. Support Systems, Vol. 17, pp.183–197.
Christopher, I., Scofield, L. and Us, W.A. et al. (2016) Predicting Expected Road Traffic
Conditions Based on Historical and Current Data, US 9,257,041B2.
Cortes, C. and Vapnik, V. (1995) ‘Support-vector networks’, Machine Learning, Vol. 20,
pp.273–297.
Dombi, J. and Zsiros, A. (2005) ‘Learning multicriteria classification models from examples:
decision rules in continuous space’, European Journal of Operational Research, Vol. 160,
pp.663–675.
Fosgerau, M. and Fukuda, D. (2012) ‘Valuing travel time variability: characteristics of the travel
time distribution on an urban road’, Transportation Research Part C: Emerging Technologies,
Vol. 24, pp.83–101.
Freund, Y. and Schapire, R.E. (1997) ‘A decision-theoretic generalization of on- line learning and
an application to boosting*’, Journal of Computer and System Sciences, Vol. 55, pp.119–139.
Giot, R. (2014) ‘Predicting bikeshare system usage up to one day ahead’, in Proceedings of the
IEEE Symposium on Computational Intelligence in Vehicles and Transportation Systems
(CIVTS), pp.1–8.
Guo, F., Krishnan, R. and Polak, J. (2017) ‘The influence of alternative data smoothing prediction
techniques on the performance of a two-stage short-term urban travel time prediction
framework’, Journal of Intelligent Transportation Systems, Vol. 21, pp.214–226.
Guo, F., Polak, J.W. and Krishnan, R. (2018) ‘Predictor fusion for short-term traffic forecasting’,
Transportation Research Part C: Emerging Technologies, Vol. 92, pp.90–100.
Guo, J. and Williams, B.M. (2010) ‘Real-time short-term traffic speed level forecasting and
uncertainty quantification using layered Kalman filters’, Transportation Research Record:
Journal of the Transportation Research Board, Vol. 75, pp.28–37.
Habtemichael, F.G. and Cetin, M. (2016) ‘Short-term traffic flow rate forecasting based on
identifying similar traffic patterns’, Transportation Research Part C: Emerging Technologies,
Vol. 66, pp.61–78.
Habtemichael, F.G. and Cetin, M. (2016) ‘Short-term traffic flow rate forecasting based on
identifying similar traffic patterns’, Transportation Research Part C: Emerging Technologies,
Vol. 66, pp.61–78.
Haworth, J., Shawe-taylor, J., Cheng, T. and Wang, J. (2014) ‘Local online kernel ridge regression
for forecasting of urban travel times’, Transportation Research Part C: Emerging
Technologies, Vol. 46, pp.151–178.
Hoerl, A.E. and Kennard, R.W. (1970) ‘Ridge regression biased estimation for non-orthogonal
problems’, Technometrics, Vol. 12, pp.55–67.
Huang, H. and Abdel-aty, M. (2014) ‘Multilevel data and Bayesian analysis in traffic safety’,
Accident, Analysis and Prevention, Vol. 42, pp.1556–1565.
Multivariate short-term traffic flow prediction 49
Jiber, M., Lamouik, I., Ali, Y. and Sabri, M.A. (2018) ‘Traffic flow prediction using neural
network’, Proceedings of the International Conference on Intelligent Systems and Computer
Vision (ISCV), pp.1–4.
Jordan, M., Bishop, M., Kleinberg, J. and Schölkopf, B. (2006) ‘Pattern recognition and machine
learning’, in Jordan, M., Kleinberg, J., Schölkopf, B. (Eds): Information Science and
Statistics, 1st ed., Springer, pp.144–194.
Kamarianakis, Y., Shen, W. and Wynter, L. (2012) ‘Real-time road traffic forecasting using
regime-switching space-time models and’, Applied Stochastic Models in Business and
Industry, Vol. 28, pp.297–315.
Kim, C., Kim, D., Kho, S., Kang, S. and Chung, K. (2016) ‘Dynamically determining the toll plaza
capacity by monitoring approaching traffic conditions in real-time’, Journal of Applied
Sciences, Vol. 6, pp. 87–90.
Kim, J. and Wang, G. (2016) ‘Diagnosis and prediction of traffic congestion on urban road network
using Bayesian networks’, Transportation Research Record: Journal of the Transportation
Research Board, Vol. 95, pp.108–118.
Kumar, S.V. and Vanajakshi, L. (2015) ‘Short-term traffic flow prediction using seasonal ARIMA
model with limited input data’, European Transport Research Review, Vol. 7, pp.21–25.
Lécué, F., Tallevi-diotallevi, S. and Hayes, J. et al. (2014) ‘Web semantics: science, services and
agents on the world wide web smart traffic analytics in the semantic web with STAR-CITY:
scenarios, system and lessons learned in Dublin City’, Web Semantic: Science, Services and
Agents on the World Wide Web, Vols. 27/28, pp.26–33.
Leshem, G., Ritov, Y. and Adaboost, A. (2007) ‘Traffic flow prediction using adaboost algorithm
with random forests as a weak learner’, Proceeding World Academy of Science, Engineering
and Technology, Vol. 21, pp.193–198.
Li, L., Chen, X. and Zhang, L. (2014) ‘Multi-model ensemble for freeway traffic state estimations’,
IEEE Transactions on Intelligent Transportation Systems, Vol. 15, pp.1323–1336.
Li, L., Su, X., Wang, Y., Lin, Y., Li, Z. and Li, Y. (2015) ‘Robust causal dependence mining in big
data network and its application to traffic flow predictions’, Transportation Research Part C:
Emerging Technologies, Vol. 58, pp. 292–307.
Lippi, M., Bertini, M. and Frasconi, P. (2013) ‘Short-term traffic flow forecasting: an experimental
comparison of time-series analysis and supervised learning’, IEEE Transactions on Intelligent
Transportation Systems, Vol. 14, pp.871–882.
Liu, X., Chien, S.I. and Chen, M. (2014) ‘An adaptive model for highway travel time prediction’,
Journal of Advanced Transportation, Vol. 48, pp. 642–654.
Lu, X. and Shladover, S.E. (2014) ‘Review of variable speed limits and advisories’, Transportation
Research Record, Vol. 24, pp.15–23.
Ma, X., Tao, Z., Wang, Y., Yu, H. and Wang, Y. (2015) ‘Long short-term memory neural network
for traffic speed prediction using remote microwave sensor data’, Transportation Research
Part C: Emerging Technologies, Vol. 54, pp.187–197.
Mingheng, Z., Yaobao, Z., Ganglong, H. and Gang, C. (2013) ‘Accurate multistep traffic flow
prediction based on SVM’, Mathematical Problems in Engineering, Vol. 20, pp.1–8.
Müller, A.C. and Guido, S. (2017) Introduction to Machine Learning with Python, O‘Reilly Media.
Myung, J., Kim, D., Kho, S. and Park, C. (2011) ‘Travel time prediction using k nearest neighbor
method with combined data from vehicle detector system and automatic toll collection
system’, Transportation Research Record: Journal of the Transportation Research Board,
Vol. 56, pp.51–59.
Polson, N.G. and Sokolov, V.O. (2017) ‘Deep learning for short-term traffic flow prediction’,
Transportation Research Part C: Emerging Technologies, Vol. 79, pp.1–17.
Qi, Y. and Ishak, S. (2014) ‘A Hidden Markov Model for short term prediction of traffic conditions
on freeways’, Transportation Research Part C: Emerging Technologies, Vol. 43, pp.95–111.
Quinlan, J.R. (1986) ‘Induction of decision trees’, Machine Learning, Vol. 1, pp.81–106.
50 A. Mor and M. Kumar
Raza, A. and Zhong, M. (2018) ‘Lane-based: short-term urban traffic parameters forecasting using
multivariate artificial neural network and locally weighted regression models: a genetic
approach’, The Canadian Journal of Civil Engineering, Vol. 12, pp.32–38.
Smith, B.L. and Demetsky, M.J. (1997) ‘Traffic flow forecasting comparison of modeling
approaches’, Journal of Transportation Engineering, Vol. 123, pp.261–266.
Stearns, B., Rangel, F. and Rangel, F. (2017) ‘Scholar performance prediction using boosted
regression trees techniques’, European Symposium on Artificial Neural Network,
Computational Intelligence and Machine Learning, Vol. 5, pp.26–28.
Su, S.S.F.C., Tsai, H.N.P. and Cheng, C. (2018) ‘Using machine learning and big data approaches
to predict travel time based on historical and real-time data from Taiwan electronic toll
collection’, Soft Computing, Vol. 22, pp.5707–5718.
Traffic, I., Using, P. and Models, T. (2018) ‘Intersection traffic prediction using decision tree
models’, Symmetry (Basel), Vol. 10, pp.80–87.
Verikas, A., Gelzinis, A. and Bacauskiene, M. (2011) ‘Mining data with random forests: a survey
and results of new tests’, Pattern Recognition, Vol. 44, pp.330–349.
Vlahogianni, E.I., Karlaftis, M.G. and Golias, J.C. (2014) ‘Short-term traffic forecasting: where
we are and where we are going’, Transportation Research Part C: Emerging Technologies,
Vol. 43, pp.3–19.
Wang, Y., Ning, J. and Li, S. (2017) ‘Research on merging pattern after toll based on simulation’,
Journal of Advances in Social Science, Education and Humanities Research, pp.509–515.
Wu, C., Ho, J. and Lee, D.T. (2004) ‘Travel-time prediction with support vector regression’, IEEE
Trans. Intell. Transp. Syst., Vol. 5, pp.276–281.
Wu, Y., Tan, H., Qin, L., Ran, B. and Jiang, Z. (2018) ‘A hybrid deep learning based traffic flow
prediction method and its understanding’, Transportation Research Part C: Emerging
Technologies, Vol. 90, pp.166–180.
Xu, Y., Kong, Q. and Liu, Y. (2014b) ‘Short-term traffic volume prediction using classification and
regression trees short-term traffic volume prediction using classification and regression trees’,
IEEE Intelligent Vehicle Symposium, Vol. 4, pp.493–498.
Xu, Y., Kong, Q., Klette, R. and Liu, Y. (2014a) ‘Accurate and interpretable Bayesian MARS for
traffic flow prediction’, IEEE Transactions on Intelligent Transportation Systems, Vol. 15,
pp.2457–2469.
Zhan, H., Gomes, G., Li, X.S., Madduri, K., Sim, A. and Member, S. (2018) ‘Consensus ensemble
system for traffic flow prediction’, IEEE Transactions on Intelligent Transportation Systems,
Vol. 19, pp.3903–3914.
Zhang, G., Patuwo, B.E. and Hu, M.Y. (1998) ‘Forecasting with artificial neural networks: the state
of the art’, International Journal of Forecasting, Vol. 14, pp.35–62.
Zhang, J., Zheng, Y. and Qi, D. (2017) ‘Deep spatio-temporal residual networks for citywide
crowd flows prediction’, AAAI Conference on Artificial Intelligence, Vol. 31, pp.1655–1661.
Zhao, Z., Chen, W., Wu, X., Chen, P.C.Y. and Liu, J. (2017) ‘LSTM network: a deep
learning approach for short-term traffic forecast’, IET Intelligent Transport Systems, Vol. 11,
pp. 68–75.
Zhu, D., Dong, F., Shi, W. and Zheng, S. (2017) ‘The model of toll station planning’, Journal of
Advances in Intelligent System Research, pp.390–394.