You are on page 1of 3

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/339385539

Bitcoin Price Forecasting: A Comparative Study Between Statistical and


Machine Learning Methods

Conference Paper · February 2020


DOI: 10.1109/ICCIS49240.2020.9257664

CITATIONS READS

2 473

4 authors:

Waddah Saeed Habib Shah


De Montfort University King Khalid University
26 PUBLICATIONS   201 CITATIONS    72 PUBLICATIONS   778 CITATIONS   

SEE PROFILE SEE PROFILE

Mohammed Jabreel Domenec Puig


Universitat Rovira i Virgili Universitat Rovira i Virgili
37 PUBLICATIONS   525 CITATIONS    320 PUBLICATIONS   4,552 CITATIONS   

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Pectoral Muscle Segmentation in Tomosynthesis Images Using Geometry Information and Grey Wolf Optimizer View project

Supervised learning with neural networks View project

All content following this page was uploaded by Waddah Saeed on 20 February 2020.

The user has requested enhancement of the downloaded file.


Bitcoin Price Forecasting: A Comparative Study
Between Statistical and Machine Learning Methods
Waddah Waheeb Habib Shah Mohammed Jabreel Domenec Puig
School of Computing Department of Computer Science IRCV - Intelligent Robotics and IRCV - Intelligent Robotics and
Asia Pacific University of College of Computer Science Computer Vision Group Computer Vision Group
Technology & Innovation King Khalid University Universitat Rovira i Virgili Universitat Rovira i Virgili
57000 Bukit Jalil, Kuala Lumpur, 62529 Abha, Saudi Arabia 43007, Tarragona, Spain 43007, Tarragona, Spain
Malaysia habibshah.uthm@gmail.com mohammed.jabreel@urv.cat domenec.puig@urv.cat
waddah.waheeb@gmail.com

Abstract— This paper presents a comparative study between statistical and machine learning methods in forecasting Bitcoin's
closing prices. Thirteen forecasting methods namely average, naive, drift, auto-regressive integrated moving-average, simple
exponential smoothing (SES), Holt, and damped exponential smoothing, the average of SES, Holt and damped methods,
exponential smoothing (ETS), bagged ETS, Theta, multilayer perceptron, and extreme learning machines (ELM) were used to
forecast the closing prices for the next 14 days. The findings of this study are three folds. First, there are seven forecasting methods
outperformed the naive method namely MLP, ELM, damped exponential smoothing, simple exponential smoothing, Theta, ETS,
and ARIMA. Second, MLP and ELM showed better forecasting accuracy on both validation and out-of-sample data among the
forecasting methods used in this study. Third, the size of the training data is essential factor that should be considered when
training forecasting methods.

Keywords— Bitcoin, Cryptocurrency, Statistical methods, Machine learning methods

I. INTRODUCTION
Bitcoin is the most popular cryptocurrency that allows doing online payment transactions without requiring a third-party
involvement [1]. There is considerable attention about Bitcoin since the significant increase in December 2017, which was
accompanied by a tremendous increase both in the number of Bitcoins in circulation and the Bitcoin market capitalization [2].
Various methods were used to forecast Bitcoin price or direction such as auto-regressive (AR), moving-average (MA),
autoregressive integrated moving average (ARIMA), neural networks, neuro-fuzzy technique, support vector machines (SVM),
and deep learning [2] – [6]. However, and according to [2], since the Bitcoin market is a memoryless system, it is impossible to
forecast Bitcoin’s prices on a daily basis. However, in this paper, we are interested in investigating the following research
questions:
 Are there forecasting methods that can outperform the naive method for forecasting the next 14 days for Bitcoin
closing prices?
 Which forecasting method can produce better forecasts among some conventional statistical and machine learning
methods?
 Does the size of training data can affect the forecasting accuracy for the forecasting methods?
The importance of this research is to determine if there is a difference in the forecasting accuracy between the naive method
and some conventional statistical and machine learning methods? This could help to suggest using some forecasting method to
produce more accurate forecasts compared to the naive method.
To achieve that, Bitcoin's closing price historical data between Nov. 24, 2013 and Nov. 24, 2019 was collected to train and
test thirteen forecasting methods. These methods are: average, naive, drift, auto-regressive integrated moving-average, simple
exponential smoothing (SES), Holt and damped exponential smoothing, the average of SES, Holt, and damped methods,
exponential smoothing (ETS), Bagged ETS, Theta, multilayer perceptron, and extreme learning machines methods.
The rest of this research work is organized as follows. The related works are described in Section II. The experimental
design is given in Section III. Section IV presents and discusses the obtained results. Finally, the conclusion is given in Section
V.
References [11] N. Kourentzes, nnfor: Time Series Forecasting with Neural Networks.
R package version 0.9.6, https://cran.r-
project.org/web/packages/nnfor/index.html.
[1] S. Nakamoto, Bitcoin: A peer-to-peer electronic cash system, 2008.
[12] G. Box, G. M. Jenkins, Time series analysis: forecasting and control.
[2] G.S. Atsalakis, I.G. Atsalaki, F. Pasiouras, C. Zopounidis, ``Bitcoin San Franscisco: Holden Day, 1970.
price forecasting with neuro-fuzzy techniques,'' European Journal of
Operational Research, 276(2), pp.770-780, 2019. [13] R.J. Hyndman, Y. Khandakar, ``Automatic time series forecasting:
The forecast package for R,'' Journal of Statistical Software, 26(3),
[3] S. Roy, S. Nanjiba, A. Chakrabarty, ``Bitcoin Price Forecasting Using 2008.
Time Series Analysis,'' In 2018 21st International Conference of
Computer and Information Technology (ICCIT), pp. 1-5, IEEE, 2018. [14] E.S. Gardner, ``Exponential smoothing: the state of the art,'' Journal
of Forecasting, 4(1), pp. 1-28, 1985.
[4] D.U. Sutiksno, A.S. Ahmar, N. Kurniasih, E. Susanto, A.
Leiwakabessy, ``Forecasting Historical Data of Bitcoin using ARIMA [15] E.S. Gardner, ``Exponential smoothing: The state of the art-Part II,''
and α-Sutte Indicator,'' In Journal of Physics: Conference Series (Vol. International Journal of Forecasting, 22(4), pp. 637-666, 2006.
1028, No. 1, p. 012194), IOP Publishing, 2018. [16] R.J. Hyndman, A.B. Koehler, R.D. Snyder , S. Grose, ``A state space
[5] N. Hitam, A. Ismail, ``Comparative performance of machine learning framework for automatic forecasting using exponential smoothing
algorithms for cryptocurrency forecasting,'' Ind. J. Electr. Eng. methods,'' International Journal of Forecasting, 18(3), pp. 439-454,
Comput. Sci, 11(3), pp. 1121-1128, 2018. 2002.
[6] C.C. Tun, N. Majid, ``Comparison between artificial neural network [17] C. Bergmeir, R. J. Hyndman, J. M. Benítez, ``Bagging exponential
and autoregressive integrated moving average model in bitcoin price smoothing methods using STL decomposition and Box-Cox
forecasting,'' Journal of Quality Measurement and Analysis JQMA, transformation,'' International Journal of Forecasting, 32(2), pp. 303–
14(2), pp.45-53, 2018. 312, 2016.
[7] Bitcoin historical data. [18] V. Assimakopoulos, K. Nikolopoulos, ``The theta model: a
https://coinmarketcap.com/currencies/bitcoin/historical-data/. Last decomposition approach to forecasting,'' International Journal of
Accessed: 2019-12-07. Forecasting, 16(4), pp. 521-530, 2000.
[8] Team RC, R: A language and environment for statistical computing. [19] S. Makridakis, M. Hibon, ``The M3-Competition: results, conclusions
and implications,'' International journal of forecasting, 16(4), pp.451-
[9] R. Hyndman and G. Athanasopoulos, Forecasting: principles and 476, 2000.
practice. OTexts, 2018.
[20] G.B. Huang, Q.Y. Zhu, C.K. Siew, ``Extreme learning machine:
[10] R. Hyndman , G. Athanasopoulos, C. Bergmeir, G. Caceres, L. theory and applications,'' Neurocomputing, 70, pp. 489–501, 2006.
Chhay, M. O'Hara-Wild, F. Petropoulos, S. Razbash, E. Wang, F.
Yasmeen, forecast: Forecasting functions for time series and linear [21] S. Makridakis, ``Accuracy measures: theoretical and practical
models. R package version 8.10, concerns,'' International Journal of Forecasting, 9(4), pp. 527-529,
http://pkg.robjhyndman.com/forecast. 1993.
[22] S. Hochreiter, J. Schmidhuber, ``Long short-term memory,'' Neural
computation, 9(8), pp. 1735-1780, 1997.

View publication stats

You might also like