You are on page 1of 3

STOCK PRICE PREDICTION

I. INTRODUCTION Network (ANN) including Single Layer


Stock market is basically nonlinear in na- Perceptron (SLP), Multi-layer Perceptron
ture and the research on stock market is one (MLP), Radial Basis Function (RBF) and
of the most important issues in recent years. Deep Belief Network (DBN). Other tech-
People invest in stock market based on niques include Support Vector Machine
some prediction. To predict, the stock mar- (SVM), Decision Tree and Naï ve Bayes.
ket prices people search such methods and All these techniques were compared to find
tools which will increase their profits, the best predicting model. The results
while minimize their risks. Prediction plays showed that MLP performed best and pre-
a very important role in stock market busi- dicted the market with accuracy of 77%.
ness which is very complicated and chal- Each factor was studied independently to
lenging process. Employing traditional find out its association with market perfor-
methods like fundamental and technical mance. The change in Petrol prices showed
analysis may not ensure the reliability of the strongest association with market per-
the prediction. To make predictions regres- formance. The results suggested that behav-
sion analysis is used mostly. We survey a iour of market can be predicted using ma-
well-known efficient regression approach chine learning techniques.
to predict the stock market price from stock
market data based. In future the results of III. METHODOLOGY
multiple regression approach could be im- The aim of our project is to help the stock
proved using more number of variables. brokers and investors for investing money
II. ABSTRACT in the stock market. The prediction plays a
very important role in stock market busi-
One decision in Stock Market can make ness which is very complicated and chal-
huge impact on an investor's life. The stock lenging process due to dynamic nature of
market is a complex system and often cov- the stock market.
ered in mystery, it is therefore, very diffi-
cult to analyze all the impacting factors be- Machine learning algorithms for stock
fore making a decision. In this research, we market prediction:
have tried to design a stock market predic- To test our main hypothesis we used two
tion model which is based on different fac- machine learning algorithms which allow
tors. The model was built to predict perfor- us to classify days by appearance of events
mance of KSE-100 index. The prediction and use created model for prediction. They
model predicts market as positive or nega- are Neural Networks and Support Vector
tive with the help of different attributes. Machine. In order to answer the question:
These factors include price fluctuation of “Do sentiment analysis of tweets provide
fuel, commodity, foreign exchange, interest additional information?”, and “will the pre-
rate, general public sentiment, related dicted stock be accurate or approximately
NEWS and Auto-Regressive Integrated equal?” we use learning algorithms on three
Moving Average (ARIMA) and Simple sets of data. The first set of data were the
Moving Average (SMA) predicted values characteristics of stock market in previous
with help of historical data of the market. days, we call it basic set (Basic). The sec-
The techniques used for prediction include ond set was created by adding a normalized
four different versions of Artificial Neural number of tweets with words “Worry”,
“Hope”, “Fear” to the basic set other factors. First, it could be that infor-
(Basic&WHF). The third set was created mation about application of Twitter for
by adding a normalized number of tweets DJIA become available to trading society in
from each of 8 categories of the following 2010 and now this analysis technique could
emotions: “happy”, “loving”, “calm”, “en- not consistently beat the market as some of
ergetic”, “fearful”, “angry”, “tired”, “sad” traders already used it. Partially this could
(Basic&8EMO). We expect that the com- confirm efficient market hypothesis. Sec-
parison between accuracy of predictions ond, probably we need to extend training
based on our three learning sets will be dif- period from 60 days to several month like
ferent. Bollen and his colleagues did. Third, we
could not compare performance directly be-
III. ANALYSIS cause proprietary nature their algorithm and
Prediction of the growth of stock market further improvement of our sentiment ana-
We started by generating data sets. First, lyzer needed. However, we found out that
we filtered tweets only from business days, Support Vector Machine provide a little
and wrote a Java-script to generate the data better prediction accuracy of S&P500 indi-
sets. Each data set had 7 sub tables for lag cator (62.03%), than 51.88% demonstrated
in time from one to seven days. To apply by Ding et Al. [3]. We found that our Twit-
Support Vector Machine, and Neural Net- ter analyzer could provide significantly
works algorithms we divided the days into higher accuracy of prediction and could not
two groups by adding a variable growth confirm our hypothesis, as we found no sig-
(0,1), (1 when the opening price was lower nificant differences in average accuracy of
than price at close, 0 when the opening predictions based on all three data sets. Our
price was higher than or equal to the price research provides a new argument about
at close). We divided the analyzed period in potential possibility to improve predication
to data sets contained 61 days. We used the of stock market indicators using human
first 60 days as a training sample and 1 day sentiments analysis. Although, we think
as test sample. Analyzed period allow us to that at this point it is too early to suggest
conduct more than 70 prediction experi- that Twitter sentiment analysis could not to
ments. improve forecasts and more experiments
are needed. Also it can be seen that further
IV. DISCUSSION experiments will require more effort as it
The application of Twitter data for stock Twitter is growing rapidly: in 2008,
market prediction looks like an attempt to 9,853,498 tweets could represent the period
use a magic crystal ball or unrelated data. from February 28 to December 19th, 2008,
However, it may not be as far-fetched as it and in 2013 for representing period from 13
appears at first sight. Based on work by February until 29 September 2013 we have
Bollen and his colleagues we wanted to to download 755’000 101 tweets. Taking
replicate and expand their results in a wide into account different length eleven months
time frame. Application of sentiment analy- in research of Bollen et al. and eight
sis data for machine learning algorithms al- months in ours, we could estimate that to
lows us to receive maximum accuracy of make a whole year analysis we have to
stock market predictions for DJIA – download and analyze more than one bil-
64.10%. For DJIA our accuracy is below lion of tweets.
87.6% that reported by Bollen and co-au- V. CONCLUSION
thors. This could lead to a conclusion that
probably higher prediction rate demon- The aim of our project is to help the stock
strated by Bollen and co-athours was re- brokers and investors for investing money
lated to a small test period (only 19 days). in the stock market. The prediction plays a
These results could also be explained by
very important role in stock market busi-
ness which is very complicated and chal-
lenging process due to dynamic nature of
the stock market.

VI. REFERENCES
[1] Stock market prediction, Wikipedia
https://en.wikipedia.org/wiki/Stock_mar-
ket_prediction
[2] Deep learning model for machine learn-
ing https://medium.com/mlreview/a-sim-
ple-deep-learning-model-for-stock-price-
prediction-using-tensorflow-30505541d877
[3] Ding, T., Fang, V., & Zuo, D. (2013).
Stock Market Prediction based on Time Se-
ries Data and Market Sentiment. Retrieved
from http://murphy.wot.eecs.northwest-
ern.edu/~pzu918/EECS349/final_dZ
uo_tDing_vFang.pdf [4] Jurafsky, D., &
James, H. (2000). Speech and Language
[4] Processing An Introduction to Natural
Language Processing, Computational Lin-
guistics, and Speech. Retrieved from
http://repository.unikom.ac.id/repo/sec-
tor/buku/view/3/key/12110/Sp eech-and-
Language-Processing-An-Introduction-to-
NaturalLanguage-Processing-Computa-
tional-Linguistics-and-SpeechRecogni-
tion.html

You might also like