Professional Documents
Culture Documents
The Research on Stock Price Forecast Model Based on Data Mining of BP Neural
Networks
Wu Ming-tao,Yang Yong
Economic and Management College Northeast Petroleum University, Daqing, Heilongjiang, 163318, China
dqwumingtao@126.com
Abstract—The model of stock price forecast based on data hidden layer and input layer has effect on the input layer
mining of BP neural networks is put forward in this article. On which from the upper layer, and bring corresponding output.
the basis of an integrated data mining process of selection of
data samples, data conversion, network modeling, network
simulation, and evaluation of results, the prediction about the
trend of SSE(Shanghai Stock Exchange) Composite Index
provides a higher accuracy. It indicates that the use of data
mining of BP neural network in the forecast of non-linear
system has advantages, so will it provide a new idea for the
forecast of non-linear system.
Keywords-Neural networks; BP algorithm; Data mining; Figure 1. The topological structure of BP neural network
Stock price forecast
B. Process of Data Mining
I. INTRODUCTION We ensure the general process, based on the data mining
Stock market forecast is a branch of economy forecast. It technology of BP neural network, by referring to the 5A
is the forecast of prospect of stock market’s future model of corporation SPSS and SEMMA model of research
development, based on accurate investigation of statistical institution SAS, and combining with of data mining
data and stock market information, starting with stock technology of neural network to its own features. See in
market history, status quo and regularity, and applying figure 2.
scientific methods. At present, the most frequently used
forecast method is developed through some linear methods,
like moving average and regression analysis. Although
researchers have done much work in the application of
neural networks in stock price forecast, the present research
is mainly focused on the construction and optimizing of
neural network model and the settlement of technical
aspects, but not the discussion on the solution of problems in
this kind.
In this article, based on the process of data mining and
regarding the neural network model as key technology, the
trend of SSE(Shanghai Stock Exchange) Composite Index is Figure 2. The data mining process based on BP neural network
quantitatively forecasted by empirical analysis.
II. DATA MINING PROCESS BASED ON NEURAL III. SAMPLE DATA
NETWORK
A. Selection of Sample Data
A. BP Neural Networks Model In terms of stock market, the selection of sample data has
Artificial neural network is a newly emerging Inter- two principles to follow: the first is to choose samples that
discipline subject, which is developed as the non-linear accord with trade rules and have distinct trade characteristics;
information processing system based on the imitation of the the second is to take the performance of neural network
intelligent structure and function of human brains. The model into account. So this test selects continuous
topological structure of BP neural network is shown in SSE(Shanghai Stock Exchange) composite index of 100
figure 1. There are connections among neurons from input days, from Nov 30th,2009 to Apr 29th ,2010 as sample data
layer to hidden layer, and hidden layer to output layer, and to be mined and divide them into training sample and test
the strength of the connections is determined by sample according to the actual need.
corresponding weight. The activation function of neurons of
1527
4) the activation function is Sigmoid().
VI. NETWORK SIMULATION
By using the optimized BP network model, 90
continuous business days are chosen from selected data
samples as training samples, from Nov 30th,2009 to Apr
15th,2010 and then to predict the closing price of ten
business day from 18th Apr,2010 to 29th Apr.
A. Network Training
By using the optimized BP network model, training data
Figure 3. Time series chart of closing price index
samples are imported, then to train the network and keep the
trained network structure and training results. The results of
analysis of network training are as follows: maximum B. Limitation
absolute error is 17.01; minimum absolute error is 0.03; In order to analyze the limitation of the model,
maximum proportional error is 1.366%; minimum histogram of prediction error is made as figure 4. It can be
proportional error is 0.005%. From what has mentioned clearly seen in the chart that as time goes on, the absolute
above, the training effect of network is preferable and the error and proportional error between prediction value and
trained network structure can be used to carry on true value present a tendency of rise overall, which shows
forecasting of next step. that the model doesn’t suit the prediction of medium-and
B. Network Testing long-term tendency. Combining the features of BP model
with the major factors that influence the tendency of stock
The closing price index can be forecasted by importing price, the reasons that the technology of data mining of
trained network and test samples into neural network neural network is of noneffective to medium-and long-term
simulation tool. In order to analyze the prediction accuracy prediction can be concluded as follows:
of BP model, the prediction value and true value of network 1) In this trial, some critical factors that have influences
model are imported into SPSS software to make time series on stock market are not considered into the selection of
chart. See in figure 3. input vectors, such as economic sentiment index, external
From figure 3, it can be known that the prediction value market environment and emergency incidents. Obviously it
of network model is same as the actual value of closing will to some extent impact the prediction accuracy of the
price, which indicates that the technology based on data model.
mining of neural networks is feasible and practical in stock 2) The samples may contain some random samples that
price prediction. don’t fit with the laws of economic. It will make the
VII. RESULTS EVALUATION network model collect some particular features and will do
nothing to efficient promotion to general situations.
At the last phase of data mining, knowledge of 3) Although preferable learning accuracy can be
professional field should be applied to the analysis and obtained when the network model converge to its extreme
evaluation to the results of data mining, so as to make it value, it will cause the overfitting of sample space and lead
meet the need of practical use. to a bad effect of the promotion of the prediction samples.
A. Predictive Effect
It can be known from figure 3 that the prediction value
of model keeps in accordance with the change of actual
value, which indicates that in short time the model
established in this test can imitate the short-time tendency
of stock market. By further analysis and prediction of
accuracy, the short-time prediction accuracy of the model
can reach a rate of over 94%. Therefore, under the
circumstance of no significant change of macro-economic,
this prediction model will provide the investors with Figure 4. Histogram of prediction error
preferable basis for decision making.
In addition, the fierce fluctuation of China’s stock
market and the combination of various kinds of factors
make the change of long-term stock index uncertainly. In
terms of improving the prediction accuracy, the further
optimization of BP network model can be considered and
1528
then higher practicability of it can be achieved through the
combination with other technology. This is also the major
research direction at present.
VIII. CONCLUSIONS
This article applies the data mining technology of neural
network to stock price forecast and receives a preferable
result, which will provide the research of the stock market
development a new thought.
1529