Professional Documents
Culture Documents
56 (2016),
ISIJ International,
No. 12 Vol. 56 (2016), No. 12, pp. 2113–2117
Xiaoxian HUANG, Xiaohui FAN,* Xuling CHEN, Guiming YANG and Min GAN
School of Minerals Processing and Bioengineering, Central South University, Changsha, Hunan, 410083 China.
(Received on April 1, 2016; accepted on August 4, 2016)
The bed permeability state prediction model of sintering process based on data mining technology was
proposed in this study. Firstly, the sintering production data were analyzed by fuzzy clustering algorithm,
to make a comprehensive evaluation of the bed permeability state. Then the prediction model of bed
permeability state was established via support vector machine, based on the sample data that obtained
in the cluster analysis. The bed permeability prediction model has a good learning and generalization abil-
ity, its prediction hit rate reached 87.5%. The practical application showed that: the sintering process could
be stabilize effectively, since the operation parameters was adjusted according to the prediction results of
bed permeability state; the standard deviation of burn through temperature and burn through point was
decreased by 47% and 34% respectively.
KEY WORDS: iron ores sintering; bed permeability; prediction model; data mining.
tent and coke breeze ratio of mixture, the way of granulation the sintering production, xk(k = 1,2,…,n) is a 3-dimensional
and distributing. It means that the permeability state can be vector, each feature value represents the burn through point,
predicted by the variables such as mixture size, moisture burn through temperature and air quantity respectively.
of mixture, coke breeze ratio, mixture temperature, bed The data set X is divided into 5 clusters, V = {v1,v2,…, v5},
height, negative pressure of ignition furnace and No.1 wind and vi(i = 1,2,…,5) is the center of each cluster. The fuzzy
box. The relationship between bed permeability state and membership function µik ∈( 0,1) is used to describe the
other sintering parameters is shown in Fig. 1. Where, the membership degree between sample data xk and cluster cen-
mixture size is the average particle size of mixture, and it ter vi. Fuzzy C-means clustering algorithm (FCM) is using
is calculation results according to the size composition and iterative optimization method that based on the least square
proportion of raw materials; the moisture of mixture refers principle to divide the data set. The expression of objective
to the water content of mixture after mixing and granulating; function Jm is as follows:
the coke breeze ratio refers to the proportion of coke breeze 5 n
in the mixture; the mixture temperature is the temperature of J m (U ,V ) = ∑ ∑ µikm dik2 ....................... (1)
mixture after steam preheating; the bed height is the height i =1 k =1
of mixture bed on the sintering pallet before ignition; the Where, dik = xk − vi is represent the Euclidean distance of
negative pressure of ignition furnace and No.1 wind box sample data xk and cluster center vi; m(m > 1) is a weighted
refers to the pressure values that detected in the ignition index, which is used to adjust the degree of membership
furnace and No.1 wind box. degree in different fuzzy cluster. The sum of all the mem-
Clustering and classification are both commonly used bership degree of any one sample data for every cluster must
data mining methods. Clustering is to divide a data set into be 100%, as follows
different classes or clusters according to a particular stan- 5
dard, and fuzzy clustering is to determine the degree of each ∑ µik = 1, ∀k = 1, 2, n ....................... (2)
i =1
data belonging to each cluster via membership function. Using Lagrange multiplication to solve the optimization
Classification is a supervised learning method, which is to problem of minimizing the objective function Jm under
find the classification rules by training the data sets that has above constrains. The fuzzy membership function μik and
known classes, and to predict the class of new data. Support cluster center vi can be written as
vector machine (SVM) is a classification algorithm, and a 1
new kind of machine learning method which is based on the µik = 2
, ∀i = 1, 2,, 5; ∀k = 1, 2,, n
5
dik m −1 .... (3)
structure risk minimization principle of statistical learning
theory. SVM can solve the problem of high dimensional and ∑ d jk
j =1
local optimal, and has good learning ability and generaliza-
n
tion ability. In this study, the fuzzy clustering was used to
analyze the variables that can reflect the bed permeability ∑ µikm xk
state in sintering process, to realize the comprehensive vi = k =1
n
, ∀i = 1, 2,, 5 .................... (4)
evaluation of permeability state, and the SVM was used ∑µ m
ik
k =1
to establish the prediction model of bed permeability state.
The optimal fuzzy classification matrix and cluster
centers are obtained through iterative algorithm according
3. Modeling
to formula (3) and formula (4), and finally complete the
3.1. Evaluation of Bed Permeability State fuzzy clustering division. In addition, considering the FCM
For the sample data set X = {x1, x2,…, xn} that extracted from algorithm is a local search algorithm essentially. It is very
Fig. 1. The relationship between bed permeability state and other sintering parameters.
sensitive to the initial value of cluster center, and easy to fall onto the 2-dimensional plane, and the different clusters were
into local minimum point. Therefore, the genetic algorithm, marked in different colors and shapes. It can be seen that
which has the strong global search ability, was introduced the fuzzy clustering algorithm can divide the sintering data
to search the whole state space in this study, which made set into several cluster effectively. Fuzzy clustering model is
the individual get rid of the local optimal solution. The an effective method to evaluate the bed permeability state,
algorithm flow as illustrated in Fig. 2. In this algorithm, the and it can provide important data for establish the predic-
genetic algorithm is used to optimize the cluster center. So tion model.
in the population initialization process, the cluster centers
are randomly initialized according to the population size. 3.2. Prediction of Bed Permeability State
The algorithm evaluates the individual by the objective The sample data set {(xi,yi), i = 1,2,…,N} can be set up
function of fuzzy C-means clustering, that the fitness func- by combining the fuzzy clustering results of bed perme-
tion can be expressed by formula (1). The iterative process ability and sintering production data, where xi is the input
is to do the genetic operations included selection, crossover parameters, and yi is the output parameter. xi is a 7-dimen-
and mutation, to find the best cluster center and optimized sional vector, each feature value represents the mixture
the objective function of fuzzy C-means clustering. size, moisture of mixture, coke breeze ratio, bed height,
The sample data of clustering were selected from the his- mixture temperature, negative pressure of ignition furnace
torical data of sintering production of Steel Corp. in China, and No.1 wind box; yi represents the bed permeability state;
where the air quantity is represented by the frequency of N represents the total number of samples. For the nonlinear
exhaust fan. Accordingly, three feature parameters that
reflect the bed permeability state are frequency of exhaust
fan, burn through point and burn through temperature. For Table 1. The clustering centers of bed permeability state.
the fuzzy C-means clustering, the weighted index m in Bed permeability Frequency of Burn through Burn through
formula (1) is 3, and the number of iterations is 20. For state exhaust fan (Hz) point (m) temperature (K)
the genetic algorithm, the population size, which means Very good 42.84 39.26 625.15
the group number of cluster centers, is 20, the maximum
Good 43.17 39.78 597.87
generation, which means the highest number of iterations,
is 100, the crossover probability is 0.7, and the mutation Normal 43.71 38.87 647.64
probability is 0.05. After clustering analysis via above algo- Bad 44.96 39.02 626.09
rithm, the objective function Jm = 89.43, and the results of Very bad 45.77 39.45 591.24
multiple runs were very close, which means the algorithm
can converge to the global optimal point. Table 1 shows
the clustering centers, and the corresponding relationship
between each cluster and bed permeability state can be con-
firmed according to expert experience. The clustering results
as shown in Fig. 3, the 3-dimensional data set was projected
separable cases, the input vectors are mapped to a high Only small parts of the solution ai of formula (9) are not
dimensional feature space using the nonlinear function ϕ, zero, and the corresponding samples xi are the support vec-
and then one optimal hyper-plane is constructed in the high tor. Thus, the optimal classification function is as follows:
dimensional feature space:
l
f ( x ) = sgn ∑ ai yi K ( xi , x ) + b .............. (10)
ω T ⋅ φ ( x ) + b = 0 ............................ (5) i =1
Taking into account that some samples cannot be cor- Where sgn is the symbolic function, l is the number of
rectly separated in the case of linear non-separable, the support vectors, and b is the classification threshold. The
slack variables ξi ≥ 0, i = 1,2,…, N were introduced. Finally, model structure is shown in Fig. 4.
the classification problem of original sample space can be The prediction model of bed permeability state was
expressed as: established based on the above SVM classification algo-
rithm. According to the sintering production data and the
yi [ω T ⋅ φ ( x ) + b] ≥ 1 − ξi , i = 1, 2,, N ............. (6) clustering result of bed permeability state, 700 sets of data
According to the principle of construction risk minimiza- samples were constructed, among which 500 sets were used
tion, the minimum risk bounds of the classification problem as training sample, and the other 200 sets were used as test-
can be obtained from the following optimization problem: ing sample. The kernel function of support vector machine
N is RBF kernel, and the model parameters are optimized by
1
min J (ω , ξ ) = ω T ω + c ∑ ξi .................. (7) cross validation method. The prediction results of the model
2 i =1 are shown in Fig. 5. Where, the abscissa represents sample
Through a series of optimization methods, such as intro- number, which was optionally given, and the ordinate rep-
ducing the Lagrange function, the optimization problem is resents bed permeability state. Each sample has an actual
transformed into solving the maximum value of following state and a predict state, expressed in different colors and
function, under the constraint conditions of 0 ≤ ai ≤ c and shapes in this figure. When the predict state and actual state
n
matches, it indicates that the prediction result is correct,
∑ ai yi = 0 . Where ai is a Lagrange multiplier. otherwise it is not correct.
i =1
1 N N
Q a, φ ( xi ) = − ∑ yi y j φ ( xi ) φ ( x j ) ai a j + ∑ ai ...... (8)
T
2 i , j =1 i =1
max Q ( a , K ( xi , x j ) ) = − ∑
2 i , j =1
yi y j K ( xi , x j ) ai a j + ∑ ai
i =1
N
s.t. ∑ ai yi = 0 , i = 1, 2,, N
i =1
0 ≤ ai ≤ c
.......................................... (9)
Fig. 5. The prediction results of bed permeability state.