Professional Documents
Culture Documents
To cite this article: G J Tsekouras, I K Hatzilau & J M Prousalidis (2009) A new pattern recognition
methodology for classification of load profiles for ships electric consumers, Journal of Marine
Engineering & Technology, 8:2, 45-58, DOI: 10.1080/20464177.2009.11020222
In this paper a new pattern recognition methodology is presented for the classification
of the daily chronological load curves of ship electric consumers (equipment) and the
determination of the respective typical load curves of each one of them. It is based on
pattern recognition methods, such as k-means, adaptive vector quantisation, fuzzy k-
means, self-organising maps and hierarchical clustering, which are theoretically described
and properly adapted. The parameters of each clustering method are properly selected
by an optimisation process, which is separately applied for each one of six adequacy
measures: the error function, the mean index adequacy, the clustering dispersion
indicator, the similarity matrix indicator, the Davies-Bouldin indicator and the ratio of
within cluster sum of squares to between cluster variation. As a study case, this method-
ology is applied to a set of consumers of Hellenic Navy MEKO type frigates.
Finally, the results of the application of the developed kWh and kVArh) for each period in steps of 1min,
methodology are thoroughly presented for the case study of 15min, etc. The chronological load curves for each
six electric consumers of Hellenic Navy MEKO type fri- individual consumer are determined for each study
gates, the chronological load curves of which have been period (week, month).
measured within the framework of a research project.1 2. Consumers’ clustering using a priori indices: Consu-
mers can be characterised by their voltage level (high,
medium, low), installed power, power factor, load fac-
PROPOSED PATTERN RECOGNITION tor, criticality according to ship’s operating mode etc.
These indices are not necessarily related to the load
METHODOLOGY FOR LOAD CURVES curves. They can be used however for the pre-classifi-
CLASSIFICATION OF SHIPS ELECTRIC cation of consumers. It is noted that the load curves of
CONSUMERS each consumer are normalised using the respective
The classification of daily chronological load curves of minimum and maximum loads of the period under
each ship electric consumer is achieved by applying the study.
pattern recognition methodology shown in Fig 2. The main 3. Data preprocessing: The load curves of each consumer
steps are: are examined for normality, in order to modify or
delete the values that are obviously wrong (noise sup-
1. Data and features selection: Using electronic meters or pression). If necessary, a preliminary execution of a
ship energy management system for main consumers, pattern recognition algorithm is carried out, in order to
the active and reactive power values are recorded (in trace erroneous measurements or networks faults,
vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
which, if uncorrected, will reduce the number of the u d
u1 X
useful typical time intervals for a constant number of d ð~ x‘2 Þ ¼ t
x‘1 , ~ ð x‘ i x‘2 i Þ
2
(1)
clusters. d i¼1 1
4. Typical load curves clustering for each consumer: For
2. the distance between the representative vector ~ w j of
each consumer, a number of clustering algorithms (k-
j-th cluster and the subset j , calculated as the geo-
means, adaptive vector quantisation, fuzzy k-means,
metric mean of the Euclidean distances between ~ w j and
self-organising maps and hierarchical methods) is ap-
each member of j :
plied. Each algorithm is trained for the set of load sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
diagrams and evaluated according to six adequacy
1 X 2
d ~ w j, j ¼ d ~ w j, ~ x‘ (2)
measures. The parameters of the algorithms are opti- N j ~x 2
‘ j
mised, if necessary. The developed methodology uses
the clustering methods providing the most satisfactory 3. the infra-set mean distance of a set, defined as the
results. This process is repeated for the total set of geometric mean of the inter-distances between the
consumers under study. Special consumers, such as members of the set, ie, for the subset j :
seasonal or emergency ones (eg, machine tools, fire- sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1 X 2
pumps, etc) are identified. These results can be com- d ð j Þ ¼
^ d ~ x‘ , j (3)
bined with the ships’ operating mode. At this stage, the 2N j ~x 2
‘ j
centre. The corresponding set is the classes set, which is where N (jt)
is the population of the respective set (jt)
defined by W ¼ f~ w k , k ¼ 1, . . . Mg. The subset of input during epoch t.
vectors ~ x‘ , which belongs to the j-th cluster, is j and the 4. Next, the number of epochs is increased by one. This
respective population of load diagrams is N j . For the study process is repeated (return to step b) until the maxi-
and evaluation of classification algorithms the following mum number of epochs is used or weights do not
distance forms are defined: w(jt) ~
significantly change, ie, (j~ w(jtþ1) j , , where is
the upper limit of weight change between sequential
1. the Euclidean distance between ‘1 , ‘2 input vectors of iterations). The algorithm’s main purpose is to mini-
the set X: mise the error function J:
The main difference with the classic model is that the The algorithm core is executed for a specific number of
process is repeated for various pairs of (a,b). The best neurons and the respective parameters 0 , min and T0 are
results for each adequacy measure are recorded for optimised for each adequacy measure separately. This pro-
various pairs (a,b). cess is repeated from M1 to M2 neurons.
X q
duced by the Kohonen winner-take-all learning ð Þ
u‘ tþ1 ~
x‘
rule.13,14 j
ð Þ ‘¼1
3. Next, the number of epochs is increased by one. This w j tþ1 ¼
~ N
q (15)
X ð Þ
process is repeated (return to step b) until either the u‘ tþ1
j
maximum number of epochs is reached or the weights ‘¼1
converge or the error function J does not improve,
where q is the amount of fuzziness in the range (1, 1)
which means:
which increases as fuzziness reduces.
J ð tÞ J ð tþ1Þ 4. Next, the number of epochs is increased by one. This
, 9 for t > T in (11) process is repeated (return to step b) until the maxi-
J ð tÞ
mum number of epochs is used or weights do not
where 9 is the upper limit of error function change significantly change.
This process is repeated for different pairs of (a,b) and for during the fine tuning phase the respective values are
different amounts of fuzziness. The best results for each f , T 0 . The h i9 j (t) is the neighbourhood symmetrical
adequacy measure are recorded for different pairs (a,b) function, that will activate the j neurons that are topolo-
and q. gically close to the winning neuron i9, according to
their geometrical distance, who will learn from the
same ~ x‘ (collaboration stage). In this case the Gauss
Self-organising maps (SOM) function is proposed:
" #
The Kohonen SOM 13-16 is a topologically unsupervised
d 2i9 j
neural network that projects a d-dimensional input data set h i9 j ð tÞ ¼ exp (17)
into a reduced dimension space (usually a mono-dimen- 2 2 ð tÞ
sional or bi-dimensional map). It is composed of a prede- where d i9 j ¼ k~
ri9 ~
r j k is the respective distance be-
fined grid containing M 3 1 or M 1 3 M 2 d-dimensional tween i9 and j neurons, ~ r j ¼ (x j , y j ) are the respective
neurons ~w k for mono-dimensional or bi-dimensional map co-ordinates in the grid, (t) ¼ 0 exp (t=T 0 ) is
respectively, which are calculated by a competitive learning the decreasing neighbourhood radius function where 0
algorithm updating not only the weights of the winning and T 0 are the respective initial value and time para-
neuron, but also the weights of its neighbour units in in- meter of the radius respectively.
verse proportion of their distance. The neighbourhood size 3. Next, the number of the epochs is increased by one.
of each neuron shrinks progressively during the training This process is repeated (return to step b) until either
process, starting with nearly the whole map and ending with the maximum number of epochs is reached or the index
the single neuron. The training of SOM is divided into two Is gets the minimum value:10
phases:
I s (t) ¼ J (t) þ ADM ð tÞ þ TE(t) (18)
• rough ordering, with high initial learning rate, large where the quality measures of the optimum SOM are
radius and small number of epochs, so that neurons are based on the quantization error J – given by equation
arranged into a structure which approximately displays (7), the topographic error TE and the average distortion
the inherent characteristics of the input data, measure ADM. The topographic error measures the
• fine tuning, with small initial learning rate, small radius distortion of the map as the percentage of input vectors
and higher number of training epochs, in order to tune for which the first i91 and second i92 winning neuron are
the final structure of the SOM. not neighbouring map units:
The transition of the rough ordering phase to fine tuning X
N
one takes place after T s0 epochs. TE ¼ neighbð i91 , i92 Þ=N (19)
It is mentioned that, in the case of the bi-dimensional ‘¼1
map, the immediate exploitation of the respective clusters is where, for each input vector, neighb(i91 , i92 ) equals
not a simple problem. The map can be exploited either by either 1, if i91 and i92 neurons are not neighbours, or 0.
inspection or applying a second simple clustering method, The average distortion measure is given for the t epoch
such as the simple k-means method.16 Here, only the case by:
of the mono-dimensional map is examined. N X
X M
More specifically, the respective steps of the mono- ADM ð tÞ ¼ h i9!~x‘ , j ð tÞ d 2 ~x‘ , ~
w j =N (20)
dimensional SOM algorithm are the following: ‘¼1 j¼1
a new row and a new column containing the distances 1
d ð1Þ ð C q , C s Þ ¼ d ð1Þ ð C i , C s Þ þ d ð1Þ ð C j , C s Þ
between the newly formed cluster and the old ones. The 2
distance between the newly formed cluster C q (the result of 1
d ð1Þ ð C i , C j Þ
merging C i and C j ) and an old cluster C s is determined as: 4
(29)
d ðC q , C s Þ ¼ ai d ðC i , C s Þ þ a j d ðC j , C s Þ
• the Ward or minimum variance algorithm (WARD):
þ b d ð C i , C j Þ þ c d ð C i , C s Þ d ð C j , C s Þ
d ð2Þ ð C q , C s Þ ¼
(23)
ð Þ ð Þ
where a i , a j , b and c correspond to different choices of the ð ni þ ns Þ d 2 ðC i , C s Þ þ ð n j þ ns Þ d 2 ðC j , C s Þ
dissimilarity measure. It is noted that in each level t the n s d ð2Þ ð C i , C j Þ
respective representative vectors are calculated by (6). ð ni þ n j þ ns Þ
The basic algorithms, which are going to be used in our (30)
case, are:
where:
• the single link algorithm (SL) – it is obtained from (23)
d ð2Þ ð C i , C j Þ ¼
ni n j
d ð1Þ ð C i , C j Þ (31)
for a i ¼ a j ¼ 0:5, b ¼ 0 and c ¼ 0:5: ni þ n j
d ð C q , C s Þ ¼ min d ð C i , C s Þ, d ð C j , C s Þ It is noted that in each level t the respective representa-
tive vectors are calculated by equation (6).
1 1
¼ d ðC i , C s Þ þ d ðC j , C s Þ (24)
2 2
1 The respective steps of the respective algorithms are the
d ð C i , C s Þ d ð C j , C s Þ following:
2
• the complete link algorithm (CL) – it is obtained from 1. Initialisation: The set of the remaining patterns R0 for
(23) for a i ¼ a j ¼ 0:5, b ¼ 0 and c ¼ 0:5: zero level (t ¼ 0 ) is the set of the input vectors X. The
d ð C q , C s Þ ¼ max d ð C i , C s Þ, d ð C j , C s Þ similarity matrix P0 ¼ P(X ) is determined. Afterwards
t increases by one (t ¼ t + 1).
1 1 2. During level t clusters C i and C j are found, for
d ðC i , C s Þ þ d ðC j , C s Þ
¼ (25)
2 2 which the minimisation criterion d(C i , C j ) ¼
1 min r,s¼1,..., N , r6¼ s d(C r , C s ) is satisfied.
þ d ð C i , C s Þ d ð C j , C s Þ 3. Then clusters C i and C j are merged into a single
2
cluster C q and the set of the remaining patterns R t is
• the unweighted pair group method average algorithm formed as: R t ¼ (R t1 fC i , C j g) [ fC q g.
(UPGMA): 4. The construction of the dissimilarity matrix Pt from
ni d ðC i , C s Þ þ n j d ðC j , C s Þ Pt1 is realised by applying equation (23).
d ðC q , C s Þ ¼ (26) 5. Next, the number of the levels is increased by one. This
ni þ n j
process is repeated (return to step b) until the remain-
where n i and n j - are the respective members’ popula- ing patterns R N 1 is formed and all input vectors are
tions of clusters C i and C j . in the same and unique cluster.
• the weighted pair group method average algorithm
(WPGMA): It is mentioned that the number of iterations is determined
1 from the beginning and it equals to the number of input
d ðC q , C s Þ ¼ d ðC i , C s Þ þ d ðC j , C s Þ (27) vectors decreased by 1 (N-1).
2
• the unweighted pair group method centroid algorithm
(UPGMC):
Adequacy measures
ð1Þ ð1Þ
n i d ð C i , C s Þ þ n j d C , C
ð j sÞ In order to evaluate the performance of the clustering algo-
d ð1Þ ð C q , C s Þ ¼
n
ð i þ n jÞ rithms and to compare them with each other, six different
adequacy measures are applied. Their purpose is to obtain
d ð1Þ ð C i , C j Þ well-separated and compact clusters, in order to make the
ni n j load curves self explanatory. The definitions of these meas-
ð n i þ n j Þ2
ures are the following:
(28)
(1) 2
where d (C q , C s ) ¼ k~
wq ~
w s k and ~
w q is the repre- 1. Mean square error or error function (J) given by equa-
sentative centre of the q-th cluster according to the tion (7).
equation (6). 2. Mean index adequacy (MIA)2-3 , which is defined as the
• the weighted pair group method centroid algorithm average of the distances between each input vector
(WPGMC): assigned to the cluster and its centre:
vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
u tem, the maximum peak load of which ranges between
u1 X M
MIA ¼ t d2 ~ w j, j (32) 3.5kW and 60kW. The respective data are the 1 minute ON/
M j¼1
OFF normalised load values for each individual consumer
for a period of eleven days during November 1997 and
3. Clustering dispersion indicator (CDI),2-4 which de-
January 1998.1 The respective consumers are:
pends on the mean infra-set distance between the input
vectors in the same cluster and inversely on the infra- • chiller,
set distance between the class representative load • HP compressor FWD/AFT,
curves: • refrigeration plant,
vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi •
u LP compressor FWD/AFT,
u1 X M
CDI ¼ t d^ 2 ð k Þ= d^ ð W Þ (33) • sanitary plant,
M k¼1 • boiler.
4. Similarity matrix indicator (SMI),3 which is defined as Initially, the analysis of the ‘chiller’ is presented in detail,
the maximum off-diagonal element of the symmetrical while additional consumers can be analysed in a similar
similarity matrix, whose terms are calculated by using way. It is supposed that the time interval is a day. The
a logarithmic function of the Euclidean distance be- representative consumer’s typical day has been chosen to be
tween any kind of class representative load curves: the most populated one. The respective set of the daily
n
1 o chronological curves has 11 members. No curves are re-
SMI ¼ max 1 1=ln d ~ w p, ~
wq : jected through data pre-processing.
piq (34)
p, q ¼ 1, . . . , M
5. Davies-Bouldin indicator (DBI)3,5 , which represents the Application of k-means
system-wide average of the similarity measures of each The modified model of the k-means method is executed for
cluster with its most similar cluster: different pairs (a,b) from 2 to 10 clusters, where a ¼ {0.1,
( ) 0.11 ,. . . , 0.45} and a + b ¼ {0.54, 0.55, . . . , 0.9}. For
1 X M
d^ ð p Þ þ d^ ð q Þ each cluster, 1332 different pairs (a, b) are examined. The
DBI ¼ max
:
M k¼1 p6¼ q d ~w p, ~
wq (35) best results for the six adequacy measures do not refer to
p, q ¼ 1, . . . , M the same pair (a, b). From the results of Table 1, it is
obvious that the developed k-means is superior to the clas-
6. Ratio of within cluster sum of squares to between sical one with the random choice of the input vectors during
cluster variation (WCBCR)3 , which depends on the the centres’ initialisation. For the classical k-means model
sum of the distance square between each input vector 100 executions are carried out and the best results for each
and its cluster representative vector, as well as the adequacy measure are recorded. The superiority of the
similarity of the clusters centres: developed model applies in all cases of neurons. A second
X M X X
M advantage is the convergence to the same results for the
2
WCBCR ¼ d ð~ x‘ Þ
wk, ~ d2 ~ w p, ~
wq respective pairs (a,b), which can not be achieved using the
x‘ 2 k
k¼1 ~ 1<q, p classical model.
(36)
Fig 5: Adequacy measures of the 7 hierarchical clustering algorithms for 2 to 10 clusters for the set of 11 training patterns of the
chiller
Fig 6: Adequacy measures and dead clusters of each clustering method for 2 to 10 clusters for the set of 11 training patterns of
the chiller
of the adequacy measure is gradually stabilised. It can also chiller are presented using the AVQ method with the best
be estimated graphically, using the ‘knee-rule’, of which WCBCR measure for 4 clusters.
gives values between 3 to 4 clusters (see Fig 7). In Fig 8 The training time for the methods under study has the
the typical daily chronological ON/OFF load curves for the ratio 0.05:1:22:24:36 (hierarchical: proposed k-means: opti-
mised adaptive vector quantisation: mono-dimensional self-
organising map: fuzzy k-means for q ¼ 6). Therefore, the
k-means, AVQ and hierarchical models have been selected.
cluster is the most populated one. In Fig 9, the respective the electrical consumer’s load behaviour, the design, the
representative typical load diagrams are presented. operation and the reliability of the ship’s power system and
the improvement of the operation of the automatic battle
management system for a battleship.
CONCLUSIONS From the respective application for six consumers of
This paper presents an efficient pattern recognition method- HN MEKO type frigates electric system for a small dataset
ology for the study of the load demand behaviour of elec- (only 11 training sets), it is concluded that three to four
trical consumers of ships. Unsupervised clustering methods clusters suffice for the satisfactory description of the daily
can be applied, such as the k-means, adaptive vector quanti- load curves of each consumer. It is also concluded that the
sation (AVQ), fuzzy k-means, mono-dimensional self-orga- optimal clustering technique is the modified k-means, the
nising maps (SOM) and hierarchical methods. The AVQ and the hierarchical clustering techniques, while the
performance of these methods is evaluated by six adequacy optimal adequacy measure is the ratio of within cluster sum
measures: mean square error, mean index adequacy, cluster- of squares to between cluster variation.
ing dispersion indicator, similarity matrix indicator, Davies- These results surely depend on the population of the
Bouldin indicator, the ratio of within cluster sum of squares data set, the ship operating mode (‘anchor’, ‘shore’, ‘at
to between cluster variation. Finally, the representative daily sea’, ‘general quarter’) and the load curve’s time interval
load diagrams along with the respective populations per (in this case study, it was a day). In the future, it should be
each consumer are calculated. This information is valuable investigated in larger datasets for longer study periods with
for ship engineers, because it facilitates the formalisation of different time intervals (from few hours to days).
Fig 9: The representative (most populated one) chronological ON/OFF load curves for each one of the six consumers