Professional Documents
Culture Documents
article info a b s t r a c t
Article history: This study proposes a new fault diagnosis algorithm for a photovoltaic (PV) solar system implemented
Received 19 June 2020 on the bases of diode distribution and two classifiers based on ensemble bagged trees and neural
Received in revised form 12 August 2020 pattern recognition techniques. The proposed approach is designed by adding two diodes connected
Accepted 7 September 2020
to the upper and lower terminals of each PV string. The two diodes are connected to prevent the
Available online 12 September 2020
reverse direction of faulted string currents, thereby preventing high injecting fault currents. Given
Keywords: that the string fault current is prevented owing to diode interaction, this dynamic interaction provides
PV solar system new states during the fault periods considered to detect string faults. Owing to the distributed diodes,
Ensemble bagged trees system voltage is not reduced during fault disturbances and voltage is not considered an input to
Neural pattern recognition the proposed detection techniques. Monitoring currents at the upper and lower ends of each string is
MATLAB/Simulink required as input for the proposed approaches to detect string fault types. The proposed approaches are
used to detect string fault types, such as cell-to-string negative terminal, cell-to-cell in the same string,
and string-to-string faults. The proposed model is applied for 400 kW, and the PV system consists of
4 arrays interconnected with 1200 V AC grid built-in MATLAB/Simulink. Experimental results validate
the newly suggested approaches.
© 2020 Elsevier Ltd. All rights reserved.
https://doi.org/10.1016/j.segan.2020.100389
2352-4677/© 2020 Elsevier Ltd. All rights reserved.
I.B.M. Taha, N.I. Elkalashy, M. Alsharef et al. Sustainable Energy, Grids and Networks 23 (2020) 100389
such as string faults, inverter faults, PV ageing, and maximum lower sides of each string to implement the proposed techniques.
power tracking errors (MPPT). In [12], a fault type detection was The DC common busbar is used to collect power delivered to the
diagnosed based on the analysis of array voltage and current. grid through the DC/AC three-phase inverter type of the voltage
The effects of partial shading on PV array voltage, current, and source converter (VSC) operating at the unity power factor. The
PV system losses were also studied and discussed. In [15], the inverter is connected to the grid using 400 kVA and 0.260/25
grey wolf optimizer with genetic algorithm and Tabu search kV transformer. A line of 14 km and 25 kV is used to deliver
algorithms were used to diagnose PV system fault types and their the power to the grid of 120 kV (source behind impedance rep-
locations. In [24], the random forest ensemble learning technique resentation). These power elements are involved in the AC grid
was applied to detect the PV system fault location based on string system block shown in Fig. 1. Additional details of the PV farm,
faults and currents. The model diagnosed different PV system converters, and grid representation are presented in [26] and
fault types, such as short and open circuit degradations partial implemented using MATLAB/Simulink.
shading. However, they depended on monitoring the system volt-
ages and distributed currents to detect the fault types. In general,
2.2. Data preparation
the published fault detection techniques in the literature have
failed to contribute to prevent reversing current through faulted
The system shown in Fig. 1 is subjected to different operation
strings during the fault periods.
cases, such as steady-state and electric faults concerning cell-to-
This study presents a novel approach based on inserting two
string negative terminal, cell-to-cell, and string-to-string faults,
diodes in each PV string, in which their aims prevent the high
as depicted in Fig. 2. The currents at the upper and lower ends of
string current during fault periods. Two artificial intelligence
each string are measured, and the total array current is recorded
(AI) approaches are used to detect string faults, and based on
the bagged ensemble trees and neural pattern recognition tech- at different fault cases. To generalize the proposed algorithm
niques. The main contribution of the proposed approach is sug- concerning the PV strings shown in Fig. 2, string currents are
gested based on adding two diodes for each PV string; one diode normalized as follows:
is connected to the upper terminal of the PV string, while the ia1 = Ia1 × N/It , (1)
other is connected to the lower end. The two diodes are con-
nected to prevent the reverse direction of string currents during ia2 = Ia2 × N/It , (2)
the fault intervals, and facilitate the fault detection based only where ia1 and ia2 are the normalized currents of the upper and
on the upper and lower currents of each string without ap- lower end measured currents Ia1 and Ia2 , respectively, of string a;
plying string voltage to the proposed classifier approach. Given It is the measured total array current; and N is the total string
the diode interaction with the faulted PV system, the proposed number. For the system under study, N is 68 strings. In the pre-
approach is used to determine string faults, such as cell-to-string
ceding equations, factor N is utilized in the normalization because
negative terminal, cell-to-cell in the same string, and string-to-
it is the total number of strings. Accordingly, the normalized com-
string faults, based only on string current without requirements
puted current is close to one during the steady state. For training
of string voltage. The proposed model is applied for 400 kW, and a
purposes, the fault cases over the same string are collected for
PV system [26] consisting of four arrays is connected to a 120-kV
distributed fault points through the string, in which these points
AC grid. The PV system and two suggested fault detection algo-
are selected and called cell 1, 2, 3, and 4 points. Thereafter, the
rithms are executed using MATLAB/Simulink. The experimental
state of the cell-to-cell faults is represented as cell 1 to cell 2
results validate the proposed fault detection approaches using a
faults, cell 1 to cell 3, cell 1 to cell 4, cell 1 to cell 5, cell 2 to cell
5-kW PV system.
3, and cell 2 to cell 4, among others. All other fault options are
The remainder of this paper is organized as follows. Section 2
considered, and the measurements of all fault cases are collected,
illustrates the effects of utilizing two diodes in each PV string
normalized, and inserted into the classification model. A total
at the upper and lower terminals. Section 3 introduces the two
proposed AI methods and discusses their performance. Section 4 of 276 fault cases are measured and recorded. The measured
presents the accuracy evaluation of the two proposed methods. variables are the currents at the upper and lower ends of each
Section 5 provides the experimental verification. Lastly, Section 6 string and array line current, and are normalized using (1) and
provides the conclusions. (2). The string fault detector requires only the upper and lower
current normalized values of the different operation states and
2. System description corresponding fault states. The upper and lower fault currents of
strings a, b, c, and d are introduced in Fig. 2. The different three
2.1. Simulated system detected fault types are shown in Fig. 2. The studied fault types
are as follows: (a) cell-to-cell fault (between cells 1 and 3), (b)
The complete PV solar system consists of 4 PV subsystems of cell-to-string negative terminal fault type (cell 4 to ground), and
100 kW and 400 A ratings at irradiance of 1000 W/m2 (see Fig. 1). (c) string-to-string fault (between cell 2 of string c and cell 3 of
As shown in Fig. 1, each PV subsystem is connected to a DC/DC string d), as shown in Fig. 2.
boost converter to attain maximum power point tracking. The The fault states are denoted as 0, 1, 2, 3, 4, 5, and 6 for steady-
maximum power point tracking algorithm is implemented based state, high and low cell-to-negative terminal faults, high and low
on the well-known perturb and observe technique, as reported cell-to-cell faults, and high and low string-to-string faults, re-
in [27]. The advantages of this method are its simplicity and spectively, as indicated in Table 1. The term ‘‘high’’ is considered
minimal measuring signals [28]. The control principles depend on for low fault resistances, while ‘‘low’’ is used for the high fault
voltage perturbation to constantly keep the power towards in- resistances. For the distributed diodes considerations, the ‘‘high’’
creasing each step with respect to the previous one. Accordingly, and ‘‘low’’ faults are correspondingly associated with the diode
the control provides operation oscillatory around the peak power status during the fault period. For the high fault stress where the
point. fault resistance is low, the diode status is changed to the off-
In Fig. 1, each PV subsystem has 68 parallel strings, in which state during the fault period, and vice versa. Accordingly, the fault
each string contains 5 series SunPower SPR-315E modules. More- resistance limit for the ‘‘high’’ and ‘‘low’’ faults differ from fault
over, Fig. 1 shows that two diodes are connected at the upper and type to the other.
2
I.B.M. Taha, N.I. Elkalashy, M. Alsharef et al. Sustainable Energy, Grids and Networks 23 (2020) 100389
Fig. 1. Construction block diagram of the PV connected system using the MATLAB/Simulink toolbox.
Fig. 5. Faults with 3 between the first cell of the second string and the third
cell of the third string with and without utilizing diodes.
Table 2
Bagged tree controlling parameters.
Controlling parameters Description
Results Accuracy: 96.38%
Training time: 2.8768 s
Predicted speed: 1400 obs/s
Model type Preset: Bag trees
Ensemble method: Bag
Learner type: Decision tree
Maximum number of splits: 275
Number of learners: 30
Table 3
Sample fault cases for bagged trees check.
Fig. 6. Bagged ensemble trees method of the training data with 96.38% accuracy
to fault classification.
five folds is selected to validate the system model. From training Table 4
the measured steady-state and fault cases, the performance eval- Confusion matrix of the bagged trees model.
uation indicated that the ensemble bagged trees method had the Predicted class Accuracy
best performance. State 0 1 2 3 4 5 6
The training results illustrated that the ensemble bagged trees 0 11 0 0 0 0 0 0 100%
method had good training accuracy compared with that of the 1 0 26 0 0 0 1 0 96.3%
other methods introduced in the classification learner MATLAB 2 0 0 15 0 0 0 3 83.3%
True class 3 0 0 0 66 1 0 0 98.5%
toolbox [30]. These techniques are described, and their imple-
4 0 0 0 2 42 0 0 95.5%
mentation process is defined in the MATLAB classification learner 5 0 0 0 0 0 55 0 100%
toolbox [30]. 6 0 0 2 0 1 0 51 94.4%
The data sets are divided into 5 folds using the cross/fold Overall accuracy 96.38%
validation approach. The maximum number of splits is 275, while
the number of learners is 30 for the proposed bagged trees model.
Table 2 shows that the bagged tree has the controlling parame- Table 4 illustrates the confusion matrix of the bagged en-
ters. The overall performance accuracy of the bagged ensemble semble trees method with the 276 case studies, in which the
trees method is 96.38%. Table 1 shows that this performance is confusion matrix describes the relationship between the true and
for the fault classification. predicted classes detected by the classifier method. The 276 test
The tree plot of the bagged trees method is shown in Fig. 6, case studies include 11, 27, 18, 67, 44, 55, and 54 for fault states
which confirms the online implementation applicability of the and 0, 1, 2, 3, 4, 5, and 6, respectively. Its diagonal elements
detection method. The bagged tree shown in Fig. 6 has the ad- represent the numbers of predicted samples corresponding to
vantage of simplicity, which is due to the proposed approach of each class, while the off-diagonal elements represent the number
adding diodes at the upper and lower ends of each string, as of false detecting samples against each class. From the confusion
depicted in the system under study in Fig. 2. This figure also matrix, the proposed model succeeds in detecting all steady-state
illustrates the sequence from the root node to the final leaf node cases (11 cases), while it fails only in detecting 4 cases from 45
corresponding to each operation state. For example, the operating cases of cell-to-negative terminal faults. It succeeds in 108 case
state 3 can be obtained as follows: ia2 should be under 0.19085 studies (i.e., 66/67 high cell-to-cell fault + 42/44 low cell-to-cell
p.u. and ia1 is below 0.5533, while the operating state 1 can be fault) from the 111 cell-to-cell fault cases. Moreover, it succeeds
obtained when ia2 is above or equal to 1.0522 p.u., as shown in to detect 106 (55/55 high cell-to-cell fault + 51/54 low cell-to-
Fig. 6. The classifier representation depicted in Fig. 6 confirms cell fault) cases from 109 string-to-string fault cases. The overall
the simplicity of the classifier implementation either by using performance accuracy of the ensemble bagged trees model to
Simulink or a digital signal processing board with the expectedly detect the different fault cases is 96.38%, which is attained by
low execution time of the classification algorithm. considering the diode interaction. However, the accuracy is 100%
Table 3 presents seven fault state test cases to check the when the decision is only for fault detection and not for fault
bagged trees presented in Fig. 6. The seven test cases consist of classification. In addition, the algorithm accuracy is extremely
one case study for each 7 fault states (0, 1, 2, 3, 4, 5 and 6). reduced to 71% using the training data of the system without
The highlighted cases represented three cases to illustrate how diode implementation.
the tree predicts the fault state. The first highlighted case has
fault state of 2 and the tree takes the path ‘‘ABCEFHJ2’’ which 3.2. Using neural pattern recognition method
means that the tree correctly predicts the fault state. The second
highlighted case has fault state 4 and the tree takes the path The neural pattern recognition approach is also used to de-
‘‘ABCEG4’’ to reach the fault state 4 with correctly detecting the tect the PV system operating state. The 276 training data sets
fault state. The third highlighted sample with fault state 6 will be are used to build the neural pattern recognition-MATLAB model.
detected by the tree flow path ‘‘ABC6’’ through the tree. The neural pattern recognition model was designed to include
5
I.B.M. Taha, N.I. Elkalashy, M. Alsharef et al. Sustainable Energy, Grids and Networks 23 (2020) 100389
Table 5
Confusion matrix of using the neural pattern recognition model.
Predicted class Accuracy
States 0 1 2 3 4 5 6
0 11 0 0 0 0 0 0 100%
1 0 27 0 0 0 0 0 100%
2 0 0 16 0 0 0 2 88.9%
True class 3 0 0 0 66 1 0 0 98.5%
4 0 0 0 0 44 0 0 100%
5 0 0 0 0 0 55 0 100%
6 0 0 2 0 0 0 52 96.3%
Overall accuracy 98.20%
Fig. 7. ROC plotting of the training, validation, testing, and overall data sets.
Fig. 9. Predicting performance of the two proposed models with 106 testing
1 output, 2 input, and 25 hidden layers. The neural pattern cases.
recognition approach is a binary output system, thereby enabling
the operating states to be converted to binary codes as follows.
The codes [1 0 0 0 0 0 0], [0 1 0 0 0 0 0], [0 0 1 0 0 0 0], [0 recognition model is better than that of bagged tree, thereby
0 0 1 0 0 0], [0 0 0 0 1 0 0], [0 0 0 0 0 1 0], and [0 0 0 0 0 providing good detection for the majority of the new cases but
0 1] are for 0, 1, 2, 3, 4, 5 and 6 condition states, respectively. continue to be less than the neural pattern recognition model.
The training data sets are divided into 70%, 15%, and 15% for the
training, validation, and testing stages, respectively. The built-in 4. Performance evaluation
neural pattern recognition-MATLAB toolbox is used to generate
the proposed neural pattern recognition model. As summarized 4.1. Detection accuracy
in Table 5 illustrating the confusion matrix, the performance
accuracies of the designed neural pattern recognition model are The proposed two prediction models are tested with the other
97.9%, 100%, 97.6%, and 98.2% for the training, validation, testing, 106 test cases. The test cases consist of 3, 11, 10, 20, 20, 21, and 21
and overall data sets, respectively. for fault state indicators 0, 1, 2, 3, 4, 5, and 6, respectively. Fig. 9
Fig. 7 shows the receiver operating characteristic (ROC), which presents the detection results of the bagged trees (see Fig. 9a)
is a metric used to check the quality of classifiers for the training, and neural pattern recognition (see Fig. 9b) models compared
validation, testing, and overall data sets. The results ensure that with the actual fault states. The results show that the bagged
the ROC plot of the training data set is extremely close to 1, vali- trees method has 100% accuracy for all fault states, except state
dation data set is equal to 1, and testing data set is near 1. Lastly, 2 (‘‘low’’ cell-to-negative terminal fault), in which the accuracy
the overall data set is extremely close to 1, which means high- is reduced to 40%. The fault detecting of state 2 is detected as
quality classification process. Fig. 8 presents the cross-entropy state 1, which is a similar fault category (‘‘high’’ cell-to-negative
performance of the training, validation, and testing data sets. The terminal fault). The overall accuracy of the bagged trees model
best performance of the validation data set is obtained after 38 with 106 testing cases is 94.34%. For the neural pattern recogni-
epochs with a value of 0.000899. tion model, the detecting accuracy with the 106 test cases is 100%
From the evaluation of the results presented in this section, for all fault states. Thus, it is used for predicting the fault states
the distributed diode effect on the fault current distributions aims in the next section.
to detect the faults in the PV solar system. The diode performance From the performance evaluation in this subsection, the rea-
is either on or off-state, which simplifies the fault detection sons for detecting string faults based only on string currents
issue. The training accuracy of the bagged ensemble trees with are due to utilizing two diodes with each string, in which the
the 276 cases obtained from the classification learner is 96.38%, distributed diodes prevent the reversing current through strings
while the neural pattern recognition approach presents a training during fault periods. Accordingly, the system voltage is unaffected
accuracy of 98.2%. Therefore, the prediction of the neural pattern by the fault disturbance and excluded within the fault detection
6
I.B.M. Taha, N.I. Elkalashy, M. Alsharef et al. Sustainable Energy, Grids and Networks 23 (2020) 100389
Fig. 10. Comparisons between bagged ensemble trees and neural pattern
recognition methods with three fault cases.
Fig. 11. Class detection neural pattern recognition model, cell-to-cell fault of
the third string between cells 1 and 3 through 60 fault resistance.
Fig. 13. Fault between cell 1 of string 2 and cell 3 of string 3 with 15 fault
resistances.
Fig. 15. Fault between cells 2 and 4 with 15 fault resistances of string 2 f
(rom 0.2 s to 0.3 s) and irradiance reduction of cell one by 20% (from 0.1 s to
0.3 s).
Table 6
PV panel data.
Performance at standard test conditions
Maximum power (PMPP ) 280 W
Short Circuit Current (ISC ) 9.41 A
Open circuit voltage 38.97 V
Current at maximum power 8.84 A
Voltage at maximum power 31.67 V
5. Experimental validation
Table 7
Experimental measurements and classifier performance.
Fault conditions String 1 String 2 Suggested classifier outputs
ia1 (A) ia2 (A) ib1 (A) ib2 (A) Bagged tree Neural Net.
S1FC S2FC S1FC S2FC
Fault-free 7.29 7.42 7.49 7.48 0 0 0 0
Case 1 0 0 7.22 7.26 3 0 3 0
Cell-to-cell Case 2 0 0 7.4 7.4 3 0 3 0
Case 3 0 0 7.2 7.3 3 0 3 0
Case 4 0 0 7.15 7.16 3 0 3 0
Case 1 0 6.67 7 0.42 4 5 5 5
String-to-string Case 2 0 6.68 7.12 0.52 4 5 5 5
Case 3 0 7.17 7.07 0 5 5 5 5
Case 4 0 7.07 6.96 0 5 5 5 5
the decision goes to the right side at the root node. At the second
node, the current ib2 is below 1.05 p.u. Hence, the decision goes
to the left side. At the third node, the decision goes to the left
side to reach the fourth node (current is below 1.0035). At the
fourth node, the decision goes to the right side (current is above
0.9438). At the fifth node, the decision goes to the right side
(current is above 0.07718) and the decision at the sixth node (leaf
node) goes to the right side to detect the fault type as 0, which
means steady-state operation. From this performance, the bagged
tree method successfully discriminates the fault. The performance
of the neural pattern recognition shown in Table 6 indicates
the fault using the experimental measurements. In general, the
designed bagged tree and neural pattern recognition methods
using the simulation fault data accurately detects the fault made
using the laboratory measurements. This result confirms the uni-
versality of the proposed technique based on normalized string
currents in (1) and (2).
6. Conclusions
10