Professional Documents
Culture Documents
Abstract—Distance relay operation due to power swing con- the propagation of disturbances. Although there are many
ditions should be blocked in order to prevent unnecessary power system quantities that can be monitored, such as
line trips leading to the formation of unplanned islands and rate of change of power, phase angle difference across a
the propagation of outages. In this work, a machine learning
classification approach to improve the detection of symmetrical transmission line, swing-center voltage, etc., most modern
faults during power swing conditions in conventional distance distance relays provide a function for power swing blocking
relays is presented. The approach is intended to augment existing (PSB) which operates based on rate of change of positive
power swing blocking methods and trains a classifier with a sequence impedance. However, this method may still fail to
focus on accuracy and interpretability. The contribution of the block if the impedance moves too fast during the power swing
approach is the training of classifiers that maintain performance
of other methods, while providing interpretable results. Training [6]. This approach fundamentally relies on the signal, rate
and test data is generated from time-domain simulations of of change of impedance, to discriminate symmetrical faults
the IEEE 9-Bus system for 17 280 scenarios. The following from power swings, conditioned on impedance being close
classifiers are compared and evaluated: support vector machine, to protected line. While it allows for great interpretability,
random forest, gradient tree boosting, decision tree, and k-nearest e.g. it is straightforward to interpret a decision threshold
neighbors. Utilizing mutual information for feature selection
and only local measurements, our results show performance specifying swing speeds greater than 6 Hz are classified as
comparable to prior methods and provides the ability to view faults, the approach suffers from two key challenges: setting
the decision boundary for greater understanding of the classifier the appropriate threshold for rate of change of impedance
output. Trade-offs between interpretability and performance are and the detection of symmetrical faults after PSB has been
also assessed. activated.
Index Terms—power system faults, power system stability,
power system protection, power quality. During power swing conditions, the instantaneous voltage
and current signals typically exhibit distinct characteristics
from symmetrical faults. As a result, several approaches have
I. I NTRODUCTION
been proposed to augment conventional PSB [7]–[9] using
Unintentional operation of distance relays has played a additional signals, such as high-frequency components or re-
significant role in several major cascading outages [1]–[3]. For mote terminal measurements. In particular, machine learning,
example, a relay tripping incorrectly due to heavy loading and data mining, and statistical pattern recognition approaches
power swing conditions helped trigger a massive cascading have gained attention due to their ability to accurately detect
outage in 2003, now called the Northeast blackout [4]. Initia- patterns in high-dimensional data sets and ability to detect
tion of cascading outages are often characterized by stressed faults in scenarios where discrimination is not straightforward.
system conditions in which several events coincide in a short For example, [10]–[12] apply classification models to detect
amount of time, such as high load demand, changes in circuit high-impedance faults and faults on series compensated lines.
topology, equipment outages, and large power oscillations. Detection of symmetrical faults during power swings using
Therefore, it is critical that relays operate securely under such classification methods is shown in [13]–[16] with good re-
conditions. The power swing phenomenon is characterized by sults. However, in these prior works, the primary focus has
oscillations in power flow between areas of a power system been to achieve high classification rates with only a minor
due to an abrupt imbalance in mechanical and electrical emphasis on investigating the trade-off between accuracy and
power [5]. Initiating events that can result in a power swing interpretability.
include short-circuit faults, lines switching, and large changes Therefore, in this work an alternative machine learning
in generation or load – creating sudden differences in electrical based approach using an effective feature selection criteria
power, as mechanical power remains constant. During a power is proposed to augment discrimination of symmetrical faults
swing, load impedance can enter into zone 1 or zone 2 of a during power swings. The approach seeks to maintain the per-
distance relay, appearing as a symmetrical fault and causing formance of high-dimensional classifiers in the least amount
an unintentional line trip. of features and to provide intuitive understanding with visual-
Therefore, distance relay operation during power swing ization and evaluation methods. The following classifiers are
conditions should be blocked in order to prevent unnecessary compared and evaluated: support vector machine, random for-
line trips leading to the formation of unplanned islands and est, gradient tree boosting, CART decision tree, and k-nearest
8 1.0+j0.35
neighbors. The contribution of the approach is a systematic and G2 2 7 0.79-j0.01 0.22+j0.026 9 3 G3
Load
interpretable approach resulting in the training of classifiers B78 5 Swing initiating
that maintains high classification rates. The approach is evalu- 1.63+j0.07 B75
fault locations
0.85-j0.086
ated on power swing and fault scenario data from the IEEE 9- 0.84-j0.099 0.65-j0.17
Classifier Parameters
decision tree ndepth = 4, nsamples = 50, nnodes = 20
k-nearest neighbors k=3
support vector machine C = 4096, γ = 8
random forest ntrees = 50
gradient tree boosting ntrees = 500 , λ = 0.1
Voltage (kV)
500
C LASSIFICATION ACCURACY USING FIVE FEATURES WITH HIGHEST
MUTUAL INFORMATION 0
-500
Classifier 5-fold CV (%) Test Set (%) 1.8 2 2.2 2.4 2.6 2.8 3
Current (kA)
2
decision tree 92.06 93.41
0
k-nearest neighbors 93.23 94.67
support vector machine 93.67 94.71 -2
1.8 2 2.2 2.4 2.6 2.8 3
random forest 97.83 97.84
1
y predict
gradient tree boosting 98.10 98.29 0.5
0
1.8 2 2.2 2.4 2.6 2.8 3
y predict,prob
1
0.5
0
1.8 2 2.2 2.4 2.6 2.8 3
Time (sec)
(a)
Zone 1: 85%
300 Zone 2: 120%
Zone 3: 152%
power swing
200 sym. fault
X (ohms)
100
the five most important features based on (1) and Fig. 2 are -200 -100 0 100 200 300
R (ohms)
considered. Our approach first holds out a test set which is
(b)
generated by randomly sampling 20% of the total data set
without replacement; the remaining data is used as a training Fig. 4. Classifier output for one power swing scenario. Voltage and current
waveforms, predicted class, and predicted probability shown in (a). Corre-
set. The data is approximately evenly split between fault and sponding impedance plane trajectories shown in (b).
power swing classes. Five-fold cross-validation is performed
by splitting the training data into five random subsets, training
on four of the subsets, and validating on the remaining subset. An illustrative power swing scenario is held out from the
This is repeated for each of the five folds and the average is training data and is shown in Fig. 4. The time-domain voltage
taken as the cross-validation score. The classification accuracy and current waveforms of phase A show a power swing
for cross-validation and the test set are shown in Table III. occurring 1.8 to 2.7 s, after which a symmetrical fault is
Classification accuracy alone, in general, is not a sufficient observed. The impedance trajectories observed by relay at B75
measure of a predictive model’s performance. Most classifiers and corresponding output from the random forest classifier
return predicted class probabilities in addition to binary class are also shown. It can be observed that the predicted class
predictions and have a default threshold of 50%. It is therefore transitions from 1 to 0 accurately when the fault occurs. The
important to further consider the true-positive (TPR) and false- predicted probability of fault also rises sharply. The suscep-
positive rates (FPR) as the classification threshold is varied, tibility to false-positive can be further tuned by adjusting the
which is not captured in the classification accuracy score alone. classification threshold above 50% probability.
In particular, power system protection is designed for high
security and is largely averse to false-positive misoperation V. C ONCLUSION
of relays (where faults are the positive class). Typically a Supervised learning classification models can have excel-
trade-off has to be made between TPR and FPR. The receiver lent performance in finding decision boundaries in high-
operating characteristic (ROC) and area under the curve is an dimensional spaces that minimize the classification error rate.
appropriate method to summarize both types of errors [26]. However, there is an inherent trade-off between predictive
The ROC plots the FPR versus TPR as the classification model performance and interpretability of the model and
threshold is varied and is shown for the five classifiers in results. Fig. 5 further illustrates that additional relevant fea-
Fig. 3. The ROC curves for random forest (rf) and gradient tree tures can greatly improve classification rates. Including all
boosting (gbc) show a significant advantage for maintaining a 12 features can achieve accuracy over 99%. On the other
high TPR without incurring larger FPR. hand, a model utilizing only two features such as R and X
[3] D. Tziouvaras, “Relay performance during major system disturbances,”
in 60th Annual Conference for Protective Relay Engineers, 2007., March
2007, pp. 251–270.
[4] G. Andersson et al., “Causes of the 2003 major grid blackouts in
north america and europe, and recommended means to improve system
dynamic performance,” IEEE Trans. Power Syst., vol. 20, no. 4, pp.
1922–1928, Nov 2005.
[5] D. Novosel, G. Bartok, G. Henneberg, P. Mysore, D. Tziouvaras, and
S. Ward, “IEEE PSRC report on performance of relaying during wide-
area stressed conditions,” IEEE Trans. Power Del., vol. 25, no. 1, pp.
3–16, Jan 2010.
[6] S. Lotfifard, J. Faiz, and M. Kezunovic, “Detection of symmetrical
faults by distance relays during power swings,” IEEE Trans. Power Del.,
vol. 25, no. 1, pp. 81–87, Jan 2010.
[7] B. Mahamedi and J. G. Zhu, “A novel approach to detect symmetrical
faults occurring during power swings by using frequency components
of instantaneous three-phase active power,” IEEE Trans. Power Del.,
vol. 27, no. 3, pp. 1368–1376, July 2012.
Fig. 5. Classification error rate on test set as a function of the number of input [8] J. G. Rao and A. K. Pradhan, “Differential power-based symmetrical
features. Features are added sequentially in order of decreasing importance. fault detection during power swing,” IEEE Trans. Power Del., vol. 27,
no. 3, pp. 1557–1564, July 2012.
[9] ——, “Power-swing detection using moving window averaging of
current signals,” IEEE Trans. Power Del., vol. 30, no. 1, pp. 368–376,
Feb 2015.
[10] A. R. Sedighi, M. R. Haghifam, O. P. Malik, and M. H. Ghassemian,
“High impedance fault detection based on wavelet transform and statis-
tical pattern recognition,” IEEE Trans. Power Del., vol. 20, no. 4, pp.
2414–2421, Oct 2005.
[11] N. Perera and A. D. Rajapakse, “Recognition of fault transients using a
probabilistic neural-network classifier,” IEEE Trans. Power Del., vol. 26,
no. 1, pp. 410–419, Jan 2011.
[12] Z. Moravej, M. Pazoki, and M. Khederzadeh, “New pattern-recognition
method for fault analysis in transmission line with UPFC,” IEEE Trans.
Power Del., vol. 30, no. 3, pp. 1231–1242, June 2015.
[13] K. Seethalekshmi, S. N. Singh, and S. C. Srivastava, “A classification
approach using support vector machines to prevent distance relay
maloperation under power swing and voltage instability,” IEEE Trans.
Power Del., vol. 27, no. 3, pp. 1124–1133, July 2012.
[14] A. Swetapadma and A. Yadav, “Data-mining-based fault during power
Fig. 6. Predicted probability contours for random forest classifier. Data for swing identification in power transmission system,” IET Science, Mea-
faults and power swings are shown from training set. surement Technology, vol. 10, no. 2, pp. 130–139, 2016.
[15] N. G. Chothani, B. R. Bhalja, and U. B. Parikh, “New support vector
machine-based digital relaying scheme for discrimination between power
swing and fault,” IET Generation, Transmission Distribution, vol. 8,
(as in conventional distance relays) is greatly interpretable. no. 1, pp. 17–25, Jan 2014.
Even with machine learning models, often considered as black [16] T. Guo and J. V. Milanovi, “Online identification of power system
boxes, it allows for novel visualizations that makes the results dynamic signature using pmu measurements and data mining,” IEEE
Trans. Power Syst., vol. 31, no. 3, pp. 1760–1768, May 2016.
more intuitive and understandable. For example, a random [17] “Protection system response to power swings,” NERC, Tech. Rep., 2013.
forest classifier is trained only using R and X as features, [18] S. Brahma, “Distance relay with out-of-step blocking function using
with the training data for both power swings and symmetrical wavelet transform,” IEEE Trans. Power Del., vol. 22, no. 3, pp. 1360–
1366, July 2007.
faults plotted in Fig. 6. The predicted class probabilities from [19] “Application of overreaching distance relays,” IEEE Power Systems
the model are plotted as contours, with the scale shown Relaying Committee, Tech. Rep., 2009.
at right. It can be observed that the visualization provides [20] P. Kundur, N. J. Balu, and M. G. Lauby, Power system stability and
control. McGraw-Hill.
great intuition and provides confidence for predictions on [21] P. M. Anderson and A. A. Fouad, Power system control and stability.
unseen data. Even with only two features, accuracy is greater Iowa State University Press.
than 90%. To conclude, a supervised learning approach for [22] N. Fischer et al., “Tutorial on power swing blocking and out-of-step
tripping,” in 39th Annual Western Protective Relay Conference, 2012,
detecting faults during power swings and focused on both pp. 108–119.
interpretability and performance has been presented. Results [23] “Power swing and out-of-step considerations on transmission lines,”
indicate performance of classifiers on par with prior models, Power Systems Engineering Research Center, Tech. Rep., 2005.
[24] H. Peng, F. Long, and C. Ding, “Feature selection based on mu-
while providing intuitive evaluation and visualization methods. tual information criteria of max-dependency, max-relevance, and min-
redundancy,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 8, pp.
R EFERENCES 1226–1238, Aug 2005.
[1] M. V. Venkatasubramanian, M. Kezunovic, and V. Vittal, “Detection, [25] C. M. Bishop, Pattern Recognition and Machine Learning. Springer.
prevention and mitigation of cascading events,” Power Systems Engi- [26] T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical
neering Research Center, Tech. Rep., 2008. Learning. Springer.
[2] M. Vaiman, P. Hines, J. Jiang, S. Norris, M. Papic, A. Pitto, Y. Wang, [27] F. Pedregosa et al., “Scikit-learn: Machine learning in Python,” Journal
and G. Zweigle, “Mitigation and prevention of cascading outages: of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
Methodologies and practical applications,” in 2013 IEEE Power and
Energy Society General Meeting (PES), July 2013, pp. 1–5.