You are on page 1of 9

Available online at www.sciencedirect.

com
Available online at www.sciencedirect.com
ScienceDirect
ScienceDirect
Procedia
Available Computer
online Science 00 (2019) 000–000
at www.sciencedirect.com
Procedia Computer Science 00 (2019) 000–000 www.elsevier.com/locate/procedia
www.elsevier.com/locate/procedia
ScienceDirect
Procedia Computer Science 169 (2020) 597–605

Postproceedings of the 10th Annual International Conference on Biologically Inspired Cognitive


Postproceedings of the 10th
Architectures, Annual
BICA 2019International Conference
(Tenth Annual Meetingonof Biologically Inspired Cognitive
the BICA Society)
Architectures, BICA 2019 (Tenth Annual Meeting of the BICA Society)
Development of methods and algorithms for identification of a type
Development
of electric of methods
energy and algorithms
consumers for identification
using artificial intelligenceofand
a type
of electric energy consumers using artificial intelligence
machine learning models for Smart Grid Systems and
machine learning models for Smart Grida Systems
a
Dmitriy Raspopova and Pavel Belousova
Dmitriy Raspopov and Pavel Belousov
a
National Research Nuclear University of MEPhI (Moscow Engineering Physics Institute) Obninsk Institute of Atomic Energy (IATE),
a
National Research Nuclear University of MEPhI (Moscow Engineering Physics Institute) Obninsk Institute of Atomic Energy (IATE),
Studenchesky Gorodok 1, Obninsk, 249030, Russia
Studenchesky Gorodok 1, Obninsk, 249030, Russia

Abstract
Abstract
Article presents the relevance of creating Smart Grid for public power networks and industrial enterprises. The article
Article presents
describes the relevance
an experiment of creating
in which data Smart
were Grid for public
collected from power
currentnetworks and industrial
and voltage enterprises.
sensors from severalThe article
different
consumersan
describes of experiment
electric energy, using the
in which datacreated
were Smart Socket
collected from- Smart Energy
current device. sensors
and voltage The methods
from and algorithms
several of
different
consumers
intellectual of
andelectric energy,
spectral using
analysis weretheused
created Smart Socket
to analyze - Smart
and process Energy device.
experimental data.The methods
In this paper,and
we algorithms
developed an of
intellectual
algorithm forandidentifying
spectral analysis
the typewere used to analyze
of consumer and process
of electric energy inexperimental
the networkdata. In this power
of general paper, we developed
supply using thean
algorithm for identifying
machine learning the type of
model XGBoost consumer
- extreme of electric
gradient energy
boosted in the
decision network of general power supply using the
trees.
machine learning model XGBoost - extreme gradient boosted decision trees.
© 2019 The Authors. Published by Elsevier B.V.
© 2020 The Authors. Published by Elsevier B.V.
This
© is an
2019 open
The accessPublished
Authors. article under the CC BY-NC-ND
by Elsevier B.V. license (https://creativecommons.org/licenses/by-nc-nd/4.0/)
This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
This is an
Peer-reviewopen access
under article under
responsibility of the
the CC BY-NC-ND
scientific license
committee
Peer-review under responsibility of the scientific committee (https://creativecommons.org/licenses/by-nc-nd/4.0/)
ofofthe
the 9th Annual
10th Annual International
International Conference on Biologically
Conference on Biologically Inspired
Inspired
Peer-review
Cognitive under responsibility
Architectures.
Cognitive Architectures. of the scientific committee of the 9th Annual International Conference on Biologically Inspired
Cognitive Architectures.
Keywords: Smart Grid; XGBoost; decision trees; spectral analysis; PCA; Fourier transform
Keywords: Smart Grid; XGBoost; decision trees; spectral analysis; PCA; Fourier transform

1. Introduction
1. Introduction
The problem of ensuring the required level of quality of electric energy in electric power systems of consumers of
The problem
electric energy hasof ensuring the required
always been leveltoday
relevant and of quality of lost
has not electric energy in electric power systems of consumers of
its significance.
electric energya has
Currently, always
large beenofrelevant
number andequipment
electrical today has not lost its significance.
is connected to the power system of industrial enterprises,
Currently,
municipal andahealth
large care
number of electrical
institutions, banksequipment
and largeisoffices.
connected
Suchtoequipment
the powercansystem
becomeof industrial
a source ofenterprises,
nonlinear
municipal
distortions and health
in the careand
network institutions,
affect notbanks and large
only itself, offices.
but also Such equipment
the entire can as
energy system become a source
a whole. of nonlinear
As a result of such
distortions
an impact, in the network
fluctuations andand affect notinonly
deviations the itself, but also the
characteristics entire energy
of electric powersystem
(currentasstrength,
a whole.voltage,
As a result of such
frequency,
an impact,
phase) can fluctuations
occur in theand deviations
electric in the
network, characteristics
which can lead toofshort
electric powerand
circuits, (current strength, to
subsequently voltage, frequency,
fires and serious
phase) can occur in the electric network, which can lead to short circuits, and subsequently to fires and serious
1877-0509 © 2019 The Authors. Published by Elsevier B.V.
This is an open
1877-0509 access
© 2019 The article
Authors.under the CC by
Published BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/)
Elsevier B.V.
Peer-review
This under
is an open responsibility
access of the
article under thescientific committee
CC BY-NC-ND of the(https://creativecommons.org/licenses/by-nc-nd/4.0/)
license 9th Annual International Conference on Biologically Inspired Cognitive
Architectures.
Peer-review under responsibility of the scientific committee of the 9th Annual International Conference on Biologically Inspired Cognitive
Architectures.

1877-0509 © 2020 The Authors. Published by Elsevier B.V.


This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Peer-review under responsibility of the scientific committee of the 10th Annual International Conference on Biologically Inspired
Cognitive Architectures.
10.1016/j.procs.2020.02.204
598 Dmitriy Raspopov et al. / Procedia Computer Science 169 (2020) 597–605
2 Raspopov D.A ./ Procedia Computer Science 00 (2019) 000–000

emergency situations connected with them.


To minimize the consequences of the negative impact of electrical equipment on the energy system of large
facilities, it is necessary to use the latest technologies, methods, tools and digital intelligent solutions of world science
and technology. One of such modern and fast-growing solutions are intelligent networks, which are called “Smart
Grid” abroad. Intelligent network management provides real-time automation, monitoring and control of the state of
the electrical network [1-2]. Intelligent networks are capable of online tasks to collect and analyze large amounts of
data [3].
Currently, the process of introducing intelligent power supply systems is rapidly developing [4-5]. In Russia, one
can already cite many examples of the implementation of intelligent networks and the associated positive effects [6].
The introduction of such systems in industrial enterprises, municipal and healthcare institutions, banks and offices can
significantly increase their overall energy efficiency.
One of the important tasks in the creation and implementation of modern intelligent systems is to identify the type
of consumer of electrical energy [7]. This feature can be used to accelerate the self-healing of the system, the
redistribution of resources and localization of the source of anomalies in the electrical network as a result of an
emergency [8].

2. Materials and methods

Smart Socket - Smart Energy Systems device is shown in Fig. 1. This device was developed at the IATE MEPhI
for a grant. Smart Socket Smart Energy Systems allows you to digitize current and voltage data from appropriate
sensors with a maximum frequency of up to 2 MHz.

Fig. 1. Smart Socket – Smart Energy Systems

As the objects of control for the experiment, 4 consumers of electric energy were selected. As a result of the
experiment, data were obtained containing the values of current strength and voltage, digitized with different
frequencies from 5 kHz to 2 MHz. During the experiment, data on the characteristics of current and voltage were
taken from the corresponding sensors for every 30 seconds and recorded in separate text files, which were then
converted into two matrices - one with values for the electric voltage signal, and the other with values for the electric
force signal current for each power consumer.

3. Results

3.1. Exploratory data analysis

Oscillograms of the time variation of the voltage and current characteristics obtained from the corresponding
sensors for four consumers of electric energy at a certain time interval are shown in Fig. 2 and Fig. 3.
Dmitriy Raspopov et al. / Procedia Computer Science 169 (2020) 597–605 599
Raspopov D.A. / Procedia Computer Science 00 (2019) 000–000 3

Fig. 2. Graphs of the dependence of the voltage characteristics on time for four consumers of electric energy

Fig. 3. Graphs of the dependence of the current characteristics on time for four consumers of electric energy

From the graphs of changes in current strength over time, you can see that for each electric consumer, the current
values are different. This is because all electrical consumers differ in both power and amperage. From the graphs of
the dependence of voltage values on time, we can conclude that its values are the same for each type of electrical
equipment.

3.2. Spectral data analysis

The traditional method of analysis and processing of electrical signals is the Fourier Transform, which consists in
representing the voltage and current of the mains as the sum of harmonic components with the coefficients of the
participation of these components. Using the discrete Fourier transform algorithm [9], spectra were constructed for
the electrical signals of current and voltage on a logarithmic scale, as shown in Fig. 4 and Fig.5.
600 Dmitriy Raspopov et al. / Procedia Computer Science 169 (2020) 597–605
4 Raspopov D.A ./ Procedia Computer Science 00 (2019) 000–000

Fig. 4. Graphs of the spectra of electrical voltage characteristics on a logarithmic scale.

Fig. 5. Graphs of the spectra of electrical current characteristics on a logarithmic scale.

The constructed spectral characteristics of the experimental data contain not only the values of higher and lower
harmonics, but also the noises that the equipment itself emits during its operation. These noises can be informative
and then they can be used to develop and implement an algorithm for identifying a consumer of electrical energy, as
shown in Fig. 4 and in Fig. 5, respectively.

3.3. Cluster analysis

Plotting graphs on a logarithmic scale is necessary to highlight the noises characteristics of the equipment
corresponding to the peak values in the graph in Fig. 6, which were used to implement the algorithm for identifying
the type of consumer of electrical energy.
Dmitriy Raspopov et al. / Procedia Computer Science 169 (2020) 597–605 601
Raspopov D.A. / Procedia Computer Science 00 (2019) 000–000 5

Fig. 6. Isolation of noises characteristics from spectrum

After extracting the peak values of informative frequencies from the current and voltage spectra for each consumer
of electric energy, its were used as signs for clustering and highlighting the structure and relationships in the data.
The traditional algorithm for reducing the dimension - the method of principal components was chosen as the main
method for visualizing the partitioning of data into clusters. Thanks to this method, it becomes possible to project all
points corresponding to measurements in the data from the multidimensional space of signs into the space of two main
components. Projections on the axis of the main components of the values of informative frequencies isolated from
the spectra of values of current, voltage and both electrical characteristics are shown together in Fig. 7.

Fig. 7. Projections on the axis of the main components of informative frequencies from the spectra of (a) current strength; (b) voltage; (c) current
and voltage
602 Dmitriy Raspopov et al. / Procedia Computer Science 169 (2020) 597–605
6 Raspopov D.A ./ Procedia Computer Science 00 (2019) 000–000

Thus, after constructing the projections of the points corresponding to the amplitudes of the informative frequencies
extracted from the spectra of current and voltage, on the axis of the main components, you can notice that the data are
well divided into 4 clusters, each of which corresponds to the type of electric energy consumer. Based on the logic,
constructed graphs and laws of electrical engineering, it can be assumed that for identifying the type of consumer of
electrical energy, only the current characteristics obtained for each electrical equipment are sufficient. But the main
problem in this case will be the impossibility of determining the consumer, provided that several identical devices
(equipment) are connected to the electric network, with the same characteristics and consuming the same amount of
current.
A key role in dividing objects into clusters according to data containing voltage values is played by noises, which
is different for any electrical equipment connected to the network. It is these noises that influence the devices both on
each other and on the entire electrical network as a whole. Therefore, it is important to take into account not only data
containing current values, but also data with voltage values and noises characteristics present in them [10].
This fact allows us to state with high accuracy that to develop an algorithm for identifying the type of consumer of
electrical energy, it is necessary to use signs from both data received from the current sensor and data received from
the voltage sensor.

3.4. Developing a predictive machine learning model

The allocated informative frequencies (noises characteristics) from the voltage and current spectra were used as
features in the consumer identification algorithm. As a predictive model, a modern machine learning algorithm was
used - gradient boosting over decisive XGBoost classification trees [11]. This choice was made based on a comparison
of the accuracy and advantages of popular machine learning and artificial intellegence models, among which were:
RandomForest, Multilayer Perceptron (MLP), K-nearest neighbors (KNN) and Gradient Boosting (XGBoost) [12].
The diagram for assessing the accuracy of models in the test set is shown in Fig. 8. The volume of the test sample was
30% of the total amount of data.

Fig. 8. Evaluation of the accuracy of machine learning algorithms on a test set

To assess the quality and accuracy of the XGBoost algorithm on the test sample, the following classification quality
metrics were selected:
Dmitriy Raspopov et al. / Procedia Computer Science 169 (2020) 597–605 603
Raspopov D.A. / Procedia Computer Science 00 (2019) 000–000 7

• Precision - accuracy, that is, the proportion of sample objects really belonging to this class relative to all objects
that the algorithm assigned to this class [13].First point
• Recall - completeness, that is, the proportion of objects found by the classifier that belong to the class with
respect to all objects of this class in the test sample.
• F1-score - is a harmonic mean between accuracy and completeness. It tends to zero if accuracy or completeness
tends to zero [14].

All the above metrics are collected in one classification report, which is presented in Fig. 9.

Fig. 9. Classification Report

After training the XGBoost algorithm on a training sample, which accounted for 70% of the total amount of data,
a decision tree was constructed, shown in Fig. 10.

Fig. 10. Visualization of a decision tree constructed as a result of learning the XGBoost algorithm

As a result of the analysis of the constructed graph of the decision tree, we can conclude that the following are the
most important for identifying the type of consumer of electric energy: 250 Hz, 1050 Hz, 150 Hz. The XGBoost
algorithm identifies these same signs as informative and having the greatest influence on the target variable, i.e. the
class number of the electric consumer, as shown in the bar graph on the right in Fig. 11.
604 Dmitriy Raspopov et al. / Procedia Computer Science 169 (2020) 597–605
8 Raspopov D.A ./ Procedia Computer Science 00 (2019) 000–000

Fig. 11. Diagram for assessing the importance of features identified using the XGBoost model

An interesting fact is that these frequencies belong to a group of informative features isolated from voltage
characteristics spectra, and features isolated from current characteristics have only a small influence of 1% and are
located at the very bottom of the graph (identifiers - 1350_cur, 1150_cur).

4. Concluding remarks

The main feature of the scientific research in this paper is the identification of the type of consumer of electric
energy, using the machine learning algorithm XGBoost, which uses the characteristics of voltage, but not current.
Based on the knowledge of the theory and laws of electrical engineering, only data with the values of the characteristics
of the current strength should be sufficient to determine the type of electric consumer, based on the fact that the more
powerful the equipment, the more current it consumes. But in practice, it very often happens that in the electrical
network there may be several of the same power and characteristics of electrical appliances. In this situation, the
recognition and identification of the type of consumer is impossible only by the values of the current strength and
therefore it is very important to apply other signs, for example, informative values of frequencies from the spectra of
voltage characteristics. Thanks to the identification of such signs, the process of identifying the type of consumer of
electric energy is possible under various conditions, and using modern machine learning methods, the speed of the
classification algorithm is increased and optimized, accompanied by high accuracy of prediction.
A further promising development of this work is to improve the accuracy of the predictive model and write software
for the Smart Socket Smart Energy Systems with the aim of further research and development of new methods and
algorithms for Smart Grid.

References

[1] L. Wenpeng, D. Sharp, S. Lancashire (2010) "Smart grid communication network capacity planning for power utilities", Proc. IEEE PES
Transmission Distrib. Conf. Expo: 1-4.
[2] B.B. Kobets, I.O. Volkova (2010) Innovative development of the electric power industry based on the SmartGrid concept. / M .: IAC Energy:
208.
[3] Clark W. Gellings (2009) “The Smart Grid: Enabling Energy Efficiency and Demand Response.” CRC Press: 250.
[4] S. J. Anders (2007) "The emerging smart grid" in Energy Policy Initiative Center, University of San Diego School of Law: 4-8.
[5] H. Farhangi (2008) "Intelligent micro grid research at BCIT", EnergyBiz Smart Grid Suppl.
[6] M. Amin, S. Wollenberg (2005) "Toward a smart grid: Power delivery for the 21st century", IEEE Power Energy Mag., vol. 3, no. 5: 34-41.
[7] M. Smith (2008) "Overview of federal R&D on microgrid technologies", Proc. Kythonos 2008 Symp. Microgrids: 2-8.
[8] N. Bressan, L. Bazzaco, N. Bui, P. Casari, L. Vangelista, M. Zorzi (2010) "The deployment of a smart monitoring system using wireless sensors
and actuators networks", Proc. IEEE Int. Conf. Smart Grid Commun. (SmartGridComm): 49-54.
[9] Pavleyno M.A. and Romadanov V.M. (2007) “Spectral transformations in MATLAB” St. Petersburg: 160
Dmitriy Raspopov et al. / Procedia Computer Science 169 (2020) 597–605 605
Raspopov D.A. / Procedia Computer Science 00 (2019) 000–000 9

[10] Ron Bekkerman , Mikhail Bilenko , John Langford (2011) “Scaling up Machine Learning: Parallel and Distributed Approaches”, Cambridge
University Press, New York, NY.
[11] Jerome H. Friedman (2002) “Stochastic gradient boosting”, Computational Statistics & Data Analysis, v.38 n.4: 367-378.
[12] Jerry Ye , Jyh-Herng Chow , Jiang Chen , Zhaohui Zheng (2009) “Stochastic gradient boosted distributed decision trees”, Proceedings of the
18th ACM conference on Information and knowledge management, Hong Kong, China.
[13] T. Chen, S. Singh, B. Taskar, and C. Guestrin (2015) “Efficient second-order gradient boosting for conditional random fields” In Proceeding
of 18th Artificial Intelligence and Statistics Conference (AISTATS'15), volume 1.
[14] T. Chen, H. Li, Q. Yang, and Y. Yu (2013) “General functional matrix factorization using gradient boosting” In Proceeding of 30th
International Conference on Machine Learning (ICML'13), volume 1, 436-444.

You might also like