Machine Learning in Photovoltaic Systems Areview

Renewable Energy 196 (2022) 298e318
Contents lists available at ScienceDirect
Renewable Energy
journal homepage: www.elsevier.com/locate/renene
Machine learning in photovoltaic systems: A review

ez , Camilo Guillen , Luis Felipe Giraldo *,
Jorge Felipe Gaviria , Gabriel Narva
Michael Bressan
D.C., 111711, Colombia
Department of Electric and Electronic Engineering, Universidad de Los Andes, Bogota
a r t i c l e i n f o a b s t r a c t
Article history: This paper presents a review of up-to-date Machine Learning (ML) techniques applied to photovoltaic
Received 27 December 2021 (PV) systems, with a special focus on deep learning. It examines the use of ML applied to control,
Received in revised form islanding detection, management, fault detection and diagnosis, forecasting irradiance and power gen-
16 June 2022
eration, sizing, and site adaptation in PV systems. The contribution of this work is three fold: first, we
Accepted 21 June 2022
Available online 1 July 2022
review more than 100 research articles, most of them from the last five years, that applied state-of-the-
art ML techniques in PV systems; second, we review resources where researchers can find open data-
sets, source code, and simulation environments that can be used to test ML algorithms; third, we pro-
Keywords:
Machine learning
vide a case study for each of one of the topics with open-source code and data to facilitate researchers
Deep learning interested in learning about these topics to introduce themselves to implementations of up-to-date ML
Photovoltaic systems techniques applied to PV systems. Also, we provide some directions, insights, and possibilities for future
Neural networks development.
Reinforcement learning © 2022 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND
Review license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 299
2. Techniques in machine learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 300
3. ML in control within PV systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301
3.1. ML for maximum power point tracking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301
3.2. ML for voltage and frequency control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 302
3.2.1. Centralized voltage and frequency control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 302
3.2.2. Decentralized voltage and frequency control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
4. Machine learning for management with PV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
5. ML for faults in PV systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304
5.1. ML for fault detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304
5.2. ML for PV diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305
5.3. ML for islanding detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 306
6. ML for irradiance forecasting and PV output power estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307
7. ML in sizing methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307
8. ML in site-adaptation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 308
9. Open resources for ML research in PV systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 308
9.1. Simulation environments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 308
9.2. Datasets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 308
10. Case studies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 309
10.1. Case study: maximum power point tracking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 309
10.2. Case study: islanding detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 309
* Corresponding author.
E-mail addresses: jf.gaviria@uniandes.edu.co (J.F. Gaviria), ge.narvaez@uniandes.
edu.co (G. Narva ez), c.guillen@uniandes.edu.co (C. Guillen), lf.giraldo404@
uniandes.edu.co (L.F. Giraldo), m.bressan@uniandes.edu.co (M. Bressan).
https://doi.org/10.1016/j.renene.2022.06.105
0960-1481/© 2022 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
ez, C. Guillen et al.
J.F. Gaviria, G. Narva Renewable Energy 196 (2022) 298e318
10.3. Case study: management . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 310

10.4. Case study: fault detection and PV diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 311
10.5. Case study: irradiance forecasting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 314
10.6. Case study: sizing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 314
10.7. Case study: site adaptation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 314
11. Conclusions and future research directions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 314
CRediT authorship contribution statement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
Declaration of competing interest . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
Acknowledgement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
Supplementary data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
Nomenclature LSTM Long Short-Term Memory

MAE Mean Average Error
R2 Coefficient of Determination MPPT Maximum Power Point Tracking
ANN Artificial Neural Networks PCC Point of Common Coupling
CNN Convolutional Neural Networks PV Photovoltaic
DDPG Deep Deterministic Policy Gradient RF Random Forest
DG Distributed Generation RL Reinforcement Learning
DL Machine Learning RMSE Root Mean Square Error
DQL Deep Q-Learning RNN Recurrent Neural Networks
DQN Deep Q-Networks SAC Soft-Actor-Critic
EMS Energy Management System SVM Support Vector Machines
ESS Energy Storage System TD3 Twin-delayed Deep Deterministic Policy Gradient
GRUs Gated Recurrent Units
1. Introduction Fig. 1: control, management, faults in PV systems, irradiance fore-

casting, sizing, and site adaptation. We consider that these cate-
Among the renewable energy sources, solar generation is gories encompass most of the problems in PV systems where ML
perhaps one of the most widely used. For example, it currently have been applied to. Challenges involving cybersecurity in PV
corresponds to produce 11% of the total renewable generation in systems was not covered since it was thoroughly reviewed by
2017 in the US, and it is expected to increase to 48% by 2050 [9]. Kurukuru et al. [67]. Also, we present case studies for each of the
Moreover, the global solar photovoltaic (PV) capacity is estimated described challenges, including the source code and data that can
to increase from 593.9 GW in 2019 to 1582.9 GW in 2030, following be found in an open repository. Each portion of the open-source
capacity additions by China, India, Germany, the US, and Japan [37]. code is based, to some extent, on one representative reported
However, implementing PV systems still implies high costs and research work that proposes a solution to a challenge. Our goal is to
efficiency issues that need to be resolved. Efforts are still being allow researchers in the field to not only understand the problem
made to decrease the costs of implementing PV systems while and state-of-the-art solutions that have been proposed, but also to
increasing their efficiency, easing their implementation and quickly introduce themselves to implementations with real-world
coupling to electric grids. and simulated data. We also overview some of the most impor-
To address those challenges, machine learning (ML) algorithms tant resources that provide open datasets, source code, and simu-
have arisen as an alternative to traditional methods to provide lation environments to test ML algorithms in PV systems.
solutions that improve the performance of PV systems through the Very relevant and interesting reviews on applications of artifi-
integration of different techniques and modeling of complex dy- cial intelligence on energy systems have been done, such as the
namics. In particular, the advances of deep learning (DL) in PV ones proposed by Youssef et al. [125], Tina et al. [110], Kurukuru
systems during the last five years ago has allowed research in ML et al. [67], and Forootan et al. [43]. However, they either review a
and PV systems to move at a rapid pace since then, now involving wider range of topics in energy systems, or do not focus on state-of-
more powerful models to analyze structured information ranging the-art DL models in PV systems, or do not provide open source-
from images and video to multidimensional time series. This makes code and data. Our contribution in this paper is to complement
it imperative to review new methodologies that address problems previous review papers and to present a review of research work
in PV systems using state-of-the-art models in the field of ML. that have addressed different challenges in PV systems using
In this paper, we present a review of research work that has modern techniques in ML, in particular DL-based models, along
been reported mostly during the last five years in this topic, with a with open-source and data, to evaluate such models in the context
particular focus on applications of DL. For the literature review step, of PV systems. Due to the nature of the research topic, having a
the engines used to search the articles were: Scopus/ScienceDirect, review of the current advances along with open-source code and
IEEEXplore, ResearchGate, and GoogleScholar with keywords data is key to fuel the generation of new ideas for technology
related to machine learning, artificial intelligence, and photovoltaic development in the area.
systems. We focus on six important challenges in the imple- This paper is organized as follows: We first provide a brief
mentation and management of PV systems, as it is illustrated in introduction of the main ML techniques being applied in PV
299
Fig. 1. Problem categories in PV systems in which ML has been applied to.
Fig. 2. Some of representative techniques for supervised learning and reinforcement learning applied to PV systems in the research work reviewed.
systems (Section 2). Then, we review recently reported work in ML Reinforcement Learning (RL) is a technique that enables an
applied to PV systems according to Fig. 1 (Sections 3 to 8). After- agent to learn control policies by trial and error using rewards
ward, we provide a summary of some of the main open resources, provided by an interacting environment, which is described as a
such as source code and data, to work in ML applied to PV systems, Markov Decision Process [28]. Essential elements of an RL problem
(Section 9). We then present each of the study cases developed include the state (the current situation of the agent), the reward
(Section 10). We end this paper with some conclusions and a dis- (the feedback provided by the environment), and the policy (the
cussion about the possibilities for future development in the field of method that defines the relationship between the agent's states
ML in PV systems (Section 11). and its actions). The primary goal of the algorithm must be to find a
set of actions that maximizes the reward received by interactions of
2. Techniques in machine learning the agent with the environment. Some of the currently used
techniques to conduct the learning process in RL are Q-learning,
In this section we introduce some of the most representative SARSA, and their variants [106]. Some approaches divide the
techniques of ML that have been applied to PV systems with a continuous states and action-space into a finite number of regions,
special focus on DL algorithms. The categorization of these tech- known as Q-table. Another family of methods use DL models where
niques is shown in Fig. 2. ANNs are used to approximate a policy function. This allows for
Artificial Neural Networks (ANN) are models that are inspired in continuous state and action-spaces [41], where the Q-table is
the fact that neurons, as processing units, are interconnected in the replaced by a nonlinear continuous function modeled by an ANN
human brain [50]. A ANN defines a function that maps a n- [1]. Some of these approaches include Soft-Actor Critic (SAC), Deep
dimensional input to a r-dimensional output, and that it is con- Deterministic Policy Gradient (DDPG), Twin Delayed DDPG (TD3),
structed as the parallel and serial interconnection of basic pro- and Deep Q-Learning (DQL).
cessing units, called neurons. Each neuron computes a weighted Convolutional Neural Networks (CNN) are a type of ANN that
sum of its own inputs that is then transformed using an activation extract important features of a 1D, 2D or 3D array through filters of
function. The learning process refers to the process of finding the a certain length in a hierarchical manner.
weights for each neuron in a way that the ANN model is able to Recurrent Neural Networks (RNN) are a class of networks that
accurately predict the output for a given input [53]. can work on data sequences of arbitrary length to predict future
300
outcomes or to classify events. This is done by inferring relation- [48], the implementation of advanced communication infrastruc-
ships between the frame through its connections that point back- ture and powerful electronic devices open up the possibility to
wards [47]. Each neuron in an RNN layer receives an element of the implement advanced control techniques in the PV-based power
analyzed frame as input as well as the output of the previous system. In this section, we review different control techniques
prediction (hidden state) that was based on the last analyzed based on ML applied to the problems of maximum power point
element of the frame. This structure implies that RNNs allow pre- tracking (MPPT) with charge controllers and voltage/frequency
vious outputs to be used as inputs, while having hidden units to control for PV electric systems. Since applications on smart grids
take into account historical information through hidden states [16]. that combine PV and other renewable energies have already been
Gated Recurrent Units (GRU) aim to remember past information covered in specific reviews such as the one by Rangel-Martinez
for long periods of time [63]. This is done by assigning a set of et al. [95], these topics are not entirely covered in this review.
parameters in between edges of adjacent hidden units in RNNs.
This parameter would act as a gate, regulating the amount of in- 3.1. ML for maximum power point tracking
formation that the network can remember over time. If the value of
this parameter is close to one, more memory is retained, if it is close The total produced power by a PV system is influenced primarily
to zero, the memory of previous steps would be forgotten. by two elements: the solar irradiance and the cell temperature of
Long Short-Term Memory (LSTM) is a special kind of RNN that is the solar panel arrays. To consistently obtain the highest efficiency
capable of learning long-term dependencies [84]. It remembers in solar power production under changing conditions, the voltage
information for long periods of time due to a memory cell that of the system is controlled to keep the system at the MPP. This
stores previous information, called cell state, which is updated problem is known as MPPT control. Some traditional methods for
through 2 different parameters that are similar to the one find in MPPT control are Perturb-and-Observe and the Incremental-
GRUs. These parameters allow previous information to be Conductance methods, which are easy to implement [36]. How-
remembered or forgotten, and new information to be added to a ever, these methods usually need certain calibration levels to pre-
certain degree. The cell state is then used to update the hidden state vent high oscillations around the MPP during its search.
of the common RNN by using another parameter that lets certain Furthermore, when partial shading conditions are present, these
parts of its information flow through the hidden unit. These pa- algorithms tend to stay at local maxima, leading to lower energy
rameters are the output of gates (forget gate, input gate and output conversions. This issue is due to the nonlinear and time-varying
gate) that are simple ANNs whose input is the previous hidden characteristics of the PV array affected by changes in the atmo-
state. spheric and load conditions [40]. Fig. 3 shows the power-vs-voltage
Support Vector Machines (SVM) is a technique that finds a linear curve in a system with 3 PV modules where partial shading con-
model for classification [47]. The criterion of the SVM is based on ditions are present. Note that there are several local maxima for
the concept of maximum margin, which refers to the distance from different values of voltage.
the separating hyperplane to the closest observations in either Recently, ML techniques have been applied to solve these
class. problems, outperforming traditional methods. A review presenting
Random Forest (RF) is a model based on decision trees that are intelligent techniques applied for MPPT control in photovoltaic
constructed in parallel using random subsets of input variables. systems was published in 2014 by El Telbany et al. [40]. However,
Once the decision trees are trained, a RF model takes an example, the application of techniques such as RL and ANNs were not
passes it through all trees, and assigns the class with the most votes included in the review. Here, we review recent studies that have
[124]. proposed the implementation of more ML algorithms to address
the problem of MPPT control. A case study is introduced in Section
3. ML in control within PV systems 10.1.
Messalti et al. [79] obtained data from a standard Perturb-and-
There is a need for advanced control techniques to ensure a Observe algorithm to train a ANN that learns when to increase or
reliable integration of PV systems to the grid. As stated by Glavic decrease the voltage. The input variables are the output power
derivative and the voltage derivative. The results showed a high
tracking accuracy, high response time, and low overshoot. Similar
algorithms were proposed by Arjun and Zubin [19] and Agha et al.
[12], in which they achieved higher response times than conven-
tional methods.
Another DL algorithm that has provided excellent results for
MPPT control is RL. One of the first uses of RL in MPPT was
implemented by Kofinas et al. [62]. Arguing that RL has high
convergence stability with shorter computational time than other
meta-heuristic methods, the authors proposed a Q-Learning
tabular approach to track the MPP. The state-space was defined by
the voltage, the normalized current, and an angle relating to cur-
rent and voltage changes. The action-space was defined as a list of
steps to change the duty cycle, and the reward was based on pos-
itive or negative changes in the output power. The results obtained
showed that the algorithm designed outperformed the conven-
tional Perturb-and-Observe method in three different scenarios, in
which irradiance and temperature were changed. This research
leads to other similar tabular Q-learning methods applied to MPPT
such as one made by Aurobinda et al. [20], where their main con-
tributions were comparing the obtained RL control agent to fuzzy-
Fig. 3. PV array power curve under partial shading conditions. logic-sliding mode control and incremental conductance-sliding
301
Table 1
Comparative table of reports of ML applied to MPPT.
Articles Programming Language Experimental Results ML Techniques Used
[12,19,79] Python No ANNs

[21,22] Deep RL
[90] Matlab
[26,56,62] Tabular RL
[20,34] Yes Tabular & Deep RL
mode control, achieving better results with the RL algorithm. They power and the previous power. The action-space was defined as the
also conducted experiments on real testbeds with good results. disturbance applied to the controllable variable of the voltage. It is
However, both of these papers did not include comparisons be- important to note that the authors did not simulate a complete
tween different RL algorithms. This is why Bavarinos et al. [26] boost converter model in OpenAI Gym for the environment. The
implemented the Q-learning and SARSA algorithms for MPPT reward was defined as the current power divided by a normaliza-
control, while also comparing them. The variables used as the state- tion factor if the change of power was positive and minus one if the
space were the open-circuit voltage, the short circuit current, and change of power was negative. An epsilon-greedy exploration
the MPP under standard test conditions. The action-space used was strategy with linear decay was also used. The results obtained by
defined as discrete changes in the voltage duty-cycle. the authors showed that the maximum operating power point
Kalogerakis et al. [56] proposed a tabular Q-learning algorithm using the algorithm was less than 1% of the theoretical MPP.
for tracking the global MPP at partial shading conditions. A Boltz- In Table 1 we show some of the characteristics identified in the
mann exploration policy was used with a similar reward policy as presented papers. First, most of the papers work using Matlab as
previous mentioned papers. The state-space included current duty programming environment. One of the main reasons for this is that
cycle value, the power, and the previous value of the duty cycle. The Matlab facilitates the integration of ML algorithms into a wide va-
results achieved were compared to particle a swarm optimization riety of applications in PV systems. There is a need to develop more
algorithm and showed that the RL algorithm reduced between benchmark scenarios in free programming languages such as Py-
80.5% and 98.3% the time required for detecting the global MPP thon. Also, we identify a lack of experimental evaluation of RL al-
with respect to other algorithms based on RL. gorithms in real test-beds. We consider that more experiments
Although previous authors achieved remarkable results, none of should be conducted on real testbeds for MPPT tracking consid-
them utilized DL methods applied to RL. In the work made by Phan ering mainly deep RL methods. Deep RL methods such as DDPG,
et al. [90], a deep Q-learning (DQL) agent and a deep deterministic SAC and TD3 have not been yet tested on real experiments, and
policy gradient (DDPG) agent [68] were proposed as ways of con- considerations on implementations for real-time processing can
ducting an MPPT control, focusing primarily on partial shading potentially arise new research challenges.
conditions. This methodology allowed the agents to deal with
continuous state-space. The state-space was defined as the com- 3.2. ML for voltage and frequency control
bination of voltage, current, duty cycle, and current perturbation.
The action-space for the DQL agent was defined as a set of discrete The integration of PV in distributed networks generates voltage
steps that changed the duty cycle while the DDPG handled swings due to the rapid power fluctuations generated by PV sys-
continuous action-spaces. The reward function was defined using tems. This voltage swing generates voltage quality degradation and
the changes in power from one step to another, the MPP at standard voltage stability issues, which may breach established regulations
test conditions, and a penalty if the agent actions caused the duty [39]. Intelligent voltage control models have been recently imple-
cycle to step out of bounds. The results obtained by the models mented to maintain the voltage magnitudes of buses within a
were compared to the commonly used Perturb-and-Observe desirable defined range. This regulation is made by applying the
method, evidencing better results in the eight validation sce- control algorithms to the DC/AC inverter responsible for generating
narios. The agents tracked the global MPP in seven out of eight the three-phase AC voltage [125]. In a review made by Guo et al.
partial shading scenarios. Chou et al. [34] made comparisons be- [49], two voltage control categories are defined: centralized control
tween Q-table and Q-network algorithms for MPPT. The RL-Q- and decentralized control, which will be further analyzed in the
network MPPT achieved a higher average power than the RL-Q- following sections.
table MPPT method, while the former provided smaller oscilla-
tions. Both methods were then compared to the conventional 3.2.1. Centralized voltage and frequency control
method Perturb-and-Observe. The RL-based methods provided Centralized voltage control schemes use sophisticated
smaller ripples and faster tracking in the simulated scenarios. communication networks to regulate voltage using a system
However, in experimental results, the Perturb-and-Observe operator [113,117]. It requires highly reliable communication
method obtained a better performance than the proposed schemes using protocols such as DNP 3.0 or IEC 61850 [49]. This
methods. control is conducted using only one agent or multiple agents that
Although Matlab provides an environment for training and can regulate multiple zones in a grid with an injection of PV energy.
testing the RL agents while also being highly used, it is not an open- Li et al. [71] used a DDPG algorithm to coordinate multiple PV
source software that every researcher can access. This is why Avila smart inverters that regulate the voltage in a PV-grid system. The
et al. [21,22] constructed a DL model-free algorithm based on RL, authors described in great detail the characteristics of the agent
which was trained and tested in a custom developed OpenAI Gym and how it was trained. The state-space were the voltage magni-
environment in Python [85]. The algorithm proposed is based on tudes at each bus, real and reactive power generation/consumption,
DDPG in [21]. Moreover, the inverted-gradient and the TD3 algo- and actual values of the loads. The action-space was considered as
rithm were implemented to solve the problem in [22]. The each inverter's generated reactive power. The rewards consisted of
continuous state-space for the algorithm proposed in these papers large penalties due to breaching voltage limits and a negative
was the voltage, power, and the difference between the actual reward proportional to the total reactive power dispatched to
302
minimize the PV production reduction. The algorithm was trained believe that there is an opportunity to evaluate the use of a full-
and tested in the IEEE 37 bus system. To simulate the power flow, scale grid using only RL algorithms for both MPPT and voltage/
the solver OpenDSS [4] was used. Compared to the benchmark frequency control in uncertain scenarios.
method of Volt-Var, the DDPG was able to utilize reactive power
more efficiently with much less curtailment incurred. The DDPG
4. Machine learning for management with PV
generated a power loss of just 4.1% of the Volt-Var method losses
that resulted from reactive power generation while also avoiding
Energy management systems (EMS) have become increasingly
over or under voltages in all nodes.
helpful for consumers to reduce their electricity bills and maintain
A similar approach was proposed by Wang et al. [120] were a
efficiency. This improvement is possible due to the insertion of
multi-agent DDPG method was tested in the Illinois 200-Bus sys-
smart grid technologies such as smart meters and demand-in-
tem with PV integration considering load generation changes, N 1
response algorithms. The main objective of an EMS is to schedule,
contingencies, and a weak centralized communication environ-
in real-time, the system's energy flows by minimizing a defined
ment. Here, 3 agents were used to control certain parts of the 200-
objective function while maintaining reliable, secure, and safe op-
Bus system, which was divided into three zones. The state-space
erations of the system according to Zia et al. [126]. This objective
was defined as system-wide bus voltage magnitudes, phase an-
function is defined in a way that a desired demand/response is
gles, generations, and power flow. The action-space was a vector of
controlled through incentives or penalties. Several studies have
desired bus voltage magnitudes. The reward function took into
been conducted on the EMS optimization formulation. In this sec-
account a motivation to reduce the deviation of bus voltage mag-
tion, we will focus on some of the ML approaches proposed to
nitudes from a reference value and a penalization when a voltage
achieve this goal, as the Fig. 4 shows. We distinguish management
breach occurs in the system.
from control since the latter refers to keeping a specific process at a
set-point, while management refers to keeping the process's set-
3.2.2. Decentralized voltage and frequency control
point at a profitable point [80]. A case study is introduced in Sec-
In this type of control scheme, several agents have their own
tion 10.3.
local control policies [61]. This scheme implies that the coordina-
One of the most used ML techniques for solving this problem is
tion between agents is done in an automated manner, locally in
RL. Lee and Choi [69] used RL to manage the optimal management
each bus where it is needed, and without a centralized operator,
for residential houses that use a rooftop solar PV system, an energy
which may optimize local grid operation [98,116]. Cao et al. [31]
storage system (ESS), and smart home appliances. The authors used
used a multi-agent deep RL-based approach for a distribution
a Q-learning table method to control the energy consumption by
system with high penetration of PV. The main idea in this work was
appliances consumption such as air conditioner or washing ma-
that the agents interacted with each other by modeling each other's
chine, and the charging and discharging events in the energy
policies during training. The state-space of the agents consisted of
storage system. Also, the indoor temperature is predicted using an
the active and reactive power of load demand where the agent was
ANN to assist the Q-learning algorithm. The algorithm focuses on
connected and the active power injection of PV. Moreover, the
reducing the electricity bill using solar PV systems and the energy
action-space of each agent was the reactive power value of the
storage system while also reducing an established dissatisfaction
corresponding PV inverter to which the agent was associated.
cost. The state-space for the proposed problem was based on the
Lastly, the reward was defined as the negative sum of the absolute
energy consumption of both the washing machine and the air
value of the voltage deviation at each bus. The validation of the
conditioners, and the state of energy of ESS batteries at a given
algorithm was done using the IEEE 33 bus system [15]. The pro-
time. The action-spaces were: 1) turning on/off the washing ma-
posed algorithm was compared to the droop control method and
chine; 2) the ten different levels in which the system can set the AC
achieved shorter voltage fluctuations while also preventing over/
energy consumption of the air conditioner; 3) the ESS nine levels
under-voltage.
A similar approach was designed by Sun and Qiu [104], where
they used a multi-agent DDPG algorithm to mitigate voltage vio-
lations with centralized training and decentralized application. The
authors determined the dispatch of the on-load tap changers and
capacitor banks day-ahead, based on the hourly PV and load pre-
dictions. Then, the agents were trained offline using this informa-
tion as their input. The state-space was obtained from on-load tap
changers and capacitors banks. The action-space was defined as the
difference of PV reactive power output between two neighboring
time sections. The agents were trained in a centralized way while
its implementation was decentralized. The algorithm was tested in
multiple scenarios. It was demonstrated that it can potentially
mitigate voltage violations and reduce power loss better than other
methods. Another decentralized method was proposed by
Takayama and Ishigame [109]. The authors implemented a
multiple-agent RL algorithm with a radial basis function to deter-
mine the reactive power output of the PV inverters. The results
were compared to an algorithm that used particle swarm optimi-
zation, achieving a better performance.
Based on the reported research, there is a need to provide a
thorough comparison between centralized and decentralized
voltage control methods based on ML where practical aspects of
implementation can be assessed such as scalability, convergence,
communication topology, and real-time processing. Moreover, we Fig. 4. Diagram of the application of ML on EMS.
303
that allow the system to discharge and charge the batteries. The used included the Mamdani-type FIS, adaptive neuro-fuzzy inter-
reward evaluates the agents performance in terms of cost, the face system, support vector regression, echo state network, and
consumers satisfaction with the current operation, the electric cost, multi-layer perceptron models. The models were compared as well
and energy under-utilization. The previously mentioned ANN al- with solutions obtained through rolling-time-horizon strategies.
lows the system to accurately calculate the dissatisfaction of the Also, an approach for representing the potential microgrid energy
consumer. The algorithm was able to minimize the electricity bill flows as a single point in a planar domain is proposed to assess the
through the energy consumption scheduling while also maintain- microgrid operational configuration. A mathematical formulation
ing the consumer comfort level and appliance operation of the microgrid model is highly described with the formulation of
characteristics. the objective function. The results obtained showed that the rolling
Multi-agent-based RL techniques with Q-tables have also been time horizon strategy led to the best performance followed by the
used to address this problem as shown by Xu et al. [122]. In that neuro-fuzzy interface system, which in one scenario outperformed
paper, the authors used a multi-agent RL with Extreme Learning the former strategy. EMS management based on multi-objetive
Machine for implementing a home EMS. The objective of the agent predictive energy management strategies were also made by Shi-
was to reduce the energy electricity bill and the demand response vam et al. [101]. They used mainly CNNs to predict energy pro-
induced dissatisfaction cost. The structure of the EMS included a duction and electric load as one of the three levels of the control
refrigerator, an alarm system, air conditioner system, heating, strategy.
lighting, washing machine, dishwasher, and an electric vehicle. The Based on this review, we consider that more efforts should be
state-space corresponded to both the electricity price and the PV made to create or improve the current scenarios where researchers
output in an hour ahead slot. The action-space corresponded to the are testing their algorithms. Simulation of EMS need to be stan-
energy consumption scheduling of each home appliance and dardized in platforms such as OpenAI to make research de-
charging/discharging of the electric vehicle. The actions on an velopments comparable between each other. Although the
appliance depended on its type, which could be non-shift-able, CityLearn environment tries to standardize the tests of different ML
power-shift-able, or time-shift-able. Furthermore, the authors algorithms implemented over an EMS, it lacks of a variety of EMS
used Extreme Learning Machines for predicting the future trends of scenarios that researchers could encounter in real experiments.
the electricity price and PV generations and used this as part of the Furthermore, based on the simulated scenarios created for testing
state-space. The algorithm was able to reduce the electricity cost by the algorithms, experimental results should be conducted and re-
45% when compared to scenarios where the algorithm was not ported to test the ML algorithms in real scenarios.
being used.
A DRL agent was proposed in the work made by François-Lavet 5. ML for faults in PV systems
et al. [44], where the management problem considered efficiently
operating storage devices in a microgrid featuring panels with One of the main challenges that the PV industry faces is its
short and long-term storage capacities. The RL technique used was vulnerability to faults due to its exposure to harsh environmental
the DQL algorithm. The state-space was composed of the amount of conditions [66]. Fault detection and diagnosis is essential to opti-
energy in the batteries and the amount of energy in a hydrogen mize energy conversion efficiency and improving the life span of
tank. The action-space consisted of the amount of energy trans- the PV system. In this section, we identified three subcategories
ferred in or out from the storage system. The reward function was were ML algorithms are used: fault detection based on online
composed of the instantaneous operational revenues, which voltage and current data, fault diagnosis over PV modules, and
considered the electric power generated by the PV system. The islanding detection. It is worth to mention that in this review we
results gathered showed that the ANN representation of the value did not cover fault detection approaches based on ML for grid
generalized the policy in an efficient manner. connected PV inverters since this topic has been widely covered by
Although these papers achieved remarkable results, the lack of Malik et al. [73].
standardization in EMS makes it difficult to compare the different
proposed algorithms. This is why some studies have implemented 5.1. ML for fault detection
their solutions on open environments that allow researches to
share, replicate, and compare their designed models. An example There are different types of faults that can be classified based on
includes the work made by Kathirgamanathan et al. [58], where a what causes them. For example, incipient faults are low in
centralized Soft Actor-Critic (SAC) algorithm was implemented to magnitude and last only for a few microseconds, making them very
flatten and smooth the aggregated curve of the electrical demand of difficult to detect [97]. Abrupt faults refer to faults that occur
a district using the CityLearn environment [119]. The proposed instantaneously, often attributed to the results of line-to-line faults
multi-objective cost function consisted of the peak electricity de- or line-to-ground short circuits. Intermittent faults refer to faults
mand, the average daily electricity peak demand, ramping, the load that are cleared over time, such as partial shading [51]. Line to line
factor, and the net electricity consumption of the district over the faults occurs when an unexpected short circuit happens between
evaluation period. The state-space included hour, month, day, two lines in a PV array [123], whereas the line to ground happens
direct solar radiation, non-shift-able load, solar generation, cooling when there is a short circuit between a point of the array and
storage SOC, domestic hot water stock SOC, and the temperature ground. These are types of short circuit (SC) faults in which
outside of the buildings. The reward function was designed to abnormal connections between two points of different potentials
penalize peak consumption based on the total electricity con- occur [11]. Hot spot faults are caused by partial or complete shading
sumption. Moreover, the reward considered the charging and dis- of a PV module due to dust, which may increase the localized
charging of batteries during day and night. temperature in a cell and damage the module [38]. Open circuit
Other approach to tackle this problematic is based on using (OC) fault occurs when in a PV array, an accidental disconnection
optimization algorithms for managing the EMS that are not mainly happens [57]. Finally, arc faults are generated because of a loose
based on an ML algorithm. This is done by Leonori et al. [70] where connection or a cable insulation failure [103]. In recent years, ML
the authors made a comparison between several ML methods was techniques have been highly studied for fault classification and, in
made. Several EMS models were created using ML techniques and some cases, to identify the specific location of the fault. In this
compared using six different simulation scenarios. The algorithms section, we will study some of these papers.
304
Appiah et al. [18] used LSTM networks for fault classification and features to the CNN. The features used for training were the PV
detection. The faults considered by the authors only included line- voltage and current transformed into a 2-Dimension electrical time
to-line faults and hot spot faults. The data was collected using a series graph and used as inputs to the designed CNN.
simulated model of 5.3 kW PV array with added noise to simulate One of the main contributions made by Krizhevsky et al. [64]
real-world scenarios. The classification accuracy for the data was was the design and implementation of an experimental platform to
100% for scenarios with no faults, 100% for scenarios with hot spot verify the proposed CNN model. The researchers obtained 1400
faults, and 99.03% for the line-to-line faults. When adding the noise, graphs from the experimental platform that were divided into
the accuracy of the models was reduced to 99.23%, 98.78%, and training and testing datasets. Different sliding windows and graph
97.66%, respectively. sizes were tested to obtain the best results. The proposed algorithm
An RF classifier was used by Chen et al. [33] for the detection and was then compared to classifiers that used Inception-V3 [107] and a
diagnosis of PV array early faults. A dataset was generated including wavelet-transform using SVM. Compared to the other methods, the
line-line faults, degradation, open circuit, and partial shading in proposed method achieved higher accuracy under different con-
simulation. The features considered for training the model were the ditions. The authors achieved a maximum accuracy of 99.51% using
operating voltage and the string of current of the PV array. The the created model and a sliding window of 10 s.
results showed that the performance of a random forest out- Furthermore, Gao and Wai [45] proposed another fault PV array
performed other methods. A 99% accuracy was obtained in the identification technique based on CNN. Instead of using a 2D-CNN
classification and detection tasks in both the simulated and as in previous papers, the authors used a 1D-CNN and a fully con-
experimental results. Pahwa et al. [86] made a comparison between nected module for classification. The faulty scenarios considered for
decision trees, XGBoost, RF, and ANNs. Six different faults that this research included short-circuit faults, partial shading faults,
include partial shading, bypass diode fault, bridging fault, temper- abnormal aging faults, and hybrid faults. To verify the effectiveness
ature fault, complete shading fault, and short circuit faults were of the proposed method, the authors used the t-distributed sto-
included in the dataset with a total of 1200 readings. The results chastic neighbor embedding [114] to visualize the models distri-
presented showed that the ANNs achieved the highest validation bution effect over the features extracted. The results achieved in the
performance with an accuracy of 99.91%. A similar work was pre- testing data showed a 100% accuracy. The overall accuracy of the
sented by Haji et al. [51], where a principal component analysis was model using the data gathered through the experimental platform
used to reduce the number of features before classification. was 98.41%.
CNNs have also been used to address this problem. Aziz et al. Table 2 compares the described articles. We highlight the lack of
[24] used a 2D CNN to extract features from scalograms generated both experimental results on real testbeds, and the assessment of
from PV system data and classify faults. Five different faulty sce- new ML methods such as vision transformers applied to scalo-
narios were used to train the model. The data was collected with grams. It has been shown that vision transformers can potentially
the incorporation of an MPPT algorithm in the simulated system. outperform other models such as CNNs. Also, open datasets of real
The features transformed to a scalogram were the irradiance, the experimental results should be made available across researchers
temperature, the short circuit current, open-circuit voltage, the PV for comparison purposes.
current, the MPP and boost converter current, voltage, and power.
One of the main contributions of [24] was that the different values
of irradiance considered for the fault classification implied changes 5.2. ML for PV diagnostics
in the MPPT model to generate broader types of scenarios for the
predictions. A CNN was also used by Lu et al. [72]. In that paper, the The decrease in efficiency due to abnormal operating conditions
faulty time-series data was analyzed and then used as input fault generated because of dirt on the surface of modules, cell breakage,
delamination or hot stops raises the need for diagnostics to PV
Table 2
ML for fault detection articles.
Articles Faults Studied Experimental Results Features Used Main ML Technique
[18] Line-to-line, No Current, LSTM

Hot Spots Voltage,
Power
[33] Line-to-line, RF
Module Degradation,
OC,
Partial Shading
[86, 51] Partial shading, OC voltage, ANNs
Bypass diode fault, SC current,
Temperature fault, MPP data,
Complete shading, Temperature,
OC Irradiance,
Fill factor
[107] SC, IeV curve, 1D-CNN
Partial Shading, Irradiance,
Abnormal Aging, Temperature
Hybrid Faults
[24, 72] Partial Shading, Scalogram based on: 2D CNN
Line-to-line, - Irradiance,
OC, - Temperature,
Arc fault, - SC current,
Hybrid Faults, - OC voltage,
MPPT variations -PV, Boost & MPP current & voltage
[64] Yes 2D CNN
Inception-V3
305
modules to be done with ML techniques by analyzing images of methods tend to fail due to the need to set up a threshold value that
panels. Cipriani et al. [35] used CNN to automatically classify may omit certain islanding scenarios. Active methods need to inject
thermographic images. The aim of the model was to distinguish disturbances into the grid to detect the event that generates power
between hot spot conditions from those of dust. The dataset was quality issues in the system. Intelligent methods eliminate the need
constructed using different thermal camera technologies, resolu- to use a threshold value and to inject disturbances into the system
tion, and color scales with legacy PV systems from different char- while also presenting low detection times. In this section, we will
acteristics through the Italian territory. Several techniques were focus on presenting some of the ML techniques applied to islanding
applied for pre-processing the images, which included normaliza- detection. A case study is introduced in Section 10.2.Kumar et al.
tion, homogenization of the number of pixels, grayscaling, thresh- [65] proposed an islanding detection technique based on the
olding, box blur, and Sobel-Feldman filters. The results showed that wavelet transform for feature extraction and an ANN for classifi-
a maximum accuracy of 98% was achieved with an actuation time cation. This technique was simulated in Matlab under different
between milliseconds and about 2 min. A similar approach was scenarios considering all positive switching transients, islanding
performed by Rico Espinosa et al. [96] where they proposed a events, and faults from the grid side. The proposed algorithm
method based on semantic segmentation and the use of CNN for achieved a mean accuracy of 98%. The algorithm was also tested
failure mode classification. The authors first used semantic seg- under noisy conditions achieving an accuracy of 88.5% in the
mentation to identify the PV modules in an image, obtaining its noisiest scenario. A similar approach was conducted by Puthen-
mask. The obtained mask and the original image were then used as purakel and Subadhra [94]. Features were extracted using a wavelet
input for a classification network that used CNN. The classification transform applied to the negative sequence voltage measurement
networks considered two classification scenarios: binary classifi- at the point of common coupling (PCC) of a DG system with a PV
cation and quaternary classification. The binary classification and system and a wind-power-plant. These features, along with the
qua-ternary classification model achieved an accuracy of 75% and total harmonic distortion, were used to train an ANN. An accuracy
70%, respectively. of 95% was obtained. The main challenge in this problem was
Pierdicca et al. [91] implemented an anomaly cells detection making voltage swell (heavy load switching or capacitor load
model based on thermal infrared PV images collected by aerial switching events) predictions. Ahmadipour et al. [13] used a
vehicles. The model was based on the mask region-based CNN ar- Ridgelet Probabilistic ANN to detect islanding events. They simu-
chitecture because of its ability to make both object detection and lated two islanding scenarios and four none islanding scenarios to
instance segmentation. Furthermore, the researchers collected a extract features using the Slantlet Transform [99]. Then, the best of
publicly available dataset, which was used to train the proposed these variables were selected to train the ANN using an evolu-
model. The intersection over union and the dice coefficient metrics tionary search algorithm. The proposed method was compared
showed the effectiveness of the proposed models. Moradi Sizkouhi with other classifiers and achieved better performance.
et al. [82] proposed a deep convolutional encoder-decoder archi- Manikonda et al. have published several articles regarding
tecture for autonomous fault detection of PV plants using multi- islanding detection in the past years. The use of CNN can be found
copters. The paper focuses especially on detection of bird's drops in [75,76]. In both of them, the authors collected data based on a
over PV modules. The average accuracy reached over the testing set simulated 100 KW grid-connected PV system. The voltage phase
was 93%. A more in depth review on ML methods applied to PV signals from the PCC were measured, concatenated, and then used
modules diagnostics was recently reported by Berghout et al. [27]. to generate images using the wavelet transformation method. In
The authors discuss several ML tools for classifications including [75], the authors used a transfer learning approach with AlexNet
conventional ML and DL. They included several papers that discuss [64] to classify islanding and non-islanding events with a SVM used
the detection of faults using electroluminescence images and in the classification layer. On the other hand, in [76], the authors
thermographic images. used a custom-designed CNN. The results showed a 98.78% accu-
Based on the reported results in this topic, we identify an op- racy in both papers presented. Noise was also added to the data,
portunity to collect and make datasets available in which new ML and the models achieved the same performance showing that the
models for PV diagnostics can be tested. Also, ML methods such as proposed models were resilient to noise.
generative adversarial networks could be applied to create gener- SVM methods have also been playing a major role in islanding
ators of artificial thermal images and have a better understanding event predictions. Baghaee et al. [25] proposed a SVM algorithm
of the phenomena being studied. Furthermore, future research can that was able to classify islanding and grid fault events. This work,
involve the development of a pipeline for deploying the ML algo- unlike previous work, studied the classification problem using real-
rithms for real time PV diagnostics using performance- and power- world data from a custom-made intelligent electronic device from a
constrained devices and Tiny-ML technologies [121]. micro-grid along with data from simulations. Seven Gaussian SVM
classifiers were trained, whose inputs included frequency, active
5.3. ML for islanding detection and reactive power of loads at PCC and total harmonic distortion
values of voltage and current. The results obtained in that work
Islanding is the process through which an on-grid distributed showed high precision and effectiveness. Gayathry and Sujith [46]
generation (DG) system is disconnected from the grid while the DG proposed a SVM with a Radial-Basis-Function-Kernel for islanding
system continues to power the load. When unplanned islanding detection. The training and testing were conducted on a dataset
events occur, due to line tripping or equipment errors, it may with different faults. The accuracy of the validations was 100%, but
generate power quality problems on account of voltage/frequency it is worth noting that very few data were used for validation. A
instability and even damage electrical instruments. For this reason, SVM method was also proposed by Manikonda and Gaonkar [77].
detecting unplanned islanding conditions is critical for DG reliance. The authors extracted features from the images using a histogram-
According to standards such as the IEEE Std. 1537e2018 [8] and IEC of-oriented-gradient-features. The dataset contained 1720 images,
62116:2014 [7], an unplanned islanding detection condition must where 860 of them were from islanding scenarios, and 860 were
be detected and stopped within 2 s since the formation of the event. different grid-connected events. Tree-based classifiers have also
As described by Manikonda and Gaonkar [78], there are six types of been used to detect islanding scenarios as proposed by Khan et al.
islanding detection techniques: passive, active, hybrid, remote, [59]. The results show that the model achieved an accuracy of
signal processing methods, and intelligent methods. Passive 97.6%, detecting the fault within 0.2s.
306
Table 3 shows a comparison between the reported articles. From outperformed models such as RNN, general regression neural
this analysis, we can see that very little has been done to report network, and extreme learning machines. Another approach using
experimental results on real testbeds. There is an interesting LSTMs is presented by Pan et al. [87], where they used a short-term
pipeline proposed by Baghaee et al. [25] to conduct experimental solar generation forecasting method based on LSTM with temporal
tests and also to compare the multiple ML methods in this field of attention mechanism. Narvaez et al. [83] evaluated Encoder-
study. Decoder LSTM and GRU networks for daily and weekly time hori-
zons predictions. The LSTMs outperformed the GRU networks both
in daily and weekly horizons.
6. ML for irradiance forecasting and PV output power
Suresh et al. [105] had the goal to forecast the solar PV output
estimation
using CNN. The approach included using regular CNN, multi-
headed CNN, and a CNN-LSTM for short-term and medium-term
PV energy production is highly dependent on weather condi-
forecasting. The models were then compared to multiple linear
tions such as solar irradiance and temperature. Therefore, pro-
regression and an auto-regressive moving average models. The
duction levels of this energy source fluctuate, making it difficult for
features used for prediction included solar irradiation, wind speed,
power companies to balance the production and consumption of
ambient temperature, and the PV module temperature measured in
electricity when using PV systems. Hence, several ML algorithms
15-min windows. These features were chosen according to the IEA
have been implemented to forecast solar irradiation and the output
report in [89]. The results in that work showed that a simple CNN
power from PV systems. A case study is introduced in Section 10.5.
and a CNN along with a LSTM network performed well for 1h, 1-
Miranda et al. [81] compared multiple ML algorithms, including
day, and 1 week prediction time horizons. A similar approach was
ANN, for the prediction of half-hourly diffuse horizontal solar
conducted by Tovar et al. [112], where they used a five-layer CNN-
irradiance from only the global horizontal irradiance and a
LSTM model for PV power forecast using data from Temixo [5,111].
geographic coordinate. Data taken from the national solar radiation
The forecasting horizons ranged from 10 min to 180 min.
database (NSRDB) [100] for 6 different sites in Colombia was used.
Table 4 describes the reported papers for PV output power
The authors achieved a coefficient of determination ranging form
estimation and irradiance forecasting. This table shows that the
0.9974 to 0.9983 over the different sites. Furthermore, the article
forecasting algorithms are mainly based on recurrent units. There is
includes a github repository with the work made. Ahn and Park [14]
a need to assess more recent techniques for time-series prediction
used a RNN as a PV power short-term forecaster. This model was
such as transformers architectures, which are based solely on
trained using power data collected in real-time. The collected data
attention mechanism. Also, none of the articles described a pipeline
was the solar irradiation, the module temperature, the ambient
for the deployment of these models in real-time scenarios, which is
temperature, the wind speed, and the humidity. The forecasting
a challenging task that can define future research directions.
was conducted for 5min, 15min, 1h and 3h time horizons. The ex-
periments showed that the proposed model achieved higher pre-
diction accuracy than autoregressive integrated moving average 7. ML in sizing methods
and support vector regression with RF models for the short-term
forecast. The RNN model achieved an accuracy of normalized ML techniques have been used to define the optimal number of
mean average of 99.1% and 98.6% for 5 min and 15 min forecasting, panels, storage capacity of batteries, tilt, and azimuth angles
and 97.4% and 96.2% for the 1h and 3h time horizons. The same required in PV systems. Moreover, several strategies have been
problem was solved by Hossain and Mahmood [55] using LSTM developed to size PV systems installed by residential costumers. A
networks. However, they also used a K-nearest-neighbors algo- case study is introduced in Section 10.6.
rithm to classify historical irradiance data into different types of sky Kumar et al. [65] used ANNs to estimate the PV size, tilt, and
groups. The authors showed that the proposed model azimuth, using behind-the-meter data. The dataset used originated
Table 3
ML for islanding articles.
Articles Faults Studied Experimental Results Feature Extraction Main ML Technique
[65, 94] Positive switching No Wavelet ANN

transients, Transform
Islanding events,
Faults from grid side,
Voltage swell
[75, 76] Islanding, CNN
Power mismatch AlexNet
between DG
source generation,
Load connected to
PCC near zero,
Sudden switching of
capacitors and inductance
Loads
[59] Islanding, Decision Tree
Grid Faults
[13] Two islanding scenarios, Stantlet Ridgelet
4 none islanding scenarios Transform Probabilistic
ANN
[46, 77] Islanding, None SVM
Grid Faults,
Capacitor Switching
[25] Yes
307
Table 4
ML for irradiance forecasting and PV output power estimation.
Articles Power Forecast Algoritm Used Irradiance Forecast Algorithm Used
[81] e ANN
[14] RNN e
[55] LSTM K-nearest-neighbor
[87] LSTM-attention LSTM-attention
[83] e LSTM-GRU
[105, 112] CNN e
from the Pecan Street dataset [88], which collects the behavior of temperature, the wind speed, and the time of day. To collect sat-
more than 1300 customer loads over one year. PV generation was ellite data, the authors used the NSRDB [100]. The models tested
simulated using the system advisor model simulator [6] and the included linear regressions, ANN, RF, and AdaBoost. To compare the
PCLib toolbox. The input values were the minimum yearly day and results of the different methods, the authors computed the site-
night net load values. The models used to predict PV size were adaptation method quantile mapping [92], which is a classical
tested under different generated scenarios created by the authors, method for site-adaptation. When comparing the RMSE, R2, and
including PV estimations of 1000 test customers, estimation with MAE metrics of the models, it was clear that almost every ML model
varying tilt and azimuth considering errors in the dataset, different outperformed quantile mapping. The model with the best perfor-
net load data resolution, and mislabeled data added on purpose. mance was the random forest for all the performance indicators.
The results showed an improvement in accuracy for the neural A case study is introduced in Section 10.7. Since ML applied
networks approach compared to the linear regression model. A towards site-adaptation has not been yet further studied, other
generalized RNN was used by Khatib and Elmenreich [60], where techniques for regression and data preprocessing can be explored
the main objective was to estimate the PV array and battery sizing to improve the performance of the site-adaptation algorithms.
ratio. This sizing was computed using the loss of load probability
index, latitude, and longitude. Using a simulation of hourly solar 9. Open resources for ML research in PV systems
radiation and load demand, the model was validated, achieving a
mean absolute percentage error of 0.6%. 9.1. Simulation environments
An ML algorithm to map PV arrays in high-resolution overhead
imagery by identifying individual PV arrays, their size, and the An OpenAI gym environment was created to evaluate MPPT
power generation capacities over large geographical areas was algorithms [21,22] and it is available in [23]. The state-space in-
proposed by Malof et al. [74]. It was created using CNN for semantic cludes the current values of voltage, power, and the difference
segmentation, which provides pixel-wise labels of an input image. between the actual and previous power. The action-space consists
It was called SolarMapper [10] and is publicly available. It was of disturbances applied to the output voltage. For the simulation of
trained on the Duke California Solar Array dataset [29], which management systems, there are two main sources from which a
contains 400 km2 imagery and has 16000 hand-labeled solar ar- simulation of an environment can be found to easily compare
rays. An estimate to calculate the installed solar capacity of the designed models. One is CityLearn, created with the OpenAI Gym
panels is also proposed based on the identification realized by the environment that allows the implementation of centralized or
classifier. The model was accurate with a precision of 0.76 in the multi-agent control for building energy coordination and demand
object-based performance metric. The technique used to assess the response in cities [118,119]. Another useful environment would be
pixel-wise performance was the intersection-over-union (IOU) pymgrid [54], which is an open-source package to generate and
metric, which was 0.67. To calculate the installed solar capacity, the simulate a large number of micro-grids.
researchers first estimated the surface area of the PV array installed
based on the segmentation process. Then, the parameters of a
9.2. Datasets
simple linear regression based on the surface area to predict the PV
array capacity installed was used. The correlation coefficient ach-
Open datasets are available to test and validate ML techniques
ieved by the model was 0.91, using color imagery to estimate the
that are applied to different problems in PV systems. Some of them
parameters for each array.
include:
8. ML in site-adaptation C Duke California Solar Array Dataset [29]: Over 400 km2 of
imagery and 16000 hand-labeled solar arrays.
Site-adaptation is the process in which satellite-based mea- C Desert Knowledge Australia Center Dataset [3]: It unifies
surements are calibrated with in-situ measurements that were multiple real life data of PV technologies spanning many
collected at a specific site. Most of the common approaches to solve types, ages, models and configurations.
this problem are statistical analyses that attempt to fit satellite C ESOLMET-IER Dataset [5]: The Institute of Renewable En-
estimates to in-situ data according to Polo et al. [93]. To the authors ergies UNAM makes available solarmetric and meteorolog-
knowledge, only one article has addressed this problem using ML ical data taken from the station “ESOLMET-IER”.
techniques. Narvaez et al. [83] tested several ML regression tech- C The National Solar Radiation Data Base (NSRDB) [100]: It is a
niques to construct a model that could accurately capture the point- publicly available dataset that consists of solar radiation and
to-point relationship between meteorological station measure- meteorological data over the US and surrounding countries
ments and satellite data. The input variables used were the during the last 23 years.
satellite-based data, while the output was the estimation of in-situ C Photovoltaic Thermal Images Dataset [91]: Aerial thermal
global horizontal irradiance measurements. The input variables images of PV arrays with the presence of one or more
were the diffuse horizontal irradiance, the global normal irradi- anomaly cells and their respective masks. The images are
ance, the direct normal irradiance, the solar zenith angle, the from a 66 MW PV plant in Tomboruke.
308
Fig. 6. PV power and duty cycle at G1 ¼ 1000W/m2, G2 ¼ 1000W/m2, G3 ¼ 1000W/

m2 and T ¼ 25 deg.C by using the Deep Q network agent and the Perturb and Observe
algorithm.
Fig. 5. Episode Reward for MPPT agent implemented with DQN.
Ptþ1
C Pecan Street Dataset [88]: Collects the behavior of more than r4 ¼ (6)
PMPP;STC
1300 customer loads over one year.
where d1, d2 and d3 are small constants, PMPP,STC is the MPP at
10. Case studies standard test conditions, and PMax is the MPP achieved in a defined
interval of time, which is restarted to the current power when the
In this section we present a case study for each of the problems signal (PG) of a pulse generator is 1, which does this every 0.504s.
in Fig. 1 with a link to open source code. PMax is initialized with a value of zero. A decaying epsilon greedy
strategy is used to explore the environment. The parameters used
for training the algorithm are similar to the ones presented in [90],
10.1. Case study: maximum power point tracking however, the researchers in that paper did not use Eq. (2) nor Eq.
(3). These parameters, along with the network architecture and
For this particular case study, the objective was to replicate the further explanation regarding the use of the algorithm, can be
results obtained by Phan et al. [90] where the authors used DQL. We found in the GitHub repository created.1
used the same PV system as the one present in the paper. The al- As it can be seen in Fig. 5, the DQN agent training curve con-
gorithm was trained and tested under different conditions, verges in about 2000 episodes. Multiple environmental conditions
including partial shading. The state-space was defined as the were used for this training process, including partial shaded and
voltage and current generated by the PV, the duty cycle, and the nonpartial shading conditions, different initial temperatures, and
duty cycle perturbation, while the action-space was defined as 9 different initial duty cycle values. Some of the results of the
discrete changes in the duty cycle. The discrete changes range from designed model against the commonly used Perturb-and-Observe
very-high-positive to very-high-negative-changes to the duty cycle, algorithm in three different environmental scenarios can be seen
including a value of no change. The reward function that we pro- in Figs. 6e8. These figures show that the DQN model outperformed
posed changes from the one presented in [90] as: the commonly used Perturb-and-Observe algorithm by generating
more power in each scenario.
r ¼ r1 þ r2 þ r3 þ r4 (1)

Ptþ1 if Ptþ1 PMax d1 or PG ¼ 1 10.2. Case study: islanding detection
PMax ¼ (2)
PMax if Ptþ1 PMax < d1 and PGs1
In this case study, we replicated, to some extent, the work
8
>
<0 if Ptþ1 PMax d2 presented by Manikonda and Gaonkar [76]. We generated our own
dataset from a simulated 100 kW grid-connected PV system based
r1 ¼ Ptþ1 PMax (3)
>
:5 otherwise on [2]. The system included an active load and a capacitor bank
PMPP;STC connected to the PCC. To collect the data, the voltage at each phase
8 in the PCC was measured and concatenated in a single array. Then,
> 2 2 these time series data were converted into images using the
>
< Ptþ1 Ptþ1
if Ptþ1 Pt d3 wavelet transform with a Morlet wavelet and re-scaled into
r2 ¼ PMPP;STC PMPP;STC (4)
>
> 224 224 RGB images. The scenarios included in the dataset can be
:
otherwise seen in Table 5.
The dataset was constructed by changing the active power of a

0 if 0 D 1 1
r3 ¼ (5)
otherwise
1
MPPT Case Study: https://github.com/SmartSystems-UniAndes/PV_MPPT_
Control_Based_on_Reinforcement_Learning.
309
Fig. 7. PV power and duty cycle at G1 ¼ 900W/m2, G2 ¼ 900W/m2, G3 ¼ 300W/m2

and T ¼ 25 deg.C by using the Deep Q network agent and the Perturb and Observe
algorithm. Fig. 9. Scalogram of the grid connected with a capacitor and load at 90 kW (scalogram
of class 0).
Fig. 8. PV power and duty cycle at G1 ¼ 1000W/m2, G2 ¼ 500W/m2, G3 ¼ 300W/m2

and T ¼ 25 deg.C by using the Deep Q network agent and the Perturb and Observe
algorithm.
Fig. 10. Scalogram of an islanding event with the capacitor connected and load at
90 kW (scalogram of class 4).
Table 5
Dataset composition for Islanding detection.
note that every image collected in the dataset was generated from
- On Grid On Grid Switch Switch Island. Island. LL LN
C Load C Load ON C OFF C C Load C Load Fault Fault measurements that took place exactly at the beginning of each
ON OFF Load Load ON OFF event (0.2se0.35s). Therefore, we encourage the reader to generate
Events 100 100 100 100 100 100 100 100
a more realistic dataset based on the system that we provide,
Class 0 1 2 3 4 5 6 7 adding noise to the data and collecting data in different time
windows.
load connected to the PCC from values between 90 kW and 110 kW 10.3. Case study: management
for each one of the events. An example of the scalograms generated
can be seen in Fig. 9 and Fig. 10. In this case study, we used the CityLearn environment and
The results obtained over the validation set are shown in the replicated the methodology presented by Kathirgamanathan et al.
confusion matrix presented in Fig. 11, which shows a 100% accuracy. [58]. However, instead of using a centralized SAC agent, we
The dataset, along with the code to generate it, and the code implemented a centralized DDPG agent. The environment involved
implemented for the CNN model are publicly available.2 one building, which had a solar power capacity of 40 kW, an annual
Although the model achieved a 100% accuracy, it is important to cooling demand of 280, 239 kWh, and an annual non-shift-able
electrical demand of 36, 978 kWh. Moreover, we used the climate
zone type 1. The action-space is composed of changes that increase/
2
Case Study Islanding Detection: https://github.com/SmartSystems-UniAndes/ decrease the cooling energy stored in the storage device of the
Islanding_Detection. building. The state-space variables used for our centralized agent
310
were day, month, year, the outside dry-bulb temperature, the direct Table 6
solar radiation, the non-shift-able electricity load of the building, Hyperparameters for the DDPG agent training.
and the state of charge of the buildings cooling storage. The Symbol Description Value
implementation of the DDPG agent was primarily based on a Replay buffer size 1e5
e
GitHub repository created by Tabor [108]. The hyperparameters Minibatch size 1024
e
used for training the DDPG algorithm can be seen in Table 6. L Discount factor 0.99
The reward function was designed based on the peak con- h1 Actor learning rate 0.001
sumption penalization as presented in [58], with incentive charging h2 Critic learning rate 0.002
during the nights. This reward function is defined based on Eq 7 Τ Target smoothing coefficient 0.0003
Hidden layers sizes 800
and Eq 8. e
ðe2demand Þ*b þ rnight

r¼ (7)
5000
8 5000 if 3am hour 4am and csSOC < 0:9and 0:1 < a < 0:3
<
rnight ¼ (8)
: 5000 if 3am hour 4am and csSOC < 0:9and a < 00 otherwise
where b is a weighting coefficient, edemand is the electricity demand,

csSOC is the state of charge of the cooling storage, and a is the action
chosen by the agent. The training process of the centralized DQN
agent can be seen in Fig. 12.
When the trained agent is tested, a reduction of around 16% over
the energy consumption occurs compared to an environment with
no agent. The building electricity demand profile using the imple-
mented DDPG agent and without using it throughout a year can be
seen in Fig. 13. Furthermore, in Fig. 14, a smaller interval of the
building electricity profile is presented to show the performance of
the created model.
The multi-objective cost function scores are presented in Table 7.
When compared to the results obtained in [58], the only metric in
Fig. 12. Episode Reward for Management agent implemented with DDPG in CityLearn.
which our model surpassed the SAC was in the net electricity
consumption. Also, the results obtained in the former paper used
more buildings than this case study. Therefore, we encourage the
reader to modify the DDPG algorithm implemented to be tested
with more buildings while also checking if, by changing hyper-
parameters of the model, it is possible to surpass the average score
obtained in [58]. Adjustments to the reward function are also
encouraged since the ramping score show that the implemented
model produces too many fluctuations over the building energy
demand. The current version of the model suffers from randomness
when training, which sometimes leads to a local maxima that does
not allow the agent to learn. This repository is publicly available.3
10.4. Case study: fault detection and PV diagnostics
For this case study, the main goal is to replicate, to some extent,
the results achieved by Rico Espinosa et al. [96]. In their paper, four
3
Case Study Management: https://github.com/SmartSystems-UniAndes/
Fig. 11. Confusion matrix on the validation dataset for islanding detection. Management_City_Learning.
311
Fig. 13. One year of electricity demand profile of building 4 without using i) PV generations, ii) with PV generation and iii) with PV generation and the implemented agent.
Fig. 14. Two hundred hours of electricity demand profile of building 4 i) without using PV generations, ii) with PV generation and iii) with PV generation and the implemented
agent.
Table 7
Results obtained for management case study.
Climate Ramping Load Avg. Daily Peak Net. Elect. Consumption Avg. Score
Factor Peak Demand
1 1.80 1.03 1.34 1.25 0.96 1.27
Fig. 15. Architecture of the classifier based on VGG-16.
failure signatures are classified based on RGB images. The main goal reused as the starting point for a model that solves another task
is to use the augmented dataset with the masks shared in [17] to [30]. We used the VGG16 as the pre-trained network [52,102]. Here,
create a quaternary classification model. However, in contrast to we first segment the PV modules in the image, to then classify the
the one presented in that paper, our model is generated using failure signature. The different classes of images can be seen in
transfer learning techniques. Transfer learning is a technique in Fig. 16. The structure of the classifier can be seen in Fig. 15.
which a model that has been trained to solve a specific task is The results were obtained on 400 images. The confusion matrix
312
Table 8
Layer configuration of the model.
Layer Configuration Activation Function
1D Convolution filters ¼ 16, kernel ¼ 16 ReLU

LSTM Enconder Units ¼ 24 Tanh, Sigmoid
Repeat Vector repetition factor ¼ 24 e
LSTM Decoder Units ¼ 24 Tanh, Sigmoid
Time Distributed Units ¼ 1 Linear
Table 9
CNN-LSTM Encoder-Decoder performance in daily forecasting.
Dataset R2 RMSE MAE
Training 0.837 110.5 59.86

Validation 0.85 100.9 55.08
Testing 0.846 106.5 58.72
Fig. 16. Fault signature categories that are intended to be detected: (a) Glass Damage,
(b) No Faults, (c) Dust and (d) Shadow.
Fig. 18. Daily forecasting of irradiance in 200 h.
Fig. 17. Confusion matrix for failure mode detection.

Fig. 19. Site-adaptation with SVR
313
Fig. 20. Sizing case study's example of original image, label and prediction based on the model created.
Table 10 dataset are not completely well labeled. The metrics show that the
SVR performance metrics of site adaptation according to cross-validation. tendency of the model was to over-fit with more epochs. Therefore,
Model Hyperparameters R2 RMSE MAE we encourage the reader to test different models and try to obtain
SVR kernel: radial basis function 0.74 0.5 0.27
better results. The repository created is publicly available.6
10.7. Case study: site adaptation

is presented in Fig. 17. Note that the classification accuracy is
around 86.75%, surpassing the accuracy obtained in [42]. The In this case study, we intended to test a support vector regres-
implemented model along with the dataset are publicly available.4 sion model over the dataset used by Narvaez et al. [83] for site
adaptation. The repository created is publicly available.7 The input
10.5. Case study: irradiance forecasting data used for the model was the same used by the researchers. As a
pre-processing technique, the measurements were all standard-
In this case study, we intended to replicate the methods applied ized. A 4-cross-validation scheme was used to evaluate the per-
in [111] to the dataset used in [83]. This dataset was adjusted by the formance of the site adaptation technique. The results obtained can
site-adaptation models used in the [83]. With the improved data- be seen in Table 10.
set, we trained a daily forecast model on an hourly sampling time. Furthermore, an image representing the adapted site data
The dataset was divided into three sets: 70% for training, 15% for compared to the satellite and in-situ measurements is shown in
validation, and 15% for testing. We used a CNN-LSTM encoder- Fig. 19.
decoder network model with an input size of 168 h of global hor-
izontal irradiance. The layer configuration of the proposed model
can be seen in Table 8. 11. Conclusions and future research directions
The performance obtained after training the model through 17
epochs can be seen in Table 9, where R2, RMSE and MAE stand for We presented a thorough review of the recent advances in ML
coefficient of determination, root mean square error and mean applied to PV systems, where most of the papers considered were
average error respectively. In Fig. 18, the results of using the daily published within the time period from 2014 to 2021. We also pre-
forecasting model over a period of 200 h can be seen more clearly. sented resources that are available to validate ML techniques in PV
The model created along with the data are publicly available.5 systems, and provided study cases with links to open source code.
There are still a variety of challenges in the implementation and
10.6. Case study: sizing analysis of PV systems that can potentially be addressed using ML.
One of the key aspects of the success of this type of techniques is
For this case study, we intended to reproduce, to some degree, the availability of data that can provide information that is useful to
the methodology applied by Malof et al. [74] to map PV arrays in find those patterns that make it possible an automatic learning
high-resolution overhead imagery. To do this, we used transfer process. Therefore, more open datasets with real data from PV
learning techniques with DeepLabv3 network [32], which uses systems should be shared in the research community. Also, we have
deep CNN in the application of semantic segmentation. We trained identified several problems that can be considered for future
and tested the algorithm using four images of 5000 5000 reso- research in this field of study:
lution in [29], divided into images of size 500 500 and used as
input for the model. After training the model through 30 epochs, C More experimental tests on real testbeds should be con-
we managed to achieve an inspection over union performance of ducted while also making more comparisons between agent-
0.689 and 0.61 over the training and testing data, respectively. critic RL methods for MPPT. Moreover, RL techniques for
An example of the results achieved by the model is shown in MPPT and voltage/frequency control should be tested
Fig. 20. It is important to note that the images gathered in the together.
4 6
Case Study Fault Detection and PV Diagnostics: https://github.com/ Case Study Sizing: https://github.com/SmartSystems-UniAndes/Sizing_
SmartSystems-UniAndes/Failure_Detection_for_PV. DeepLabV3_Semantic_Segmentation.
5 7
Case Study Irradiance Forecasting: https://github.com/SmartSystems- Case Study Site Adaptation: https://github.com/SmartSystems-UniAndes/
UniAndes/Solar_Radiation_Forecasting. Machine_Learning_for_Site_Adaptation.
314
C In forecasting methods, new techniques such as transformers [10] Solar energy environmental mapper (solar mapper) web-based GIS appli-
cation, URL, https://solarmapper.anl.gov/.
that are based solely on attention mechanism [115] should be
[11] Types of faults in electrical power systems, URL, https://www.
tested. electronicshub.org/types-of-faults-in-electrical-power-systems/.
C Since the applications of ML techniques towards site- [12] Haider Saif Agha, Zafar-ullah Koreshi, and Mohammad Bilal Khan. Artificial
adaptation are recently being developed, more data pro- neural network based maximum power point tracking for solar photovol-
taics. In 2017 International Conference on Information and Communication
cessing techniques should be tested. Technologies (ICICT), pages 150-155. IEEE. ISBN 978-1-5386-2186-8. doi:
C For islanding detection, more physical trials should be con- 10.1109/ICICT.2017.8320180. URL http://ieeexplore.ieee.org/document/
ducted to test the new proposed models. 8320180/.
[13] Masoud Ahmadipour, Hashim Hizam, Mohammad Lutfi Othman, Mohd
C For fault detection and classification, there is a need to study Amran Mohd Radzi, Avinash Srikanta Murthy, Islanding detection technique
the behavior of RL-based MPPT algorithms in the presence of using slantlet transform and ridgelet probabilistic neural network in grid-
faults. Also, new image recognition algorithms such as vision connected photovoltaic system, 231:645e659. ISSN 03062619, https://
linkinghub.elsevier.com/retrieve/pii/S0306261918314570.
transformers can be tested. [14] Hyung Keun Ahn, Neungsoo Park, Deep RNN-based photovoltaic power
C For diagnosis and anomaly detection using thermal images, short-term forecast using power IoT sensors, 14(2):436, https://www.mdpi.
the use of generative adversarial networks can be explored to com/1996-1073/14/2/436. Number: 2 Publisher: Multidisciplinary Digital
Publishing Institute.
expand the current datasets available.
[15] Ikbal Ali, Mini Thomas, and Dr Pawan Kumar. Distributed Resource Planning
C Little has been done to propose ML algorithms for real-time for Improved Voltage Stability of Radial Distribution System. 2:95e101.
processing and decision-making. There is a need to propose [16] Afshine Amidi, Recurrent neural networks cheatsheet, URL, https://stanford.
edu/shervine/teaching/cs-230/cheatsheet-recurrent-neural-networks.
pipelines that describe the deployment of models for real-
[17] AndresFlorez-Git, AndresFlorez-git/proyecto_electronica, URL, https://
time processing using TinyML or EdgeIA. github.com/AndresFlorez-Git/Proyecto_Electronica. original-date: 2020-04-
16T19:27:14Z.
[18] A. Y. Appiah, X. Zhang, B. B. K. Ayawli, and F. Kyeremeh. Long short-term
memory networks based automatic feature extraction for photovoltaic
CRediT authorship contribution statement array fault diagnosis. 7:30089e30101. ISSN 2169-3536. doi: 10.1109/AC-
CESS.2019.2902949. Conference Name: IEEE Access.
Jorge Felipe Gaviria: Conceptualization, Methodology, Soft- [19] M. Arjun and J. B. Zubin. Artificial neural network based hybrid MPPT for
photovoltaic modules. In 2018 International CET Conference on Control,
ware, Validation, Investigation, Data curation, Writing e original Communication, and Computing (IC4), pages 140e145. doi: 10.1109/
ez: Conceptualization, Writing e
draft, Visualization. Gabriel Narva CETIC4.2018.8530922.
review & editing, Supervision. Camilo Guillen: Methodology, [20] Bag Aurobinda, Bidyadhar Subudhi, and Pravat Kumar. A combined rein-
forcement learning and sliding mode control scheme for grid integration of a
Software, Writing e review & editing. Luis Felipe Giraldo: Super- PV system. ISSN 20960042, 20960042. doi: 10.17775/CSEEJPES.2017.01000.
vision, Writing e review & editing, Project administration. Michael URL https://ieeexplore.ieee.org/stamp/stamp.jsp?tp¼&
Bressan: Supervision, Writing e review & editing. arnumber¼8928283.
[21] L. Avila, M. De Paula, I. Carlucho, and C. Sanchez Reinoso. MPPT for PV sys-
tems using deep reinforcement learning algorithms. 17(12):2020e2027, .
Declaration of competing interest ISSN 1548-0992. doi: 10.1109/TLA.2019.9011547. Conf. Name: IEEE Latin
Am. Transact..
[22] Luis Avila, Mariano De Paula, Maximiliano Trimboli, Ignacio Carlucho, Deep
The authors declare that they have no known competing reinforcement learning approach for MPPT control of partially shaded PV
financial interests or personal relationships that could have systems in smart grids, 97:106711, . ISSN 15684946, https://linkinghub.
appeared to influence the work reported in this paper. elsevier.com/retrieve/pii/S1568494620306499.
[23] Luis Avila, Mariano De Paula, Maximiliano Trimboli, Ignacio Carlucho, loa-
vila/mppt-gym. https://github.com/loavila/mppt-gym original-date: 2019-
Acknowledgement 04-10T13:39:40Z.
[24] F. Aziz, A. Ul Haq, S. Ahmad, Y. Mahmoud, M. Jalal, and U. Ali. A Novel
Convolutional Neural Network-Based Approach for Fault Classification in
This work was supported by Centro de Desarrollo Sostenible Photovoltaic Arrays. 8:41889e41904. ISSN 2169-3536. doi: 10.1109/AC-
para Ame rica Latina through Convocatoria para financiacio n de CESS.2020.2977116. Conference Name: IEEE Access.
proyectos de investigacio n relacionados con el alcance de los [25] Hamid Reza Baghaee, Dragan Mlakic, Srete Nikolovski, Tomislav Dragicevic,
Support vector machine-based islanding and grid fault detection in active
objetivos de desarrollo sostenible. distribution networks, 8(3): 2385e2403. ISSN 2168-6777, 2168-6785,
https://ieeexplore.ieee.org/document/8714012/.
Appendix A. Supplementary data [26] Kostas Bavarinos, Anastasios Dounis, Panagiotis Kofinas, Maximum power
point tracking based on reinforcement learning using evolutionary optimi-
zation algorithms, 14(2):335, https://www.mdpi.com/1996-1073/14/2/335.
Supplementary data to this article can be found online at Number: 2 Publisher: Multidisciplinary Digital Publishing Institute.
https://doi.org/10.1016/j.renene.2022.06.105. [27] Tarek Berghout, Benbouzid Mohamed, Toufik Bentrcia, Xiandong Ma,
Sinisa Djurovic, Leïla-Hayet Mouss, Machine learning-based condition
monitoring for PV systems: state of the art and future prospects, 14(19):
References 6316. ISSN 1996-1073, https://www.mdpi.com/1996-1073/14/19/6316.
[28] Shweta Bhatt, Reinforcement learning 101, URL, https://towardsdatascience.
[1] Deep deterministic policy gradient d spinning up documentation, URL, com/reinforcement-learning-101-e24b50e1d292.
https://spinningup.openai.com/en/latest/algorithms/ddpg.html. [29] Kyle Bradbury, Raghav Saboo, Jordan Malof, Timothy Johnson, Arjun
[2] Detailed model of a 100-kW grid-connected PV array - MATLAB & simulink, Devarajan, Wuming Zhang, Collins Leslie, and Richard Newell. Distributed
URL, https://www.mathworks.com/help/physmod/sps/ug/detailed-model- Solar Photovoltaic Array Location and Extent Data Set for Remote Sensing
of-a-100-kw-grid-connected-pv-array.html. Object Identification. doi: 10.6084/m9.figshare.3385780.v1. URL/articles/
[3] DKASC, Alice springs | DKA solar centre, URL, http://dkasolarcentre.com.au/ dataset/Distributed_Solar_Photovoltaic_Array_Location_and_Extent_Data_
locations/alice-springs. Set_for_Remote_Sensing_Object_Identification/3385780/1. Publisher: fig-
[4] EPRI | smart grid resource center simulation tool e OpenDSS, URL, https:// share type: dataset.
smartgrid.epri.com/SimulationTool.aspx. [30] Jason Brownlee, A gentle introduction to pooling layers for convolutional
[5] ESOLMET-IER instituto de energías renovables, URL, http://esolmet.ier.unam. neural networks. https://machinelearningmastery.com/pooling-layers-for-
mx/Tipos_consulta.php. convolutional-neural-networks/.
[6] Home - system advisor model (SAM), URL, https://sam.nrel.gov/. [31] Di Cao, Weihao Hu, Junbo Zhao, Qi Huang, Zhe Chen, Frede Blaabjerg,
[7] IEC 62116:2014 | IEC webstore | invertor, smart city, LVDC. https://webstore. A multi-agent deep reinforcement learning based voltage regulation using
iec.ch/publication/6479. coordinated PV inverters, 35(5):4120e4123. ISSN 0885-8950, 1558-0679.
[8] Ieee 1547-2018 - IEEE standard for interconnection and interoperability of https://ieeexplore.ieee.org/document/9113746/.
distributed energy resources with associated electric power systems in- [32] Liang-Chieh Chen, Papandreou George, Florian Schroff, Hartwig Adam,
terfaces, URL, https://standards.ieee.org/standard/1547-2018.html. Rethinking atrous convolution for semantic image segmentation. http://
[9] Renewable energy, URL, https://www.c2es.org/content/renewable-energy/. arxiv.org/abs/1706.05587.
315
[33] Zhicong Chen, Fuchang Han, Lijun Wu, Jinling Yu, Shuying Cheng, Peijie Lin, International Conference on Electrical Machines and Systems (ICEMS), pages
Huihuang Chen, Random forest based intelligent fault diagnosis for PV arrays 1e4.
using array voltage and string currents, 178:250e264, . ISSN 0196-8904, [58] Anjukan Kathirgamanathan, Kacper Twardowski, Eleni Mangina, Donal Finn,
https://www.sciencedirect.com/science/article/pii/S0196890418311415. A centralised soft actor critic deep reinforcement learning approach to dis-
[34] Kuan-Yu Chou, Shu-Ting Yang, Yon-Ping Chen, Maximum power point trict demand side management through CityLearn. http://arxiv.org/abs/2009.
tracking of photovoltaic system based on reinforcement learning, 19(22): 10562.
5054, https://www.mdpi.com/1424-8220/19/22/5054. Number: 22 Pub- [59] Mohammed Ali Khan, Ahteshamul Haque, and Varaha Satya Bharath. Ma-
lisher: Multidisciplinary Digital Publishing Institute. chine Learning Based Islanding Detection for Grid Connected Photovoltaic
[35] Giovanni Cipriani, Antonino D'Amico, Stefania Guarino, Donatella Manno, System. page 6..
Marzia Traverso, Vincenzo Di Dio, Convolutional neural network for dust and [60] Tamer Khatib, Wilfried Elmenreich, An Improved Method for Sizing Stand-
hotspot classification in PV modules, 13(23):6357. ISSN 1996-1073, https:// alone Photovoltaic Systems Using Generalized Regression Neural Network,
www.mdpi.com/1996-1073/13/23/6357. 2014. URL, https://www.hindawi.com/journals/ijp/2014/748142/. ISSN:
[36] M.A. Danandeh, S.M. Mousavi G, Comparative and comprehensive review of 1110-662X Pages: e748142 Publisher: Hindawi Volume.
maximum power point tracking methods for PV cells, 82:2743e2767. ISSN [61] Aristides Kiprakis and A.R. Wallace. Maximising Energy Capture from
13640321, https://linkinghub.elsevier.com/retrieve/pii/ Distributed Generators in Weak Networks. 151:611e618. doi: 10.1049/ip-
S1364032117313813. gtd:20040697.
[37] G. Data, Global solar photovoltaic (PV) market update, 2019 with historic [62] P. Kofinas, S. Doltsinis, A.I. Dounis, G.A. Vouros, A Reinforcement Learning
(2006-2018) and forecast (2019-2030), URL, https://www.businesswire.com. Approach for MPPT Control Method of Photovoltaic Sources, 108:461e473.
[38] Mahmoud Dhimish, Ghadeer Badran, Photovoltaic hot-spots fault detection ISSN 09601481, https://linkinghub.elsevier.com/retrieve/pii/
algorithm using fuzzy systems, 19(4):671e679. ISSN 1530-4388, 1558-2574, S0960148117301891.
https://ieeexplore.ieee.org/document/8854326/. [63] Simeon Kostadinov, Understanding GRU networks. https://
[39] Ruisheng Diao, Zhiwei Wang, Di Shi, Qianyun Chang, Jiajun Duan, towardsdatascience.com/understanding-gru-networks-2ef37df6c9be.
Xiaohu Zhang, Autonomous voltage control for grid operation using deep [64] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. ImageNet classifi-
reinforcement learning, URL, http://arxiv.org/abs/1904.10597. cation with deep convolutional neural networks. 60(6):84e90. ISSN 0001-
[40] Mohamed E. El Telbany, Ayman Youssef, and Abdelhalim Abdelnaby Zekry. 0782, 1557-7317. doi: 10.1145/3065386. URL https://dl.acm.org/doi/
Intelligent techniques for MPPT control in photovoltaic systems: a compre- 10.1145/3065386..
hensive review. In 2014 4th International Conference on Artificial Intelli- [65] S. Ananda Kumar, M.S.P. Subathra, Nallapaneni Manoj Kumar,
gence with Applications in Engineering and Technology, pages 17e22. IEEE. Maria Malvoni, N.J. Sairamya, S. Thomas George, Easter S. Suviseshamuthu,
ISBN 978-1-4799-7910-3. doi: 10.1109/ICAIET.2014.13. URL https:// Shauhrat S. Chopra, A novel islanding detection technique for a resilient
ieeexplore.ieee.org/document/7351807/. photovoltaic-based distributed power generation system using a tunable-q
[41] Jianqing Fan, Zhaoran Wang, Yuchen Xie, Zhuoran Yang, A theoretical wavelet transform and an artificial neural network, 13(16):4238. ISSN
analysis of deep q-learning, URL, http://arxiv.org/abs/1901.00137. 1996-1073, https://www.mdpi.com/1996-1073/13/16/4238.
[42] Andre s Flo
rez, Luis Felipe Giraldo, and Michael Bressan. Portable Real-Time [66] C. Kuo, J. Chen, S. Chen, C. Kao, H. Yau, and C. Lin. Photovoltaic energy
Failure Mode Classification on PV Modules Based on RGB Images. page 10. conversion system fault detection using fractional-order color relation
[43] Mohammad Mahdi Forootan, Iman Larki, Rahim Zahedi, Abolfazl Ahmadi, classifier in microdistribution systems. 8(3):1163e1172. ISSN 1949-3061.
Machine learning and deep learning in energy systems: a review, 14(8): doi: 10.1109/TSG.2015.2478855. Conf. Name: IEEE Trans. Smart Grid..
4832. ISSN 2071-1050, https://www.mdpi.com/2071-1050/14/8/4832. [67] Varaha Satra Bharath Kurukuru, Ahteshamul Haque, Mohammed Ali Khan,
Number: 8 Publisher: Multidisciplinary Digital Publishing Institute. Subham Sahoo, Azra Malik, Frede Blaabjerg, A review on artificial intelli-
[44] Vincent François-Lavet, David Taralla, Damien Ernst, and Raphael Fonteneau. gence applications for grid-connected solar photovoltaic systems, 14(15):
Deep Reinforcement Learning Solutions for Energy Microgrids Management. 4690. ISSN 1996-1073, https://www.mdpi.com/1996-1073/14/15/4690.
page 7. Number: 15 Publisher: Multidisciplinary Digital Publishing Institute.
[45] W. Gao and R.-J. Wai. A novel fault identification method for photovoltaic [68] Maxim Lapan. Deep Reinforcement Learning Hands-On: Apply Modern RL
array via convolutional neural network and residual gated recurrent unit. 8: Methods, with Deep Q-Networks, Value Iteration, Policy Gradients, TRPO,
159493e159510. ISSN 2169-3536. doi: 10.1109/ACCESS.2020.3020296. AlphaGo Zero and More. Packt Publishing Ltd. ISBN 978-1-78883-930-3.
Conference Name: IEEE Access. Google-Books-ID: xKdhDwAAQBAJ..
[46] V. Gayathry and M. Sujith. Machine learning based synchrophasor data [69] Sangyoon Lee, Dae-Hyun Choi, Reinforcement learning-based energy man-
analysis for islanding detection. In 2020 International Conference for agement of smart home with rooftop solar photovoltaic system, energy
Emerging Technology (INCET), pages 1e6. doi: 10.1109/ storage system, and home appliances, 19(18):3937. ISSN 1424-8220, https://
INCET49848.2020.9154089. www.mdpi.com/1424-8220/19/18/3937.
[47] Aurelien Geron. Hands-On Machine Learning with Scikit-Learn, Keras, and [70] Stefano Leonori, Alessio Martino, Fabio Massimo Frattale Mascioli, and
TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems. Antonello Rizzi. Microgrid Energy Management Systems Design by Compu-
O'Reilly Media, second ed. edition.. tational Intelligence Techniques. 277:115524. ISSN 03062619. doi: 10.1016/
[48] Mevludin Glavic. (deep) Reinforcement Learning for Electric Power System j.apenergy.2020.115524. URL https://linkinghub.elsevier.com/retrieve/pii/
Control and Related Problems: A Short Review and Perspectives. 48:22e35. S0306261920310369.
ISSN 13675788. doi: 10.1016/j.arcontrol.2019.09.008. URL https:// [71] Changfu Li, Chenrui Jin, Ratnesh Sharma, Coordination of PV smart inverters
linkinghub.elsevier.com/retrieve/pii/S1367578819301014. using deep reinforcement learning for grid voltage regulation. http://arxiv.
[49] Qinglai Guo, Junjian Qi, Venkataramana Ajjarapu, Richard Bravo, Joe Chow, org/abs/1910.05907.
Zhengshuo Li, Rohit Moghe, Ehsan Nasr-Azadani, Ujjwol Tamrakar, G.N. [72] Xiaoyang Lu, Peijie Lin, Shuying Cheng, Yaohai Lin, Zhicong Chen, Lijun Wu,
Taranto, Reinaldo Tonkoski, Gustavo Valverde, Qiuwei Wu, and Guangya Qianying Zheng, Fault diagnosis for photovoltaic array based on convolu-
Yang. Review of Challenges and Research Opportunities for Voltage Control tional neural network and electrical time series graph, 196:950e965. ISSN
in Smart Grids. 34:2790e2801. doi: 10.1109/TPWRS.2019.2897948. 01968904, https://linkinghub.elsevier.com/retrieve/pii/
[50] Kevin Gurney. Introduction to Neural Networks. ISBN: 9780203451519 S0196890419307332.
Place: Oxford OCLC: 892785047. [73] Azra Malik, Ahteshamul Haque, V.S. Bharath Kurukuru, Mohammed Ali Khan,
[51] Mansour Hajji, Mohamed-Faouzi Harkat, Abdelmalek Kouadri, Frede Blaabjerg, Overview of fault detection approaches for grid connected
Kamaleldin Abodayeh, Majdi Mansouri, Hazem Nounou, Nounou Mohamed, photovoltaic inverters, 100035. ISSN 2772-6711, https://www.sciencedirect.
Multivariate feature extraction based supervised machine learning for fault com/science/article/pii/S2772671122000079.
detection and diagnosis in photovoltaic systems, page S0947358019304054. [74] Jordan M Malof, Boning Li, Bohao Huang, Kyle Bradbury, and Artem Stretslov.
ISSN 09473580, https://linkinghub.elsevier.com/retrieve/pii/ Mapping Solar Array Location, Size, and Capacity Using Deep Learning and
S0947358019304054. Overhead Imagery. page 6.
[52] Muneeb Hassan, VGG16 - convolutional network for classification and [75] Santhosh K G Manikonda and Dattatraya N Gaonkar. A new islanding
detection, URL, https://neurohive.io/en/popular-networks/vgg16/. detection method using transfer learning technique. In 2018 8th IEEE India
[53] Robert Hecht-Nielsen. Theory of the Backpropagation Neural Network. page International Conference on Power Electronics (IICPE), pages 1e6. IEEE, .
13. ISBN 978-1-5386-4996-1. doi: 10.1109/IICPE.2018.8709431. URL https://
[54] Gonzague Henri and Tanguy Levent. Pymgrid: an Open-Source python ieeexplore.ieee.org/document/8709431/.
Microgrid Simulator for Applied Artificial Intelligence Research. page 7.. [76] Santhosh K.G. Manikonda, Dattatraya N. Gaonkar, IDM based on image
[55] Mohammad Safayet Hossain, Hisham Mahmood, Short-term photovoltaic classification with CNN (10):7256e7262, . ISSN 2051-3305, 2051-3305,
power forecasting using an LSTM neural network and synthetic weather https://onlinelibrary.wiley.com/doi/10.1049/joe.2019.0025, 2019.
forecast, 8: 172524e172533. ISSN 2169-3536, https://ieeexplore.ieee.org/ [77] Santhosh K.G. Manikonda, Dattatraya N. Gaonkar, Islanding detection
document/9200614/. method based on image classification technique using histogram of oriented
[56] Christos Kalogerakis, Eftichis Koutroulis, Michail G. Lagoudakis, Global MPPT gradient features, 14 (14):2790e2799, . ISSN 1751-8695, 1751-8695, https://
based on machine-learning for PV arrays operating under partial shading onlinelibrary.wiley.com/doi/10.1049/iet-gtd.2019.1824.
conditions, 10(2):700, https://www.mdpi.com/2076-3417/10/2/700. Num- [78] Santhosh K.G. Manikonda and Dattatraya Narayan Gaonkar. Compreh. rev.
ber: 2 Publisher: Multidisciplinary Digital Publishing Institute. IDMs in DG sys.. 2(1):11e24, . ISSN 2515-2947, 2515-2947. doi: 10.1049/iet-
[57] R. Kase and S. Nishikawa. Fault detection of bypass circuit of PV module d stg.2018.0096. URL https://onlinelibrary.wiley.com/doi/10.1049/iet-stg.
detection technology of open circuit fault location. In 2016 19th 2018.0096.
316
[79] Sabir Messalti, Abd Ghani Harrag, and Abd Elhamid Loukriz. A new neural [97] Mehdi Salay Naderi, Gevork B. Gharehpetian, M. Abedi, and Trevor Black-
networks MPPT controller for PV systems. In IREC2015 the Sixth Interna- burn. Modeling and Detection of Transformer Internal Incipient Fault during
tional Renewable Energy Congress, pages 1e6. IEEE. ISBN 978-1-4799-7947- Impulse Test. 15: 284e291. doi: 10.1109/T-DEI.2008.4446762.
9. doi: 10.1109/IREC.2015.7110907. URL https://ieeexplore.ieee.org/ [98] T. Sansawatt, J. O'Donnell, L. F. Ochoa, and G. P. Harrison. Decentralised
document/7110907. voltage control for active distribution networks. In 2009 44th International
[80] Dan Miklovic, The difference between process control & management, URL, Universities Power Engineering Conference (UPEC), pages 1e5.
https://www.controlglobal.com/blogs/manufacturing2020/the-difference- [99] Ivan Selesnick. Slantlet transform.. 47:1304e1313. doi: 10.1109/78.757218.
between-process-control-and-management/. [100] Manajit Sengupta, Yu Xie, Anthony Lopez, Aron Habte, Galen Maclaurin,
[81] Elieser Miranda, Jorge Felipe Gaviria Fierro, Narv aez Gabriel, Luis Shelby James, The national solar radiation data base (NSRDB), 89:51e60.
Felipe Giraldo, Michael Bressan, Prediction of site-specific solar diffuse ISSN 1364-0321, https://www.sciencedirect.com/science/article/pii/
horizontal irradiance from two input variables in Colombia, 7(12):e08602. S136403211830087X.
ISSN 2405-8440, https://www.sciencedirect.com/science/article/pii/ [101] Shivam Kumar, Jong-Chyuan Tzou, Shang-Chen Wu, A multi-objective pre-
S2405844021027055. dictive energy management strategy for residential grid-connected PV-bat-
[82] Amirmohammad Moradi Sizkouhi, Mohammadreza Aghaei, Sayyed tery hybrid systems based on machine learning technique, 237:114103. ISSN
Majid Esmailifar, A deep convolutional encoder-decoder architecture for 0196-8904, https://www.sciencedirect.com/science/article/pii/
autonomous fault detection of PV plants using multi-copters, 223:217e228. S019689042100279X.
ISSN 0038-092X, https://www.sciencedirect.com/science/article/pii/ [102] Karen Simonyan, Andrew Zisserman, Very deep convolutional networks for
S0038092X21003935. large-scale image recognition, URL, http://arxiv.org/abs/1409.1556.
[83] Gabriel Narvaez, Luis Felipe Giraldo, Michael Bressan, and Andres Pantoja. [103] C. Strobl and P. Meckler. Arc faults in photovoltaic systems. In 2010 Pro-
Machine Learning for Site-Adaptation and Solar Radiation Forecasting. 167: ceedings of the 56th IEEE Holm Conference on Electrical Contacts, pages
333e342. ISSN 09601481. doi: 10.1016/j.renene.2020.11.089. URL https:// 1e7. doi: 10.1109/HOLM.2010.5619538.ISSN:2158-9992.
linkinghub.elsevier.com/retrieve/pii/S0960148120318395. [104] X. Sun and J. Qiu. Two-stage volt/var control in active distribution networks
[84] OLah, Understanding LSTM networkds. https://web.stanford.edu/class/ with multi-agent deep reinforcement learning method. pages 1e1. ISSN
cs379c/archive/2018/class_messages_listing/content/Artificial_Neural_ 1949-3061. doi: 10.1109/TSG.2021.3052998. Conference Name: IEEE Trans-
Network_Technology_Tutorials/OlahLSTM-NEURAL-NETWORK-TUTORIAL- actions on Smart Grid.
15.pdf. [105] Vishnu Suresh, Przemyslaw Janik, Jacek Rezmer, and Zbigniew Leonowicz.
[85] OpenAI. Gym, A toolkit for developing and comparing reinforcement Forecasting Solar PV Output Using Convolutional Neural Networks with a
learning algorithms, URL, https://gym.openai.com. Sliding Window Algorithm. page 15.
[86] K. Pahwa, M. Sharma, M. S. Saggu, and A. Kumar Mandpura. Performance [106] Richard S. Sutton and Andrew G. Barto. Reinforcement learning: an Intro-
evaluation of machine learning techniques for fault detection and classifi- duction. Adaptive Computation and Machine Learning Series. The MIT Press,
cation in PV array systems. In 2020 7th International Conference on Signal second ed. edition. ISBN 978-0-262-03924-6.
Processing and Integrated Networks (SPIN), pages 791e796. doi: 10.1109/ [107] Christian Szegedy, Vanhoucke Vincent, Sergey Ioffe, Jonathon Shlens,
SPIN48934.2020.9071223.ISSN:2688-769X. Zbigniew Wojna, Rethinking the inception architecture for computer vision,
[87] Cheng Pan, Jie Tan, Dandan Feng, and Yi Li. Very short-term solar generation URL, http://arxiv.org/abs/1512.00567.
forecasting based on LSTM with temporal attention mechanism. In 2019 IEEE [108] Phil Tabor, philtabor/actor-critic-methods-paper-to-code. https://github.
5th International Conference on Computer and Communications (ICCC), com/philtabor/Actor-Critic-Methods-Paper-To-Code original-date: 2020-
pages 267e271. IEEE. ISBN 978-1-72814-743-7. doi: 10.1109/ 05-26T23:40:09Z.
ICCC47050.2019.9064298. URL https://ieeexplore.ieee.org/document/ [109] Satoshi Takayama, Atsushi Ishigame, Autonomous decentralized control of
9064298/. distribution network voltage using reinforcement learning**this study was
[88] PecanStreet. PecanStreet inc. dataport load data, URL, https://www. supported by research grant from Japan power academy, 51 (28):209e214.
pecanstreet.org/dataport/. ISSN 2405-8963, https://www.sciencedirect.com/science/article/pii/
[89] Sophie Pelland, Jan Remund, Jan Kleissl, Takashi Oozeki, and Karel De Bra- S2405896318334220.
bandere. Photovoltaic and Solar Forecasting: State of the Art. ISBN 978-3- [110] Giuseppe Marco Tina, Cristina Ventura, Sergio Ferlito, and Saverio De Vito. A
906042-13-8. state-of-art-review on machine-learning based methods for PV. 11(16):
[90] Bao Chau Phan, Ying-Chih Lai, Chin E. Lin, A deep reinforcement learning- 7550. ISSN 2076-3417. doi: 10.3390/app11167550. URL https://www.mdpi.
based MPPT control for PV systems under partial shading condition, com/2076-3417/11/16/7550. Number: 16 Publisher: Multidisciplinary Digi-
20(11):3039, https://www.mdpi.com/1424-8220/20/11/3039. Number: 11 tal Publishing Institute..
Publisher: Multidisciplinary Digital Publishing Institute. [111] Mario Tovar, Esolmet, URL, https://github.com/mariotovarrosas/
[91] Roberto Pierdicca, Marina Paolanti, Andrea Felicetti, Fabio Piccinini, ESOLMET2019. original-date: 2020-09-21T22:54:55Z.
Primo Zingaretti, Automatic faults detection of photovoltaic farms: solAIr, a [112] Mario Tovar, Miguel Robles, and Felipe Rashid. PV power prediction, using
deep learning-based system for thermal images, 13(24):6496, https://www. CNN-LSTM hybrid neural network model. case of study: Temixco-morelos,
mdpi.com/1996-1073/13/24/6496. Number: 24 Publisher: Multidisciplinary mexico. 13(24):6512. ISSN 1996-1073. doi: 10.3390/en13246512. URL
Digital Publishing Institute. https://www.mdpi.com/1996-1073/13/24/6512.
[92] Jesus Polo, Carlos Fernandez-Peruchena, Vasileios Salamalikis, Luis Mazorra- [113] G. Valverde and T. Van Cutsem. Model predictive control of voltages in active
Aguiar, Mathieu Turpin, Luis Martín-Pomares, Andreas Kazantzidis, distribution networks. 4(4):2152e2161. ISSN 1949-3061. doi: 10.1109/
Philippe Blanc, Remund Jan, Benchmarking on improvement and site- TSG.2013.2246199. Conf. Name: IEEE Trans. Smart Grid..
adaptation techniques for modeled solar radiation datasets, 201:469e479, . [114] Laurens van der Maaten and Geoffrey Hinton. Viualizing Data Using T-SNE.
ISSN 0038-092X, https://www.sciencedirect.com/science/article/pii/ 9: 2579e2605.
S0038092X20302784. [115] Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones,
[93] Jesus Polo, Stefan Wilbert, Jose A. Ruiz-Arias, Richard Meyer, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin, Attention is all you need,
Christian Gueymard, Marcel Suri, Luis Martin, Theresa Mieslinger, URL, http://arxiv.org/abs/1706.03762.
Philippe Blanc, Ian Grant, John Boland, Pierre Ineichen, Jan Remund, [116] Jan Von Appen, Martin Braun, Thomas Stetz, Konrad Diwold, and Dominik
Rodrigo Escobar, Alberto Troccoli, Manajit Sengupta, Kristian Pagh Nielsen, Geibel. Time in the Sun: the Challenge of High PV Penetration in the German
David Renne , Norbert Geuder, Tomas Cebecauer, Preliminary survey on site- Electric Grid. 11: 55e64. doi: 10.1109/MPE.2012.2234407.
adaptation techniques for satellite-derived and reanalysis solar radiation [117] P. N. Vovos, A. E. Kiprakis, A. R. Wallace, and G. P. Harrison. Centralized and
datasets, 132:25e37, https://hal-mines-paristech.archives-ouvertes.fr/hal- distributed voltage control: impact on distributed generation penetration.
01297310. 22(1): 476e483. ISSN 1558-0679. doi: 10.1109/TPWRS.2006.888982. Conf.
[94] Sleeba Paul Puthenpurakel and P. R. Subadhra. Identification and classifica- Name: IEEE Trans. Power Syst...
tion of microgrid disturbances in a hybrid distributed generation system R. Va
[118] Jose szquez-Canteli, Je ro
^me K€ampf, G. Henze, Zolt an Nagy, intelligent-
using wavelet transform. In 2016 International Conference on Next Gener- environments-lab/CityLearn, original-date: 2019-06-30T02:41:48Z, https://
ation Intelligent Systems (ICNGIS), pages 1e5. IEEE. ISBN 978-1-5090-0870- github.com/intelligent-environments-lab/CityLearn.
4. doi: 10.1109/ICNGIS.2016.7854066. URL http://ieeexplore.ieee.org/ R V
[119] Jose azquez-Canteli, Sourav Dey, Gregor Henze, and Zoltan Nagy. Cit-
document/7854066/. yLearn: Standardizing Research in Multi-Agent Reinforcement Learning for
[95] Daniel Rangel-Martinez, K. D. P. Nigam, and Luis A. Ricardez-Sandoval. Ma- Demand Response and Urban Energy Management. page 11..
chine Learning on Sustainable Energy: A Review and Outlook on Renewable [120] Shengyi Wang, Jiajun Duan, Di Shi, Chunlei Xu, Haifeng Li, Ruisheng Diao,
Energy Systems, Catalysis, Smart Grid and Energy Storage. 174:414e441. and Zhiwei Wang. A data-driven multi-agent autonomous voltage control
ISSN 0263-8762. doi: 10.1016/j.cherd.2021.08.013. URL https://www. framework using deep reinforcement learning. 35(6):4644e4654. ISSN
sciencedirect.com/science/article/pii/S0263876221003312. 0885-8950, 1558-0679. doi: 10.1109/TPWRS.2020.2990179. URL https://
[96] Alejandro Rico Espinosa, Michael Bressan, Luis Felipe Giraldo, Failure ieeexplore.ieee.org/document/9076841/.
signature classification in solar photovoltaic plants using RGB images and [121] Pete Warden and Daniel Situnayake. TinyML: Machine Learning with Ten-
convolutional neural networks, 162:249e256. ISSN 0960-1481, https:// sorFlow Lite on Arduino and Ultra-low-power Microcontrollers. ”O'Reilly
www.sciencedirect.com/science/article/pii/S0960148120312301. Media, Inc.”. ISBN 978-1-4920-5201-2. Google-Books-ID: tn3EDwAAQBAJ..
317
[122] Xu Xu, Youwei Jia, Yan Xu, Zhao Xu, Songjian Chai, and Chun Sing Lai. A [125] Ayman Youssef, Mohammed El-Telbany, Abdelhalim Zekry, The role of
multi-agent reinforcement learning-based data-driven method for home artificial intelligence in photo-voltaic systems design and control: a review,
energy management. 11(4):3201e3211. ISSN 1949-3053, 1949-3061. doi: 78:72e79. ISSN 13640321, https://linkinghub.elsevier.com/retrieve/pii/
10.1109/TSG.2020.2971427. URL https://ieeexplore.ieee.org/document/ S1364032117305555.
8981876/.. [126] Muhammad Fahad Zia, Elhoussin Elbouchikhi, and Mohamed Benbouzid.
[123] S.-M Xue and C. Liu. Line-to-line Fault Analysis and Location in a VSC-Based Microgrids Energy Management Systems: A Critical Review on Methods,
Low-Voltage DC Distribution Network. 11. doi: 10.3390/en11030536.. Solutions, and Prospects. 222:1033e1055. doi: 10.1016/
[124] Tony Yiu. Understanding random forest. URL https://towardsdatascience. j.apenergy.2018.04.103.
com/understanding-random-forest-58381e0602d2.
318

Machine Learning in Photovoltaic Systems Areview

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Machine Learning in Photovoltaic Systems Areview

Uploaded by

Copyright:

Available Formats

Renewable Energy 196 (2022) 298e318

Contents lists available at ScienceDirect

Machine learning in photovoltaic systems: A review

10.3. Case study: management . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 310

Nomenclature LSTM Long Short-Term Memory

1. Introduction Fig. 1: control, management, faults in PV systems, irradiance fore-

Fig. 1. Problem categories in PV systems in which ML has been applied to.

Articles Programming Language Experimental Results ML Techniques Used

[12,19,79] Python No ANNs

Articles Faults Studied Experimental Results Features Used Main ML Technique

[18] Line-to-line, No Current, LSTM

Articles Faults Studied Experimental Results Feature Extraction Main ML Technique

[65, 94] Positive switching No Wavelet ANN

Articles Power Forecast Algoritm Used Irradiance Forecast Algorithm Used

Fig. 6. PV power and duty cycle at G1 ¼ 1000W/m2, G2 ¼ 1000W/m2, G3 ¼ 1000W/

Fig. 5. Episode Reward for MPPT agent implemented with DQN.

Fig. 7. PV power and duty cycle at G1 ¼ 900W/m2, G2 ¼ 900W/m2, G3 ¼ 300W/m2

Fig. 8. PV power and duty cycle at G1 ¼ 1000W/m2, G2 ¼ 500W/m2, G3 ¼ 300W/m2

ðe2demand Þ*b þ rnight

where b is a weighting coefﬁcient, edemand is the electricity demand,

10.4. Case study: fault detection and PV diagnostics

1 1.80 1.03 1.34 1.25 0.96 1.27

Fig. 15. Architecture of the classiﬁer based on VGG-16.

Layer Conﬁguration Activation Function

1D Convolution ﬁlters ¼ 16, kernel ¼ 16 ReLU

Dataset R2 RMSE MAE

Training 0.837 110.5 59.86

Fig. 18. Daily forecasting of irradiance in 200 h.

Fig. 17. Confusion matrix for failure mode detection.

10.7. Case study: site adaptation

You might also like