You are on page 1of 6

2019 8th International Conference on Modern Power Systems (MPS)

Monitoring Solar Panels using Machine Learning


Techniques
Cristian-Győző Haba
Department of Electrical Engineering
"Gheorghe Asachi" Technical University of Iaşi
Iaşi, Romania
cghaba@tuiasi.ro

Abstract— Increasing photovoltaic (PV) park installations, in Compared to other systems used for generation of
conjunction with the development of wind farms and other electricity, solar panels are designed to operate in outside
renewable energy systems, is a safe way to reduce carbon dioxide environments, thus being exposed to weather phenomena
emissions that are accused of having led to significant climate ranging from moderate to the most extreme. The result of such
change in recent years. Photovoltaic cells must operate in open exposure can be the panel damaging or in a better case, a
space in order to capture solar radiation with the disadvantage of reduction in the efficiency of the panel.
being continuously exposed to meteorological phenomena. The
effects of some of these phenomena can lead to degradation of Some of the situations that result in reduced operation or
photovoltaic panel efficiency or operation. In this paper we efficiency of the solar panels are due to interposition of matter
propose a method of monitoring photovoltaic panels for the between the solar cells and solar rays coming from sun. Fig. 1
identification of possible degradation of operation based on presents the cases where solar panels operation is affected by
machine learning (ML) techniques. The development of the sand and mud respectively.
model and its validation is based on data collected both from the
panels' photovoltaic control system and weather data collected Restoring good condition for panel operation implies a
from Internet. maintenance procedure aiming to remove the obstructing
matter. Some systems were considered to be added to the solar
Keywords—photovoltaic systems; downgraded operation panels in order to perform the cleaning operation. In [1] a high
detection; machine learning; binary classification pressure pump system is used to create air jets to remove
impurities, while [2] presents a solution using a transparent
I. INTRODUCTION electrodynamic screen.
Recent years have shown a significant increase in interest The sooner the downgraded operation is detected, the
in the installation of renewable energy systems, some of which sooner the maintenance procedures can be put in place.
aim to replace systems using older technologies and considered
Such situation can be easily identified by doing a visual
less environmentally friendly.
inspection of the solar panel. If we are talking about large solar
Systems based on photovoltaic panels are becoming more panel parks, that are located in a far and not easy reaching area,
and more interesting as technology development has succeeded this visual inspection is less easy to perform. Using the
in further increasing their conversion efficiency. information local to the photovoltaic park, or better, from each
solar panel, the identification of downgraded condition can be
done more quickly, more efficiently and automatically.
In order to implement such functionality, we propose the
development of a system that integrates information flows that
can be used both to monitor the operation of the systems within
the photovoltaic park as well as to the proper functioning and
integrity of the system.
An important factor in achieving a high efficiency of
conversion of solar energy into electrical energy is to conserve
the solar panel's optical properties over its operation period.
This takes into account preserving both the glass protective
layer characteristics and those in the space between the solar
radiation source and the surface of the panel. In [3] is presented
a study that takes into account the effects that the marine
environment can have on the optical characteristics of solar
panels. Due to air, water and impurities of marine origin, the
a) b) c)
protective glass can deteriorate, change colour or degrade,
Fig. 1. Solar panel a) clean, covered with b) sand and, c) mud. resulting in reduced solar cell conversion efficiency.

978-1-7281-0750-9/19/$31.00 ©2019 IEEE

Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on October 04,2021 at 21:41:00 UTC from IEEE Xplore. Restrictions apply.
a) b) c)
Fig. 2. Solar panels covered with snow a) 100%, b) ≈50% and, c) ≈33%.
Applying machine learning techniques is widespread in A use case of machine learning to detect degraded solar
applications that target photovoltaic systems. A technique for panels due to breaks in panels, due to snow or weeds is
modelling PV panels based on feed-forward neural network in presented in [11]. The model is trained with the "tensor flow"
presented in [4]. In [5] is presented a study which is using technique using current and voltage data taken from the
Gaussian process to determine panel maximum power point measuring points of the series strings (solar panels connected in
(MPP) as a result of the variation of the environmental series) and at the junction points of several such strings.
parameters. Using neuronal network based (ANN) algorithms,
A study presented in [12] shows that the state of solar
in [6] the Daily Mean Solar Power is predicted and the method
panels (clean or dusty) in a photovoltaic park can be
is compared to the results obtained using standard regression
determined with a given confidence interval by using statistical
algorithms. The results show that the ANN perform
methods and Maximum Power data taken from a single solar
considerably better than standard regression ones. A special
panel within the park. Another study [13] tries to identify
case is described in [7] where solar panels are used to power a
defective panels by using binary classification based on texture
racing car and where machine learning techniques are used to
feature extraction from thermography images of panels. The
predict solar irradiance that will be used to select the best
method achieved high model accuracy of more than 98 % and
strategy for powering and controlling the car for winning the
proved high precision and sensitivity values for a large set of
race.
configuration. A different case is the one presented in [14]
The identification of patterns of solar energy utilization for which refers to Solar-Powered Wireless Mesh Networks which
a better planning of consumption in the distribution system can attempts to detect malfunctions and diagnose panels that make
be done as shown in [8]. This study is different from other up the mesh networks. For this purpose, the Knowledge
studies, as it is considering photovoltaic systems for domestic Discovery in Databases (KDD) methodology was used. The
use, located in isolated areas and not part of photovoltaic parks. problem being defined as a classification, several classification
The machine learning algorithms used in the method are based methods were tested such as Naive Bayes, Support Vector
on clustering, looking to identify the number and types of Machine (SVM), Decision Table, k-Nearest Neighbors (k-NN)
appliances used in house for a certain period of time. The study and C4.5, resulting that C4.5 and SVM algorithms are the best
shows that identifying the users’ profiles using these machine for predictions.
learning techniques can help distribution companies to make
probabilistic load forecast in order to increase the efficiency of II. DESCRIPTION OF PHOTOVOLTAIC SYSTEM
their activity.
The monitoring method described in the present paper is
The machine learning-based identification methods developed using the ENERED photovoltaic platform created as
presented in [9] attempt to classify the events that may occur in part of the European founded ENERED project [15]. The
the distribution networks to which photovoltaic systems are platform was installed near the Faculty of Electrical
connected and whose behaviour can affect the stability and Engineering in Iasi which was partner in the European project.
controllability of the system. Quick identification of these The platform consists of three rows of solar panels totaling a
events allows network operators to adopt strategies that are number of 9 PV panels. The rows have 3 panels each, the
most suited for this kind of situations. distinction between the panels in different rows being the type
The realization of predictions regarding the energy of sun tracking system the panel is mounted on.
converted by the solar panels, taking into account the There are three different mounting systems, the first one,
atmospheric conditions (degree of cloudiness) is presented in for the panels in the first row being a fixed system, which has
[10]. In order to do the predictions, in addition to panel no possibility of sun tracking. The second row includes solar
conversion history, sky pictures taken from above the panel are panels that are mounted on a DEGER TOPtracker® - Single-
used. The pictures are processed and analyzed to determine the axis tracking system [16] allowing a -45º ÷45º rotation angle.
various weather conditions. In order to achieve the predictor The panels in the third row are mounted on DEGERtraker
model, convolutional neural networks are used and the study 5000NT dual axis tracking systems [17] allowing a 300º
shows that both the conversion history and the information rotation angle. The elevation angle for the fixed and the single
extracted from the images are important for the short-term axis tracking system panels is in the range 15º ÷30º while for
prediction of the solar panel output.

Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on October 04,2021 at 21:41:00 UTC from IEEE Xplore. Restrictions apply.
Weather
Cloud stations
Services
Internet
Weather
stations
Gateway

Weather
Stations
Local Gateway
Server

Sensors

Solar PV Inverter Power Distribution Local


Platform Network loads

Energy Electricity
storage Utility

Fig.3. Grid connected photovoltaic system with monitoring and analyzing system.

the dual axis tracking system panels, the elevation is in the displayed on a web site organized in the following sections:
range 20º ÷90º. Status, Mode, Meters, Setup, System, Firmware.
The solar panel conversion is controlled using OutBack An application running on a local server was developed in
Power inverters and charge controllers which also provide order to read data continuously from the MATE system. The
protection and continuous monitoring of the system. The frequency of reading is 1 data read per second, but this can be
electrical energy, if not sent to the grid, can be stored in a bank easily adjusted to obtain a lower frequency. More than 35
of 14 12V- batteries. parameters are read from MATE device which are saved in
files, in csv (Comma Separated Values) format. There is one
The ENERED platform includes five inverters that control file with data from each inverter, each file containing data for
energy conversion as follows: approximate one day period (1440 rows). The files are saved in
• fixed panels are connected to one conversion system; folder with a name that identifies the day when data were
recorded.
• single axis tracking system panels are connected to one
conversion system; The system architecture is given in Fig. 3. The system is
connected to sensors mounted on the photovoltaic panels or
• each dual axis tracking system panel is connected to one other elements from the photovoltaic park, to weather stations
conversion system. located in within the solar PV park or in the nearby region and
Conversion data monitoring, the display and modification to cloud services that provide data to be used in the prediction
of control and inverter settings can be achieved using a MATE algorithm, for data storing, integration, process and display.
device [18]. The communication with the conversion system is Fig. 4 shows an aerial view of the ENERED PV platform
done using a communication and system management HUB. where the three rows with different types of panels are
Different data can be monitored and displayed on the identified.
MATE device: In [15] is presented a use case where data collected from
the PV platform is sent to cloud services in order to be
• system status (generating, inverting, charging)
processed and to be displayed using a dashboard for
• battery status (charged, charging, minimum and monitoring purposes.
maximum battery voltage values and values of different
parameters from inverters, charge controllers and III. METHODOLOGY
battery
A modular ML pipeline shown in Fig. 5 was developed for
Via the communication and system management HUB, monitoring and classification of solar PV status by using data
monitoring is also accessible in a local network and can be acquired from different sources.

Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on October 04,2021 at 21:41:00 UTC from IEEE Xplore. Restrictions apply.
nearer the solar PV platform, in our case, the IAII19.
Unfortunately this weather station doesn't record solar
radiation. Therefore, data for solar radiation is collected from
station IISVLADI2 which does these recordings and has the
same measurement frequency as the IAII19 station.
The third data source consists of a sunrise and sunset
calculator that can provide the sunrise and sunset times for a
specific location and for a specific day which is available at
https://www.timeanddate.com/sun/.

Fig. 4. Aerial view of the ENERED photovoltaic platform with fixed, single
axis tracking and dual axis tracking system panels.

Data Data Data


source 1 source 2 source 3

Preprocess
data

Model
training

Evaluation Fig. 6. Location of weather stations with respect to the PV platfrom.

Prediction TABLE I. CHARACTERISTICS OF WEATHER STATIONS IN THE PV


PLATFROM REGION
Weather Stations IIAI19 IIAI20 IISVLADI2
Fig. 5. ML pipeline for classification of solar PV status. Characteristics
Measurement of Temperature, Yes Yes Yes
Humidity, Dew Point,
The pipeline contains the usual blocks used in ML systems, Precipitation, Pressure
this modularity having the advantage of being easily reused if Measurement of Wind Direction, Yes Yes Yes
new data sources are used or integrated with the initial ones. Speed and Gust
Measurement of Solar Radiation No No Yes
A. Dataset Measurement period (min) 5 15 5
Distance from PV platform (km) 1.64 2.82 4.55
In this work we relied on data coming from three sources. Weather History in Graphs and Yes Yes Yes
First source consists of data recorded from the solar PV Tables
platform presented in section II. As a second source we relied
on data coming from the weather stations in the Weather B. Preprocessing
Underground network located in the region of the PV platform A preprocessing step was performed in order to reduce the
(https://www.wunderground.com/weather/ro/iasi). The three size of the data set and to improve the reliability of the
closest to the platform are IAII19, IAII20 and IISVLADI2 recorded data. The preprocessing was used to eliminate
which are depicted in Fig 6. irrelevant time frames and measurement noise.
Data provided by the weather station included temperature,
C. Binary classification for prediction
humidity, precipitation level, solar radiation and pressure. The
characteristics of the collected data differ from station to In order to perform the training step, the data set was
station, as presented in Table I. Not all needed data can be divided in two classes, namely “covered” and “not covered”.
collected from a single station therefore, data was merged from For this purpose solar PV panels were observed during
stations IAII19 and IAAII20. More precisely, data regarding December 2018 and January 2019 and their status was
precipitation level is more relevant if the weather station is recorded. Fig. 7 depicts the variation of several parameters

Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on October 04,2021 at 21:41:00 UTC from IEEE Xplore. Restrictions apply.
recorded from one of the MATE device associated with the The initial data set was split in two subsets to be used for
ENERED solar PV platform. During the observation period training and validation step respectively. Different split
there were days when the panels were covered with snow and percentages were tested in order to see which one gives the best
days when the panels were clean. From the recorded data were result.
selected data points from both categories. The points were
selected from both categories so that it included 171 points for For the training, the logistic regression classifier model was
each category. selected and linear and lbfgs solvers were used.

IV. RESULTS AND DISCUSSION


A dataset of size 342 was used for training and validation
of the model that could predict reduced operation status of
solar panels due to snow coverage. The model was trained
using different percentage values for splitting the initial set into
training and test sets. The results are given in Table II.

TABLE II. VARIATION OF SCORE WITH TRANING AND TEST SET SIZE
Train Test Score
# % # % lbfgs liblinear
307 90 35 10 0.918 0.902
290 85 52 15 0.914 0.893
273 80 69 20 0.916 0.897
256 75 86 25 0.911 0.906
239 70 103 30 0.920 0.879

From the recorded parameters, we have tested different sets


to be considered as features in the ML method. In this sets we
have included only instantaneous values read from the PV
system or a set of instantaneous values and the solar radiation
read from the weather station. The results from these tests are
given in Table III.

TABLE III. VARIATION OF SCORE WITH FEATURE SET


Feature 1 Feature 2 Feature 3 Score
Fig. 7. Solar panel parameters variation for a 7 day period. Channel B Current PV Voltage Solar 0.916
Radiation
Channel A Current Channel B Current Solar 0.846
D. Feature selection Radiation
Data from MATE devices consists of 39 parameters. These Channel A Current Channel B Current PV Voltage 0.927
Panel Output Current Charged Energy Solar 0.831
can be classified in the following categories: time stamp data Radiation
(like time and date of measurement), instantaneous values Panel Output Voltage Battery Voltage Solar 0.890
(generated current, voltage, power), accumulated values Radiation
(accumulated amperage or accumulated kilowatt-hours), Inverter Input Inverter Output Solar 0.857
system status attributes (ex. Charging, Inverting, Silent). In Voltage Voltage Radiation
addition, information from cloud services such as time of PV Voltage Panel Output Current Solar 0.838
Radiation
sunrise and sunset, temperature, pressure, solar radiation,
amount of precipitation can be added to PV system data. A
good selection of parameters to be used in creating and training The analysis of the results in Table III shows the
the prediction model is an important part of the machine importance of selecting a good set of features. Good results are
learning process. obtained both when the set includes only instantaneous values
read from the PV system and when it includes both a set of
In our study we have tested different set of parameters to be instantaneous values and the solar radiation read from the
considered as features in the ML model and the classification weather station.
variable named “covered” as the target.
A candidate for a good feature would be the information
coming from the weather stations regarding the precipitation.
E. Training and Validation
Information from the considered weather stations is an
Training and validation of the model was performed using accumulated value which is reset at the end of each day.
Python and machine learning algorithms implemented in the Therefore, though the precipitation (snow) has an accumulated
sklearn library [19]. The development environment used to value of, for example, 3 cm in a day, at the beginning of the
code the ML algorithm was the Python Notebook.

Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on October 04,2021 at 21:41:00 UTC from IEEE Xplore. Restrictions apply.
next day the level will be zero (though the accumulated snow International Conference on Electronics, Computers and Artificial
didn't suddenly melt away at the midnight). Intelligence (ECAI), 2016, pp. 1–6.
[5] N. N. B. Ulapane and S. G. Abeyratne, “Gaussian process for learning
The main purpose of this research was make the initial solar panel maximum power point characteristics as functions of
development of the method for monitoring PV for the environmental conditions,” in 2014 9th IEEE Conference on Industrial
identification of possible degradation of operation based on Electronics and Applications, 2014, pp. 1756–1761.
machine learning (ML) techniques. The results are affected by [6] F. Jawaid and K. NazirJunejo, “Predicting daily mean solar power using
machine learning regression techniques,” in 2016 Sixth International
the fact that for the moment the PV park is not working in an Conference on Innovative Computing Technology (INTECH), 2016, pp.
operational mode closer to a more real situation. At present, the 355–360.
load is small and connected only during the night. The PV [7] X. Shao et al., “Solar irradiance forecasting by machine learning for
park, though having the capability, is not connected to the grid. solar car races,” in 2016 IEEE International Conference on Big Data
A new set of tests are designed in order to validate the method (Big Data), 2016, pp. 2209–2216.
with the park operating closer to a real situation. [8] V. Mehra, R. Ram, and C. Vergara, “A novel application of machine
learning techniques for activity-based load disaggregation in rural off-
grid, isolated solar systems,” in 2016 IEEE Global Humanitarian
V. CONCLUSIONS AND FURTHER WORK Technology Conference (GHTC), 2016, pp. 372–378.
This work presented a proposal for a machine learning [9] Y. Tao, M. Zhang, and M. Parsons, “Deep learning in photovoltaic
method for monitoring and identifying the situation when solar penetration classification,” in 2017 IEEE Power & Energy Society
panels in a PV platform are in a reduced operation status due to General Meeting, 2017, pp. 1–5.
coverage of solar panels with snow, sand, mud or other optical [10] Y. Sun, V. Venugopal, and A. R. Brandt, “Convolutional Neural
Network for Short-term Solar Panel Output Prediction,” in 2018 IEEE
obstructing materials. Early identification of these situations 7th World Conference on Photovoltaic Energy Conversion (WCPEC) (A
can trigger rapid maintenance activities to bring the operation Joint Conference of 45th IEEE PVSC, 28th PVSEC & 34th EU PVSEC),
parameters to the normal values. 2018, pp. 2357–2361.
[11] T. Babasaki and Y. Higuchi, “Using PV string data to diagnose failure of
Though the tests and results are presented for the case when solar panels in a solar power plant,” in 2018 IEEE International
solar panels are covered with snow, the method can be Telecommunications Energy Conference (INTELEC), 2018, pp. 1–4.
extended to be used in the other cases. Additional studies will [12] M. Catelani, L. Ciani, L. Cristaldi, M. Faifer, M. Lazzaroni, and M.
seek to test other combinations of electrical and weather Rossi, “Characterization of photovoltaic panels: The effects of dust,” in
parameters to be used as features in the ML method. 2012 IEEE International Instrumentation and Measurement Technology
Conference Proceedings, 2012, pp. 1–4.
As these special conditions of reduced operation status can [13] K. Niazi, W. Akhtar, H. A. Khan, S. Sohaib, and A. K. Nasir, “Binary
occur only in certain periods of the year (ex. snowfalls are Classification of Defective Solar PV Modules Using Thermography,” in
restricted to the winter season) or in certain geographical 2018 IEEE 7th World Conference on Photovoltaic Energy Conversion
regions (sand storms specific to regions in proximity of (WCPEC) (A Joint Conference of 45th IEEE PVSC, 28th PVSEC & 34th
deserts), methods to simulate these situations will be studied in EU PVSEC), 2018, pp. 753–757.
order to improve the quality of prediction model. [14] V. C. Ferreira, R. C. Carrano, J. O. Silva, C. V. N. Albuquerque, D. C.
Muchaluat-Saade, and D. Passos, “Fault detection and diagnosis for
solar-powered Wireless Mesh Networks using machine learning,” in
REFERENCES 2017 IFIP/IEEE Symposium on Integrated Network and Service
Management (IM), 2017, pp. 456–462.
[15] C.-G. Haba, “Monitoring photovoltaic parks for damage prevention and
[1] X. Zhang et al., “Solar photovoltaic array surface cleaning device and optimal operation,” in 2017 International Conference on
control means,” in 2016 Chinese Control and Decision Conference Electromechanical and Power Systems (SIELMEN), 2017.
(CCDC), 2016, pp. 5982–5983.
[16] Deger Energie, “DEGER TOPtracker® 40NT Single Axis Tracking
[2] M. K. Mazumder et al., “Environmental degradation of the optical System, Technical Data.” 2013.
surface of PV modules and solar mirrors by soiling and high RH and
mitigation methods for minimizing energy yield losses,” in 2015 IEEE [17] Deger Energie, “DEGERtraker 5000NT Dual Axis Tracking System,
42nd Photovoltaic Specialist Conference (PVSC), 2015, pp. 1–6. Data Sheet.” 2011.
[3] C. Yuan, C. Dong, L. Zhao, and X. Yan, “Marine environmental damage [18] Outback Power and Technologies, “MATE3 System Display and
effects of solar cell panel,” in 2010 Prognostics and System Health Controller, Owner’s Manual.” 2011.
Management Conference, 2010, pp. 1–5. [19] F. Pedregosa et al., “Scikit-learn: Machine Learning in Python,” J.
[4] B. Enache, F. Bîrleanu, and M. RăduŢ, “Modeling a PV panel using the Mach. Learn. Res., vol. 12, pp. 2825–2830, 2011.
manufacturer data and a hybrid adaptive method,” in 2016 8th

Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on October 04,2021 at 21:41:00 UTC from IEEE Xplore. Restrictions apply.

You might also like