You are on page 1of 5

International Conference on Electronics, Communication and Aerospace Technology

ICECA 2017

Soft Computing and Data mining Techniques


for Thunderstorms and Lightning Prediction:
A Survey
Kanchan Bala Dilip Kumar Choubey Sanchita Paul
Computer Science Engineering Computer Science & Engineering Computer Science Engineering
Birla Institute of Technology, Mesra Birla Institute of Technology, Mesra Birla Institute of Technology, Mesra
Ranchi, India Ranchi, India Ranchi, India
kanchanbala237@gmail.com dilipchoubey_1988@yahoo.in sanchita07@gmail.com

Abstract - Thunderstorms are fascinating and elegant phenomenon that causes lots of damages to life,
event, which occurred frequently all over the world. properties, animal and crops. Thunderstorms are
When sudden rumbling of sound associated with a bolt accompanied with lightning flashes, torrential
of lightning flashed across the sky, then thunderstorm is rainfall, high wind gusts, occasional hail and
said to be occurring. Lightning is associated with every
tornadoes [1]. Forecasting of severe thunderstorm
thunderstorm, which kills more people than hurricanes
and tornadoes. Heavy rain from thunderstorm causes and lightning is a challenge for both meteorologists
extensive loss of property and leads to flooding. and atmospheric scientists in the world because such
Accurate prediction of thunderstorm is a difficult task highly nonlinear and chaotic phenomena may incur
in weather forecasting due to its temporal extension and significant detrimental consequences on the huge part
spatial either physically or dynamically. Different of agricultural productivity.
technological and scientific researchers are carried on Moisture, unstable air, and lifting mechanism are
forecasting of thunderstorm and lightning in advance to playing an important role in the formation of
reduce damages. In this regard, there are many thunderstorm. There are many causes that lead to
different methodologies have been proposed such as
uplifting of warm and humid air such as air-solar
statistical, machine learning, numerical weather
prediction, weather research forecasting, image heating, two different air streams meet, vicinage of
processing etc. This paper focuses on the survey of the low pressure channel, etc. When humid air is
several research papers on the thunderstorm and lifted upward and cooled then the moisture in the air
lightning using soft computing and data mining is condensed and form clouds. Due to further lifting
techniques such as neural network, rough set, support of humid air, the cloud would be extended larger and
vector machine, fuzzy logic, genetic algorithm, k-means water droplets continue growing in the cloud and
clustering, k- nearest neighbor etc. This paper has been further freezes to form ice crystals. As soon as the
also enclosed the suggestions on the future research water droplets are become so heavy, they are falling
direction.
as hail. The hail acquires the negative charge due to
rubbing against the ice crystals in clouds. Thus, the
Keywords - Thunderstorm, Lightning, Forecast, Soft
Computing, Data Mining, Prediction, Neural Network, negative charges are collected at the base of the cloud
Fuzzy Logic, Weather. and positive charges are created at the top of the
cloud. These negative charges are attracted by some
I. INTRODUCTION other clouds, objects and earth. When the attraction
A thunderstorm is a series of sudden electrical extent to large, negative and positive charges are
discharge resulting from atmospheric conditions. either discharge or come together to form lightning.
Electric discharged result in sudden flashes of light Lightning is further heating and expanded the air,
and rumbling sound wave, commonly known as which produced the thunder. Thunderstorms are
thunder and lightning. The thunderstorm is a categorized based on their actual physical
mesoscale weather spectacular phenomenon with characteristic. Actually, there is a continuous
space scale varying from a few kilometers to a couple spectrum of thunderstorm type, but these can be
of 100 kilometers and time scale varying from less broadly classified into four types- single cell storms,
than an hour to several hours which occurs multicell cluster storms, multicell line storms and
seasonally. A severe thunderstorm is a natural supercell storms. Every thunderstorm poses lightning.
Lightning is generally classified into four broad

978-1-5090-5686-6/17/$31.00 ©2017 IEEE 42


International Conference on Electronics, Communication and Aerospace Technology
ICECA 2017

categories: Inter-cloud, cloud-to-cloud, cloud to In this regard, the paper has been organized as
ground, cloud to air. Cloud to ground is more follows: Literature review has been presented and
dangerous than the other types of lightning. In devoted to the future direction.
the last two decades, the prediction of thunderstorm II. LITERATURE REVIEW
and lightning is an active area of research. Yet it is There are several papers which are summarized
still a challenging work for researcher and forecasters on prediction of thunderstorm and lightning using
due to its spatial and temporal extension. Soft soft computing and data mining techniques.
computing and data mining have an aim to exploit the Summarization is done in form of a table which
tolerance of uncertainty, approximation and consists of paper title, used data set, techniques, tool,
imprecision to achieve the decision-making. Several advantages, issues, and their accuracy.
soft computing and data mining techniques are
applied in the prediction of thunderstorm and
lightning with the small scale and short-term forecast.

TABLE I. SUMMARY OF EXISTING WORKS

S.N Paper Data Set Techniques Used Tool Advantages Issues Accuracy
O reference Used
No.
1. [2]. Mean sea level pressure, Artificial Neural ----- ANN with LM algorithm Nowcasting of 76%.
wind speed, and relative Network (ANN) with predicts hourly sudden thunderstorm is not
humidity are collected Levenberg Marquardt falls of temperature as mentioned.
from the Indian (LM) algorithm. compare to the other
Meteorological learning algorithms.
Department (IMD).

2. [3]. Vertical moisture K-Nearest ----- Modified K-NN model is One of the issues is 82.02%
difference and dry Neighbor(K-NN), used as best classifier. that no satisfactory
adiabatic lapse rate at Modified K-NN and Satisfactory result has result can be
different geo-potential Multi Layer been obtained using only obtained for more
heights of 40 years data are Perceptron (MLP). five input variables. than five input
collected from the IMD. variables.

3. [4]. Convective Available Genetic Algorithm MATLA Automated forecasting Automated Brier skill
Potential Energy (CAPE), (GA), Fuzzy Logic B, WRF. scheme produces forecasting scheme score of
Relative Humidity (RH), and Logistic calibrated probabilities, showed not good 32%.
Bulk Richardson Number Regression. unambiguous and Brier resolution
(BRN) and low level consistent desired than human
vertical velocity are output. forecasters.
predictors.
4. [5]. Wind speed, convective Correlation based ----- Analysis is performed This paper has not --------
instability (or stability), Principal Component separately for morning revealed that how
conditional instability (or Analysis (PCA). and afternoon for much accurately
stability) vertical shear of thunderstorm days and predicts the
horizontal wind. fair-weather days. occurrence of a
thunderstorm.

5. [6]. Radiosonde data are used Back Propagation ----- BPN-PCA achieves more Only radiosonde 74.9%
as input and derived 52 Neural Network accurate prediction of data are used for using
atmospheric instability (BPN), Self- cumulonimbus clouds prediction purpose. BPN and
indices from these Organizing Map than BPN. 82.5%
radiosonde data. (SOM) and PCA. with
BPN-
PCA.

6. [7]. Satellite images are taken Saliency map, K- ----- Saliency map improved Lead time of 94%.
from the National Oceanic means clustering and the prediction accuracy. forecasting is not
and Atmospheric Haar wavelet mentioned.
Administration. transform.

978-1-5090-5686-6/17/$31.00 ©2017 IEEE 43


International Conference on Electronics, Communication and Aerospace Technology
ICECA 2017

7. [8]. Wind analysis, temperature Rough Set (RS), MATLA RS-SVM has been Applicable for only An
analysis, height analysis, Support Vector B 7.1, predicting the higher regional forecast. accuracy
cloud analysis, water Machine (SVM). GrADS resolution nowcasting of . of 0.71
analysis and derived 1.9 lightning and with 3
analysis and its derivative. thunderstorms hours lead
time

8. [9]. Wind shear is considered MLP. ----- MLP is an excellent Weather variables 98.34%
at four different geo- classifier for the obtained from With 10
potential heights which are nowcasting of severe RSRW flight, which to 12
obtained from rawindsonde thunderstorm using one is limited to a hours lead
during the period of upper air parameter. particular region. time.
18years from IMD.
9. [10]. Composite Constant Fuzzy logic based ----- Membership function has Large number of 40000
Altitude reflectivity PPI clustering. been initialized thunderstorms are storms are
(Plan Position Indicator) statistically which produced which tracked.
(CAPPI), Composite echo- enables the adaptation of required the
top altitudes with the algorithm to the local analysis laboriously
45dBZ(ET45) and climate. and manually.
20dBZ(ET20), CG flash
density map,
Radar-derived rainfall
accumulation, storm area,
storm flash density, storm
radar reflectivity.
10. [11]. Maximum rate of change Naïve Bayesian. ----- Horizontal and vertical Not much 65%.
in top of troposphere growth in convective accurately
emissivity, maximum rate clouds has an excellent predicted.
of change in ice cloud differentiation capability
fraction, rate of change in between severe and non-
area of the storm of top of severe thunderstorm.
troposphere emissivity,
rate of change in the area
of the storm of 0.65 μm
optical depth.
11. [12]. Sunshine hour (SSH), K-NN. ----- K-NN is simple and very Nothing is learned 72.16%
Cloud coverage (Nh), flexible classifier for from training and with a
Pressure at Freezing point prediction required large lead time
(FRZ) and dry adiabatic Learning cost of K-NN is storage size. of 12
lapse rates at different geo- zero. hours.
potential heights of the
atmosphere of 33 years
from IMD.
12. [13]. Satellite images. K-means clustering MATLA The model improves the Segmentation is 89.23%.
and Haar wavelet B efficiency to great extent performed using
transform. R2011a. as compare to the other only clustering
learning methods of technique.
MLP.
13. [14]. Low level clouds types RS. ----- The values of three Every object of 06 GMT
and record of occurrences factors certainty, discourse with some are more
of thunderstorms are coverage and strength are information preferable
collected at 00 GMT, 03 used for the decision rule associated with a for
GMT, 09 GMT and 12 with different condition decision rule. thundersto
GMT of ten years from for a particular decision. rm.
IMD.
14. [15]. PTU (Pressure, ANN-BP. C ANN is simpler and more Not feasible for 67%.
Temperature, and Languag biased for the lightning industry which is
Humidity) table data and e. days. sensible to lightning
lightning data are used as risk.
input.
15. [17]. Electromagnetic field. ANN-MLP feed ----- Modular ANN improves An electromagnetic The
forward with LM the prediction of field is used as relative
algorithm and MTLL locations CG lightning. input, which error is
model. Learning speed of ANN obtained from only less than
is improved using the one radar station. 5% and an
LM. absolute
error not

978-1-5090-5686-6/17/$31.00 ©2017 IEEE 44


International Conference on Electronics, Communication and Aerospace Technology
ICECA 2017

greater
than 1 km.
16. [18]. Data of four years from the Decision Tree, Naïve R- This paper helps in Optimization of 73% for
Lightning Imaging Sensor Bayesian, K-Means Program forecasting of future time complexity is small
(LIS) aboard the Tropical clustering. ming distribution of lightning. needed through datasets
Rainfall Measuring Languag parallel computing. and 63%
Mission (TRMM) are used. e. for larger
datasets.
17. [19]. Lightning data are Spatial Clustering ----- The model predicts Only predict the 75%.
collected from State Grid lightning every 5 linear-trend
Electric Power Research minutes. movement of
Institute (SGEPRI), China. lightning.
18. [20]. Wind, dew point, ANN-BP. MATLA Computational burden is Post processing is Rms error
humidity, pressure, B. reducing in achieving the needed for is 41%
temperature, cloud height converged solution. achieving perfection with R-
and moisture difference Learning speed of ANN of developed value of
data are collected from is improved using the LM network. 0.99997.
Malasysian Meteorological algorithm.
Service and Tenaga
National Research and
Department.
19 [21]. Laps data and lightning RS, SVM with RBF MATLA The RS-SVM achieves The experiment has 71%.
positioning data are kernel. B 7.1, short time now- casting been conducted on
collected from the Jiangxi GrADS and high resolution data of one year.
. Meteorological 1.9. prediction of lightning.
Administration
show the prediction of thunderstorm with lead time.
[16] and [22] also analyzed and compared the Lead time is 10-12 hour or 3-6 hour. In electronic
several existed soft computing and data mining era, the large scale forecast has been unable to meet
technique on diabetes with their advantages, issues, the requirement. So, we need the development of
technique, tool used, existed work, future work. The short term and small scale forecast research have
used technique, tool has also been discussed on the practical and important significant. In this regard,
basis of following parameter i.e., advantages, issues,
several soft computing and data mining techniques
application.
have been applied for prediction and forecast. It is
III. DISSCUSSION AND FUTURE required a good technique for the predicting
DIRECTIONS thunderstorm and lightning. So, the authors present
Now days, prediction of thunderstorm and lightning the future directions of some papers, along with
is challenging task. Different research papers are existing work which are summarized in the following
included that analyzed on data set, technique, tool table.
used, advantages and disadvantages. Some papers

TABLE II. SUMMARY OF THE FUTURE WORKS OVER THE EXISTING WORK

S.N Paper Exiting Work Future Work


o. Reference
No.

1. [2]. ANN model is used to test the impact of different learning Other soft computing techniques can be applied to further
algorithm on prediction of severe thunderstorm that occurred increase the accuracy, Work can be further extended for more
over Kolkata on 3, 11 and 15 May 2009, Model results are days.
validated by observation.
2. [3]. K-NN, modified K-NN and MLP with two parameters at Other data mining and soft computing techniques with more
different geo-potential heights are used to predict severe weather parameters can be applied to increase the accuracy of
thunderstorms. the result in future work.

3. [8]. The thunderstorm is predicted using RS-SVM with the small Regional geography and climate condition can be considered to
scale and short term on historic data with ignorance of regional improve the accuracy of the model.
geography and climate condition.
4. [9]. The thunderstorm is predicted using MLP with one weather Weather data can obtain globally instead of locally.
variable; Data are obtained from RSRW flight, which is limited
to a particular region.

978-1-5090-5686-6/17/$31.00 ©2017 IEEE 45


International Conference on Electronics, Communication and Aerospace Technology
ICECA 2017

5. [10]. Statistical approach, fuzzy logic model with human-oriented New parameters and sub model can be integrated into the
linguistic inference rules, clustering based tracking algorithm existing model to improve an accuracy of the model.
and density based spatial clustering algorithm are used to classify Classification of severity can be also improved using state-of-
the severity of the storm. the- art methods made by dual-polarization radar.
6. [13]. K-means clustering and wavelet transform techniques are used to This work can be further modified using improved
predict the thunderstorms. Segmentation is performed using K- segmentation. Image can be segmented on the basis of
means clustering technique which uses only color. orientation, motion, intensity not only on a color basis.
7. [19]. The spatial clustering method is adopted to predict the lightning Prediction of a non-linear movement of thunderstorm cloud can
motion by using historical and real time lightning monitoring be included in future work. Another characteristic of lightning
data of the lightning location system. such as density, electricity can be predicted.

8 [21]. RS, SVM and RS-SVM model are used to forecast thunderstorm Special model can be built for certain regional and amount of
with high resolution and short time nowcasting using real data of experimental data can be increased.
one year.

For the future research work, the authors suggest to [11] John L. Cintineo, Michael J. Pavolonis, Justine M. Sieglaff
and Daniel T. Lindsey, “Probabilistic nowcasting of severe
develop an expert system of prediction of
convection,” National Weather Association Annual Meeting,
thunderstorm and lightning which will provide good Madison, WI, 2012.
prediction accuracy. [12] Dyuti Chatterjee and Himadri Chakrabarty, “Application of
machine learning technique to predict severe thunderstorm
References using upper air data,” International Journal of Scientific and
[1] Sutapa Chaudhari, “A probe for consistency in CAPE and Engineering Research, vol. 6, no. 7, pp. 1527-1530, July
CINE during the prevalence of severe thunderstorms: 2015.
Statistical-Fuzzy Coupled approach,” Atmospheric and [13] Kishor Kumar Reddy C, Anisha P R, and Narasimha Prasad L
Climate Science, pp. 197-205, 2011. V, “Detection of thunderstorms using data mining and image
[2] Litta A. J, Sumam Mary Idicula, and C. Naveen Francis, processing,” pp. 226-231, 2014.
“Artificial neural network for the prediction of thunderstorms [14] Sutapa Chaudhuri, “Preferred type of cloud in the genesis of
over Kolkata,” International Journal of Computer severe thunderstorms- A soft computing approach,”
Applications, vol. 50, no. 11 pp. 50-55, July 2012. Atmospheric Research 88, pp. 149-156, 2008.
[3] H. Chakrabarty, C. A. Murthy, and A. D. Gupta, “Application [15] Leong Yeng Weng, Jamaludin Bin Omar, Yap Keem Siah,
of pattern recognition techniques to predict severe Syed Khaleel Ahmed, Izham Bin Zainal Abidin and
thunderstorms,” International Journal of Computer Theory and Noradlina Abdullah, “Lightning forecasting using ANN-BP
Engineering, vol. 5, no. 6, pp. 850-855, Dec. 2013. & radiosonde,”IEE International Conference on Intelligent
[4] Christopher J. Hanlon, George S. Young, Johannes Verlinde, Computing and Cognitive Informatics, 2010.
Arthur A. Small, and Satajit Bose, “Probabilistic forecasting [16] Dilip Kumar Choubey and Sanchita. Paul, “Classification
for isolated thunderstorms using a genetic algorithm: The DC3 techniques for diagnosis of diabetes disease: A review,”
campaign,” Journal of Geophysical Research: Atmospheres, International Journal of Biomedical Engineering and
vol. 119, pp. 65-74, 2014. Technology (IJBET), Inderscience, Vol. 21, No. 1, pp. 15-39,
[5] S. Ghosh, P.K. Sen, and U.K. De, “Identification of significant 2016.
parameters for the prediction of pre-monsoon thunderstorms at [17] S. A. Emamghoreishi, R Moini and S. H. H. Sadeghi, “A
Calcutta, India,” International Journal of Climatology, pp. 673- modular neural network approach for locating cloud-to-
681, 1999. ground lightning strokes,” IEEE, 2001.
[6] Agie Wandala Putra and Chidehanok Lursinsap, [18] Aimee Booysens and Serestina Viriri, “Detection of lightning
“Cumulonimbus prediction using artificial neural network pattern changes using machine learning algorithms,” In
back propagation with radiosonde indeces,” pp. 153-165, Proceeding of International Conference on Communications,
2014. Signal Processing and Computers, 2014.
[7] Rasika P. Gawande, and Namrata D. Ghuse, “An efficient [19] Guo Juntian, Gu Shanqiang and Feng Wanxing. “A lightning
approach towards thunderstorm detection using saliency map,” motion prediction technology based on spatial clustering
International Journal of Computer Science and Information method,” In Proceeding of International Conference on
Technologies, vol. 6(3), pp.2429-2434, 2015. Lightning, IEEE, 2011.
[8] Liu Ping, Qiu Tao-rong and Lin Yu-yuan, “The study on the [20] Dalina Johari, Titik Khawa Abdul Rahman and Ismail
model of thunderstorm forecast based on RS-SVM,” Journal of Musirin, “Artificial neural network based technique for
Convergence Information Technology, vol. 8, no. 10, pp. 66- lightning prediction,” In Proceeding of Student Conference
74, May 2013. on Research and Development (SCORED), IEEE, Dec. 2007.
[9] Himadri Chakrabarty, C. A. Murthy, Sonia Bhattacharya and [21] Taorong Qiu, Shanshan Zhang, Hou Zhou,, Xiaoming Bai,
Ashis Das Gupta, “Application of artificial neural network to and Ping Liu, “Application study of machine learning in
predict squall-thunderstorms using RAWIND data,” lightning forecasting,” Information Technology Journal,
International Journal of Scientific & Engineering Research, 12(21), 6031-6037, 2013.
vol. 4 no. 5, pp.1313-1318, May 2013. [22] Dilip Kumar Choubey, Sanchita Paul and Joy Bhattacharjee,
[10] Pekka J. Rossi, Vesa Hasu, Jarmo Koistinen, Dmitri Moisseev, “Soft computing approaches for diabetes disease diagnosis: A
Antti Makela, and Elena Saltikoff. “Analysis of a statistically survey”, International Journal of Applied Engineering
initialized fuzzy logic scheme for classifying the severity of Research (IJAER), RIP, Vol. 9, pp. 11715-11726, 2014.
convective storms in Finland,” Royal Meteorological Society,
pp. 656-674, 2014.

978-1-5090-5686-6/17/$31.00 ©2017 IEEE 46

You might also like