Professional Documents
Culture Documents
Abstract—Information about climate changes is required at is a non-profit organization, which analyzes Earth’s Surface
global, regional and basin levels for a variety of purposes, Temperature records and disseminates its scientific
including the study of impact of the greenhouse gases. The investigations to shed light on the plight of global warming, by
analyses mentioned in this research relate to the observation of addressing the concerns of the skeptics. Berkeley Earth’s
trends in the temperatures of the Indian states. The research study on the surface temperatures of Earth consists of more
begins with the exposition of the ongoing analysis methodologies than 1.5 billion temperature reports. The data on global land
prevalent in exploratory analysis and prediction modeling on and ocean-and-land temperatures consists of records which
temperature data. It further develops into the proposed work, enlist statistics for the temperature recorded on the day of
where the analysis of means of the average temperatures
measurement, such as, average temperature of land,
observed across the Indian states from 1800-2013 is summarized,
which in turn is found to reveal confounding results. The
uncertainty over average land temperature, maximum and
proposed work concludes with further focused analysis of minimum temperature, land-and-ocean average temperature
geographically similar states, namely the states lying on the Indo- and its uncertainty. This facilitates an exhaustive analysis of
Gangetic plains, which reveal encouraging results, thereby data which can aid in deciphering trends across regions or
showing an occurrence of a trend. The research concludes with group of regions.
the propounding of the future scope, which includes modeling for Temperature data has been studied in the past to find the
predicting the average temperatures which can be attained over underlying trends which could account for warming. Analyses
the next few decades, which in turn would be significant for the
have been performed over the years to determine metrics such
observation of the corollaries of global warming in India.
as the highest increase in temperatures over a span of time,
Keywords—exploratory analysis; indian states; temperature; regions contributing the most to an increase in the national
ggplot2; r; rstudio; average temperature of a county, probability distribution
(normal, beta, uniform and exponential, etc.) of average
temperatures of a nation, correlation between the solar output
I. INTRODUCTION and the Earth’s surface temperatures, association between the
density and temperature of a country over the years to study
There has been a sizeable uptick in warming and the threat climate change trends, and tendencies in the average
of Climate Change is more startling than it has ever been. temperature uncertainty over the years to check for noisy data,
Scientists have performed exhaustive analyses and developed et cetera.
prediction models to ascertain the rise in temperatures and
The data extracted from Berkeley Earth was extremely
prognosticate how the trends will stand in the decades to
exhaustive, which allowed for fascinating subsets such as
come. A huge amount of data to study climate change has
those arranged by country or city, which in turn enabled an
been collected for approximately a hundred and thirty years,
interesting study on the average temperatures observed in the
with researchers having laid down the very real hazard of
states of India from 1800-2013. RStudio IDE [2], based on R
global warming using solid facts. Examination of Climate
[3] was used for the analysis and the dataset with average
Change trends entails a long-time evaluation of data, the
temperatures, and average temperature uncertainty grouped by
preparation of which is highly tricky. Collection, cleaning,
state was used to extract the data specific to India. Trends, if
preprocessing of this data is complex, which is why survey of
any, existing in temperatures were analyzed for different
this data needs to be very thorough. Mercury thermometers
states. The trends in the temperatures across the states, over
were earlier used for collection, which in turn could give way
the years, threw up some interesting results. Many states were
to dodgy results owing to any variations which could exist in
found to account for the warming trends in different years and
visit time. There was a shift to electronic thermometers in the
with varying degrees. The analysis of the current trends can
80s, which became a more reliable source of data collection.
provide for a rigorous prediction model which in turn can
This paper is based on the exploratory analysis of data predict temperature trends for the coming years depending on
extracted from Berkeley Earth[1] to evaluate trends in the available facts, figures and past patterns.
temperatures of Indian States from 1800-2013. Berkeley Earth
548
2016 2nd International Conference on Next Generation Computing Technologies (NGCT-2016)
Dehradun, India 14-16 October 2016
549
2016 2nd International Conference on Next Generation Computing Technologies (NGCT-2016)
Dehradun, India 14-16 October 2016
The Temp variable and the years were plotted using the qplot Inconsequential results were obtained from the analysis of all
function of the ggplot2[11] package, for R Studio. The scatter the states of India that was correlated with the diversity of
and line plot, Fig 5, show a visible dip in the average India, where significant change in any particular geographical
temperatures of the Indian states under consideration. This dip region caused a change in the averaging of all the
was deemed as an oddity in analysis, mostly accounted to the geographical regions taken as unity. To eradicate the chaining
huge diversity in the states of India, which range from the sub- effect of changes in different geographic systems, the research
freezing temperatures of Jammu-Kashmir, to the extremely was focused on the analysis of a particular geographically
hot conditions prevalent in Rajasthan. Further, the root-mean- similar region, namely- the states in the Indo-Gangetic region.
square value of the deviations for the period of 1800~1850
will evidently be very high as compared to the rest of the plot. The following states were decided for further focus of the
The boxplots that were obtained for the interval of 50 years analysis- Delhi, Bihar, Uttar Pradesh, Madhya Pradesh,
i.e. for years – 1800, 1850, 1900, 1950 and 2000 also revealed Haryana, Chhattisgarh, Jharkhand, and West Bengal. Filter
similar results. Discrete values of mean temperatures of the function was used for filtering out the data of the
states for the respective year have been considered for the aforementioned states from the data of all the states of India,
box-plot, which will be further analyzed to consider mean as used in previous section.
values of a period of years. This analysis visual was also
obtained in a similar manner using the qplot function of
ggplot2 package.
550
2016 2nd International Conference on Next Generation Computing Technologies (NGCT-2016)
Dehradun, India 14-16 October 2016
The pipelining of the thus obtained data from the series of This analyses allows the scope of prediction of future
functions, as done earlier, was repeated, i.e. - the data was temperature recordings which will have wide-ranging impact
filtered, grouped by, and then summarized using the functions on the way the rise in temperatures is viewed, as a disastrous
filter, group_by, and summarise. increase will need to be avoided and if the prediction model
predicts such catastrophic temperatures, then measures would
The qplot of the data obtained in a similar manner to that done need to be adopted for thwarting the occurrence of the same.
for all the states of India revealed encouraging results and a
regular trend, as shown below. Fig 7 points in the direction of
a trend in the increase of the average temperatures of the states V. FUTURE SCOPE
which belong to the Indo-Gangetic region, which share many The analysis of the states around the Indo-Gangetic plains
geographical characteristics. The root-mean-square values of reveals a steady rise in the means of the averages of the
the scatter plot around the trend-line would also be less, as the temperatures across these states, namely – Delhi, Bihar, Uttar
average temperature readings are condensed around a uniform Pradesh, Madhya Pradesh, Haryana, Chhattisgarh, Jharkhand,
trend. and West Bengal, over the years, from 1800-2013.
This trend can be mapped onto a linear regression model,
This exploratory analysis of the data of the states around the or other apposite prediction model can be developed which
Indo-Gangetic region was further improved and depicted in a will provide us with the probable average temperatures that
plot of the boxplots of the average temperatures at an interval these regions can encounter over the next few decades. This
of 40 years, for the period of 1800-2000. The box-plots will facilitate the study of temperature trends, which may
consider single values of all the occurring mean temperatures occur in the plains of India. It will help in gauging the impact
for the states in the particular year, as depicted by the abscissa. of global warming and foment countermeasures for checking
Further analyses are proposed for the future, which will the same. Further study over the same would be done for
provide comprehensive detail by accounting for mean values provisioning the average temperature range that these states
for a time span of a few decades. The work done by David A. would face during the decade of 2040-2050.
Abdo et al [12], S. Kundu et al [13] and A.Ghosh et al [14] These analyses can also be carried out over other
inspired us in the search of these trends. geographically similar areas, which will further provide
similar or improved insights, as are being observed from the
The boxplots visible in the Fig 8 also point in the same Indo-Gangetic states in India, for studying and researching
direction, as done by the scatter-line plot of the average into the information which can be provisioned from the
temperatures of the states around the Indo-Gangetic region. available data.
These plots show the potential of the development of a
suitable prediction model, which will be capable of predicting
the average temperatures of these states, which will be in
conjunction with the rise depicted in the given plots. VI. CONCLUSION
The above research studies the methodologies prevalent
for the modeling of the temperature trends, from the wrangled
temperature data and also provides analyses of the temperature
trends which were prevalent in specific geographically similar
regions of India, during the period of 1800-2013. Different
regions, owing to different geography and different
environmental conditions show different changes in
temperature data, with some regions facing the brunt of global
warming more than others. Similar analysis of the average
temperatures of all the states of India, taken as a whole, did
not allow for reasonable analysis of trends. But, the visualized
temperature data of the Indo-Gangetic regions exhibited trends
which allow the scope for modeling, which will further the
scope of this research and allow for the development of
suitable prediction model for predicting the average
temperatures for certain regions of India. Such a model could
aid in outlining the major regions where the maximum
increase in temperature may be observed, which would in turn
yield in scientific research on the specific causes and the
consequences of the said increase. Study of trends which may
occur in the future could be used as an effective tool to further
scientific literature in the domain of climate change.
551
2016 2nd International Conference on Next Generation Computing Technologies (NGCT-2016)
Dehradun, India 14-16 October 2016
552