Professional Documents
Culture Documents
REVIEW OF LITERATURE
A review of the available literature relevant to the scope of the study has been
presented in this chapter with a view to survey the various methodologies employed
by the researchers. This chapter is divided under the following sections:
Leuthold et al. (1970) forecasted daily hog prices and daily quantities
supplied by using several alternative techniques. A distinction between econometric
and the Box Jenkins models was made. It was stated that the former identified and
measured both economic and non-economic variables affecting price and quantity,
while the latter identified the stochastic components. The models were tested using
Theils ‘U’ coefficient and the authors concluded that the econometric models yielded
slightly superior forecasts. Finally, it was concluded that although better forecasts
would be obtained by econometric models yet stochastic models were less prone to
error and were less expensive.
concluded that the simple ARMA models were relatively more robust with respect to
post sample predictions than the complex econometric models. If the mean square
error were an appropriate measure of loss, an unweighted assessment clearly
indicated that a decision maker would have been better off relying simply on the
ARIMA predictions in the post sample period.
Chatfield and Protharo (1973) observed that the Box Jenkins procedure was
not suitable for the sale forecasts with a multiplicative seasonal component. In this
analysis, monthly data on sales of a company was used. The adequacy of the model
was tested using Box-Pierce Test.
Govindan (1974) used Box Jenkins model to analyze wholesale price indices
of rice, wheat, jowar and gram. The short term forecasts were found to give good
results while the same was not true of long term forecasts. Janus quotients of the
forecasts showed that the model gave good results.
Chatfield (1977) observed that the Box Jenkins approach, being a valuable
addition in the forecast tool bag, gave a deeper understanding of time series behavior.
Even though it was found to be more expensive yet the accuracy justified the cost.
~11~
Review of Literature
Chengappa (1980) applied the Box Jenkins model to forecast poor sale and
export auction prices of coffee. The ARIMA seasonal model was applied by using
monthly datadue to the distinct seasonal variation in prices. The poor sale price
forecasts were found to be accurate when compared to forecast of export prices. This
was attributed to a possible lack of stationarity of the data. Hence adoption of
differencing procedure or a transformation to make the data stationary was found
necessary for a better estimate of export prices.
Hillmer and Tiao (1982) examined three of the ARIMA models commonly
fitted to economic time series and showed that certain restrictions must be placed on
the range of parameter values for such a decomposition to exist.
Achoth (1985) analyzed the supply, price and trade of Indian tea by fitting
ARIMA models to data on prices and production. The moving average models were
found to be most suitable. Among the price series a particular month’s price was not
related to the price of the immediate previous month but significantly related to the
price of same month in previous years. However, the production in a particular month
was related both to production of the previous month as well as to the production of
same month in previous years. The forecasts yielded reasonably good results as
judged from the tests of their efficiency. The forecasts of prices were superior when
compared to the forecasts of quantities, which was attributed to the highly structured
pattern of price behaviour.
~12~
Review of Literature
Dooley and Lenihan (2005) analyzed the ability of two time series
forecasting techniques to predict future lead and zinc prices. With regards to zinc,
~13~
Review of Literature
there was no conclusive evidence to suggest that one model was superior to the other
in its forecasting ability. In the case of lead, however, it was found that forecasting
cash prices using ARIMA modeling led to results that had superior forecasting power
in five out of eight cases and hence difficult to offer an outright conclusion regarding
which forecasting method developed in this paper produced a better result. However,
on balance one would had to acknowledge that out of a total of sixteen cases, in nine
of these ARIMA price forecasts provided a superior result to lagged forward price
models.
Batchelor et al. (2007) tested the performance of popular time series models
in predicting spot and forward rates on major seaborne freight routes investigate the
performance of alternative univariate and bivariate linear time-series models in
generating short-term forecasts of spot freight rates in the international dry bulk
shipping market they found that in predicting forward rates ARIMA and VAR models
forecasted better.
Bharathi (2009) used the Box-Jenkins ARIMA model to forecast the monthly
arrivals and prices of mulberry silk cocoons in the two markets.
~14~
Review of Literature
Valipour (2012) forecasted the inflow of Dez dam reservoir by using ARMA
and ARIMA models while increasing the number of parameters in order to increase
the forecast accuracy to four parameters and comparing them. In ARMA and ARIMA
models, the polynomial was derived respectively with four and six parameters to
forecast the inflow. By comparing root mean square error of the model, it was
determined that ARIMA model can forecast inflow to the Dez reservoir from 12
months ago with lower error than the ARMA model.
Kumari et al. (2014) studied rice yield prediction in India using Auto
regressive integrated moving average approach and out of different eleven ARIMA
model, ARIMA(1,1,1) model was found to be the best for their study.
Deluyker et. al. (1987) analyzed on modeling daily milk yield in Holstein
cows using time series analysis. Time series analysis of milk yields of cows milked 3
times daily was carried out on 513 partial or complete lactation yield records. It was
found that the exponential smoothing function was most appropriate for the modeling
of individual milking and daily yield data. Model parameters were influenced by
parity, stage of lactation, occurrence of missed milkings and treatment for diseases.
An examination of the residual variances showed that the model to forecast daily total
yield performed as well as the model to forecast individual-milking yield.
~15~
Review of Literature
Sisak (1989) worked on the principle of adaptive models of time series with
regard to short term forecasting and the possibilities for application to cost planning.
An adaptive model for the exponential smoothing of time series data was used to
determine short term forecasts in the development of production costs for a farm
forestry enterprise in Czechoslovakia. The results obtained were compared to those
derived from an extrapolation of regression estimates. It was found that it provided
better quality of forecasts for farm planning/budgeting than the regression model
forecasts.
Manurung et. al. (1991) analysed on forecasting of oil palm hectarage and
the need for seed in the second long term development plan. Forecasts of oil palm
hectarage in Indonesia over the period of the second long term development plan
(1994-2018) were made using the double exponential smoothing method. The
average forecast growth rate was 3.24 percent per year.
~16~
Review of Literature
Exponential Smoothing models in forecasting. It was also found that forecasting the
first differences of tourist arrivals performs worse than forecasting its various levels.
Gajendra and Bhogal (2006) analyzed the food security situation in the state
of Gujarat, India. The study considered the exponential smoothing and moving
averages for making projections of food grains. The estimates of food grain
production and requirement indicated that the overall cereals and pulses requirement
would continue to be in deficit in both periods.
~17~
Review of Literature
basic structural model, leading to better forecasts from the exponential smoothing
model when there was a rapidly changing seasonal pattern.
Eva and Oskar (2012) described a relatively simple and versatile technique
for forecasting time series data i.e. simple exponential smoothing. The procedure
gave heaviest weight to more recent observations and smaller weight to observations
in the more distant past. The accuracy of the SES method strongly depended on the
optimal value of the smoothing constant a. To determine the optimal a value in the
paper was used a traditional optimization method based on the lowest mean absolute
error (MAE), mean absolute percentage error (MAPE) and root mean square error
(RMSE).
~18~
Review of Literature
McCulloch and Pitts (1943) developed the first computing machines that
intend to simulate the structure of the biological nervous system and could perform
logic functions, which were used to transmit information from one neuron to another.
This eventually led to the development of binary probit model. According to this
model, the neural unit could either switch on or off depending on whether the
function was activated or not.
Werbos (1974) published the back propagation learning method. The back
propagation technique enabled the determination of parameter values for which the
error was minimized. Prior to the introduction of back-propagation method, it was
difficult to determine multiple parameter values.
~19~
Review of Literature
computing and physics. Two years later the Boltzmann machine was invented. The
neural network utilized a stochastic learning algorithm based on properties of the
Boltzmann distribution.
~20~
Review of Literature
functional MLP was statistically well defined. They finally showed on simulated and
real world data that the proposed model performed in a very satisfactory way.
Kumar and Walia (2006) carried out work on application of Artificial Neural
Networks in finance for cash forecasting. They presented two neural network models
for cash forecasting of a bank branch. One was daily model – taking the parameter
values for a day as input to forecast cash requirement for the next day and the other
was weekly model, which takes the withdrawal affecting input patterns of a week to
predict cash requirement for the next week. The system performed better than other
cash forecasting systems.
Zhang (2007) aimed at certain points that were to: 1) point out common
pitfalls and misuses in the neural network research; 2) draw attention to relevant
literature on important issues; and 3) suggest possible remedies and guidelines for
practical applications. The main message was great care must be taken in using ANNs
for research and data analysis.
Halbert (2008) reviewed concepts and analytical results from the literatures
of mathematical statistics, econometrics, systems identification, and optimization
theory relevant to the analysis of learning in artificial neural networks. He focused
primarily on learning procedures for feed forward networks and suggested some
potentially useful new training methods for artificial neural networks.
~21~
Review of Literature
module for the neural network method. The results indicated that neural networks and
conventional algorithmic methods should not be in competition but complement each
other for the development of affect recognition systems.
Sang (2009) proposed to increase the number of output nodes per each class
for performance improvement of MLPs. Also, simulations of 50 isolated-word
recognition showed the effectiveness of proposed method.
Hippert and Taylor (2010) said that Neural networks (NNs) had frequently
been proposed for short-term load forecasting (STLF), because of their capabilities
for nonlinear modeling of large multivariate datasets. The family of NN models
known as multilayer perceptrons (MLPs) were probably the most frequently used,
since they had been shown to be universal approximators of functions and could be
used to model the function that relates the electric load to its exogenous variables.
~22~
Review of Literature
reviewed relevant literature and attempt to provide a set of insights for selecting the
appropriate approach.
Sang (2012) proposed a new error function, in order to improve the error
back-propagation algorithm for the classification of imbalanced data sets. This
method was compared with the two-phase, threshold-moving, and target node
methods through simulations in a mammography data set and the proposed method
attained the best results.
~23~
Review of Literature
Kumari et al. (2013) developed a model to forecast the productivity and pod
damage by Helicoverpa armigera using artificial neural network model in pigeonpea
(Cajanus Cajan). Sigmoid and linear functions were used as activation function
hidden and output nodes respectively. By using Random Data Division Process, the
data were divided into three mutually exclusive sets (Training, Validation and
Testing) having 70% data in training, 15 % in validation and 15% in testing sets. The
performance of the proposed network when trained with Levenberg-Marquardt back
propagation algorithm was assessed by its Root Mean Squared Error (RMSE) values
along with multiple correlation coefficients (R) between observed and predicted
outputs.
Kumari et al. (2014) presented time series forecasting of losses due to pod
borer, pod fly and productivity of pigeonpea (Cajanus cajan) for North West Plain
Zone (NWPZ) by using artificial neural network (ANN). The performance of the
~24~
Review of Literature
Fisher (1924) was the first to tackle the pre-harvest forecasting problem and
assumed that the effect of change in weather variables in successive weeks would not
be an abrupt or erratic change but an orderly one that follows some mathematical law.
Using the polynomials of the fifth degree on rainfall distribution and obtained the
rainfall constants. A multiple regression equation was developed using crop yield as
dependent variable and rainfall distribution constants as independent variables. It was
found that wheat crop yield was significantly affected by rainfall.
Davis and Harrell (1942) fitted third degree polynomials to study effect of
rainfall and average maximum temperature on corn yield at various locations from the
Great Plains to the Atlantic coast. It was found that a systematic change occurs in the
pattern of weather yield relationships from one end of the region to the other.
Kokate et al. (2000) developed a statistical model for forecasting the yield of
rice in Ratnagiri district of Maharashtra. They applied correlation analysis to know
the association of plant characters and climatological parameters with the yield. They
applied step down regression analysis technique to know dominant variables in the
study. It was found that integrating plant characteristics and climatological factors
together in the model provides better estimates for forecasting rice yield.
~25~
Review of Literature
together. Time series data on weather variables and agricultural inputs were used and
Multiple Linear Regression Analysis was applied for developing models. It was
reported that reliable forecasting yield could be obtained when both the crops were 12
weeks old i.e. about 2 month before harvest.
Sarmah and Handique (2001) developed Linear Regression Models for the
pre-harvest forecasting of winter rice yield in Jorhat district of Assam, India. It was
found that 69.10 per cent of the total variability in rice yield is due to weather
variables. The deviation of forecasted yield from that of observed yield was ranging
from 0.06 to 21.36 per cent. The best time for rice yield forecasting was observed in
the 43rd standard meteorological week.
Nain et al. (2004) developed methodology for large area yield forecast using a
crop simulation model and a discrete technology trend, and was applied to the
~26~
Review of Literature
coherent wheat yield variability zones of eastern Uttar Pradesh. The regression
coefficients were generated using 10 years’ data (1984/85–1994/95) and the
reliability of the approach were tested on a data set of 5 years’ independent data
(1995-96 to 1999-2000). The results showed that this approach could capture year to
year variability in large area wheat yield with reasonable accuracy. The Root Mean
Square Error (RMSE) between observed and predicted yield was reported as 0.098
t/ha for the mean yield of 2.072 t/ha (4.72%). The pre-harvest forecasts were made
using in-season weather data up to the end of February and climatic-normal for the
rest of the wheat-growing season, which showed good agreement with observed
wheat yields.
Kumar and Bhar (2005) used Multiple Linear Regression Model for
forecasting yield of Indian mustard (Brassica juncea L.) at Hisar district of Haryana.
They developed models for each growing phase of Mustard. It was reported that the
reliable earliest forecasting could be achieved 6 weeks after sowing and the latest
forecasting could be done 4-5 weeks before harvesting.
~27~
Review of Literature
Singh and Singh (2007) developed Multivariate Regression Model for wheat
yield using soil characteristics and management practices in Udaipur district of
Rajasthan. It was reported that available water capacity strongly affected the crop
yield. It was found that in shallow sandy-loam soils six irrigations were required to
produce the potential yield, while in deep clay-loam soils five irrigations and in deep
clay soils only four irrigations were sufficient to produce the potential yield.
~28~
Review of Literature
varieties of AHH-468 and MCU-10. The rainfall at the beginning of the season was
favorable for the yield of the crop.
Yadav and Patil (2008) studied the influences of agroclimatic indices on fruit
yield of cucumber during Kharif in Dapoli, district Ratnagiri, Maharashtra. They
obtained the relationship between fruit yield of cucumber and agro climatic indices. It
was found that early sowing of cucumber i.e. immediately after onset of monsoon
produced significantly highest fruit yield over the late sowing.
Singh et al. (2008) developed Multivariate Regression Model for maize yield
based on edaphic characters in Udaipur district of Rajasthan. It was observed that
depth of soil strongly affected the crop yield. It was also found that under variability
in rainfall and availability of irrigations the yield could be predicted well in advance
in low, medium and high management conditions.
Lobel and Burke (2010) used a perfect model approach to examine the
ability of statistical models to predict yield responses to changes in mean temperature
and precipitation, as simulated by a process based crop model. The CERES-Maize
model was first used to simulate historical maize yield variability at nearly 200 sites
in Sub-Saharan Africa, as well as the impacts of hypothetical future scenarios of 2 0 C
warming and 20% precipitation reduction. Statistical models of three types (time
series, panel, and cross-sectional models) were then trained on the simulated
historical variability and used to predict the responses to the future climate changes.
Results suggested that statistical models, as compared to CERES-Maize, represented
a useful tool for projecting future yield responses, with their usefulness higher at
broader spatial scales.
~29~
Review of Literature
Garde et al. (2012) derived Multiple Linear Regression (MLR) equations for
estimating wheat productivity for the district of Ghazipur in eastern Uttar Pradesh.
Weather indices were computed using varied weather parameters for the year 1982-
83 to 2005-06. The cross-validation of the developed forecast models were tested
their accuracy using the year 2006-07.Based on a Forecast error percentage it was
found that the forecasting model produced the most accurate forecast for 15th weekof
the crop growing season. The relationship between actual and forecast wheat yield
was highly significant being R2 varied from 0.72 to 0.89 for the different weeks.
Boken (2000) studied the time series analysis (linear trend, quadratic trend,
simple exponential smoothing, double exponential smoothing, simple moving
averaging, and double moving averaging) for forecasting wheat yield in
Saskatchewan, Canada. It was found that developed model produced more accurate
forecast using deterministic measure (i.e. Mean Squared Error, MSE).
Mathur et al. (2001) developed stock market forecasting models using Neural
Network & Multiple Regression Analysis. The models were based on a company’s
stock price movement data for a complete calendar year and the data were pre-
processed to identify the hidden patterns. It was found that Neural Network model
could provide better forecasts as compared to Multiple Regression Analysis.
~30~
Review of Literature
Ho et al. (2002) investigated suitable time series models for repairable system
failure analysis. A comparative study of the Box-Jenkins autoregressive integrated
moving average (ARIMA) model and the artificial neural network model in
predicting failures were carried out. The neural network architectures were the
multilayer feed-forward network and the recurrent neural network. Simulation results
on a set of compressor failures showed that in modeling the stochastic nature of
reliability data, both the ARIMA and the recurrent neural network models outperform
the feed-forward model; in terms of lower predictive errors and higher percentage of
correct reversal detection. However, both models performed better with short term
forecasting.
~31~
Review of Literature
Results indicated ANN model provides better accuracy in forecasting river flow than
does the ARIMA model.
~32~
Review of Literature
Pal et al. (2007) made an attempt to forecast milk production using statistical
time series modeling techniques such as double exponential smoothing and Auto-
Regressive Moving Average (ARIMA) for the study period of twenty five years
(1980-81 to 2004-05). On validation of the forecast from these models, ARIMA
model performed better than the other one.
Terzi and Onal (2012) used artificial neural networks (ANN) and developed
multiple linear regression (MLR) using the same input parameters to forecast monthly
flow for Kizilirmak River in Turkey. It was found that ANN models shows better
performance when compared to MLR models.
Mishra and Singh (2013) forecasted the price of groundnut oil in Delhi
(India) by using ANN and ARIMA methodologies. They compared forecasting
capabilities of these models with the help of Root Mean Square Error (RMSE), Mean
Square Error (MSE) and Mean Absolute Percentage Error (MAPE).
~33~
Review of Literature
Marinoni (2003) showed how smoothing effects around zero value zones can
be reduced significantly by the use of a combined ordinary-indicator Kriging
approach in this paper. The focus was mainly put on the statistics of the mechanical
parameters of soil and rock masses that go directly into the safety calculations.
Kumar and Remadevi (2006) applied the spatial techniques, Kriging for the
spatial analysis of groundwater levels. With the use of measured elevations of the
water table, experimental semivariograms were constructed that characterises the
spatial variability of the measured groundwater levels.
Reza et al. (2010) interpreted and analysed the spatial variability of soil
properties, carried out by ordinary kriging and inverse distance weighting (IDW)
methods to generate continuous sample for site-specific management. A total of 535
soil samples (0-25 cm) were collected at an interval of 2 km grid in Dhalai district of
Tripma, India. The data were interpolated by ordinary kriging and IDW with power 2.
All the selected soil chemical parameters were strongly spatially dependent, but the
range of spatial dependence was found to vary within soil parameters. The study
~34~
Review of Literature
showed that prediction of spatial variability for different soil parameters (except
available nitrogen) may be better understood by ordinary Kriging than by IDW
method.
Jafari et al. (2011) assessed the spatial variation of chemical and physical soil
properties and then used this information to select an appropriate area to install a
pasture rehabilitation experiment in the Zereshk in region, Iran. They performed
conventional statistical methods and geostatistics in order to analyse soil properties
spatial dependence. Soil properties such as pH, P, SAR and Na were best fitted by
spherical semivariogram models. Kriging was performed in order to analyze spatial
variation of chemical and physical soil properties then for enhancing estimation
accuracy and comparing results co-Kriging technique was also used. They compared
the results using statistical techniques showed that Kriging technique has acceptable
accuracy in characterizing the spatial variability.
~35~
Review of Literature
Vijaykumar et al. (2011) studied the relationship between soil properties and
macro and micro-nutrients in the soil. Their study indicated that the soil properties
pH, EC, OC and OM are the main characteristics playing major role in controlling the
availability of micronutrients.
Singh et al. (2015) collected two thousand three hundred thirty four surface
soil samples from Chandauli, Mirzapur, Sant Ravidas Nagar and Varanasi districts of
eastern Uttar Pradesh during 2011-12 under GPS and GIS based soil fertility mapping
project. Analysis of these soil samples revealed the occurrence of acidic soils (pH
<5.5) in Chandauli and Mirzapur districts. A wide variation in pH (4.5-10.4)
indicated acidic to alkali nature of the soils in this region. These soils have electrical
conductivity (EC) less than 1 dS m-1 and organic carbon (OC) content ranged from
~36~
Review of Literature
0.6 to 13.3 g kg-1. High OC content was noticed in some Vindhyan low land soils of
Chandauli and Mirzapur district. Available sulphur (S) content ranged from 0.43 to
165 mg kg-1 with a mean value of 16.10, 9.63, 13.05 and 12.36 mg kg-1 in
Chandauli, Mirzapur, Sant Ravidas Nagar and Varanasi districts, respectively. The
corresponding deficiency of Sin soils of these districts was 39, 63, 45 and 56 per cent.
Nutrient index (NI) indicated S fertility level of low to medium. Soils were also found
to be highly deficient in available boron (B), with mean contents of 0.55, 0.49, 0.66
and 0.62 mg kg-1 in Chandauli, Mirzapur, Sant Ravidas Nagar and Varanasi districts,
respectively showing deficiency in 55, 61, 30 and 37 per cent soil samples. High
magnitude of B deficiency was noticed in soils of Vikas Khand Nawgarh (94%)
followed by Rajgarh (85%) and Marihan (80%), representing the area of low pH
Vindhyan soils.
Oliver and Gregory (2015) discussed thedirect and indirect effects of soil or
its constituents on human health are through its ingestion, inhalation or absorption.
Theyfocussed on four trace elements (iodine, iron, selenium and zinc) whose
deficiencies have substantial effects on human health. They reviewed as the world's
population increases issues of food security become more pressing, as does the need
to sustain soil fertility and minimize its degradation. Lack of adequate food and food
of poor nutritional quality lead to differing degrees of under-nutrition, which in turn
causes ill health. Soil and land are finite resources and agricultural land is under
severe competition from other uses. Relationships between soil and health are often
difficult to extricate because of the many confounding factors present. Nevertheless,
recent scientific understanding of soil processes and factors that affect human health
are enabling greater insight into the effects of soil on our health. Multidisciplinary
research that includes soil science, agronomy, agricultural sustainability, toxicology,
epidemiology and the medical sciences will facilitate the discovery of new antibiotics,
a greater understanding of how materials added to soil used for food production affect
health and deciphering of the complex relationships between soil and human health.
~37~