You are on page 1of 6

International Journal of Civil & Environmental Engineering IJCEE-IJENS Vol: 12 No: 01 36

Selecting Optimum Locations of Rainfall


Stations Using Kriging and Entropy
Ayman G. Awadallah
Associate Professor, Civil Engineering Department, Faculty of Engineering,
Fayoum University, Fayoum, Egypt
aawadallah@darcairo.com

Abstract—Availability of rainfall data in real time The actual density of rainfall network, in most
is needed for management of important water watersheds, is significantly poorer than the values
resources like that of Zamzam well in Makkah recommended by the World Meteorological
watershed. However, rainfall network, in most Organization [1]. There has been much research
watersheds, is significantly poorer than the values done on the subject of minimum densities of
recommended by the World Meteorological rainfall networks. WMO recommends certain
Organization. A methodology based on the densities of rain gauge stations to be followed for
sequential use of kriging and entropy principles is different types of catchments. For small
used to determine the spatial distribution of
mountainous regions with irregular precipitation,
potential rainfall gauging stations. Kriging, which
25 km2 per station is recommended. According to
performs linear averaging to reconstruct the
rainfall over the watershed on the basis of observed the same WMO guidelines, the density falls to 10
rainfall data, is used to compute the spatial – 20 km2 per station in urban areas [2]. As such,
variations of rainfall in the locations of candidate for the 40 km2 Wadi Ibrahim watershed, 2 to 4
stations. The information entropy reveals the stations are required. The full WMO minimum
rainfall information of each of the candidate network requirements are listed in Table 1.
locations. By calculating the joint entropy, the
Several publications have applied statistical
candidate stations are prioritized. The
theory for rainfall network design. Shih [2]
methodology is applied on Makkah watershed
using geostatistical analyst in ArcGIS software for
introduced various steps based on a covariance
kriging and R software for information entropy. factor among rain gauge stations to design the
The candidate locations are validated on ground rainfall network. Shannon’s entropy was used to
and final stations locations are selected. decide the addition or removal of rain gauges ([3]
and [4]). Kriging method (proposed by Matheron
Index Terms—Rainfall network design, Kriging, [5] for spatial interpolation) has also been applied
Entropy, Makkah, Saudi Arabia for network design ([6] and [7]). Chen et al. [8]
proposed a method composed of kriging and
I. INTRODUCTION entropy that can determine the spatial distribution
of rain gauge stations in a catchment. The
Accurate rainfall data is indispensable in approach proposed by Chen et al. [8] was
planning and operation of water projects. followed to select the locations of the proposed 2
Availability of rainfall data in real time is needed to 4 rainfall stations in Wadi Ibrahim watershed.
for management of water resources and especially
in arid zones where water scarcity is prevailing.
The purpose of this research is to present the II. AVAILABLE DATA
methodology undertaken to select the optimum The nearest stations available with extent till
number and locations of the proposed rainfall 2009 are four stations (Azeezeah, Sharaaea,
stations in Wadi Ibrahim, Makkah watershed, Makkah and Laith) with coordinates as shown in
Saudi Arabia (Figure 1). Wadi Ibrahim watershed Table 2. The rainfall data for these stations are
is the source of Zamzam well, which is the available for the period from 2002 to 2009. The
drinking water for millions of pilgrims coming to stations are ordered in the table in descending
Makkah. The withdrawal from Zamzam well is order from the point of view of the distance away
determined based on the availability of rainfall from Wadi Ibrahim. Figure 2 illustrates the
and groundwater levels in the aquifer. locations of these rainfall stations

126301-7575 IJCEE-IJENS © February 2012 IJENS


IJENS
International Journal of Civil & Environmental Engineering IJCEE-IJENS Vol: 12 No: 01 37

TABLE I. RECOMMENDED MINIMUM DENSITIES OF PRECIPITATION STATIONS


Minimum Density per Station
Physiographic Unit (Area in Km2 per station)
Non-Recording Recording
Coastal 900 9000
Mountainous 250 2500
Inter plains 575 5750
Hilly / Undulating 575 5750
Small Islands 25 250
Urban Areas 10-20
Polar / arid 10000 100000

TABLE II. AVAILABLE RAINFALL STATIONS IN THE VICINITY OF THE STUDY ZONE
Coordinates
Station Location Available Data Range
Lat.(N) Long.(E)
Laith 21° 20.506 39° 43.682 2002-2009
Sharaaea 21° 31.451 39° 55.115 2002-2009
Makkah 21° 26.267 39° 46.133 1985-2009
Azeezeah 21° 24.411 39° 51.888 2002-2009

Optimal bin intervals were selected using the


III. METHODOLOGY AND RESULTS Jenks' natural breaks classification scheme. This
As previously stated, kriging and entropy are scheme determines the best arrangement of
used sequentially for selecting station locations. values into classes by iteratively comparing sums
Kriging, as an interpolator, performs linear of the squared difference between observed
averaging to reconstruct the rainfall over the values within each class and class means. The
catchment on the basis of the observed rainfall. It "optimum" classification identifies breaks in the
is used to compute the spatial variations of ordered distribution of values that minimizes
rainfall for each of the available years on a grid within-class sum of squared differences, using the
covering 20 by 10 km2, with a cell size of 2.5 by total amount of data at the 32 locations. After the
2.5 km2. Thus, the rainfall data at 32 candidate classes or bins are determined, the frequency of
locations of the rain gauge stations are total annual rainfall in each of the classes is
reconstructed. Five intervals are adopted to calculated for each of the 32 candidate locations.
construct the empirical histogram of temporal The locations of the candidate stations are shown
rainfall distribution at the 32 candidate locations. by Figure 3.

Figure 1. Location of Wadi Ibrahim Watershed, Makkah, Saudi Arabia

126301-7575 IJCEE-IJENS © February 2012 IJENS


IJENS
International Journal of Civil & Environmental Engineering IJCEE-IJENS Vol: 12 No: 01 38

Figure 2. Location of Exisitng Rainfall Stations

A1 A2

B1

B2

Figure 3. Location of Candidate Rainfall Stations on which kriging is applied

126301-7575 IJCEE-IJENS © February 2012 IJENS


IJENS
International Journal of Civil & Environmental Engineering IJCEE-IJENS Vol: 12 No: 01 39

The information entropy, which is then applied to the problem of entropy estimator. All
applied, reveals the rainfall information of each three methods were implemented on R software
rain gauge station in the catchment. First of all, [12] using the entropy package developed by
the entropy of each candidate rain gauge station is Hausser and Strimmer [13]. All three estimators
calculated after all rainfall data are reconstructed. gave different results; however, the regions with
The most important rain gauge station is the one maximum entropy were 2 regions formed by
with the highest entropy H(x). If p denotes the locations (2, 3, 10, 11, 17, and 18) and by
probability mass function of X and I(X) is the locations (7 and 8). The two regions were chosen
information content or self-information of X, then as the two first potential regions. They are shown
the entropy can explicitly be written as in Figure 3 as A1 and A2 potential regions.
To choose the next two potential regions, it
n n was assumed that the information of the 2 first
H ( X ) = ∑ p( xi ) I ( xi ) = −∑ p( xi ) logb p( xi ) (1)
potential regions was introduced. With the
i =1 i =1
introduction of this information, the relocated rain
gauge station with the minimum mutual
where b is the base of the logarithm used. information, min{H(x1) – H(x1|x2)}, is selected
Common values of b are 2, Euler's number e, and next. The rain gauge station providing the
10. maximum conditional entropy H(x1|x2) is selected
In the case of pi = 0 for some i, the value of as the second important rain gauge station in the
the corresponding summand 0 logb 0 is taken to rain gauge network. Since 2 regions– instead of
be 0, which is consistent with the limit one – are selected in the first entropy round, two
regions are selected in the second round based on
the mutual information concept. The second set
lim ( p ⋅ log p ) = 0 of potential regions is denoted B1 and B2 on
p → 0+ Figure 3. Thus, the optimum spatial distribution
(2)
of the 4 rain gauge stations in the network can be
Various estimators are used to estimate determined. They are shown in the centre of the
entropy, such as the empirical entropy, the Miller previously mentioned regions and within the
bias corrected empirical estimator [9] and the non boundaries of the watershed, as illustrated by
parametric estimator of Chao and Shen [10] Figure 4.
which is a Horvitz-Thompson [11] estimator

TABLE III. CRITERIA FOR SELECTION OF RAINFALL STATIONS

Parameter Description
Collector Orientation Wet Bucket 45° of Magnetic West
Distance between dry-deposition collector and rain gauge ≤ 5 m distance < 30 m
Vertical distance between collector orifice and rain gauge orifice Distance ≤ 0.3 m
Vegetation height Height ≤ 0.6 m within 5 m of instrument base
Cultivated agricultural fields Distance > 20 m from collector
Pasture land Distance > 20 m from collector
Vertical objects (Including towers, wires, fences, etc.), angle of projection from
Projection angle ≤ 45° from top of instrument
instrumentation
Trees, angle of projection from instrumentation Projection angle ≤ 45° from top of instrument
Buildings, angle of projection from instrumentation Projection angle ≤ 30° from top of instrument
Objects, > 1 m tall, > 5 cm in width or depth Distance ≥ 5 m from instrument
> 20% of annual precipitation is frozen Wind shield present on rain gauge
Wind shield, pivot axis Same height as rain gauge orifice
Rooftop installation Urban sites only
Rooftop installation, equipment separation from potential emission sources (sewer vents,
Maximize separation
HVAC systems)
Rooftop installation, objects, angle of projection Projection angle ≤ 30° from top of instrument

126301-7575 IJCEE-IJENS © February 2012 IJENS


IJENS
International Journal of Civil & Environmental Engineering IJCEE-IJENS Vol: 12 No: 01 40

Figure 4. Locations of candidate regions within the boundary of the watershed

Potential ground
stations

Figure 5. Locations of actual selected station locations

126301-7575 IJCEE-IJENS © February 2012 IJENS


IJENS
International Journal of Civil & Environmental Engineering IJCEE-IJENS Vol: 12 No: 01 41

The 4 selected regions were tested on the ground with the REFERENCES
criteria hereafter stated in Table 3. It was found that 3 sites [1] World Meteorological Organization, Guide to Hydrological
are suited for rain gauge tipping bucket type and one site Practices, WMO-164. WMO: Geneva, 1994.
even suitable for a complete weather station, among 21 [2] S. F. Shih, “Rainfall variation analysis and optimization of gaging
candidate sites tested for suitability. The sites are shown in systems”, Water Resources Research, 18, pp: 1269-1277, 1982
Figure 5 where sites, B, C and E, are the tipping bucket sites [3] P. F. Krstanovic and V. P. Singh, “Evaluation of rainfall networks
and site I is for the weather station site. The site selection using entropy: II. Application”. Water Resources Management, 6, pp.
criteria are thus confirming the statistical approach based on 295–314, 1992.
entropy / kriging methodology and the 4 locations selected [4] M. Al-Zahrani and T. Husain, “An algorithm for designing a
precipitation network in the south-eastern region of Saudi Arabia.”
are located within the boundaries of the regions optimized by Journal of Hydrology, 205, pp. 205–216, 1998.
the statistical approach. [5] G. Matheron, Traité de Géostatistique Appliquée, Tome II. Le
The kriging / entropy methodology proved to be well suited krigeage, Mémoires du Bureau de Recherches Géologiques et
Minières No. 24. Editions B.R.G.M.: Paris, 1962.
for selection of rainfall gauge locations in arid regions using
[6] G. Bastin, B. Lorent, C. Duque and M. Gevers, “Optimal estimation
limited information from the existing rainfall network. of the average rainfall and optimal selection of rain gauge locations.”
Water Resources Research, 20(4), pp. 463–470, 1984.
[7] A. St-Hilaire. T.B.M.J. Ouarda, M. Lachance, B. Bobée, J. Gaudet
IV. CONCLUSIONS and C. Gignac, “Assessment of the impact of meteorological network
density on the estimation of basin precipitation and runoff: a case
Kriging and entropy are used sequentially to determine study.” Hydrological Processes, 17, pp. 651–3580, 2003.
the locations of candidate rainfall stations in the mountainous [8] Y.-C. Chen, C. Wei and H. C. Yeh, “Rainfall network design using
watershed of Wadi Ibrahim, Makka, Saudi Arabia; which is kriging and entropy.” Hydrological Processes, 22: pp. 340–346,
the source of Zamzam well, drinking water of millions of 2008.
pilgrims. Using the available rainfall data, 4 station locations [9] G. Miller, “Note on the bias of information estimates.” In H. Quastler,
are selected to comply with WMO guidelines and to manage ed., Information Theory in Psychology II-B, pp. 95–100, Free Press,
the water resource efficiently and in real time. The locations Glencoe, IL. 1955.
selected by geostatistical approach are validated on ground. [10] A. Chao and T. -J. Shen, “Nonparametric estimation of Shannon’s
index of diversity when there are unseen species.” Environmental and
Further research is needed to apply the methodology on a Ecological Statistics, 10, pp. 429–443, 2003.
larger scale and to confirm that the proposed rainfall gauges [11] D. G. Horvitz and D. J. Thompson, “A generalization of sampling
add information to the spatial distribution of rainfall in the without replacement from a finite universe.” Journal of the American
region. Furthermore, a comparison between entropy methods Statistical Association, 47, pp. 663-685, 1952.
is needed to decide on the best suited method in selection of [12] R Development Core Team. R: A Language and Environment for
Statistical Computing. R Foundation for Statistical Computing,
rainfall gauge locations. Vienna, Austria, 2011. Available at: http://www.R-project.org.
[13] J. Hausser and K. Strimmer, “Entropy inference and the James-Stein
estimator, with application to nonlinear gene association networks.”
Journal of Machine Learning Research, 10, pp. 1469-1484, 2009.

126301-7575 IJCEE-IJENS © February 2012 IJENS


IJENS

You might also like