You are on page 1of 17

Water Resour Manage (2011) 25:41–57

DOI 10.1007/s11269-010-9686-9

Wavelet Transform Method for Synthetic Generation


of Daily Streamflow

Wensheng Wang · Shixiong Hu · Yueqing Li

Received: 27 June 2009 / Accepted: 11 June 2010 /


Published online: 30 June 2010
© Springer Science+Business Media B.V. 2010

Abstract Synthetic generation of daily streamflow sequences is one of the most


critical issues in stochastic hydrology. In this study, a new wavelet transform method
is developed for synthetic generation of daily streamflow sequences. Firstly, daily
streamflow sequences with different frequency components are decomposed into
the series of wavelet coefficients W1 (t), W2 (t),...,W P (t) and scale coefficients (the
residual) C P (t) at a resolution level P using wavelet decomposition algorithm.
Secondly, the series of W1 (t), W2 (t),...,W P (t) and C P (t) are divided into a number
of sub-series based on a yearly period. Thirdly, random sampling is performed from
sub-series of W1 (t), W2 (t),...,W P (t) and C P (t), respectively. Based on these sampled
sub-series, a large number of synthetic daily streamflow sequences are obtained using
wavelet reconstruction algorithm. The advantages of this newly developed method
include: (1) it is a nonparametric approach; (2) it is able to avoid assumptions of
probability distribution types (Normal or Pearson Type III) and of dependence
structure (linear or nonlinear); (3) it is not sensitive to the original data length and
suitable for any hydrological sequences; and (4) the generated sequences from this
method could capture the dependence structure and statistical properties presented
in the data. Finally, a case study in Jinsha River, China, indicates that the new method
is valid and efficient in generating daily streamflow sequences based on historical
data.

W. Wang (B) · S. Hu
State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University,
Chengdu 610065, China
e-mail: wangws70@sina.com

W. Wang · Y. Li
Chengdu Institute of Plateau Meteorology, CMA, Chengdu 610072, China

S. Hu
Department of Geography, East Stroudsburg University of PA, 200 Prospect Street,
East Stroudsburg, PA 18301-2999, USA
42 W. Wang et al.

Keywords Daily streamflow · Synthetic generation · Wavelet transform ·


Decomposition and reconstruction

1 Introduction

One of major goals in stochastic hydrology is to generate synthetic streamflow


sequences that are statistically similar to observed streamflow sequences. The
synthetic streamflow sequences can be applied to planning, design, operation and
management of hydrology and water resources system. The key step in synthetic
generation is to construct an appropriate stochastic hydrological model. Currently,
great progress has been made in applying stochastic hydrological models to the
field of hydrology and water resources extensively (Yevjevich 1972; Kottegota 1980;
Lall and Sharma 1996; Sharma et al. 1997; Aksoy and Bayazit 2000; Koutsoyiannis
2000; Srinivas and Srinivasan 2005; Wang et al. 2007; Wang and Ding 2007). The
conventional regressive models and disaggregation models (Yevjevich 1972; Ding
and Deng 1988) are parameter-driven by an assumed probability density function
that is estimated based on the first two or three moments of historical data. The
advantages of parameter-driven models include simple structure and convenient for
the application, and capability of capture main statistical characteristics and the
linearity of hydrological processes. However, these parameter-driven models have
difficulties in reflecting the form of real probability distribution and the complex
nonlinearity of hydrological processes.
Over the past decades, some nonlinear stochastic models have been developed
(Tong 1990; Raman et al 1995; Wang 1999; Wang et al. 2001, 2007) and used in
hydrology and water resources. Artificial neural network (ANN) is a nonlinear math-
ematical structure which is capable of representing complex nonlinear hydrological
sequences. Raman et al (1995) proposed strategies for the multivariate modeling
of water resources time series using ANN. Wang (1999) developed a hybrid ANN
model that combines the Pearson Type III stochastic model with ANN for simulating
single variable and multivariate daily discharge sequences. Yuan and Wang (1996)
examined the performance of bilinear (BL) time series model in generating daily
discharge of flood period from 1 May to 31 October in southwestern China. Jin
and Ding (2002) developed a new method for estimating the parameters of BL
model based on the genetic algorithm in forecasting monthly ground water level
and drought-flood time series. Wang et al. (2001) first introduced the threshold
auto-regression model (TAR) into stochastic hydrology and applied the model to
the generation of daily streamflow sequences. The results of this study showed that
TAR is an effective model in generation of runoff. Jin and Ding (2002) established a
genetic algorithm to estimate the parameters in TAR.
In recent 15 years, many nonparametric stochastic models have been established
(Lall and Sharma 1996; Sharma et al. 1997; Srinivas and Srinivasan 2005; Wang
et al. 2007; Wang and Ding 2007). Lall and Sharma (1996) proposed a nearest
neighbor bootstrapping model and applied it to the generation of monthly runoff
sequences. Based on kernel density theory, Sharma et al. (1997) proposed the
first-order nonparametric model for the generation of monthly streamflow in the
Beaver River, Utah. Wang and Ding (2007) suggested a multivariate nonparametric
model for the generation of daily streamflow sequences at two hydrological gauging
Wavelet Transform Method for Synthetic Generation of Daily Streamflow 43

stations. On the basis of kernel density estimation, Tarboton et al. (1998) developed
a nonparametric disaggregation (NPD) model for stochastic hydrology. Wang et al.
(1999) examined the effectiveness of NPD model in simulating monthly discharge
time series. The advantage of nonparametric models is that the models could avoid
prior assumptions as to the form of probability distribution (Normal or Pearson III)
and the form of dependence (linear or non-linear).
Wavelet analysis is a recently developed method for signal processing, image
compress and encoding, tongue encoding, and mode identification. The important
feature of wavelet analysis exists in its multi-resolution characteristics for processing
both stationary and non-stationary time series. The wavelet analysis has been widely
utilized in hydrology (Kumar and Foufoula-Georgiou 1993; Smith et al. 1998; Li
et al. 1999; Labat et al. 2000; Bayazit and Aksoy 2001; Wang and Ding 2003; Unal
et al. 2004; Anctil and Coulibaly 2004; Wang et al. 2005, 2009). Smith et al. (1998)
applied wavelet analysis to study stream flow features. Labat et al. (2000) separated
sub-processes of pumping and naturally intermittent runoff processes. Using wavelet
analysis several researchers have made efforts to explain inter-annual and inter-
decadal variability and detect quasi-periodicities of precipitation and runoff time
series (Wang et al. 2005; Kang and Henry 2007). Wavelet analysis has also been
applied to hydrological forecasting and prediction. Coulibaly et al (2000) employed
recurrent neural networks and wavelet analysis to establish the climate index of
low-frequency, and predict the annual runoff. Li et al. (1999) proposed a hybrid
model with wavelet and ANN for predicting annual runoff. Wang and Ding (2003)
presented a wavelet neural network hybrid model for predicting groundwater level
and daily discharge. Chou and Wang (2004) proposed a multi-model method using a
wavelet-based Kalman filter. Wang et al. (2009) developed a wavelet network model
to predict the inflow of Three Gorges dam in Yangtze River, China.
Several studies have been performed in applying wavelet analysis to generate
streamflow sequences. The method based on Haar wavelet, normal distribution
data and randomly combination (called BA method thereafter) was first presented
by Bayazit and Aksoy (2001). They simulated annual and monthly streamflow
series generation by using BA method. The method was later applied to generate
streamflow data (Bayazit et al. 2001) and compared to the AR2 (2-order auto-
regressive model) processes to see the performance of BA method in water resources
design and operation. Aksoy (2001) extended its use to river reservoir storage
simulation. In another study performed by Unal et al. (2004), BA method was
proposed as a tool to alternate the classical data generation schemes for simulation of
annual and monthly rainfall data. This conclusion was achieved after an application
of the method was made on a 64-year rainfall data and a comparison was performed
to five more models including the modified method of fragments of Srikanthan and
McMahon (1980). BA method was obviously the best in preserving the mean value
of the sequence and the correlation structure between months of the series (Aksoy
and Unal 2007). Heng et al (2002) presented a hybrid model that combines wavelet
transform and autoregressive model for generating annual runoff series. There is no
more reported study yet using wavelet analysis to generate the hydrological time
series.
There are only a few wavelet methods available for simulating streamflow, and
most of them have their disadvantages. For example, BA method is a successful one
in simulating streamflows, but has certain limitations: (1) BA method is sensitive to
44 W. Wang et al.

the data length. The wavelet method requires the data length to be a power of 2. This
means that the data length must be 16, 32, 64, 128, etc., and therefore, up to half of
the data may not be used by BA method; (2) The wavelet function of BA algorithm is
only Haar wavelet, which is not suitable for other wavelet functions; (3) It is difficult
for BA method to generate sequences longer than the historical record (Srikanthan
et al. 2007); and (4) BA method is only suitable for a normal distribution hydrological
series. Therefore, it is very necessary to develop new methods to avoid the above-
mentioned limitations based on wavelet analysis for simulating streamflows.
The objective of this study is to propose a wavelet transform method for gener-
ating daily streamflow sequences. The proposed method does not need to estimate
parameters, and it is able to avoid prior assumptions as to the form of probability
function and the form of dependence structure. The organization of the study is
as follows. Section 2 gives a brief description of wavelet transform and wavelet de-
composition and reconstruction algorithm. Section 3 discusses the wavelet transform
method for daily streamflow. A case study of Jinsha River watershed in China is
used to illustrate the procedures in Section 4. Finally, the conclusions are drawn in
Section 5.

2 Wavelet Analysis

2.1 Wavelet Transform

Wavelet analysis is developed from the Fourier transforms. It could investigate


detailed temporal patterns from both frequency and time domains through adjusting
time and frequency signals. The successive wavelet transform is defined by (Chui
1995; Rao and Bopardikar 1998)
  
t−b
W f (a, b ) = |a|− 2
1
f (t)ψ̄ dt (1)
R a

where a is scale factor (frequency); b is time shift; R is the domain of real number; t is
time; f (t) is successive processes; ψ̄(t) is the complex conjugate of wavelet function
ψ(t). Wavelet function ψ(t) is called mother wavelet that has shock properties and
can reduce to zero rapidly.
In real application, the input data f (t) is generally discretely sampled, such
as annual runoff time series, monthly precipitation time series and water quality
time series (nitrate, chloride, and sodium concentrations). The successive wavelet
j j
transform is often discrete. Let a = a0 , b = kb 0 a0 , a0 > 1, b 0 ∈ R, k, j are integer
number, then, discrete wavelet transform of f (t) could be written as

− j/2  −j 
W f ( j, k) = a0 f (t)ψ̄ a0 t − kb 0 dt (2)
R

When a0 = 2, b 0 = 1, Eq. 2 becomes binary wavelet transform



W f ( j, k) = 2− j/2 f (t)ψ̄(2− jt − k)dt (3)
R
Wavelet Transform Method for Synthetic Generation of Daily Streamflow 45

Wavelet transform is the decomposition of f (t) under different resolution scales (also
called level) a. This means that wavelet analysis is a multi-resolution analysis (MRA)
method. Using MRA the observed time series could be decomposed into various
time series at respective resolution levels.

2.2 Wavelet Decomposition and Wavelet Reconstruction Algorithm

The decomposition of time series can be performed efficiently using Mallat algorithm
(Mallat 1989) and Trous algorithm (Shensa 1992; Aussum et al. 1997). Chou (2007)
discussed the merits of the Trous algorithm. In this paper, the Trous algorithm was
adopted to decompose and reconstruct the observed hydrological time series.
The Trous algorithm is a redundant wavelet transform and is stable. Let us have a
hydrological time series C0 (t). The Trous decomposition algorithm works as follows:

k
Ci (t) = h(l)Ci−1 (t + 2i l) i = 1, 2, · · ·, P (4)
l=−k

Wi (t) = Ci−1 (t) − Ci (t) i = 1, 2, · · · , P (5)


Where h(l) is the discrete low-pass filter; Ci (t), Wi (t)(i = 1, 2,...,P) are scale
coefficients (approximation signal) and wavelet coefficients (detail signal) at the
resolution level i respectively; P denotes the maximum resolution level. The
approximation terms C P (t) represents the background information, and W1 (t),
W2 (t),...,W P (t) provide detailed characteristics in the original time series C0 (t).
{W1 (t), W2 (t),...,W P (t), C P (t)} are called discrete wavelet transform at the resolution
level P. In using Eq. 4, handling of boundaries may be carried out in different ways.
We take an intuitively acceptable approach by considering Ci (n + k) = Ci (n − k).
It is possible to reconstruct the original hydrological time series from discrete
wavelet transform {W1 (t), W2 (t),...,W P (t), C P (t)}. The appealing reconstruction for-
mula for original time series is given by


P
C0 (t) = C P (t) + Wi (t) (6)
i=1

Equation 6 is Trous reconstruction algorithm.


The Trous algorithm for decomposition and reconstruction of C0 (t) is simple and
rapid. The key is to select a feasible discrete low-pass filter h(l). There are several
discrete
√ √low-pass√ filters h(l) available for selecting, such as h(l) = (h−1 , h0 , h1 ) =
( 2/4, 2/2, 2/4), h(l) = (h−2 , h −1 , h0 , h1 , h2 ) = (1/16, 1/4, 3/8, 1/4, 1/16), h(l) =
(h−3 , h−2 , h−1 , h0 , h1 , h2 , h3 ) = √12 − 16
1
, 0, 16
9
, 1, 16
9
, 0, − 16
1
.

3 Synthetic Generation of Daily Streamflow with Wavelet Transform Method

Most methods and models generate daily streamflow time series with the original
data alone. In general, each daily streamflow time series includes several frequency
components. Using the component at only one resolution level to simulate daily
streamflow sequences makes the internal mechanism difficult to understand. There-
fore, a new wavelet transform method for generating daily streamflow sequences is
46 W. Wang et al.

presented here based on wavelet coefficients and approximation scale coefficients.


The major steps of the suggested method are as follows: (a) Decompose the
daily streamflow time series into the wavelet coefficients W1 (t), W2 (t),...,W P (t) at
the resolution level P and the scale coefficients C P (t) with Trous decomposition
algorithm; (b) Divide the W1 (t), W2 (t),...,W P (t) and C P (t) into a number of sub-
series based on the yearly period, and the length of each sub-series is 365 days
(the last day in February in a leap year is omitted for the convenience, and a year
is considered to consist of 365 days in this study); and (c) Sample randomly from
the sub-series of W1 (t), W2 (t),...,W P (t) and C P (t) respectively and generate many
synthetic daily streamflow sequences with the Trous reconstruction algorithm. The
detailed procedures are illustrated as follows.

3.1 Decomposition of Daily Streamflow Sequences

The low-pass filter h(l), which is a B3 spline, defined as h(l) = (h−2 , h−1 , h0 , h1 , h2 ) =
(1/16, 1/4, 3/8, 1/4, 1/16), is used in this study as an example. B3 spline is of
compact support and point-symmetric. First, the maximum resolution level P must
be determined. In general there are int[lgn] resolution level number, where n is the
length of daily streamflow sequences and lg denotes the logarithm to base 10. The
P may be selected from the range of 2 and int[lgn], that is, 2 ≤ P ≤ int[lgn]. The
historical daily streamflow sequences are standardized as

xi, j − x j
zi, j = (7)
σ̂ j

where xi, j is the historical daily streamflow data for j day of i year; zi, j is the
standardized daily streamflow data for j day of i year; x j and σ̂ j are mean and
standard deviation of calendar day j, respectively; zi, j is written as the long time
series C0 (t) for the convenience.
The standardized daily streamflow sequences C0 (t) can be decomposed with the
Trous decomposition algorithm. The wavelet coefficients and scale coefficients of the
standardized daily steamflow sequences at Pingshan gauging station of Jinsha River
in upper stream of Yangtze River, China are shown in Fig. 1 (only listed 5 years
of 1940–1944 as an example, and here 2 is selected for P). In Fig. 1, W1 (t) and
W2 (t) denote wavelet coefficients at the resolution level 1 (Fig. 1a) and 2 (Fig. 1b),
respectively. C2 (t) denotes scale coefficients at the resolution level 2 (Fig. 1c).

3.2 Reconstruction of Daily Streamflow Sequences

In order to verify the accuracy of the Trous algorithm, the reconstruction of daily
streamflow sequences is needed. The reconstruction of daily streamflow sequences
may be acquired using Eq. 6 with wavelet transform W1 (t), W2 (t),..., W P (t) and C P (t),
which have no change in time order. The reconstructed data of daily streamflow
sequences at Pingshan station z are plotted in Fig. 2b based on W1 (t), W2 (t) and
C2 (t). It can be seen from Fig. 2 that there is no difference between observed data
of standardized daily streamflow sequences z (Fig. 2a) and reconstructed ones z
(Fig. 2b). Thus, it can be concluded that the Trous algorithm does not loss any
information.
Wavelet Transform Method for Synthetic Generation of Daily Streamflow 47

(a)
0.6
0.4
0.2
W1(t)

0
-0.2
-0.4
-0.6
1 147 293 439 585 731 877 1023 1169 1315 1461 1607 1753
t (day)
(b)
0.8
0.6
0.4
W2(t)

0.2
0
-0.2
-0.4
-0.6
1 147 293 439 585 731 877 1023 1169 1315 1461 1607 1753
t (day)
(c)
3
2
1
C2(t)

0
-1
-2
1 147 293 439 585 731 877 1023 1169 1315 1461 1607 1753
t (day)

Fig. 1 Wavelet decomposition of standardized daily streamflow sequences (1940–1944) at Pingshan


station of Jinsha River in Yangtze River, China. a Wavelet coefficient W1 (t), b wavelet coefficient
W2 (t), and c scale coefficient C2 (t)

3.3 Synthetic Generation of Daily Streamflow Sequences

Based on the wavelet transform W1 (t), W2 (t),..., W P (t), C P (t), a large number of
synthetic daily streamflow time series have been generated. These synthetic daily
streamflow time series are similar to the historical data in the aspect of the statistical
properties and stochastic variation rules. The major procedures in this study could
be explained as follows.
1. The historical daily streamflow time series are decomposed into the wavelet
coefficients W1 (t), W2 (t),...,W P (t) at the resolution level P and the scale
coefficients (the residual) C P (t);
2. According to the yearly period T of the observed data, wavelet transform W1 (t),
W2 (t),...,W P (t), C P (t) is divided into m sub-series (short series), respectively,
where m = n/T, T = 365, and m is the number of years, n is the number of days;
3. Randomly select single sub-series of wavelet coefficients and scale coefficients
at each resolution level (see Fig. 3), and assure that not all the sub-series are
48 W. Wang et al.

(a)
7
5
3
z
1
-1
-3
01-01-40 12-30-44 12-29-49 12-28-54 12-27-59 12-25-64 12-24-69 12-23-74 12-22-79 12-20-84 12-19-89
date
(b)
7
5
3
z’
1
-1
-3
01-01-40 12-30-44 12-29-49 12-28-54 12-27-59 12-25-64 12-24-69 12-23-74 12-22-79 12-20-84 12-19-89
date

Fig. 2 Comparison between observed and reconstructed data of standardized daily streamflow
sequences at Pingshan station. a Observed sequences z; b reconstructed sequences z

from same column simultaneously in Fig. 3. Then, reconstruct the synthetic


daily streamflow time series with a period of T using Eq. 6. A large number
of synthetic time series can be generated by repeating this process.
Figure 3 illustrates the stochastic simulating procedures with an example. Assume
P = 2, W1 (t), W2 (t) and C2 (t) are divided into m sub-series at first, then sample k-
th sub-series from C2 (t) randomly and keeps it unchangeable. Select i-th sub-series
from W1 (t) randomly and j-th sub-series from W2 (t), respectively. The values of i, j,

Fig. 3 Sketch of wavelet transform method for generating daily streamflow sequences (if i, j, k are
equal, then the reconstructed series is same as original series. If i, j, k are not equal simultaneously,
then the reconstructed series is a newly simulating series)
Wavelet Transform Method for Synthetic Generation of Daily Streamflow 49

k (i, j, k = 1, 2, ..., m) can not be equal at the same time. Lastly, reconstruct a large
number of synthetic daily streamflow time series of period T with Eq. 6.
This is a new wavelet transform method for synthetic generation of daily
streamflow based on unconditional stochastic combination.
The suggested method must satisfy that W1 (t), W2 (t),..., W P (t) and C P (t) are
independent of each other for different lag time. This is a basic premise for synthetic
generation with wavelet transform method. In general, this premise is valid. For the
Pingshan station, the lag k cross-correlation coefficients of wavelet coefficients W1 (t),
W2 (t) and scale coefficients C2 (t) are calculated each other and plotted in Fig. 4.
Under the confidence level 1-α = 95%, it indicates that W1 (t), W2 (t) and C2 (t) are
independent of each other. The conclusion mentioned is correct for any resolution
level P.
The essences of the proposed method are: (1) the observed daily streamflow
sequences are decomposed into wavelet transform time series W1 (t), W2 (t),..., W P (t)
and C P (t), which have different properties of time and frequency; (2) these wavelet
transform time series W1 (t), W2 (t),..., W P (t) and C P (t) are independent of each other,
therefore, through unconditional stochastic combination of the wavelet coefficients
sub-series and the scale coefficients sub-series, the synthetic daily streamflow se-
quences are able to reflect various characteristics of historical data. This uncon-
ditional stochastic combination could generate many kinds of daily streamflow
sequences which are different from historical sequences.
The proposed wavelet transform method avoids prior assumptions as to the form
of probability distribution (Normal or Pearson III) and the form of dependence
(linear or non-linear), so it is a non-parametric approach. It also can extrapolate the
observed data to a much large range. The proposed new method is relatively simple
for generating daily streamflow.

3.4 Similarities and Dissimilarities Between BA Algorithm and Proposed Algorithm

The similarities between the proposed method in this paper and BA method are:
(1) the procedures of these two method are same, which is that the observed time
series is decomposed into components at various frequencies and then randomly

0.25
ra
cross-correlation coefficient

0.2
rb
0.15
rc
0.1
0.05
0
-0.05
-0.1 lag k
-0.15
-0.2
-0.25
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31

Fig. 4 Cross-correlation coefficient of W1 (t), W2 (t) and C2 (t) ra is the lag k cross-correlation
coefficients of W1 (t) and W2 (t), rb is the lag k cross-correlation coefficients of W1 (t) and C2 (t), rc
is the lag k cross-correlation coefficients of W2 (t) and C2 (t)
50 W. Wang et al.

reconstructed a new synthetic series; (2) both methods are based on wavelet analysis;
and (3) both methods are relatively simple for simulating a hydrological time series.
The dissimilarities between the proposed method in this paper and BA method
are as follows: (1) BA algorithm is sensitive to the data length. The wavelet method
requires the data length to be a power of 2. This means that the data length must
be 16, 32, 64, 128, etc., and therefore, up to half of the observed data may not
be used by BA method. The proposed algorithm in this study is not sensitive to
the data length, which means that all the measured data could be included in the
analysis. (2) The formulas and ideas about on how to decompose and reconstruct
the time series in these two algorithms are different. (3) The wavelet function of BA
algorithm is only Haar wavelet, which is not suitable for other wavelet functions. The
wavelet function of our proposed algorithm is a kind of discrete low-pass filter, which
provides more choices. (4) It is difficult for BA method to generate sequences longer
than the historical record. It is easy for our algorithm to generate longer synthetic
sequences than the historical record in extrapolation. (5) BA method is only suitable
for a normal distribution hydrological series, while our method is suitable for any
hydrological sequences.

4 Case Study

4.1 Study Area and Datasets

The case study is to demonstrate that the wavelet transform method is applicable
to synthetic generation of daily streamflow by selecting the Jinsha River in China
as case study area. The Jinsha River is located between 90◦ ∼105◦ E longitude and
24◦ ∼ 36◦ N latitude, and it is the upstream of Yangtze River in southwestern China
with a drainage area of 500,000 km2 . The length of mainstream is 3,481 km. There is a
discharge gauging station, Pingshan, at the outlet of the Jinsha Watershed. The mean
annual runoff is 1,440 billions m3 . The daily average discharges data-sets from 1940
to 1992 at Pingshan are collected. These datasets are considered consistent because
there is no significant change in local climate and human activities in general. This
can be seen from Fig. 5, which shows the daily streamflow time series xi, j during
1940–1992 at Pingshan station.

30000

20000
x(m3/s)

10000

0
01-01-4 12-30-44 12-29-49 12-28-54 12-27-59 12-25-64 12-24-69 12-23-74 12-22-79 12-20-84 12-19-89
date

Fig. 5 The observed daily streamflow time series during 1940–1992 at Pingshan station in Jinsha
River, China
Wavelet Transform Method for Synthetic Generation of Daily Streamflow 51

4.2 Synthetic Generation of Daily Streamflow Sequences at Pingshan Station

In this study there is 53-years record of daily streamflow. Then, the length of the
daily streamflow sequences xt is 19,345 days (i.e., n = 48 × 365). Here, P = 2 and
C0 (t) = x(t). In fact, 2, 3 and 4 would be selected for P. A higher value for P can
generate more daily streamflow hydrogragh with more variation in modes. Based on
the proposed wavelet transform method many synthetic daily streamflow sequences
have been generated.

4.3 Validation and Verification of the Developed Method

If a number of significant statistical characteristics of historical sequences could be


reproduced by the sequences generated based on the developed method in the study,
it would be reasonable to say that this newly developed method is valid and efficient.
Based on the developed method 100 samples have been generated, the length of each
generated sample is the same as the length of the historical sample. Validation and
verification were performed as follows.

1. Test for statistical parameters of daily streamflow for each day

The mean value, standard deviation (S), coefficient of variation (Cv), coefficient
of skew (Cs), lag-1 autocorrelation coefficient r1 , lag-2 autocorrelation coefficient
r2 , maximum and minimum of daily streamflow for each day are major statistical
parameters concerned by most hydrologists. Table 1 shows the ratios of the days
of each observed statistical parameter, which falls inside its corresponding interval
to all days (that is 365) in a year. Table 1 shows that the generated sequences
could preserves the main characteristics of statistical parameters mentioned above.
The interval for each statistical parameter is two standard deviations away from
the mean of each generated parameter. The mean and standard deviation of each
generated parameter could be calculated based on the 100 generated samples. For
example, considering that Lag-1 autocorrelation coefficient r1 ( j), j = 1, 2, ..., 100 for
the generated sample j, the mean and standard deviation are respectively

1 
100
r̄1 = r1 ( j) (8)
100 j=1

and
⎛ ⎞1/2
1 
100

S=⎝ r1 ( j) − r1 ⎠
2
(9)
99 j=1

So the interval of Lag-1 autocorrelation coefficient r1 is [r̄1 − 2S, r̄1 + 2S].

Table 1 Ratios falling inside the corresponding interval of statistical parameters/%


Mean value S Cv Cs R1 r2 Maximum Minimum
100 100 100 75.3 85.2 88.5 91.2 86
52 W. Wang et al.

Table 2 Statistical parameters of observed and generated flood volume


Parameter Mean value (109 m3 ) Cv Cs
Flood volume Observed Generated Observed Generated Observed Generated
Mean S Mean S Mean S
W1 1.48 1.51 0.05 0.24 0.22 0.02 0.79 0.72 0.34
W3 4.29 4.40 0.13 0.23 0.20 0.02 0.78 0.72 0.35
W7 9.34 9.62 0.30 0.23 0.21 0.02 0.89 0.89 0.71
W15 18.2 18.6 0.60 0.22 0.21 0.02 0.87 0.72 0.34
W30 32.1 32.6 1.10 0.22 0.21 0.02 0.79 0.55 0.25

2. Test for statistical parameters of flood volume

The maximum 1-, 3-, 7-, 15-, and 30-day flood volume (W1 , W3 , W7 , W15 , W30 ) are
concerned statistics, too. The statistical parameters (such as mean value, Cv and Cs
of various flood volumes) are obtained from the generated sequences and observed
ones (see Table 2). From Table 2, it can be seen that the observed parameters of all
kinds of flood volume fall inside their corresponding intervals, hence, the observed
characteristics of flow sequences can be preserved well in the newly generated
sequences.. The interval for each statistical parameter is one standard deviation (S in
Table 2) away from the mean of each generated parameter. The mean and standard
deviation of each generated parameter can be calculated by Eqs. 8 and 9.

3. Test for statistical parameters of monthly runoff

Statistical parameters of monthly runoff in January–December obtained from gener-


ated and observed sequences are listed in Table 3. Table 3 shows that all parameters
are well reproduced under the one S band from the mean of each generated
parameter. Here, S and mean can be acquired as before.

Table 3 Statistical characteristics of observed and generated monthly runoff


Parameter Mean value (m3 /s) Cv Cs
Month Observed Generated Observed Generated Observed Generated
Mean S Mean S Mean S
January 1,628 1,628 25 0.12 0.11 0.01 0.68 0.68 0.28
February 1,392 1,395 19 0.10 0.10 0.01 0.81 0.72 0.25
March 1,312 1,311 17 0.10 0.09 0.01 0.90 0.95 0.29
April 1,487 1,488 27 0.14 0.13 0.03 1.97 1.46 0.92
May 2,218 2,216 67 0.21 0.21 0.02 0.32 0.30 0.31
June 4,901 4,885 218 0.26 0.25 0.02 0.53 0.57 0.27
July 9,410 9,434 434 0.26 0.26 0.02 0.36 0.49 0.26
August 10,168 10,213 489 0.29 0.29 0.02 0.85 0.73 0.23
September 9,942 9,947 361 0.24 0.23 0.02 0.47 0.32 0.25
October 6,541 6,593 253 0.24 0.25 0.02 0.58 0.53 0.25
November 3,447 3,460 92 0.18 0.18 0.02 0.99 1.0 0.41
December 2,172 2,177 48 0.14 0.14 0.01 0.77 0.72 0.30
Table 4 Occurrence ratio of maximum daily streamflow in each month of a year (%)
Month January February March April May June July August September October November December
Observed 0.0 0.0 0.0 0.0 0.0 2.6 26.3 39.5 26.7 4.8 0.0 0.0
Generated 0.0 0.0 0.0 0.0 0.0 1.4 31.4 42.8 23.5 0.9 0.0 0.0
Wavelet Transform Method for Synthetic Generation of Daily Streamflow
53
54 W. Wang et al.

Fig. 6 Comparison of observed and simulated daily streamflow sequences (Vertical coordinate is
flow in m3 /s; horizon coordinate is time in day)

4. Test for seasonal variation of occurrences of maximum daily streamflow in a year

In order to verify the seasonal variation of maximum daily streamflow in a year,


the ratios of the number of occurrence of the yearly maximum daily streamflow in
each calendar month to the total years (here is 100 × 53 = 5300) are obtained from
generated and observed sequences, respectively (see Table 4).
If the ratios of generated sequences are close to the ratios of observed ones
for each calendar month, the results would be acceptable. Table 4 indicate s that
the results derived from this method is acceptable. It also shows that the seasonal
variations of maximum daily streamflow in a year of generated sequences are similar
to observed ones. Therefore, the developed method is capable of reflecting efficiently
the seasonal variation of occurrence of maximum daily streamflow in a year.
The hydrograghs of generated daily streamflow sequences and the observed ones
are relatively consistent. This can be seen from the high similarity of two observed
and generated daily streamflow sequences in Fig. 6.
In summary, it has been shown that wavelet transform method developed in
this study is capable and compelling in generating daily streamflow sequences with
similar statistical parameters of observed streamflow.

5 Conclusion

A wavelet transform method for synthetic generation of daily streamflow sequences


has been presented in this study. This method is innovative idea for generation
of stochastic hydrology. The essence of the method is unconditional stochastic
combination of the obtained sampling sub-series from the wavelet transform time
Wavelet Transform Method for Synthetic Generation of Daily Streamflow 55

series. The advantages of proposed wavelet transform method in this study are
summarized as follows.

1. The proposed method could produce a large number of synthetic daily


streamflow sequences, which are different from the historical sequences.
2. The generated sequences are statistically similar to the historical sequences.
3. The proposed method is a nonparametric approach.
4. The generated sequences could capture the dependence structure and statistical
properties presented in the data without imposing arbitrary distributional and
dependence assumptions.
5. The proposed method is not sensitive to the original data length and suitable
for any hydrological sequences, such as stationary hydrological series, non-
stationary hydrological series, normal distribution hydrological series, and skew
distribution hydrological series.
6. The method is relatively simple in procedures.

The number of generated sequences of the proposed method depends on the size of
the historical data and the reasonable choice of the resolution level P, so adequate
observed data must be available for this method.
Validation and verification of synthetic generation of daily streamflow sequences
at Pingshan station have shown that the proposed method is functional and feasible.
The proposed method could be widely applied to other hydrological time series in
the future. Future studies will be opened up about the choice of the filters suitable
for hydrological time series and the resolution level P.

Acknowledgements The work was partly supported by the National Natural Science Foundation
of China (No: 50739002), Ministry of Technology & Science of China (grant No:2009IM020100),
Program for New Century Excellent Talents in University of China and Open Research Fund
Program of State key Laboratory of Hydraulics and River Engineering, Sichuan University, China
(No. 0703). The authors also wish to thank the anonymous reviewers for their valuable comments.

References

Aksoy H (2001) Storage capacity for river reservoirs by wavelet-based generation of sequent-peak
algorithm. Water Resour Manag 15:423–437
Aksoy H, Bayazit M (2000) A model for daily flows of intermittent streams. Hydrol Process 14:1725–
1744
Aksoy H, Unal NE (2007) Discussion of “Comparison of two nonparametric alternatives for stochas-
tic generation of monthly rainfall” by R Srikanthan, A Sharma, and TA McMahon. J Hydrol Eng
©ASCE 12(6):699–701
Anctil F, Coulibaly P (2004) Wavelet analysis of the inter-annual variability in southern Quebec
stream flow. J Climate 17:163–173
Aussum A, Campbell J, Murfagh F (1997) Wavelet-based feature extraction and decomposition
strategies for financial forecasting. J Comput Intell Finance 6:5–12
Bayazit M, Aksoy H (2001) Using wavelets for data generation. J Appl Stat 28(2):157–166
Bayazit M, Onoz B, Aksoy H (2001) Nonparametric streamflow simulation by wavelet or Fourier
analysis. Hydrol Sci J 46(4):623–634
Chou CM (2007) Applying multi-resolution analysis to differential hydrological grey models with
dual series. J Hydrol 332:174–186
Chou CM, Wang RY (2004) Application of wavelet-based multi-model Kalman filters to real-time
flood forecasting. Hydrol Process 18:987–1008
56 W. Wang et al.

Chui CK (eds) (1995) Wavelet—a tutorial in theory and applications. Xi An Communication Uni-
versity Press, Xi An (in chinese)
Coulibaly P, Anctil F, Rasmussen P et al (2000) A recurrent neural networks approach using indices
of low frequency climatic variability to forecast regional annual runoff. Hydrol Process 14:2755–
2777
Ding J, Deng YR (eds) (1988) Stochastic hydrology. Chengdu University of Science and Technology
Press (in Chinese)
Heng T, Wang WS, Li LD et al (2002) Combing stochastic models base on wavelet transform and
its application to generation of runoff time series. Water Resources and Power 20(1):15–17 (in
Chinese)
Jin JL, Ding J (2002) Water resource systems engineering. Sichuan Science and Technology Press (in
Chinese)
Kang SJ, Henry L (2007) Wavelet analysis of hydrological and water quality signals in an agricultural
watershed. J Hydrol 338(1):1–14
Kottegota NT (1980) Stochastic water resources technology. The Macmillan Press LTD
Koutsoyiannis D (2000) A generalized mathematical framework for stochastic simulation and fore-
cast of hydrologic time series. Water Resour Res 36(6):1519–1533
Kumar P, Foufoula-Georgiou E (1993) A multicomponent decomposition of spatial rainfall fields
1. Segregution of large- and small-scale features using wavelet transforms. Water Resour Res
29(8):2515–2532
Labat D, Ababou R, Mangin A (2000) Rainfall-runoff relations for karstic springs Part II: continuous
wavelet and discrete orthogonal multi-resolution analyses. J Hydrol 238:149–187
Lall U, Sharma A (1996) A nearest neighbor bootstrap for resampling hydrological time series.
Water Resour Res 32(3):679–693
Li XB, Ding J, Li HQ (1999) Combing neural network models based on wavelet transform. J Hydr
Eng 2:1–4 (in Chinese)
Mallat SG (1989) A theory for multiresolution signal decomposition: the wavelet representation.
IEEE Trans PAMI 11(7):674–693
Raman, H et al (1995) Multivariate modeling of water resources time series using artificial neural
network. Hydrol Sci J 40(2):145–163
Rao RM, Bopardikar AJ (1998) Wavelet transforms, introduction to theory and applications.
Addison-Wesley, Reading
Sharma A, Tarboton DG, Lall U (1997) Stream simulation: a nonparametric approach. Water
Resour Res 33(2):291–308
Shensa MJ (1992) Discrete wavelet transform: wedding the A Trous and Mallat algorithm. IEEE
Trans Signal Process 40:2464–2482
Smith LC, Turcotte DL, Sacke BI (1998) Stream flow characterization and feature detection using a
discrete wavelet transform. Hydrol Process 12:233–249
Srikanthan R, McMahon, TA (1980) Stochastic generation of monthly flows for ephemeral streams.
J Hydrol 47(2):19–40
Srikanthan R, Sharma A, McMahon TA (2007) Closure to “Comparison of two nonparametric
alternatives for stochastic generation of monthly rainfall”. J Hydrol Eng © ASCE 12(6):701–702
Srinivas VV, Srinivasan K (2005) Matched block bootstrap for resampling multiseason hydrologic
time series. Hydrol Process 19:3659–3682
Tarboton DG, Sharma A, Lall U (1998) Disaggregation procedures for stochastic hydrology based
on nonparametric density estimation. Water Resour Res 34(1):107–119
Tong H (eds) (1990) Non-linear time series: a dynamical system approach. Clarendon Press, Oxford
Unal NE, Aksoy H, Akar T (2004) Annual and monthly rainfall data generation schemes. Stoch
Environ Res Risk Assess 18:245–257
Wang WS (1999) Nonparametric stochastic simulation and its application. Sichuan University, Ph.D.
Dissertation (in Chinese)
Wang WS, Ding J (2003) Wavelet network model and its application to the prediction of hydrology.
Nature and Science 1(1):63–67
Wang WS, Ding J (2007) A multivariate nonparametric model for synthetic generation of daily
streamflow. Hydrol Process 21:1764–1771
Wang WS, Ding J, Yuan P (1999) Nonparametric disagregation model and its application in hydro-
logical stochastic simulation. Sichuan Water Power 18(1):60–62 (in Chinese)
Wang WS, Yuan P, Ding J (2001) Threshold auto-regression model and its application to generation
in hydrology. Sichuan Water Power 20(suppl.):47–51 (in Chinese)
Wavelet Transform Method for Synthetic Generation of Daily Streamflow 57

Wang WS, Ding J, Li YQ (eds) (2005) Wavelet analysis in hydrology. Chemical Industry Press (in
Chinese)
Wang WS, Jin JL, Li YQ et al (eds) (2007) Stochastic simulation technology for Hydrology and
Water Resources. Sichuan University Press (in Chinese)
Wang WS, Jin JL, Li YQ (2009) Prediction of inflow at Three Gorges Dam in Yangtze River with
wavelet network model. Water Resour Manag 23:2791–2803. doi:10.1007/s11269-009-9409-2
Yevjevich V (1972) Stochastic process in hydrology. Water Resources Publications, Colorado
Yuan P, Wang ZY (1996) A new model for flood generation—bilinear time series model. Journal of
Sichuan Union University (Engineering Edition) 6:91–96 (in Chinese)

You might also like