You are on page 1of 16

128: Rainfall-runoff modeling: Transfer Function

Models
PETER C YOUNG
Centre for Research on Environmental Systems & Statistics (CRES), Lancaster University,
Lancaster, UK (Also, Centre for Resource & Environmental Studies, Institute of Advanced Studies,
Australian National University, Canberra ACT 2020, Australia)

This article discusses continuous and discrete-time transfer function (TF) models within the context of the Data-
Based Mechanistic (DBM) modeling of hydrological systems. Although, at a superficial level, TF models simply
provide a convenient and concise way of presenting differential or difference equations in an input-output form,
they are much more than this. In particular, they allow for the consideration of linear dynamic systems in simple
algebraic terms and for their analysis within a dynamic systems and control context. This is useful in various
ways, particularly where a higher order TF model is identified and estimated directly from hydrological data.
Often, this data-based model can then be decomposed into serial, parallel, and feedback connections of first-
order systems that can be interpreted in hydrologically meaningful terms, thereby facilitating the model’s use in
hydrological systems analysis. The article also shows that TF models can be considered in time-variable and
state-dependent parameter (SDP) form, so allowing for them to describe nonstationary and nonlinear systems.
And, since it is straightforward to consider all TF models in stochastic terms, they provide a powerful vehicle
for uncertainty, forecasting, and risk analysis. The practical utility and power of DBM TF models is illustrated
by two real hydrological examples.

INTRODUCTION (RF) processes. This is followed by a tutorial section that


discusses the formulation of linear, constant parameter TF
From a conceptual standpoint, most mathematical models models in the derivative operator s = d/dt, which are
of hydrological systems are formulated on the basis of simply the CT, TF form of ordinary differential equations.
natural laws, such as dynamic conservation equations, It then proceeds to develop and discuss the discrete-time
often expressed in terms of continuous-time (CT) linear (DT) equivalents of these CT models, namely, TFs in the
or nonlinear differential equations. The objective of Data- backward shift operator z−1 (see later text), before outlining
Based Mechanistic (DBM) modeling (e.g. Young and Lees, methods for the statistical identification and estimation of
1993; Young, 1998 and the prior references therein) is to both CT and DT models from noisy time series data. In this
infer the nature and structure of such models directly from manner, the models take on a natural stochastic form that is
hydrological data, using powerful methods of statistical appropriate to their use in applications such as hydrological
identification and estimation, and to then interpret the forecasting, uncertainty, and risk analysis. Two practical
model equations in physically meaningful terms. One examples help demonstrate the practical utility of the TF
important generic model class that facilitates such DBM approach in hydrology, one concerned with RF modeling
modeling studies is the Transfer Function (TF) family of and the other with the dynamics of solute transport and
models, which is the subject of this article. The article dispersion in rivers. Finally, the article also shows that TF
starts by reviewing briefly some of the background to the models can be extended to incorporate time-variable and
use of TF models in hydrology, concentrating on prior “state-dependent” parameters, so allowing them to describe
publications concerned with the modeling of rainfall-flow nonstationary and nonlinear stochastic systems.

Encyclopedia of Hydrological Sciences. Edited by M G Anderson.


 2005 John Wiley & Sons, Ltd.
2 RAINFALL-RUNOFF MODELING

TF MODELS IN HYDROLOGY very useful tool in HCM modeling because their outward
The most common application of TF models in hydrology “black-box” nature obscures an underlying ability to explain
is connected with the modeling of RF processes, where the more physically meaningful mechanisms that under-
the objective is to characterize the dynamic relationship lie this input-output description; mechanisms that can be
between rainfall and total measured flow in the river. Note revealed using the “system theoretic” procedures of TF
that the term “rainfall-flow” is used here, with an emphasis decomposition. In one important sense, TF models pro-
on the total flow in the river rather than the ubiquitous term vide a rather natural model form for hydrologists because
Rainfall-Runoff (RR). As we shall see later, this is because the impulse response of a continuous-time TF is equiva-
the most recent work on the application of data-based TF lent to the Instantaneous Unit Hydrograph (IUH); while the
models in this context has shown how they are able to pulse response of the discrete-time TF is equivalent to the
quantify other components of the flow; most importantly, discrete-time Unit Hydrograph (UH) (or TUH): see later
those arising from groundwater processes that dominate the Section “The unit hydrograph and finite impulse response
base flow in the river. TF models”.
The characterization of the nonlinear dynamic relation- System theoretic modeling ideas have had quite a strong
ship between rainfall and river flow is one of the most influence in hydrology over many years: to name but a
interesting and challenging modeling problems in hydrol- few contributions, for example, see Amorocho and Hart
ogy. It has received considerable attention over the past (1964); Salas et al. (1980); Wood (1980); O’Connell and
40 years, with mathematical and computer-based models Clarke (1981); Sorooshian (1983); Gupta (1984); Bras and
ranging from simple black-box representations to complex, Rodriguez-Iturbe (1985) and Singh (1988). However, such
physically based catchment models. Other articles in this models do not often attempt to represent the nonlinear
Encyclopedia will deal more comprehensively with this dynamics inherent in the transformation of rainfall to runoff
topic and there are many books that consider it, either in and, therefore, may not always perform well over long peri-
whole, or in part: for example, Anderson and Burt (1985: ods of time when the catchment storage and soil wetness is
see particularly, the chapter by Wood and O’Connell); Shaw changing over a wide range. Sometimes, piecewise linearity
(1994), Singh (1995) and Beven (2001). But, in attempting has been assumed to handle some aspects of the nonlinear
to consider the historical context of this topic, we should behavior (e.g. Natale and Todini, 1976a,b); while, in other
recall Beven’s comments in the latter book: cases, the model parameters have been allowed to vary with
“It is now virtually impossible for any one person to be aware
time (Young, 1974; Whitehead and Young, 1975; White-
of all the models that are reported in the literature, let alone head et al., 1976; Young and Wallis, 1985; Young, 1986).
know something of the historical framework of the different The latter Time-variable Parameter (TVP) approaches to
initiatives.” TF estimation have led on to a more overt treatment of
On this basis, it is clear that any treatment of such extensive nonlinearity, sometimes by the use of nonlinear black-box
and quite often controversial literature will tend to be models (e.g. the early example of Hsu et al., 1993), and
somewhat subjective. sometimes by the extension of TF models to allow for non-
Wheater et al. (1993) have categorized RF models into linearity, as considered in this article. As we shall see, this
four, broad types: conceptual models, physics-based mod- latter approach has the advantage of allowing for the easier
els, metric models and Hybrid Metric-Conceptual (HMC) inclusion of conceptual ideas and the retention of phys-
models. Conceptual and physics-based models tend to be ical meaning within the estimated model structure. This
driven by deterministic, reductionist thinking. As a result, desire to associate a physical meaning with TF models
they are often very large and suffer from problems of over- has a rather natural appeal to hydrologists and was proba-
parameterization that make rigorous statistical identification bly first adumbrated by Young (1974) and Whitehead and
and estimation from the available data difficult or even Young (1975). The idea of a Physically Realizable Trans-
impossible. At the other extreme, metric models, such as fer Function (PRTF: see Han, 1991) tackled this problem
the neural network (e.g. Hsu et al., 1993) and neuro-fuzzy by ensuring that the estimated TF model maintained mass
types (e.g. Jang et al., 1997), are the epitome of “black-box” conservation and did not have any nonphysical, oscillatory
modeling, revealing very little of their internal structure that modes of dynamic behavior. It was also the basis for the
has any physical meaning. work of Jakeman et al. (1990) and provided the philosophi-
The HMC models are an attempt to combine the ability cal and methodological underpinning of the DBM approach
of metric models to efficiently characterize the observa- to modeling RF processes (Young, 1993; Young and Beven,
tional data in statistical terms (the “principle of parsimony” 1994) that is discussed further in the following sections of
(Box and Jenkins, 1970); or the “Occam’s Razor” of antiq- this article.
uity), with the advantages of simple conceptual models that Finally, it must be emphasized that TF models are not
have a prescribed physical interpretation within the cur- restricted to RF processes: since they provide a generic
rent scientific paradigm. Transfer functions have become a tool for modeling linear and nonlinear, stochastic dynamic
RAINFALL-RUNOFF MODELING: TRANSFER FUNCTION MODELS 3

systems, they can be applied to data-based modeling T , that defines the Residence Time of the model. As we
problems in many areas of science, engineering, and the shall see later, combinations of two or more such first-
social sciences (see the wide variety of examples in Young, order models exhibit a typical UH form (e.g. Dooge, 1959;
1998). In the hydrological context, they have already been Beven, 2001).
used for flow routing down rivers (see Section “Continuous- By introducing the derivative operator s, that is, s =
time TF models”) so that, together with the rainfall-flow d/dt, and collecting like-terms together, it is easy to see
TF models, they allow for characterization of a whole river that equation (2) can be written as,
catchment. As we shall see in a later example, TF models
are also fundamental to the Aggregated Dead Zone (ADZ) (1 + T s)Qo (t) = GQi (t − τ )
modeling of pollution transport and dispersion in rivers and
soils. Given this flexibility, it seems likely that they will be so that, dividing throughout by 1 + T s, we obtain the
used even more widely in the future as a basic tool for following continuous-time TF form of equation (2),
modeling in hydrological and environmental sciences.
G
Qo (t) = Qi (t − τ ) (3)
1+Ts
CONTINUOUS-TIME TF MODELS where,
G
In order to introduce TF models, let us consider first a H (s) =
1+Ts
conceptual catchment storage equation in the form of a
continuous-time, linear storage (tank or reservoir) model: represents the TF in terms of the derivative operator s. Note
see, for example, the review papers by O’Donnell, Dooge that the same letter s (or sometimes p) is used to represent
and Young in Kraijenhoff and Moll (1986); or more the related Laplace transform operator. Considered in these
comprehensive treatments, such as the recent books by Laplace transform terms, it is possible to utilize Laplace
Beven (2001) and Dooge and O’Kane (2003). Here, the transform methods to handle initial conditions on the
rate of change of storage in the channel is defined in terms variables in the model and analytically compute its response
of water volume entering the linear storage element (e.g. (here Qo (t)) to variations in input variable (here Qi (t)).
river reach) in unit time, minus the volume leaving in the However, this is not essential in the present context,
same time interval, namely, although the interested reader should find the article a good
primer for the study of Laplace transform methods (e.g.
dS(t) Schwarzenbach and Gill, 1979).
= GQi (t − τ ) − Qo (t) (1)
dt
Physically Interpretable Parameters
where Qi (t − τ ) represents the input flow rate delayed by
a pure time or “transport” delay of τ time units to allow The TF model 3 is characterized by three parameters: G,
for pure advection; and G is a gain parameter inserted T , and τ . However, there are five, physically interpretable
to represent gain (or loss) in the system. Making the model parameters associated with the model that are worth
reasonable and fairly common assumption that the outflow discussing. The Steady State Gain (SSG), denoted by G,
is proportional to the storage at any time, that is, is obtained by setting the s operator in the TF to zero
(i.e. d/dt = 0 in a steady state). It shows the relationship
Qo (t) = T S(t) between the equilibrium output and input values when a
steady input is applied. For this reason, if the input and
and substituting into (1), we obtain, output have similar units, G is ideal for indicating the
physical losses or gains occurring in the system. In the case
dQo (t) of a flow-routing model, for example, it indicates whether
T = GQi (t − τ ) − Qo (t) (2)
dt water has been added (G > 1) or lost (G < 1) between the
upstream and downstream boundaries; and the percentage
This equation is a first-order, linear differential equation of water lost or gained can be defined by Loss Efficiency
model whose response, from an initial steady flow condi- LE = 100(1 − G), which will be negative if G > 1.0. As
imp
tion, to a unit impulsive change Qi of the input flow at pointed out above, the Residence Time or Time Constant T
time t = t0 , is given by is the time required for the storage element output to decay
to e−1 or 0.3679 of its maximum value in response to an
Qo (t) = Qe + Qi e−(t−t0 )/T
imp
impulsive input. Finally, the pure Advective Time Delay τ
indicates the time it takes for a flow increase upstream to
where Qe is the initial steady flow level. This has a typical be first detected downstream: and Tt = T + τ defines the
hydrograph recession shape, with a decay Time Constant, Travel Time of the system. These five parameters typify the
4 RAINFALL-RUNOFF MODELING

equilibrium and dynamic characteristics of the TF model Rainfall-flow


and provide a physical interpretation of the TF model in subsystem
terms of its mass transfer and dispersive characteristics. G1 Flow routing
subsystem
1 + T1s
TF Manipulation and Block Diagrams u (t ) G3 G3 y(t)
1 + T3s 1 + T3s
The first-order TF model (3) is often written in the form, G2
g0 g0 1 + T2s
Qo (t) = Qi (t − τ ) i.e. H (s) = (4)
s + f1 s + f1
where, Figure 1 Block diagram of a hypothetical catchment
G 1 model consisting of a parallel pathway, rainfall-flow
g0 = ; f1 =
T T subsystem in series with a two reach, flow-routing
subsystem
because this is the form in which the model is normally
estimated (see later). Typically, a Channel or Flow-routing
model for a river catchment will contain a number of again of the form (3) with τ = 0, but this time connected
elemental models, such as 3 or 4, connected in a manner in series as a “Nash Cascade”. The upstream input to
that relates to the structure of the catchment. For instance, a this complete system is an effective rainfall measure,
serial connection of n such elements constitutes the lag-and- represented by u(t), that is generated from the gauged
route model of a single river channel (Meijer, 1941; Dooge, rainfall r(t) (not shown) by a nonlinear relationship N (s)
1986) and, with τ = 0 and all elements identical, it becomes involving a measure of the catchment wetness (see later
the well known “Nash Cascade” model (Nash, 1959). More example). The downstream output of the complete system
complex river systems can be represented by a main channel is denoted by y(t).
of this type, with tributaries modeled in a similar manner. One advantage of the TF formulation of the model shown
Also, a typical TF model between effective rainfall and in Figure 1 is that it allows for the computation, using Block
flow often contains a parallel connection of two or more Diagram Algebra, of a single, multiorder TF that represents
such storage elements (see Young, 1992; 2001b,c, 2002a, the total system. Here, TFs connected in parallel are
2003). Related serial and parallel arrangements characterize additive; while those connected in series are multiplicative.
the mass conservation equation of the ADZ model for the Consequently, in this case, the two submodels can be
transport and dispersion of a solute in a river channel (e.g. represented by the following composite TFs:
Wallis et al., 1989 and the prior references therein). And
ADZ-type models can lead to more complex interactive Hrf (s) = H1 (s) + H2 (s); Hff (s) = H3 (s).H3 (s)
water quality models, including chemical and biological
interaction, such as dissolved oxygen-biological oxygen So that, with the above definitions of H1 (s), H2 (s) and
demand (DO-BOD) models (e.g. Beck and Young, 1975; H3 (s),
Whitehead and Young, 1975; Young, 1999b). Practical
examples of both RF and ADZ modeling are described G1 G2
later in Section “STATISTICAL IDENTIFICATION AND Hrf (s) = +
1 + T1 s 1 + T2 s
ESTIMATION OF TF MODELS”.
(G1 + G2 ) + (G1 T2 + G2 T1 )s
The TF formulation allows for the visual representation =
of a total system model in the form of a Systems Block (1 + T1 s)(1 + T2 s)
Diagram. Figure 1 is a typical example of such diagram
that represents a catchment model consisting of a effective and
(G3 )2
RF submodel involving two different first-order TFs of the Hff (s) =
form (3) with τ = 0, (1 + T3 s)(1 + T3 s)

G1 G2 Now, since Hrf (s) and Hff (s) are connected in series, the
H1 (s) = ; H2 (s) =
1 + T1 s 1 + T2 s TF of the total system H (s) is obtained as the multiplication
of these two composite TFs: that is,
that are connected in parallel. The flow output of this
submodel forms the input to a flow-routing submodel (G1 + G2 ) + (G1 T2 + G2 T1 )s
composed of two identical, first order TFs, H (s) = Hrf (s).Hff (s) =
(1 + T1 s)(1 + T2 s)
G3 (G3 )2
H3 (s) = ×
1 + T3 s (1 + T3 s)(1 + T3 s)
RAINFALL-RUNOFF MODELING: TRANSFER FUNCTION MODELS 5

Multiplying out these expressions, we see that the complete where η(t) = F (s)ξ(t) is a modified noise signal generated
H (s) is a 4th order TF that can be manipulated to the form: by the manipulation of the equation. The structure of this
model, in either form (6) or (7), is defined by the triad
g0 + g1 s [p q τ ].
H (s) = (5)
s + f1 s + f2 s 2 + f3 s + f4
4 3
If the noise ξ(t) can be assumed to have specific
statistical characteristics, then this can help in obtaining low
where we will leave the definition of the parameters fi , i = or minimum variance estimates of the TF model parameters.
1, 2, . . . , 4 and gj , j = 0, 1 in 5, as an exercise for the But, as we see later when discussing the identification
reader (hint: carry the above analysis with the TF represen- and estimation of TF models, such assumptions are not
tation 4 rather than 3: then it will be clear why the former an essential requirement of statistical estimation and good,
representation is better for analysis, although it lacks the low variance parameter estimates can be obtained without
direct physical interpretation of the latter, which provides invoking them. Also, depending on the objectives of
a better form for the block diagram representation). the modeling study, it may be necessary, in a complete
system consisting of many sub-elements such as (3) or
The General, Multi-Order TF Model (4), to consider noise inputs within the system, associated
It is clear from the above example that, in general, serial, with collections of subelements that have distinct physical
parallel (or even feedback) connections of elemental first meaning: for example, stochastic lateral inflows.
order TF models, such as 3 or 4, lead to a multi-order
TF model. The block diagram algebra for a feedback Discrete-time, Sampled Data TF Models
connection is a little more complicated but, since it is not
To date, the most popular form of TF modeling has been
particularly relevant in the current context, the interested
carried using the DT equivalents of the models (4) and (6).
reader should consult a standard text on the subject (e.g.
In the case of equation (4), this discrete-time TF model
Schwarzenbach and Gill, 1979) to find the details. In the
takes the form:
present context, the general multi-order TF form is as
follows: b0
Qo,k = Qi,k−δ (8)
G(s) 1 + a1 z−1
x(t) = u(t − τ ) y(t) = x(t) + ξ(t) (6)
F (s) Here, Qo,k is the downstream flow measured at the kth
sampling instant, that is at time kt, where t is the
where F (s) and G(s) are polynomials in s of the follow-
sampling interval in time units. Qi,k−δ is the input flow
ing form:
at time (k − δ)t time units previously, where δ is the
F (s) = s p + f1 s p−1 + f2 s p−2 + · · · + fp advective time delay, normally defined as the nearest inte-
ger value of τ/t (thus incurring a possible approximation
G(s) = g0 s q + g1 s q−1 + g2 s q−2 + · · · + gq error); and z−1 is the backward shift operator, that is,
z−r Qo,k = Qo,k−r . Of course, this model can be written
in which p and q can take on any positive integer val- in its “difference equation” form, namely:
ues. Here u(t) and x(t) denote the deterministic input and
output signals of the system at its upstream and down- Qo,k = −a1 Qo,k−1 + b0 Qi,k−δ (9)
stream boundaries, respectively; τ is a pure advective time
(transport) delay affecting the input signal u(t); and y(t) which is obtained by simple cross multiplication and
is the observed output, which is assumed to be contami- application of the z−1 operator. This reveals that the flow
nated by a noise or stochastic disturbance signal ξ(t). This Qo,k at the kth sampling instant is a proportion −a1 (note
noise is assumed to be independent of the input signal that, in the present context, a1 will be a negative number
and it represents the aggregate effect, at the downstream less than unity, so that this is a positive proportion) of its
boundary, of all the stochastic inputs to the system, includ- value Qo,k−1 at the previous (k − 1)th sampling instant, plus
ing distributed unmeasured inputs, measurement errors, and a proportion b0 of the delayed upstream flow input Qi,k−δ
modeling error. Multiplying throughout equation (6) by measured δ sampling instants previously.
F (s) and converting the resultant equation to alternative The values of the parameters a1 and b0 in equations (8)
ordinary differential equation form, we obtain: and (9) can be related to the parameters of the model
(4) in various ways depending upon how the input flow
d p y(t) d p−1 y(t) Qi (t) is assumed to change over the sampling interval
+ f1 + · · · + fp y(t)
dt p dt p−1 between the measurement of Qi,k−1 and Qi,k (since it is
d q u(t − τ ) not measured over this interval). The simplest and most
= g0 + · · · + gq u(t − τ ) + η(t) (7) common assumption is that it remains constant over this
dt q
6 RAINFALL-RUNOFF MODELING

interval (the so-called zero-order hold, ZOH, assumption), Despite this disadvantage, some hydrologists have used the
in which case the relationships are as follows: equation directly in RF modeling by considering it in a
Finite Impulse Response (FIR) form, in which the upper
g0 limit of the summation is set to some value r < ∞ where
a1 = − exp(−f1 t) b1 = {1 − exp(−f1 t)} (10)
f1 it is considered that the ordinates of the impulse response
have become small enough to be ignored.
Note that, because these relationships are functions of the Unfortunately, it can be shown that the number of FIR
sampling interval t, for every unique CT model such as model parameters (or “weights”), gi , i = 1, 2, . . . , r, is
(4), there are infinitely many DT equivalents (8), depending nearly always much larger than the number of parameters
on the choice of t, all with different parameter values in the rational TF form (11). As a result, the statistical esti-
defined in equation (10). Following from the definition of mates of these parameters will normally have unacceptably
this first-order DT model at the chosen t, the general high variance and have to be constrained in some man-
multiorder equivalent of the general CT model (6) is defined ner. For instance, Natale and Todini (1976) use quadratic
as follows: programming to compute the constrained least squares esti-
B(z−1 ) mates of the FIR parameters in order to ensure that they are
xk = uk−δ yk = xk + ξk (11) all positive and, if necessary for conservation purposes, they
A(z−1 )
all sum to unity (i.e. the model has unity steady state gain).
where, Even with this approach, however, it is difficult to recom-
mend the FIR model because the rational TF model (11),
A(z−1 ) = 1 + a1 z−1 + a2 z−2 + · · · + an z−n normally with far fewer parameters, is not only entirely
equivalent to the infinite dimensional IR model, without
B(z−1 ) = b0 + b1 z−1 + b2 z−2 + · · · + bm z−m any approximation, it is also easier to estimate from RF
data using the methodology discussed in the next Section
Normally n = p but m may be equal or greater than q. The “STATISTICAL IDENTIFICATION AND ESTIMATION
structure of this DT model is defined by the triad [n m δ]. OF TF MODELS”.
Note finally that the TFs in both (11) and (3) are composed
of ratios of polynomials (in z−1 and s respectively), so they
are sometimes referred to as Rational Transfer Functions. STATISTICAL IDENTIFICATION AND
ESTIMATION OF TF MODELS
The Unit Hydrograph and Finite Impulse Response In statistical terms, the modeling of a system using a TF
TF Models model is a problem of time-series analysis (e.g. Box and
The reader can verify that the division of the numer- Jenkins, 1970; Young, 1984) that involves identification
ator polynomial B(z−1 ) by the denominator polynomial and estimation of the model from time-series data. Here,
A(z−1 ) in the TF model (2) normally results in a infinite Identification means the definition of the most appropriate
dimensional polynomial G(z−1 ) = g0 + g1 z−1 + g2 z−2 + model structure: in particular, the values of the elements
· · · + g∞ z−∞ , so that the equation can also be written in in the triads [p q τ ] and [n m δ]. Estimation is then
the alternative form: the estimation of the parameters that characterize this
structure (fi , i = 1, 2, . . . , p; gj , j = 0, 1, . . . , q; and τ in

 the CT case; ai , i = 1, 2, . . . , n; bj , j = 0, 1, . . . , m; and
yk = gi uk−i + ξk ; gj , j = 0, 1, . . . , δ − 1 = 0 (12) δ in the DT case). Much has been written about the
i=δ
identification and estimation of the DT model (11) but
This will be recognized as the discrete-time form of the much less on the CT model 6. This is almost certainly
convolution integral equation associated with the solution for two reasons: first, it is easier to analyze the DT
of differential equations (here with a pure time delay of δ model in statistical terms; second, in these days of digital
sampling intervals or δt time units) and, once again, we instrumentation and computers, the DT model provides a
see that the TF model is simply a discrete-time equivalent more convenient model form. For these reasons, attention
of a continuous-time differential equation model. will be concentrated initially on the DT model (11) and we
Equation (12) has important connotations in hydrology will return to the more physically transparent CT modeling
because, if the noise ξk = 0 for all k, then its unit impulse later.
response (that is the response of the equation to a unit
Discrete-time TF Model Identification and
impulse input: uk = 1.0 for k = 1; uk = 0 for all k > 0),
Estimation
is equivalent to the hydrological UH. Sometimes, indeed,
the equation (12) is referred to as a TF model, although Much has been written on the identification and estimation
its infinite dimensional nature is an obvious restriction. of discrete-time TF models and the reader is advised to
RAINFALL-RUNOFF MODELING: TRANSFER FUNCTION MODELS 7

consult one or more of the many texts on the subject (e.g. Despite the popularity of this ARMAX model in the
Ljung and Söderstrom, 1983; Young, 1984; Norton, 1986). systems literature, there are good theoretical and practical
Here, we will not attempt to review this literature but simply reasons for preferring the model (13), as discussed below.
refer briefly to a number of topics that are particularly
pertinent to the use of TF models in hydrology. Refined Instrumental Variable (RIV) Estimation
The Box– Jenkins Model One approach to TF model estimation that is simpler and
The best known work on the identification and estimation more robust, because it does not rely on the assumption
of the DT model (11) is by Box and Jenkins (1970: that the noise has rational spectral density, is the Refined
see Chapter 10), to which the reader is directed for Instrumental Variable (RIV) method, which has proven
a comprehensive description of how the model can be particularly useful and robust in hydrological applications.
estimated using the Maximum Likelihood (ML) approach, The RIV approach (see Young and Jakeman, 1979; Young,
under the assumption that the noise ξk has rational spectral 1984) is based on the TF models (11) and (13) and
density. This assumption leads to the formulation of the was initially developed with environmental applications
following Box –Jenkins model: in mind, where robustness to the kinds on nonstandard
conditions met in the environment, including hydrology,
B(z−1 ) D(z−1 ) is very important. The RIV method has three primary
yk = uk−δ + ξk ξk = ek ek = N (0, σ 2 ) advantages that are pertinent to the discussion here.
A(z−1 ) C(z−1 )
or, 1. In the case of the full Box–Jenkins model (13) and
following from an important theorem by Pierce (1972),
B(z−1 ) D(z−1 ) the estimates of the “system” parameters in A(z−1 )
yk = −1
uk−δ + −1
ek ek = N (0, σ 2 ) (13)
A(z ) C(z ) and B(z−1 ) are asymptotically independent of those
for the “noise” parameters in C(z−1 ) and D(z−1 ). This
where, has important consequences on parameter estimation
C(z−1 ) = 1 + c1 z−1 + c2 z−2 + · · · + cs z−s (Jakeman and Young, 1981, 1983) that justify the
RIV estimation of the TF model (13) rather than the
D(z−1 ) = 1 + d1 z−1 + d2 z−2 + · · · + dr z−r ARMAX model (14). They also expose deficiencies in
the ARMAX model estimates if the data are generated
and ek is defined as normally distributed “white noise”, by the model (13).
defined in statistical terms as N (0, σ 2 ): that is a zero mean, 2. Like all instrumental variable methods (see e.g. Young,
serially uncorrelated sequence of normally distributed ran- 1984), the RIV approach does not necessarily require
dom variables with variance (an additional unknown param- the concurrent estimation of a noise model and, indeed,
eter) σ 2 . We see, therefore, that the “colored” noise ξk is remains statistically efficient (i.e. the parameter esti-
assumed to be generated from the white noise ek by passing mates are optimal in the sense that amongst all esti-
it through a “filter” that has a similar TF form to the sys- mates, they have the minimum variance) if the noise
tem model, but with both polynomials in z−1 having leading ξk is itself white (i.e. C(z−1 ) = D(z−1 ) = 1.0). Used in
unity elements (monic polynomials). This particular type of this manner, the RIV algorithm is, for obvious reasons,
filter is called a AutoRegressive Moving Average (ARMA) called the Simplified RIV (SRIV) algorithm (Young,
model (see Box and Jenkins, 1970). With the noise process 1985).
defined in this manner, it is clearly necessary to identify the 3. As pointed out by Wellstead (1978) and considered fur-
order of the noise model polynomials C(z−1 ) and D(z−1 ) ther by the present author and his colleagues (Young
and estimate the associated parameters ci , i = 1, 2, . . . , s et al., 1980; Young, 1989), model structure identifica-
and di , i = 1, 2, . . . , r and σ 2 . tion (definition of the triad [n m δ]) is greatly facilitated
The ARMAX Model by the special properties of the Instrumental Product
In the systems and control literature, the most widely Matrix (IPM) that is a fundamental component of IV
used approach to DT model estimation is probably the estimation and has optimal statistical properties in the
Prediction Error Minimization (PEM) method (e.g. Ljung case of RIV/SRIV estimation. This has been exploited
and Söderstrom, 1983), which is the preferred method in the in the development of the Young Information Criterion
Matlab Identification Toolbox. PEM estimation is most (YIC) order identification criterion mentioned later.
often applied to the alternative AutoRegressive Moving 4. The RIV/SRIV algorithms for DT models utilize a
Average eXogenous (ARMAX) TF model form: that is, special form of adaptive prefiltering. This is further
exploited in the RIV method for continuous-time
A(z−1 )yk = B(z−1 )uk−δ + C(z−1 )ek ; ek = N (0, σ 2 ) systems (the RIVC algorithm: Young and Jakeman,
(14) 1980; Young, 2002b) to bypass the need for direct
8 RAINFALL-RUNOFF MODELING

differentiation of the input and output signals. This variance of the output data explained by the model). Note
algorithm is discussed further in the following section. that RT2 should not be confused with its well known rel-
ative R 2 , as used in classical regression and time-series
Continuous-time TF Model Identification and analysis, which is defined in terms of the one-step-ahead
Estimation prediction errors. In general, RT2 is a more discerning mea-
sure of model adequacy in a dynamic systems context than
Following on the point 4. in the previous subsection, the R 2 since it quantifies the ability of the model to explain
statistical estimation of the CT model (6) is straightforward the whole of the output data from the input data, without
if completely continuous-time data are available. However, any reference to the output data. In contrast, R 2 only mea-
the hydrologist is normally confronted by discrete-time, sures the ability of the model to predict one-step-ahead on
sampled data and the problem of modeling continuous-time the basis of the latest input and output data. The second
models such as this, based on discrete-time, sampled data statistic is the YIC, which is a heuristic measure of model
at sampling interval t, is not so obvious. This problem identifiability and is based on the properties of the IPM
can be approached in two main ways. matrix (see the previous discussion). These statistics are
• The Indirect Approach: Here, the identification and discussed in more detail in Young (1989) and Appendix 3
estimation steps are first applied to the DT model (11) of Young (2001b).
or (13). This estimated model is then converted to the
CT model (6), again using some assumption about the TWO PRACTICAL EXAMPLES
nature of the input signal u(t) over the sampling interval
t. In the first-order, ZOH case, this conversion is given TF models can be applied in many areas of hydrology
by the relationships in equation (10) but, in more general and provide a good basis for DBM modeling. Two typical
terms, the multiorder conversion must be carried out in examples are outlined in this section. The first is a TF
a computer, using a conversion routine such as the d2cm model of the relationship between effective rainfall and
algorithm in the Matlab Control Toolbox. flow in an Australian river catchment. This DBM model
could provide a direct basis for flow forecasting at the flow
• The Direct Approach: Here, the RIVC algorithm, measurement station or it could be extended to incorporate
together with an appropriate order identification criterion flow-routing TF elements down the river in order to allow
(see the following text), is used to identify the most appro- for forecasting over the whole catchment area (that may
priate, identifiable CT model structure defined by the triad involve other RF modules). The second example is an
[p q τ ]; and then estimate the TF parameters fi , i = ADZ model for the transport of a conservative dye tracer
1, 2, . . . n, gj , j = 0, 1, . . . m and τ that characterize this in a Florida wetland area. This DBM model describes the
structure. Of course, some approximation will be incurred in physical aspects of solute advection and dispersion in the
this estimation procedure because the inter-sample behavior wetland and so could provide the physical basis for more
of input signal u(t) is not known and it must be interpo- complex, nonconservative water quality modeling involving
lated over this interval in some manner (see the preceding biological and chemical mechanisms.
text).
Rainfall-flow Modeling
Model Structure Identification Figure 2 shows a portion of effective rainfall u(t) and flow
Finally, as mentioned previously, an important aspect of TF y(t) data from the River Canning, an ephemeral River in
modeling, in both continuous and discrete-time, is Model Western Australia. A longer set of these data has been
Structure Identification (the definition of the model struc- analyzed comprehensively in Young et al. (1997), using DT
ture triads). This can be approached in various ways. For modeling. The effective rainfall series plotted in part (b) of
example, Box and Jenkins (1970) discuss the topic at length Figure 2 is generated from the effective rainfall nonlinearity
and Akaike (1974) suggested an alternative approach that identified in this earlier study, with the effective rainfall
has since spawned several other, related methods. Within generated by the equation
the context of the instrumental variable methods discussed u(t) = r(t)y(t)γ (15)
previously, however, model structure identification is based
conveniently on two statistical criteria. First, the Coeffi- where r(t) is the measured rainfall and γ is a power law
cient of Determination, RT2 , a normalized measure based parameter. Here, the flow y(t) is acting as a surrogate
on the variance of the error between the sampled output measure of the catchment storage, rather than attempting to
data yk and the simulated (CT or DT) model output at the model this storage separately in some conceptual manner
same sampling instants (this is normally equivalent to the (see e.g. Jakeman et al., 1990; Wheater et al., 1993; Young
well known Nash-Sutcliffe Efficiency measure (Nash and 2001b, 2003). In the following, we obtain direct and
Sutcliffe, 1970); and 100 × RT2 is the percentage of the indirectly estimated, linear CT models between the effective
RAINFALL-RUNOFF MODELING: TRANSFER FUNCTION MODELS 9

4 occurs in the very short term (within one sampling interval;


here one day); (ii) a quick pathway, accounting for 54.1%
Flow (cumecs)

3
of the flow, that represents a linear store with a residence
2 time of about 2.53 days, probably the result of surface and
shallow, subsurface processes in the catchment; and (iii) a
1
slow or base-flow pathway, accounting for 38.5% of the
0 flow, that passes through a much longer 15.9 day residence
(a) 1977.2 1977.3 1977.4 1977.5 1977.6 1977.7 1977.8
time, linear store, probably representing the effects of
80 deeper groundwater processes and the displacement of old
Effective rainfall (mm)

60
water.

40 Indirect CT Identification and Estimation


20
The first step of indirect CT modeling involves DT identifi-
cation using the RIV algorithm (see earlier). This identifies
0 a similar [2 3 0] structure TF with the following parameter
1977.2 1977.3 1977.4 1977.5 1977.6 1977.7 1977.8
(b) Time (h)
estimates:

â1 = −1.6034 (0.008) â2 = 0.6244 (0.007)


Figure 2 Daily effective rainfall (b) and flow (a) data for
the River Canning in Western Australia. The dotted line b̂0 = 0.0140 (0.001) b̂1 = 0.0151 (0.002)
appears to me to have come out as a grey line (a) is the b̂2 = 0.0252 (0.0013)
simulated output of the DBM model (see text)
However, since this is a discrete-time model, it has to
be converted to continuous-time form. The d2cm algo-
rainfall u(t), as defined in the above manner, and the
rithm in Matlab (see Section “CONTINUOUS-TIME TF
flow y(t). Of course, the DT model is useful in its
MODELS”.) accomplishes this conversion, using a ZOH
own right and provides an excellent basis for applications
approximation (i.e. the input effective rainfall is assumed
such as flow forecasting (see e.g. Lees et al., 1994; Lees,
to be constant over the sampling interval). The resulting
2000a,b; Young, 2001c, 2002a). However, the transparency
CT model parameter estimates are:
of the CT model in revealing, more obviously, the physical
interpretation of the model makes it more useful as a vehicle fˆ1 = 0.4711 fˆ2 = 0.0264
in DBM modeling and the use of the model in wider ĝ0 = 0.0140 ĝ1 = 0.0514 ĝ2 = 0.0049
catchment studies.
Direct CT Identification and Estimation In this case, the model is quite similar to the directly
estimated CT model in the previous subsection (i). But note
Let us consider first identification and estimation based that there are no standard errors quoted here: this is because
on the measured daily data shown in Figure 2. The RIVC these are not available after the conversion and must be
algorithm, in combination with the RT2 and YIC statistics, computed separately in some manner, usually by the less
identifies a [2 3 0], second order model with the parameter convenient and lengthy Monte Carlo simulation analysis.
estimates: This lack of directly available estimation statistics is
fˆ1 = 0.457 (0.032) fˆ2 = 0.0248 (0.0045) not the only disadvantage of the indirect approach. Unlike
ĝ0 = 0.0138 (0.001) ĝ1 = 0.0505 (0.002) the CT model parameters, which are independent of the
ĝ2 = 0.0046 (0.0008) sampling interval t, the estimated DT model parameters
are a function of t, so that different numerical estimates
where, here and elsewhere in the paper, the figures in of the parameters are obtained for different values of
parentheses are the estimated standard errors obtained from t. Moreover, the quality of these estimates is also a
the RIVC algorithm. This model output (shown as the function of t: in particular, if the sampling interval is too
dotted line in (a) of Figure 2) explains the data very small, then the estimates are poorly defined statistically; the
well with RT2 = 0.980 (98% of the output flow variance resulting CT model parameter estimates will be impaired;
explained by the simulated CT model output) and a residual and the whole estimation process is less robust (see Young,
variance of 0.00723 (standard deviation 0.085 cumecs, 2004a, for discussion on the reasons for this). Conveniently,
where the maximum flow is 3.86 cumecs). The model it is under these same, rapid sampling (small t) conditions
can also be decomposed by standard TF decomposition that the direct CT approach works best. So, in practical
(see earlier text) into a parallel pathway form. The three terms, indirect CT model estimation works very well
pathways are: (i) an instantaneous effect, accounting for when the data are not sampled too rapidly but direct CT
7.4% of the flow, which is a measure of rapid flow that estimation is preferable at rapid sampling intervals.
10 RAINFALL-RUNOFF MODELING

Aggregated Dead Zone (ADZ) Modeling of Tracer to be “not persistently exciting”), the SRIV algorithm has
Data no difficulty identifying and estimating a low order model.
The best identified TF, based on the RT2 and YIC criteria, is
Tracer experiments can provide important information on either 3rd or 4th order, but subsequent analysis, described
how solutes are transported and dispersed in hydrological later, suggests that the latter is superior from a physical
systems. This example is concerned with TF modeling of standpoint. The estimated [4 2 22] (4th order denominator,
the input-output data shown in Figure 3 (TF modeling is 2nd order numerator and a 22 sampling interval pure time
also applied to a different set of tracer data in the arti- delay) takes the form:
cle Chapter 134, Downward Approach to Hydrological
Model Development, Volume 3). These tracer data were
B̂(z−1 )
obtained from a bromide tracer experiment carried out in a yk = uk−22 + ξk (16)
Florida wetland area receiving treated domestic wastewater Â(z−1 )
for further nutrient removal. The experiment was part of
a study carried out by Chris Martinez and Dr. William R. where,
Wise of the Environmental Engineering Sciences Depart-
ment, University of Florida for the City of Orlando. The Â(z−1 ) = 1 − 3.67z−1 + 5.06z−2 − 3.11z−3 + 0.72z−4
bromide tracer was injected 765 meters upstream of a weir, B̂(z−1 ) = 0.00103 − 0.00101z−1
at which samples were taken with a sampling interval t of
2 h. As we shall see, TF models are not only able to explain Here the “hat” denotes the estimated value; yk is the
these tracer data well, but following the requirements of observed tracer concentration at the weir and uk is the
DBM modeling, they can also be interpreted in ADZ model impulsive input of tracer (186.33 mg l−1 ), both measured at
terms that have physical meaning (see Wallis et al., 1989). the kth sampling instant. Note that, although the standard
Here, we will consider the discrete-time TF model and errors on the parameters are available they are not cited
utilize the SRIV algorithm (see earlier text) to identify specifically. However, the uncertainties on the physically
the model order and estimate the parameters. However, meaningful, derived ADZ model parameters are discussed
continuous-time TF estimation using the continuous-time later. Note also that the large advective time delay of 22
SRIV algorithm yields similar (albeit not identical) results sampling intervals (44 h) is the time taken for the solute to
and the discrete-time analysis is used here simply for illus- first reach the weir. The noise ξk is small and so the model
tration. explains the data very well with RT2 = 0.997 (i.e. 99.7% of
Although the impulsive (“gulp”) input does not perturb the output variance is explained by the model).
the system continually (in the systems literature, it is said Unfortunately, despite its ability to describe the data
very well, the model (16) is not immediately acceptable
Wetland tracer experiment data (Martinez, 2001) from a DBM standpoint, primarily because the eigenvalues

are {0.988, 0.964, 0.860 ± 0.132j }, where j = (−1) is
4.5 the complex number. Here, the pair of complex roots are
4 associated with oscillatory response and this is difficult to
justify in ADZ modeling terms. In particular, the elemental,
Tracer concentration (mg L−1)

3.5 single reach ADZ model is a first-order differential equation


3 based on mass conservation (Wallis et al., 1989) and so,
other than in exceptional circumstances, multiple reach
2.5
ADZ models must be characterized by real eigenvalues
2 when considered in TF terms.
1.5
In the present circumstances, the most obvious approach
is to re-estimate the model in a form where the eigenvalues
1 are constrained to be real. This was carried out by means of
0.5 constrained, nonlinear least squares optimization using the
leastsq optimization procedure in Matlab. To ensure that the
0 most parametrically efficient model was obtained, both [3 2
100 200 300 400 500 600 700 800 900
Time (h) 22] and [4 2 22] models were considered in this analysis but
the latter yielded much the best constrained model, which
Figure 3 Wetland tracer experiment data: the input uk has the following form:
is an impulsive or ‘‘gulp’’ application of bromide tracer
(dashed line) and the output yk is the concentration of
bromide measured every 2 h at a downstream weir (full B̂(z−1 )
yk = uk−22 + ξk (17)
line) Â(z−1 )
RAINFALL-RUNOFF MODELING: TRANSFER FUNCTION MODELS 11

where, obtained by partial fraction expansion of the TF in equation


(17). This decomposition is shown in the block diagram of
Â(z−1 ) = (1 − 0.980z−1 )(1 − 0.855z−1 )3 Figure 5 which, it will be noted, has exactly the same form
as Figure 1, except for the addition of the pure advective
B̂(z−1 ) = 0.00127 − 0.00121z−1
time delay, τ , demonstrating the widely applicable, generic
nature of TF models such as these.
This model is well defined statistically and it explains
The block diagram in Figure 5 shows that the solute is
99.7% of the experimental data (RT2 = 0.997), the same as
partitioned initially into a parallel pathway form with a
the unconstrained model (1). Figure 4 compares the model
“quick-flow” pathway, consisting of a single ADZ reach
output (full line) with the measured output yk (circular
with a residence time of 12.8 h, and a “slow-flow” pathway
points).
with a much longer residence time of 99 h. The two
Unlike the TF model (16), the model (17) not only has pathways then add together and the solute passes further
four real eigenvalues, as required, but three of these are downstream, through two more ADZ reaches, each with
repeated, so defining three identical ADZ reaches. These a residence time of 12.8 h. The total travel time for this
eigenvalues define ADZ residence times (time constants) complete system is 181.5 h (the sum of the 44 h advective
of 99 h and 12.8 h (×3), giving a total estimated residence time delay and the cumulative overall time constant of
time for the wetland cell of 137.4 h (99 + 3 × 12.8). 137.4 h). This means that the dispersive fraction (see Wallis
One particular, physically meaningful, decomposition and et al., 1989; Young and Wallis, 1994; Young, 1999a) is
interpretation of the model defined in this manner is again 0.76 (i.e. 137.4 ÷ 181.5): in other words, 76% of the water
appears to be effective in dispersing the solute. This is a
4.5 very high proportion, reflecting the nature of the system
in this case, with a much higher potential for dispersion
4 of tracer than in normal, faster moving streams, where
the dispersive fraction is normally in the range 0.3 to 0.4.
Tracer concentration (mg L−1)

3.5
The inferred responses of the two pathways are plotted in
3 Figure 4: the dotted line shows the estimated concentration
changes in the quick-flow pathway (effectively three 12.8 h
2.5
residence time ADZs in series), which accounts mainly for
2 the initial response measured at the weir; while the dashed
line shows the estimated changes in the slow-flow pathway
1.5
(effectively a 99 h residence time ADZ in series with two
1 12.8 h ADZs), and these are mainly responsible for the
raised tail of the measured response.
0.5
It is possible to compute estimates of other physical
0 attributes associated with the model. First, the steady state
0 100 200 300 400 500 600 700 800 900 gains associated with the two parallel pathways define the
Time (h)
partitioning of the flow, with 33% of flow associated with
the quick pathway and 67% with the slow pathway. And
Figure 4 Comparison of the DBM model output (full line)
and tracer experiment data (circular points). Also shown since the flow rate is known in this example, the Active
are the inferred slow-flow component (dashed line) and Mixing Volumes (AMVs: Young and Lees, 1993), based
quick-flow component (dotted line) on the estimated partitioned flow, are 361 m3 in the quick

Quick flow
pathway

G1 x(t)
1 + T1s
Advective y(t)
u(t) u(t − t) G1 G1
time + 1 + T1s 1 + T1s
+
delay
G2
1 + T2s

Slow flow
pathway

Figure 5 Block diagram of a transfer function decomposition for the ADZ model that can be interpreted in physical terms
as a parallel-serial decomposition of ADZ reaches. T1 = 12.8 h; T2 = 99 h; τ = 22 h
12 RAINFALL-RUNOFF MODELING

Slow path residence time Quick path residence time

1 1

0.8 0.8

Normalized frequency

Normalized frequency
0.6 0.6

0.4 0.4

0.2 0.2

0 0
95 100 105 12.4 12.6 12.8 13
(a) Residence time (h) (b) Residence time (h)

Figure 6 MCS analysis results: normalized histograms of the slow-flow (a) and quick-flow (b) pathway residence times

pathway and 5656 m3 in the slow pathway. As a result, time ADZs the same as those in the first parallel pathway;
the total estimated AMV is 5656 + 3 × 361 = 6739 m3 , (ii) various decompositions including feedback processes.
which seems reasonable when compared with the 9749 m3 However, the latter seem less supportable in physical terms
for the total volume of the wetland, estimated by physical and are rejected according to the DBM ethos.
measurement. This suggests that about 70% of the wetland Finally, how can decompositions of ADZ reaches, such
is important in dispersing the tracer (and, therefore, the as those shown in Figure 5 be interpreted in terms of the
waste water) and compares reasonably with the dispersive wetland system? The most plausible mechanism is that the
fraction derived percentage of 76%. pure time delay allows for the flow-induced pure advection
Of course, all of the results above are statistical estimates of the solute (chemical engineers would call this “plug-
and so they are inherently uncertain. The advantage of the flow”). The quick-flow parallel pathway then represents
DBM/TF approach is that we can quantify and consider the the “main stream-flow” that is relatively unhindered by
consequences of this uncertainty (see Young, 1999a). For the vegetation; while the slow-flow pathway represents
instance, based on the covariance matrix of the parame- the solute that is captured by the heavy vegetation and
ters produced by the SRIV estimation analysis, empirical so dispersed more widely and slowly before rejoining the
probability distributions, in the form of histograms, can main flow and eventually reaching the weir. It is this
be computed for the “derived” physical parameters, such latter pathway, which we have shown above accounts for
as the residence times, partition percentages, AMVs, total some 67% of the flow, together with the large associated
AMV, and steady state gain, using Monte Carlo Simulation dispersive fraction of 76%, that is most useful in terms
(MCS) analysis. Figure 6 is a typical example of such anal- of nutrient removal, since it allows more time for the
ysis: it shows the normalized empirical distributions for the biological activity to take place.
two residence times obtained by MCS using 10 000 random
realizations (the procedure used here is discussed in Young,
1999a). NONSTATIONARY AND NONLINEAR TF
Of course, it should be noted that the parallel decom- MODELS
position of the estimated TF used here is not unique:
there are other decompositions that are just as valid and Although the RF model discussed in Section “DISCRETE-
give precisely the same input–output response. For exam- TIME TF MODEL IDENTIFICATION AND ESTIMA-
ple, two other examples are: (i) a serial connection of the TION” is characterized by an input nonlinearity, the exam-
three ADZs with residence times of 12.8 h, in parallel with ple has emphasized the constant parameter, linear TF part
another serial connection, this time of an ADZ with res- of the model (which defines the shape of the catchment
idence time of 99 h in series with two 12.8 h residence UH). However, the nonlinearity is very important because
RAINFALL-RUNOFF MODELING: TRANSFER FUNCTION MODELS 13

it defines the way in which the model responds under dif- over a decade to continually update a gain parameter,
ferent catchment wetness conditions. So, in general, it is based on telemetered data from the catchment, in order to
necessary to consider extensions to the constant parameter, provide an adaptive capability. In this case, however, the
linear TF model that allow for the consideration of more adaptive gain parameter is only required to “fine-tune” an
complex hydrological processes such as this. Fortunately, it already computed effective rainfall input generated from
is possible to extend TF models to handle “nonstationary” the measured rainfall by a “state-dependent parameter”
situations described by TVP models or “nonlinear” situ- nonlinearity, as discussed in the next section.
ations characterized by State-Dependent Parameter (SDP)
models. The latter model class encompasses a wide vari- Nonlinear State-Dependent Parameter (SDP)
ety of nonlinear, stochastic, dynamic phenomena, including Models
chaotic systems.
Although such TVP and SDP models are beyond the Again, in the discrete-time case, the SDP form of the TF
scope of this article, they are introduced briefly in the fol- model (11) can be written as follows:
lowing subsections. Both types of models can be considered
in either continuous or discrete-time. However, since statis- B(wk , z−1 )
xk = uk−δ yk = xk + ξk (19)
tical estimation is more straightforward in the DT case, it A(vk , z−1 )
often provides the more practical approach. Consequently,
the brief descriptions that follow are limited to this DT where,
situation.
A(vk , z−1 ) = 1 + a1 (v1,k )z−1 + a2 (v2,k )z−2
Nonstationary Time-Variable Parameter (TVP)
+ · · · + an (vn,k )z−n
Models
B(wk , z−1 ) = b0 (w0,k ) + b1 (w1,k )z−1 + b2 (w2,k )z−2
In the DT case, the TVP form of the TF model (11) can be
written as follows: + · · · + bm (wm,k )z−m

B(k, z−1 ) Here, the possibility that the parameters may be functions
xk = uk−δ yk = xk + ξk (18)
A(k, z−1 ) of other “state” variables is investigated. In other words, it
is assumed that any parameter in the model may vary over
where, time as a function of one or more other variables (e.g. the
input uk or output yk and their past values), so introducing
A(k, z−1 ) = 1 + a1,k z−1 + a2,k z−2 + · · · + an,k z−n nonlinear behavior into the model. In the SDP model
B(k, z−1 ) = b0,k + b1,k z−1 + b2,k z−2 + · · · + bm,k z−m (19), these variables are denoted by vi,k , i = 1, 2, . . . , n
and wj,k , j = 0, 1, . . . , m and they are, respectively, the
Here, all the parameters are assumed to be functions of the elements of the vectors vk and wk .
time index k, that is it is assumed that they may vary over The first applications of SDP estimation in hydrology
time in an unknown manner that needs to be estimated from are described in Young (1993); Lees et al. (1993, 1994)
the data. and Young and Beven (1994); while the latest research on
Some of the latest research on TVP estimation is reported this topic is reported in Young (2000, 2001a) and Young
in Young (1999b, 2000, 2004b) where the reader will (2002). A simple example of SDP estimation is provided by
find a complete description of the recursive estimation the estimation of the effective rainfall nonlinearity (15) of
algorithms that allow for the estimation of the time-variable the RF model discussed in Section “DISCRETE-TIME TF
parameters. A simple example would be a model such as MODEL IDENTIFICATION AND ESTIMATION”. Here,
(8) in which the “gain” parameter b0 = b0,k is assumed SDP estimation identifies that, in order to explain the RF
to vary over time in order to reflect any changes in the data well, the numerator parameters b0 , b1 and b2 in a
dynamic behavior of the channel that may occur over a linear TF model of the RF data need to vary significantly
wide range of flow conditions. An approach similar to as functions of the flow: that is, they are identified as
this is suggested by Cluckie (1993), who refers to the SDPs b0 (yk ), b1 (yk ) and b2 (yk ). However, the inclusion of
gain as a “model scaling factor” and points out that it a power law nonlinearity such as equation (15), operating
is analogous to the “variable proportional loss method” on the input rainfall, removes the need for these variations,
of defining effective rainfall. An example of adaptive leaving the model as a serial connection of the nonlinearity
gain adjustment that has been implemented in practice is (15) and the linear, constant parameter TF, as described
the adaptive flood warning system for the River Nith at in Section “Discrete-time TF model identification and
Dumfries, in south west Scotland (Lees et al., 1993, 1994). estimation”. Other related examples have been discussed
Here, a recursive estimation algorithm has been used for recently in Young (2001b,c, 2002a, 2003).
14 RAINFALL-RUNOFF MODELING

CONCLUSIONS of the system that is identifiable from the historical data


(see Chapter 134, Downward Approach to Hydrological
At one level, TF models are simply elegant and convenient Model Development, Volume 3 in this Encyclopedia). In
representations of continuous-time, stochastic differential this way, they can alert the model builder to those aspects of
equation models, or their discrete-time equivalents. This the simulation model that are not well defined in relation to
article demonstrates, however, that they are much more than the available data and so are of a more speculative nature.
this: they also provide a powerful, generic method of mod-
eling hydrological systems. This method provides a useful Acknowledgments
block diagram interpretation of the input–output dynamics
of the system under study that can be decomposed straight- The author is grateful to Professor Tony Jakeman of the
forwardly into physically meaningful sub-systems that are Centre for Resource and Environmental Studies, Australian
in an ideal form for DBM modeling studies. The utility of National University for providing the Canning River data;
the method is further enhanced by the powerful methods of and to Chris Martinez and Dr. William R. Wise of the Envi-
statistical identification and parameter estimation that have ronmental Engineering Sciences Department, University of
been developed for such TF models over many years. These Florida, for providing the tracer experiment data.
are not only robust in practical application, but they also
provide valuable information about the stochastic nature
of the system and allow for the quantification of the uncer- FURTHER READING
tainty associated with the model parameter estimates and its
output predictions. Moreover, these estimation methods are O’Donnell T (1986) Deterministic catchment modelling. In River
normally available in a recursive form that allows for their Flow Modelling and Forecasting, Kraijenhoff D.A and Moll J.R
use in on-line, real-time applications, where they provide a (Eds.), D. Reidel: Dordrecht, pp. 11 – 37.
Young P.C, McKenna P and Bruun J (2001) Identification of
natural basis for adaptive modeling and data assimilation.
nonlinear stochastic systems by state dependent parameter
Recent developments in recursive time-variable and state- estimation. International Journal of Control, 74, 1837 – 1857.
dependent parameter estimation have enhanced the utility of Young P.C, Parkinson S.D and Lees M (1996) Simplicity out of
TF models still further, so that they are now able to charac- complexity in environmental systems: Occam’s Razor revisited.
terize both nonstationary and nonlinear stochastic systems. Journal of Applied Statistics, 23, 165 – 210.
Finally, it should be emphasized that DBM models, in Young P.C and Pedregal D (1999) Recursive and en-bloc
general, and data-based TF models, in particular, naturally approaches to signal extraction. Journal of Applied Statistics,
require real time-series data for their identification and 26, 103 – 128.
estimation. Also, their primary use is for improving our Young P.C and Tomlin C.M (2000) Data-based mechanistic
understanding of the system dynamics and/or for imple- modelling and adaptive flow forecasting. In Flood Forecasting:
What Does Current Research Offer the Practitioner? BHS
menting model-based prediction, as in flood warning and
Occasional paper No 12, Lees M.J and Walsh P (Eds.),
forecasting applications, where the main changes that are Centre for Ecology and Hydrology on behalf of the British
likely to occur in future relate to the input variables (mea- Hydrological Society: pp. 26 – 40.
sured rainfall and upstream flows). They are not particularly
appropriate, at least in their basic form, for assessing the
impacts of changes to the nature of the system (“what-if” REFERENCES
studies) unless the derived, physically meaningful param-
eters associated with the DBM model (as, for example, in Akaike H. (1974) A new look at statistical model identification.
the case of the Florida wetland example) can be associ- IEEE Transactions on Automatic Control, ACl9, 716 – 722.
ated with physical environmental characteristics, such as Anderson M.G. and Burt T.P. (Eds.) (1985) Hydrological
the topography or other physical features of a catchment Forecasting, John Wiley: Chichester.
(“regionalization studies”: see e.g. Kokkonen et al., 2003 Amorocho J. and Hart W.E. (1964) A critique of current
and the Chapter 134, Downward Approach to Hydrolog- methods of hydrologic systems investigation. Transactions of
ical Model Development, Volume 3 in this Encyclopedia). the American Geophysical Union, 45, 307 – 321.
In this regard, it is likely that the derived parameters will be Beck M.B. and Young P.C. (1975) A dynamic model for DO-
BOD relationships in a non-tidal stream. Water Research, 9,
“bulk” parameters that relate to the behavior of the system
769 – 776.
at the catchment scale, rather than the kind of “process”
Beven K.J. (2001) Rainfall-Runoff Modelling: the Primer, John
parameters used in hydrological simulation models. Wiley and Sons: Chichester.
It is clear, however, that when suitable, measured time- Box G.E.P. and Jenkins G.M. (1970) Time-series Analysis,
series data are available, DBM/TF models should always Forecasting and Control, Holden-Day: San Francisco.
be used as an adjunct to simulation model synthesis Bras R.L. and Rodriguez-Iturbe I. (1985) Random Functions and
because they define well the “dominant mode” behavior Hydrology, Addison-Wesley: Reading.
RAINFALL-RUNOFF MODELING: TRANSFER FUNCTION MODELS 15

Cluckie I.D. (1993) Real-time flood forecasting using weather Nash J.E. and Sutcliffe J.V. (1970) River flow forecasting
radar. In Concise Encyclopedia of Environmental Systems, through conceptual models: discussion of principles. Journal
Young P.C. (Ed.), Pergamon Press: Oxford, pp. 291 – 298. of Hydrology, 10, 282 – 290.
Dooge J.C.I. (1959) A general theory of the unit hydrograph. Natale L. and Todini E. (1976a) A stable estimator for
Journal of Geophysical Research, 64, 241 – 256. linear models: 1. Theoretical development and Monte-Carlo
Dooge J.C.I. (1986) Theory of flood routing. In River Flow experiments. Water Resources Research, 12(4), 667 – 671.
Modelling and Forecasting, Kraijenhoff D.A. and Moll J.R. Natale L. and Todini E. (1976b) A stable estimator for linear
(Eds.), D. Reidel: Dordrecht, pp. 39 – 65. models: 2. real world hydrologic applications. Water Resources
Dooge J.C.I. and O’Kane J.P. (2003) Deterministic Methods in Research, 12(4), 672 – 675.
Systems Hydrology, Balkema: Lisse. Natale L. and Todini E. (1976) A stable estimator for linear
Gupta V.K. (1984) The Identification of Conceptual Watershed models: 2. real world hydrologic applications. Water Resources
Models, Ph.D. Dissertation, Case Western Reserve University, Research, 12, 672 – 676.
Cleveland. Norton J.P. (1986) An Introduction to Identification, Academic
Han D. (1991) Weather Radar Information Processing and Real- Press: London.
Time Flood Forecasting, Ph. D. thesis, Department of Civil O’Connell P.E. and Clarke R.T. (1981) Adaptive hydrological
Engineering, University of Salford. forecasting – a review. Hydrological Sciences Bulletin, 26,
Hsu K., Gupta H.V. and Sorooshian S. (1993) Artificial 179 – 205.
neural network modeling of the rainfall-runoff process. Water Pierce D.A. (1972) Least squares estimation in dynamic
Resources Research, 29, 1185 – 1194. disturbance time-series models. Biometrika, 59, 73 – 78.
Jang J.S.R., Sun C.T. and Mizutani E. (1997) Neuro-Fuzzy and Salas J.D., Delleur J.W., Yevjevich V. and Lane W.L. (1980)
Soft Computing, Prentice Hall: NJ. Applied Modeling of Hydrologic Time Series, Water Resources
Jakeman A.J. and Young P.C. (1981) On the decoupling of system Publications: Littleton.
and noise model parameter estimation in time-series analysis. Schwarzenbach J. and Gill K.F. (1979) Systems Modelling and
International Journal of Control, 34, 423 – 431. Control, Edward Arnold: London.
Jakeman A.J. and Young P.C. (1983) Advanced methods of
Shaw E.M. (1994) Hydrology in Practice, Third Edition, Chapman
recursive time-series analysis. International Journal of Control,
& Hall: London.
37, 1291 – 1310.
Singh V.P. (1988) Hydrologic Systems: Rainfall-Runoff Modeling,
Jakeman A.J., Littlewood I.G. and Whitehead P.G. (1990)
Vol. 1, Prentice-Hall: Englewood Cliffs.
Computation of the instantaneous unit hydrograph and
Singh V.P. (Ed.) (1995) Computer Models of Watershed
identifiable component flows with application to two small
Hydrology, Colorado: Water Resources Publications.
upland catchments. Journal of Hydrology, 117, 275 – 300.
Sorooshian S. (1983) Surface water hydrology: on-line estimation.
Kokkonen T.S., Jakeman A.J., Young P.C. and Koivusalo
Review of Geophysics and Space Physics, 21, 706 – 721.
H.J. (2003) Predicting daily flows in ungauged catchments
Wallis S.G., Young P.C. and Beven K.J. (1989) Experimental
model regionalization from catchment descriptors at the
investigation of the aggregated dead zone model for
Coweeta Hydrologic Laboratory, North Carolina. Hydrological
Processes, 17, 2219 – 2238. longitudinal solute transport in stream channels. Proceedings
Kraijenhoff D.A. and Moll J.R. (1986) River Flow Modelling and of the Institution of Civil Engineers, Part 2, 87, 1 – 22.
Forecasting, D. Reidel: Dordrecht. Wellstead P.E. (1978) An instrumental product moment test for
Lees M.J. (2000a) Advances in transfer function based flood model order estimation. Automatica, 14, 89 – 91.
forecasting. In Flood Forecasting: What Does Current Research Wheater H.S., Jakeman A.J. and Beven K.J. (1993) Progress and
Offer the Practitioner? Lees M.J. and Walsh P. (Eds.), BHS directions in rainfall-run-off modelling. In Modelling Change in
Occasional paper No 12, Centre for Ecology and Hydrology on Environmental Systems, Jakeman A.J., Beck M.B. and McAleer
behalf of the British Hydrological Society: pp. 41 – 55. M.J. (Eds.), Wiley: Chichester, pp. 101 – 132.
Lees M.J. (2000b) Data-based mechanistic modelling and fore- Whitehead P.G. and Young P.C. (1975) A dynamic-stochastic
casting of hydrological systems. Journal of Hydroinformatics, model for water quality in part of the Bedford-Ouse River
2, 15 – 34. system. In Computer Simulation of Water Resources Sys-
Lees M., Young P.C. and Ferguson S. (1993) Adaptive flood tems, Vansteenkiste G.C. (Ed.), North-Holland: Amsterdam, pp.
warning. In Concise Encyclopedia of Environmental Systems, 417 – 438.
Young P.C. (Ed.), Pergamon Press: Oxford, pp. 234 – 236. Whitehead P.G., Young P.C. and Hornberger G.H. (1976) A
Lees M., Young P.C., Beven K.J., Ferguson S. and Burns J. systems model of stream flow and water quality in the Bedford
(1994) An adaptive flood warning system for the River Nith at Ouse River – Part l: stream flow modeling Water Research, 13,
Dumfries. In River Flood Hydraulics, White W.R. and Watts J. 1155 – 1169.
(Eds.), Institute of Hydrology: Wallingford, pp. 65 – 75. Wood E.F. (Ed.) (1980) Workshop on Real Time Forecast-
Ljung L. and Söderstrom T. (1983) Theory and Practice of ing/Control of Water Resource Systems, Pergamon Press: New
Recursive Estimation, MIT Press: Cambridge. York.
Meijer H. (1941) Simplified flood routing. Civil Engineering, 11, Young P.C. (1974) Recursive approaches to time-Series analysis.
306 – 307. Bulletin of the Maths and its Applications, 10, 209 – 224.
Nash J.E. (1959) Systematic determination of unit hydrograph Young P.C. (1984) Recursive Estimation and Time-series Analysis,
parameters. Journal of Geophysical Research, 64, 111 – 115. Springer-Verlag: Berlin.
16 RAINFALL-RUNOFF MODELING

Young P.C. (1985) The instrumental variable method: a practical Young P.C. (2002a) Advances in real-time flood forecasting.
approach to identification and system parameter estimation. Philosophical Transactions of the Royal Society: Mathematical,
In Identification and System Parameter Estimation, Barker Physical and Engineering Sciences, 360, 1433 – 1450.
H.A. and Young P.C. (Eds.), Pergamon Press: Oxford, pp. Young P.C. (2002b) Comments on the estimation of continuous-
1 – 16. time transfer functions. International Journal of Control, 75,
Young P.C. (1986) Time-series methods and recursive estimation 693 – 697.
in hydrological systems analysis. In River Flow Modelling and Young P.C. (2003) Top-down and data-based mechanistic
Forecasting, Kraijenhoff D.A. and Moll J.R. (Eds.), D. Reidel: modelling of rainfall-flow dynamics at the catchment scale.
Dordrecht, Chap. 6, pp. 129 – 180. Hydrological Processes, 17, 2195 – 2217.
Young P.C. (1989) Recursive estimation, forecasting and adaptive Young P.C. (2004a) Identification and estimation of continuous-
control. In Control and Dynamic Systems: Advances in Theory time hydrological models from discrete-time data, Proceedings,
and Applications, Vol. 30, Leondes C.T. (Ed.), Academic Press: British Hydrological Society BHS2004 Conference, Imperial
San Diego, Chap. 6, pp. 119 – 166. College, London.
Young P.C. (1992) Parallel processes in hydrology and water Young P.C. (2004b) Identification of time varying systems,
quality: a unified time series approach. Journal of the Institute Encyclopedia of Life Support Systems (EOLSS), UNESCO-
of Water and Environmental Management, 6, 598 – 612. EOLSS: (www.eolss.net).
Young P.C. (1993) Time variable and state dependent modelling Young P.C. and Beven K.J. (1994) Data-based mechanistic
of nonstationary and nonlinear time series. In Developments in modelling and the rainfall-flow nonlinearity. Environmetrics,
Time Series Analysis, Subba Rao T. (Ed.), Chapman & Hall: 5, 335 – 363.
London, pp. 374 – 413. Young P.C. and Jakeman A.J. (1979) Refined instrumental
Young P.C. (1998) Data-based mechanistic modelling of variable methods of recursive time-series analysis: Parts I,
environmental, ecological, economic and engineering systems. single-input, single-output systems. International Journal of
Environmental Modelling and Software, 13, 105 – 122. Control, 29, 1 – 30.
Young P.C. (1999a) Data-based mechanistic modelling, gen- Young P.C. and Jakeman A.J. (1980) Refined instrumental
eralised sensitivity and dominant mode analysis. Computer variable methods of recursive time-series analysis: Part III,
Physics Communications, 117, 113 – 129. extensions. International Journal of Control, 31, 741 – 764.
Young P.C. (1999b) Nonstationary time series analysis and Young P.C. and Lees M. (1993) The Active Mixing Volume
forecasting. Progress in Environmental Science, 1, 3 – 48. (AMV): a new concept in modelling environmental systems.
Young P.C. (2000) Stochastic, dynamic modelling and signal In Statistics for the Environment, Barnett V. and Turkman K.F.
processing: time variable and state dependent parameter (Eds.), John Wiley: Chichester, Chap. 1, pp. 3 – 44.
estimation. In Nonstationary and Nonlinear Signal Processing, Young P.C. and Wallis S.G. (1985) Recursive estimation: a
Fitzgerald W.J., Walden A., Smith R. and Young P.C. (Eds.), unified approach to identification, estimation and forecasting of
Cambridge University Press: Cambridge, pp. 74 – 114. hydrological systems. Applied Mathematics and Computations,
Young P.C. (2001a) The identification and estimation of nonlinear 17, 299 – 334.
stochastic systems. In Nonlinear Dynamics and Statistics, Mees Young P.C. and Wallis S.G. (1994) Solute transport and dispersion
A.I. (Ed.), Birkhauser: Boston, pp. 127 – 166. in channels. In Channel Networks, Beven K.J. and Kirby M.J.
Young P.C. (2001b) Data-based mechanistic modelling and (Eds.), John Wiley: Chichester, Chap. 6, pp. 129 – 173.
validation of rainfall-flow processes. In Model Validation: Young P.C., Jakeman A.J. and McMurtrie R. (1980) An
Perspectives in Hydrological Science, Anderson M.G. and Bates instrumental variable method for model order identification.
P.D. (Eds.), John Wiley: Chichester, pp. 117 – 161. Automatica, 16, 281 – 294.
Young P.C. (2001c) Advances in Real-time Forecasting. Centre Young P.C., Jakeman A.J. and Post D.A. (1997) Recent advances
for Research on Environmental Systems and Statistics, Report in data-based modelling and analysis of hydrological systems.
No. TR/176, Lancaster University, p. 38. Water Science and Technology, 36, 99 – 116.

You might also like