Professional Documents
Culture Documents
Hydrologic Statistics
Reading: Chapter 11 in Applied
Hydrology
Some slides by Venkatesh
Merwade
Hydrologic Models
Classification based on randomness.
Probability
A measure of how likely an event will
occur
A number expressing the ratio of
favorable outcome to the all possible
outcomes
Probability is usually represented as P(.)
P (getting a club from a deck of playing cards) = 13/52 =
0.25 = 25 %
P (getting a 3 after rolling a dice) = 1/6
Random Variable
Random variable: a quantity used to
represent probabilistic uncertainty
Incremental precipitation
Instantaneous streamflow
Wind velocity
Sampling terminology
Sample: a finite set of observations x1, x2,.., xn of
the random variable
A sample comes from a hypothetical infinite
population possessing constant statistical properties
Sample space: set of possible samples that can be
drawn from a population
Event: subset of a sample space
Example
Population: streamflow
Sample space: instantaneous streamflow,
annual maximum streamflow, daily average
streamflow
Sample: 100 observations of annual max.
streamflow
6
Types of sampling
Summary statistics
Also called descriptive statistics
If x1, x2, xn is a sample then
Mean,
Variance,
Standard
deviation,
1 n
X xi
n i 1
1 n
S
xi X
n 1 i 1
2
for continuous
data
2
for continuous
data
for continuous
data
Coeff. of
variation,
Graphical display
10
3 3
Annual
Annual
Max
Max
Flow
Flow
(10(10
cfs)
cfs)
600
600
500
500
400
400
300
300
200
200
100
100
0
01905
1900
1908
1918
1900
1927 1938
1900
1948 1958
1900
Year
Year
1968
1900
1978
1988
1900
1998
1900
Frequency Histogram
Plots of bars whose height is the number ni of
data falling into one of several intervals of
equal width
30
60
100
90
50
25
No. ofoccurences
occurences
No.
No. of
of occurences
80
Interval = 50,000
cfs
Interval
Interval
==
10,000
25,000 cfs
cfs
70
40
20
60
30
15
50
40
20
10
30
1020
5
10
0
00
0
0 50 50 100100 150
150 200
200 250
250
300
300
350 400
400 450
450 500
500
350
3 3 3cfs)
Annual
ax
flow
(10
Annual
ax
flow
Annualmm
m
ax
flow(10
(10cfs)
cfs)
Dividing the number of occurrences with the total number of points will
give Probability Mass Function 12
90
0.8
No. of
occurences
Probability
80
0.7
70
0.6
60
0.5
50
0.4
40
0.3
30
0.2
20
0.1
10
00
0
50
100 100
150
200
200 300
250
300 400350
400500450
500
600
3 3 cfs)
Annualmm
flow(10
(10
Annual
axaxflow
cfs)
15
Cumulative distribution
function
P (Q 50000) = 0.8
Probability
0.8
P (Q 25000) = 0.4
0.6
0.4
0.2
0
0
100
200
300
400
500
600
Steps
22
Plot p on x axis and Q (sorted)
on y axis
600
500
Q (1000 cfs)
400
Median flow
300
200
100
0
0
20
40
60
23
80
100
Statistical analysis
Regression analysis
Mass curve analysis
Flood frequency analysis
Many more which are beyond the
scope of this class!
24
Linear Regression
A technique to determine the relationship between
two random variables.
Relationship between discharge and velocity in a stream
Relationship between discharge and water quality
constituents
yi by
: 0
A regression model is given
1 xi i
i 1,2,..., n
(
y
x
)
Minimize i i
i
0
1 i
i 1,2,..., n
Regression applet:
26
http://www.math.csusb.edu/faculty/stanton/m262/regress/regress.html
TDS (mg/L)
1500
1200
900
600
300
0
0
500
1000
1500
2000
2500
3000
Coefficient of determination
(R2)
It is the proportion of observed y variation that
can be explained by the simple linear regression
model
SSE
R 1
SST
2