Professional Documents
Culture Documents
Hydrologic Statistics: Reading: Chapter 11 in Applied Hydrology Some Slides by Venkatesh Merwade
Hydrologic Statistics: Reading: Chapter 11 in Applied Hydrology Some Slides by Venkatesh Merwade
Hydrologic Statistics
2
Probability
• A measure of how likely an event will occur
• A number expressing the ratio of favorable
outcome to the all possible outcomes
• Probability is usually represented as P(.)
– P (getting a club from a deck of playing cards) = 13/52 = 0.25 = 25 %
– P (getting a 3 after rolling a dice) = 1/6
3
Random Variable
• Random variable: a quantity used to represent
probabilistic uncertainty
– Incremental precipitation
– Instantaneous streamflow
– Wind velocity
• Random variable (X) is described by a probability
distribution
• Probability distribution is a set of probabilities
associated with the values in a random variable’s
sample space
4
Sampling terminology
• Sample: a finite set of observations x1, x2,….., xn of the random
variable
• A sample comes from a hypothetical infinite population
possessing constant statistical properties
• Sample space: set of possible samples that can be drawn from a
population
• Event: subset of a sample space
Example
Population: streamflow
Sample space: instantaneous streamflow, annual
maximum streamflow, daily average streamflow
Sample: 100 observations of annual max. streamflow
Event: daily average streamflow > 100 cfs
6
Types of sampling
• Random sampling: the likelihood of selection of each member of the
population is equal
– Pick any streamflow value from a population
• Uniform sampling: Data are selected such that the points are uniformly far
apart in time or space
– Pick steamflow values measured on Monday midnight
7
Summary statistics
• Also called descriptive statistics
– If x1, x2, …xn is a sample then
1 n
Mean, X xi for continuous data
n i 1
xi X
1 n
Variance, S
2
for continuous data
n 1 i 1
10
Time series plot
• Plot of variable versus time (bar/line/points)
• Example. Annual maximum flow series
600
500
Annual Max Flow (10 3 cfs)
400
300
200
100
0
1905
1900 1908 1900
1918 1927
19001938 1948
1900 1958 1968
1900 1978 1900
1988 1998
1900
Year
Year
70
of occurences
No. ofoccurences
40
20
60 Interval = 25,000
30
15
Interval = 10,000 cfscfs
50
40
No. of
20
10
30
No.
1020
5
10
0
00
0 0 50 50 100100 150
150 200
200 250
250 300
300 350 400
350 400 450
450 500
500
Annual
Annualmm
Annual m ax
ax
ax flow
flow (10
flow(10
3 3 3cfs)
(10cfs)cfs)
Dividing the number of occurrences with the total number of points will give Probability
Mass Function 12
Using Excel to plot histograms
14
Probability density function
• Continuous form of probability mass function is probability
density function
0.9
100
90
0.8
80
0.7
70
occurences
0.6
Probability
60
0.5
50
0.4
40
No. of
0.3
30
0.2
20
0.1
10
00
0 0 50
100 100 150
200 200 300
250 300 400350 400500450 500
600
Annualmm 3 3 cfs)
Annual axaxflow
flow(10
(10 cfs)
1
P (Q ≤ 50000) = 0.8
0.8
P (Q ≤ 25000) = 0.4
Probability
0.6
0.4
0.2
0
0 100 200 300 400 500 600
Annual m ax flow (103 cfs)
17
Flow duration curve
600
500
400
Median flow
Q (1000 cfs)
300
200
100
0
0 20 40 60 80 100
% of tim e Q w ill be exceeded
23
Statistical analysis
• Regression analysis
• Mass curve analysis
• Flood frequency analysis
• Many more which are beyond the scope of
this class!
24
Linear Regression
• A technique to determine the relationship between two
random variables.
– Relationship between discharge and velocity in a stream
– Relationship between discharge and water quality constituents
Regression applet:
26
http://www.math.csusb.edu/faculty/stanton/m262/regress/regress.html
Linear Regression in Excel
• Steps:
– Prepare a scatter plot
– Fit a trend line
1800
1500
TDS = 0.5946(sp. Cond) - 15.709 Data are for Brazos River
R2 = 0.9903
1200
near Highbank, TX
TDS (mg/L)
900
600
300
0
0 500 1000 1500 2000 2500 3000
Specific Conductance ( S/cm)
SST
The higher the value of R2, the more successful is the model in explaining y
variation.
If R2 is small, search for an alternative model (non linear or multiple
regression model) that can more effectively
28
explain y variation