Professional Documents
Culture Documents
Conceptual models
Parameters to be calibrated:
k1, k2, k3, k4 , d1, d2’ s1 and s2
Conceptual models
Transfer component
transfer along hillslopes
transfer along river network
Parameters to be calibrated:
Water Balance Component : B, Wm, D1, D2, P1 ,P2 and K0
Transfer Component : C1,, D1, C2, and D2
Conceptual models
Modelling component
overland flow
interflow
interflow and overland flow
routing
groundwater recharge
Parameters to be calibrated:
baseflow
Umax, Lmax ,CQOF, CQIF, TIF ,
TOF, TG, CK12 and CKBF
Data-driven models
Overview of ANN
bj
X1
. W1j
.
Wij j
Xi Yj
. Wnj
.
Xn
Y j f ( Wij X i b j )
i 1
Learning in ANN
i 1 1
Sv class 1
Gain( S , A ) Entropy( S )
v valuesA S
Entropy( S v ) X1
0 1 2 3 4 5 6
Decision tree
• C4.5 algorithm
c Si Si x2 > 2
SplitInformation( S , A ) log 2 Yes No
i 1 S S x1 > 2.5 x1 < 4
Yes No Yes No
InformationGain( S , A )
GainRatio( S , A ) x2 < 3.5 Class 0 Class 0 x2 < 1
leaves are:
2
Model 4 Model 6
• constants ( regression tree) 1
tree)
• Building initial tree
1 2 3 4 5 6
Y (output)
Ti
SDR sd ( T ) sd ( Ti ) M5 model tree
i T
x2 > 2
Yes No Yes No
• Smoothing : is used to compensate for
the sharp discontinuities between adjacent Model 1 Model 2 Model 5 Model 6
linear models
Boosting techniques in ML
The boosting algorithm for regression problem: AdaboostRT
• uniform distribution of weights
• while t = T,
• calculate error rate ft(x) based on threshold value
t Dt ( i )
f ( x ) yi
i: t i
yi
• set t = t2
• update distribution
f t ( xi ) yi Zt-normalisation factor
Dt ( i ) t if
Dt 1 ( i ) x yi
Zt 1 otherwise
• set t = t + 1
• final hypothesis
1
f fin ( x ) log
t
* f t ( x )
t
Model evaluation criteria :
2
RMSE
• Coefficient of efficiency (COE) = 1
2
n yi yi
1
n
i 1
STUDY AREA
Basin characteristics :
• Contributing Area :
2900 Km2
• Elevation range :
approx.. 2700 - <100m
• Average slope :
1%
DATA COLLECTION
• Precipitation
data Daily
: data for 8 years (January 1988 to December 1995)
3 stations - (Kathmandu, Hariharpur and Daman)
Thiessen polygon - mean aerial precipitation
• Runoff data :
Daily data for 8 years (January 1988 to December 1995)
1 stations - (Pandheradobhan)
• Evapotranspiration data :
Calculated using the FAO modified Penman method
DATA ANALYSIS AND PREPARATION
Daily average aerial precipitation and the runoff at the basin outlet
Precipitation: Runoff:
Daily Average = 5.44 mm Daily Average = 149.96 Cumec
Maximum = 364.59 mm Maximum = 5030 Cumec
Rainfall-Discharge plot
6000 0
50
5000
100
Precipitation [mm]
Runoff [Cumec]
4000
150
3000 200
250
2000
300
1000
350
0 400
Jan-88
Jun-89
Aug-91
Jan-92
Apr-95
M ay-88
Sep-88
Feb-89
Oct-89
Jul-90
Nov-90
Apr-91
M ay-92
Sep-92
Feb-93
Jun-93
Oct-93
Mar-94
Jul-94
Dec-94
Aug-95
Mar-90
Time [days]
Correlation Coeff.
0.6
0.5
6000 400
350 AutoCorrelation of Discharge
5000
300 1.2
Runoff [Cumec]
4000
Rainfall [mm]
250
1
Correlation coeff.
3000 200
0.8
150
2000
100 0.6
1000
50 0.4
0 0
0.2
2020 2025 2030 2035 2040 2045 2050 2055 2060 2065 2070
0
Time [days]
0 1 2 3 4 5 6 7 8
Runoff Rainfall
Lag [days]
DATA ANALYSIS AND PREPARATION
4 6000
• Box Cox Transformation 3.5 5000
Transformed runoff
3
• Logarithmic Transformation
Original runoff
2.5 4000
2 3000
1.5
2000
1
0.5 1000
0 0
0 500 1000 1500
Time [days]
Frequency Distibution of the original discharge Frequency Distribution of the transformed discharge (Box-cox)
400
1600
350
1400
300
1200
250
1000
Frequency
Frequency
800 200
600 150
400 100
200 50
0 0
10 0 0 0 0 0 0 0 0 00 10 20 e
12 23 34 45 56 67 78 89 or
10 11 12 M
Discharge [m3/s], bins Transformed Discharge, bins
EXPERIMENTS: Conceptual models
Input variables :
Rainfall -upto 2 previous timestep (REt, REt-1 and REt-2)
Runoff -upto 1 previous timestep (Qt, and Qt-1)
5000
4000
Runoff [Cumec]
3000
2000
1000
0
23-Jun-93 8-Jul-93 23-Jul-93 7-Aug-93 22-Aug-93 6-Sep-93 21-Sep-93
Time [Days]
180
Runoff [x 103 Cumec]
150
120
90
60
30
0
20-Jun-93 17-Nov-93 16-Apr-94 13-Sep-94 10-Feb-95 10-Jul-95 7-Dec-95
Time [days]
6000
5000
4000
Runoff [Cumec]
3000
2000
1000
0
25-Jun-93 10-Jul-93 25-Jul-93 9-Aug-93 24-Aug-93 8-Sep-93 23-Sep-93
Time [Days]
Observed Computed
RESULTS: Data-driven models
MT on verification set
(No of LM = 8)
6000
5000
4000
Runoff [Cumec]
3000
2000
1000
0
25-Jun-93 10-Jul-93 25-Jul-93 9-Aug-93 24-Aug-93 8-Sep-93 23-Sep-93
Time [Days]
Observed Computed
RESULTS: Data-driven models
6000
5000
Runoff [Cumec]
4000
3000
2000
1000
0
25-Jun-93 10-Jul-93 25-Jul-93 9-Aug-93 24-Aug-93 8-Sep-93 23-Sep-93
Time [Days]
Evaluation No.of Correctly classified Incorrectly classified Correctly classified Incorrectly classified
for instances Instances Instances Instances Instances
In number In % In number In % In number In % In number In %
Training 2000 1919 95.95% 81 4.05% 1915 95.75% 85 4.25%
Verification 919 857 93.25% 62 6.75% 855 93.04% 64 6.96%
42 22
Data distribution according to the classification low high
400
350
300
Runoff [Cumec]
250
200
150
100
50
0
600 620 640 660 680 700 720 740 760 780 800
T ime [Days]
3500
3000
2500
Runoff [Cumec]
2000
1500
1000
500
0
0 25 50 75 100 125 150
T ime [Days]
3500
3000
2500
Runoff [Cumec]
2000
1500
1000
500
0
11-May-95 31-May-95 20-Jun-95 10-Jul-95 30-Jul-95 19-Aug-95 8-Sep-95 28-Sep-95
T ime [Days]
3000
2500
Runoff [Cumec]
2000
1500
1000
500
0
11-May-95 31-May-95 20-Jun-95 10-Jul-95 30-Jul-95 19-Aug-95 8-Sep-95 28-Sep-95
T ime [Days]
350
300
250
Run off [Cum ec]
200
150
100
50
0
600 620 640 660 680 700 720 740 760 780 800
T ime [Days]
3500
3000
2500
Runoff [Cum ec]
2000
1500
1000
500
0
0 25 50 75 100 125 150
T ime [Days]
Comparison of results by MT