Professional Documents
Culture Documents
by
Kirti Sehgal
Kirti Sehgal
2016
Abstract
The Ontario Ministry of Transportation (MTO) requires regional flood frequency equations to
determine peak flows of specific return periods, established using the data from gauged
locations, to design structures at the crossings of streams and rivers. This study intends to bridge
the gaps in the current estimation techniques used in Ontario and utilize the additional data to
improve its accuracy. Regional Flood Frequency Analysis (RFFA) of Ontario streams was
performed using multiple regression and the equations for the T-year flood quantile (2, 10, 25, 50
and 100 year) were developed. The results of the regression based Unified Ontario Flood Method
(UOFM) for the province reaffirms the conclusions of previous studies that peak discharge is
directly related to drainage area. Other factors such as the lake attenuation index, representative
of the area of lakes and wetlands, and climatological factors also contribute to the determination
ii
Acknowledgments
Flood Frequency and statistics were strangers I never wanted to be friends with before
September 2014. A transfer to MASc brought along with it an opportunity to overcome my fears.
I owe the success of this work to Dr. Jennifer Drake. It wouldn’t have been possible without her
motivation and constant belief in me throughout the course of this project. Her countless
suggestions for my research and constructive feedback on my reports and thesis helped me to
achieve the results and the project could be completed in a short time span.
I would also like to thank the Ontario Ministry of Transportation, in particular Dr. Hani Fargaly,
for funding the project under its Highway Infrastructure Innovation and Funding Program
(HIIFP). Hani’s constant input of the user requirements and expectations were helpful in working
towards a simple solution for engineers and designers.
My achievement would not have been possible without the constant love and support of my
friends and family. The group lunch and coffee breaks became an important part of my day and
kept me going. Thanks to my friends Vivek, Balsher, Dikshant, Divyam and all others in GB415.
I would like to thank my brother Deepak for being there when I was tensed and encouraging me
to get back and fight it out. I am deeply indebted to my uncle and aunt, Dinesh and Sangeeta
Chhura for being my motivator and facilitator. They took such good care when I was new in
Canada and couldn’t support myself. My masters would not have been possible without their
support. This journey would have been difficult without my husband Pawan who took the
endless proofreading tasks I had for him and taught me the use of excel macros to speed up the
repetitive jobs. He loved me when I was impossible and this journey could have been longer, if
he was not there. I am grateful to my mother who has been my constant support and pillar of
strength. I credit her for the passionate woman I am today. I can never be grateful enough for
supporting my education even during the hard times. Last, but most importantly, Thank you
Papa for always being there. From where I see, you haven’t gone anywhere. Your love and
belief still inspires me to give my best without worrying about the results.
iii
Table of Contents
Acknowledgments.......................................................................................................................... iii
2.2.1 Physiography........................................................................................................... 7
2.4 Review of the Current Regional Flood Frequency Analysis (RFFA) Procedures ............ 11
iv
2.6.3 Multiple Regression .............................................................................................. 30
5.4 Simulation for Peak Flow Estimation: Case of Urban Floods .......................................... 60
v
5.6 Climate Change Considerations ........................................................................................ 65
References ..................................................................................................................................... 71
vi
List of Tables
Table 1: Function Classification and Design Flows (Source: Ontario Ministry of Transportation,
2008) ............................................................................................................................................... 6
Table 2: Relationship of Watershed Class with Class Coefficient (Joy & Whiteley, 1996) ........ 12
Table 3: Ratios to Flood Quantiles of Different Return Periods to the 25-year Quantile (Joy &
Whiteley, 1994)............................................................................................................................. 13
Table 16: Analysis Results for Medium to Large Urban Watersheds .......................................... 58
Table 17: Station Parameters for Comparison with MIFM (South) ............................................. 61
vii
Table 19: Station Parameters for Comparison with MIFM (Shield) ............................................ 62
viii
List of Figures
Figure 1: Ecozones of Ontario (Source: Ecozone, 2012) ............................................................... 8
Figure 2: Isohyetal Map with Location of Environment Canada Weather Stations ..................... 10
Figure 4: Flow Data Comparison for a Typical Station (HYDAT station: 02HB021) ................. 20
Figure 11: Stations Eliminated (non-compliant with nonparametric test hypotheses) ................. 42
Figure 13: Regression Stations and Ontario Highways (Source: Highways, 2014) ..................... 46
Figure 15: Observed and Predicted Quantiles for Medium to Large Urban Watersheds ............. 59
ix
1
Chapter 1
Introduction
1.1 Background
Flooding of streams and rivers have been a concern for designers and policy makers for a long
time. The destructive nature and randomness of floods has often led researchers and engineers to
develop prediction tools for large flood events. The history of flooding in Ontario encompasses
many notable events, the most severe of which was Hurricane Hazel in 1954. Flooding caused by
Hurricane Hazel took 81 lives (mostly in the City of Toronto) while simultaneously leaving
thousands of Ontario residents homeless (TRCA, 2014). Property damages associated with this
event have been approximated at $100 million, which is about $1 billion today (TRCA, 2014).
Another notable severe flooding event occurred in May 1974 in the Grand River watershed.
Approximately, $6.7 million was assessed by Leach (1974) as residential, industrial and
municipal losses. In June 2004 a storm event in the Grand River watershed deposited 200 mm of
rain in a very short span of time causing severe flooding and excessive erosion
(Hebb and Mortsch, 2007). In 2013 both Alberta and Ontario experienced dramatic flooding of
Downtown, Calgary and the Don Valley Parkway, Toronto. The damages for these flood events
have been estimated at more than $1.72 billion and 465 million in insured losses for Southern
Alberta and Toronto, respectively (Insurance Bureau of Canada, 2015) Thus, floods have
become important and their prediction is pivotal for design of structures on our water courses.
Design floods are the peak flood discharge (or flow rates) which are critical when assessing the
risk and safety of hydraulic structures (e.g. culverts and bridge crossings), both planned and
existing. The prediction of these peak flood values during design of a hydraulic structure at water
crossing requires historical flow records at that location. These values are typically obtained at
gauging stations built on streams and rivers. However, it is common to encounter situations
where the location of interest lacks the associated historical stream flow data. For a water rich
province like Ontario it is not possible to have gauges on all water courses and, even for
situations where the stream is gauged, the point of interest may not coincide with the location of
the gauge station. Thus, when sizing bridges and culverts, engineers regularly depend on regional
analysis to estimate flood quantiles. Regional flood frequency analysis (RFFA) is performed to
develop relationships between flow estimates and relevant physiographic and climatic
2
1. Availability of approximately twenty years of additional stream flow data since the last
study of MIFM and NOHM completed in 1996 and 1994, respectively. The additional data,
when accounted for, will provide a better representation of the current watershed and stream
flow conditions.
2. Changes in catchment characteristics, which lead to gradual changes in the flow regime,
should be taken into consideration.
3. Inconsistency in area classification and the corresponding equation for prediction of flood
quantiles needs to be addressed.
3
4. Analysis procedures for urban watersheds, predominately required for the south, are non-
existent. A review of the available data to ascertain the feasibility of such a study was
requested to be performed.
Thus, the objective of the study is to develop a simple and easy to use set of equations for
regional flood frequency analysis (RFFA). This thesis discusses the analysis procedures
undertaken and presents the results of the RFFA for Ontario.
Chapter 1– Introduction
This chapter provides the background for the current research; identifies the research objectives
and provides a description of the thesis structure.
This chapter provides a detailed review of the physiography, climate and hydrologic data for
Ontario. It also presents a review of relevant statistical literature and recent RFFA studies. This
establishes the foundation for the methodology used for the thesis and outlines the rationale of all
the subsections in the methodology.
Chapter 3– Methodology
This chapter describes the research methodology undertaken and presents the results obtained at
intermediary steps during the analysis work. The results of each step are also presented in this
section because, for majority of the cases, the subsequent step is dependent on the result obtained
at the previous step. For example, station frequency analysis is performed for only those stations
which were accepted during the non-parametric testing.
This chapter presents the results of step-wise regression analysis to develop the Unified Ontario
Flood Method (UOFM). It also presents the design table developed for computation of peak
4
flows for ungauged drainage basins for two regions of Ontario. The chapter highlights the
computation of the probable range for the predicted quantiles, established from the standard
errors of stepwise regression process, and illustrates an example calculation for UOFM.
This chapter provides the results of the verification and evaluation of the UOFM equation for
predicting flood flows. A separate analysis was also performed to check the applicability of the
equations for urban watersheds. Finally, a comparison study was performed to check the
performance of the equations relative to the methods currently used in Ontario.
This chapter discusses the important conclusions of the thesis with respect to the research
objectives. Limitations of the current investigation and regression-based flood methods are
discussed. This may serve as the recommendations for additional research for improving the
peak flow estimates for Ontario.
5
Chapter 2
Literature Review
The following sections provide an overview of the relevant literature related to Regional Flood
Frequency Analysis (RFFA) and the methods adopted for the current investigation. It provides a
background highlighting the need of RFFA procedures and its relevance to the MTO. It
subsequently discusses the factors which affect peak flows for any drainage catchment. This
section also reviews the methods currently used for regional analysis and reviews recent
investigations in various provinces of Canada and the United States. Subsequently, this section
reviews the available data and procedures for the development of RFFA equations through
multiple regression analysis.
2.1 Background
Provincial Highway Directive B-237 sets forth the MTO Drainage Management Policy and
Practice (Ontario Ministry of Transportation, 2008) for the province of Ontario. Ministry of
Transportation and Communication (MTC) as was known previously, now MTO has been
publishing its drainage design manuals since 1979. It has undergone several revisions since its
original publication and outlines the existing policies and design methodologies adopted by
MTO to date. In 1989, the MTO Drainage Management Technical Guidelines was prepared
which outlines the MTO standards and was used in conjunction with the drainage manual. Since
1997, the MTO has issued a single Drainage Management Manual (DMM) to replace the two
previous manuals. DMM includes the standards of practice and design methodologies but did not
include the drainage management policies guiding the practices and design. In 2008 MTO
published its Highway Drainage Design Standards (HDDS). This document outlines the existing
drainage design standards for components of highway infrastructure that have been adopted by
MTO over the years. The HDDS focuses on the highway surface drainage, water crossings and
storm water management for different components of the highway infrastructure. The HDDS
provides recommendations for the return period of design flows to be considered for various
highway classifications based on its function. The return period for various highway
infrastructures is summarized in Table 1.
6
The MTO owns and operates approximately 2800 bridges across the province (Ontario Minstry
of Transportation, 2015). MTO requires peak flow estimates (flood quantiles) of various return
periods to size new hydraulic structures or for repair of existing structures. For design of these
structures, like brides and culverts, a return period is specified, for example T= 100 years for a
freeway bridge of more than 6 m span (Ontario Ministry of Transportation (2008),
Watt et al. (1989)). A return period of a flood may be defined as the average time between two
flood events of similar intensity. So, it may be expected that large flood events have large return
periods and vice-versa (Rao and Hamed, 2000). A T-year return period instantaneous flood has a
recurrence interval of T years or the annual probability of exceedance of p (=1/T). Thus, a 100
year flood has an annual exceedance probability of 1%. A flood quantile of T-year is the
magnitude of flood corresponding to the exceedance probability of p. Return period of the flood
to be considered for design purpose (design flood) is often decided based on the anticipated
design life of the structure in consideration among other factors such as highway class, annual
daily traffic (ADTs), importance of the structure, etc. (Watt et al. (1989), (Ontario Ministry of
Transportation, 1997)).
2.2.1 Physiography
Physiography is an important determinant of the flood response for a drainage basin. Drainage
characteristics such as area, land use, storage and slope influence the hydrologic response to
floods and vary significantly throughout the province (Moin and Shaw, 1985). One of the earliest
investigations for Southern Ontario by Karuks (1961) identified the dependence of peak flows in
a catchment on the drainage area, storage factor, slope and the stream density. Further
investigations by Moin and Shaw (1986) for the whole of Ontario found the following
physiographic parameters as important: drainage area, slope of the channel, area of lakes and
swamps and the shape factor (parameters mentioned in the order of importance). Analysis by
Joy and Whiteley (1996) and Watt (1994), MIFM and NOHM respectively, concluded drainage
area to be the most significant determinant of peak flows. Other significant factors included the
slope, area occupied by lakes and swamps and the curve number. In recent years, urban flooding
has also become a common and costly phenomenon throughout the province. The
imperviousness associated with the urban environment decreases its ability to absorb and allow
infiltration of rainfall. This change causes more peaked floods than an equivalent rural
environment.
Homogenous region classifications have the inherent assumption that watersheds within a region
exhibit similar hydrologic properties and behavior. The assimilation of information together from
gauged stations within a homogenous region provides a better estimate of the flood quantiles
when the information is transferred to ungauged points of interest during regional analysis. Thus,
delineation of homogeneous regions in a geographical area is an important step before
proceeding with regional flood frequency analysis. Various homogeneous region classifications
have been proposed for Ontario in previous studies based on different criterion (e.g. Moin and
Shaw (1985), Moin and Shaw (1986), Gingras et al. (1994)). The homogenous regions employed
in these flood regression studies are based on (1) flood characterestics (by computing of
regression residuals) up to and including the data till the 1980’s and/or 1990’s, (2) grouping the
regions based on homogeniety tests or (3) the peak flood generating mechanism. Since flood
characteristics may change over time a more general method of regional classification is
required. A classification by the National Ecological Framework identifies the ecozones in
Canada. This classification is based on dividing large geographical units with an ecosystem
perspective. These divisions, called ecozones, depict regions of broadly similar climatic and
8
geological characteristics (Wiken, 1995) and do not depend on the flood data for this
classification, unlike previous studies like Moin and Shaw (1986) and Gingras et al. (1994)).
Thus, it can also provide a common index for comparison of climate and related phenomenon
within Ontario across different research areas. According to the classification, Ontario is divided
into three ecozones shown in Figure 1. The three ecozones present in Ontario are the Hudson
Plains, Boreal Shield and Mixed Wood Plains.
The Hudson Plain ecozone, located in Northern Ontario, has large portions of its land cover in
the form of wetlands. This is a result of poor drainage creating a high degree of water retention
throughout the region (Moin and Shaw, 1985). This ecozone has sedimentary bedrock which
gradually drains to the Hudson Bay and James Bay (Wiken, 1995). The Boreal Shield ecozone is
dominated by the Canadian Shield and forest vegetation. The Shield has a thin soil cover over
9
the rocks, resulting in rapid flows in the streams (Moin and Shaw, 1985). It is also characterized
by large natural storages in the form of lakes and wetlands which attenuate flows
(Moin and Shaw, 1985). The Mixed Wood Plains represent one of the most fertile and
productive ecozones of Canada and most of the urbanization has concentrated in this region
(Wiken, 1995). The soil in this region is well drained and has marshes which provide some
storage to runoff (Moin and Shaw, 1985).
2.2.2 Climate
The climatology across Ontario varies and so do the flood causing mechanisms. Flooding occurs
due to various climatic and hydrologic factors such as snowmelt, spring rainfall, thunderstorms,
hurricanes, ice jams and/or a combination of these factors (Gingras et al. (1994),
Moin and Shaw (1985)). Spring rainfall is the most common cause of flooding across the
province (Moin and Shaw, 1985). The recent floods due to spring rainfall in Peterborough in
June 2004 caused damages in millions of dollars (City of Peterborough , 2005). The 49th parallel
storm in June 2002, also due to excessive precipitation (approximately 400 mm) caused severe
flooding in north-western Ontario and other parts of Canada and the United States. It caused
damages in excess of $31 million, impacted infrastructure and also affected the local First Nation
communities (Hebb and Mortsch, 2007). Thus, climate and specifically precipitation is a variable
factor throughout the province and its various forms determine the flood causing mechanism.
The isohyetal map presented in Figure 2 shows the variation of mean annual precipitation across
the province. This is based on the data from Climate Normals published by Environment Canada
for 151 weather stations across Ontario from 1981-2010. Average annual precipitation ranges
from less than 700 mm in the northwest part of the province to more than 1250 mm in the
“snowbelt” east of Lake Huron.
Previous studies (Karuks (1961); Moin and Shaw(1986)) have identified the precipitation over a
region as an important determinant of the observed stream flows. However, in the currently used
regional analysis methods, NOHM and MIFM, precipitation over a drainage catchment was
either not considered or found to be statistically insignificant during the investigation procedures
by Watt (1994) and Joy and Whiteley (1994). In contrast most American studies such as
Capesius and Stephens (2009), Waltemeyer (2008) and Landers and Wilson (1991) for the states
of Colarado, New Mexico and Mississippi, respectively, as well as studies for Maritime
10
provinces Newfoundland (Rollings, 1999) and New Brunswick (Aucoin et al. 2011) all consider
precipitation to be a crucial parameter for predicting peak flows.
assumption does not hold for situations where there are large storage effects (Watt et al., 1989)
which have an attenuation effect on peak flows. The second technique, called the multiple
regression involves developing a regression based relation between the peak flows of different
return periods or the mean annual flow and the physiographic and/or climate parameters using
various regression procedures. Multiple regression and its various procedures are discussed in
detail in Section 2.6.3.Watt et al. (1989) identified direct regression of quantiles of different
return periods as an improvement over index flood method. Regression based methods have a
limitation on their applicability and should only be applied to watersheds with basin
characteristics within the range of those used for the development of the regression equation
(Watt et al. 1989). Additionally, the data should represent natural flow conditions and the
equation should not be applied to watersheds with basin characteristics outside of the range of
parameters used to develop the equations. Regression based methods have also started to have
wider acceptability in various provinces of Canada and the United States. Watt et al. (1989)
mentions the studies based on direct regression of quantiles in the late 1980s. Other recent
examples include Waltemeyer (2008) for New Mexico, Capesius and Stephens (2009) and Vaill
(2000) for Colorado, Eash (2001) for Iowa; Rollings (1999) for Newfoundland,
Aucoin et al. (2011) for New Brunswick and Sandrock et al. (1992) for Saskatchewan. All of
these reports use a regression based approach for determination of flood quantiles.
The Modified Index Flood Method (MIFM), presented in Joy and Whiteley (1994) and
Joy and Whiteley (1996), is based on the basic form of the Index flood equation used for the
estimation of the mean annual flood. The procedure is modified for prediction of Q25 quantile
12
and its implementation is illustrated in the drainage management manual (Ontario Ministry of
Transportation, 1997). The MIFM requires estimates of the Curve Number (CN) to compute its
corresponding Base Class.
The channel slope and storage are then used to compute the adjustments in the base class and the
adjusted watershed class is calculated.
𝑆𝑊 0.5
𝑆𝑙𝑜𝑝𝑒 𝐴𝑑𝑗𝑢𝑠𝑡𝑚𝑒𝑛𝑡 = 1.815 ∗ [{ } − 1] Equation (2)
0.004
where SW = the slope of the drainage basin (dimensionless); SA =Storage Area (%)
A range has been established (Table 2) for the adjusted class and the associated class coefficient
and the final class coefficient is ascertained by interpolation.
Table 2: Relationship of Watershed Class with Class Coefficient (Joy & Whiteley, 1996)
Watershed Class Class Coefficient, C
1 0.15
2 0.22
3 0.31
4 0.44
5 0.63
6 0.90
7 1.29
8 1.84
9 2.62
10 3.74
11 5.34
12 7.63
13
The Modified Index Flood Method (MIFM) calculates the 25 year flood quantile from Equation
(5). The 2.33-year, 5-year, 10-year, 50-year and 100-year quantile are calculated from the
established ratios (Table 3).
where C is the class coefficeint and A is the total drainage area (km2).
Table 3: Ratios to Flood Quantiles of Different Return Periods to the 25-year Quantile (Joy
& Whiteley, 1994)
Basin Type Return Period (yrs)
2.33 5 10 25 50 100
Non Detentive type Southern
0.49 0.66 0.81 1.00 1.16 1.32
Basins
Shield and Detentive type
0.57 0.71 0.84 1.00 1.13 1.27
Southern Basins
North Shores of Lake Erie and
0.41 0.62 0.79 1.00 1.16 1.32
Ontario
The advantage of MIFM is its applicability to drainage areas in both the Shield type and
Southern drainage basins. It is however restricted in watershed size and cannot be used for
drainage areas less than 25 km2. The classification of basin types presented in Table 3 is used as
a design chart by MTO. However, the definitions of detentive and non-detentive basins are not
explicitly stated in Joy and Whiteley (1994), Joy and Whiteley (1996) and Ontario Ministry of
Transportation (1997), which creates a situation of uncertainty for engineers and designers. Also,
the computation of CN can also be challenging for rapidly urbanizing watersheds. The flow
predictions for these watersheds may not be close to the observed flows due to the urban flood
control measures within a drainage basin. An example in Joy and Whiteley (1996) predicted the
change in peak flow by approximately four times ( from 12 m3/s to 46 m3/s) when the CN is
increased by 10 units (from 60 to 70) for a medium sized watershed (60 km2) located in Southern
Ontario. The sensitivity of flow prediction to CNs necessitates tools and methodologies for its
accurate estimation in a drainage basin. These are not yet available for the province of Ontario.
Thus, CN estimates depend on the judgement of the project engineer. The MIFM prediction for a
14
Shield type basin, however, does not require CN estimate and the watershed class is calculated
from the percentage of water detention. As per the procedures for MIFM (Shield), the calculation
of percentage of water detention considers the area of lakes and wetlands only if the total
drainage area is greater than 100 km2. It does not consider the area of lakes and wetlands for
smaller watersheds.
The Northern Ontario Hydrology Method (NOHM) (Watt,1994) is a regression based estimation
of the mean annual annual flood from the distribution parameters (the mean, standard deviation
and skewness). These distribution parameters are related to the basin characteristics. A frequency
factor relation is used to determine the other quantiles of interest from the assumed regional
distribution (Watt, 1994). The calculation of peak flow is illustrated from Equation (6) to
Equation (12). A T-year maximum daily flow value is calculated from Equation (9). A peaking
factor (P) is calculated and applied which is based on the outlet point and if it is a lake outlet
Equation (11) is chosen. The peaking factor is used to compute the maximum instantaneous
peak from Equation (12).
1.06 𝐴𝑑 2.07
𝑄𝑚 = 0.170 ∗ 𝐴 ∗ (1 − ) Equation (6)
𝐴
𝐴𝑑 1.85
𝐶𝑣 = 0.502 ∗ (1 − ) Equation (7)
𝐴
𝐴𝑑
𝐶𝑆 = −2.52 + [3.73 ∗ (1 − ) Equation (8)
𝐴
where Qm is the mean annual flood (m3/s), Cv is the coefficient of variation, Cs is the coefficient
of skew, Ad is the area of lakes and swamps (km2) and A is the total drainage area (km2).
𝑄𝑇 = 𝑄𝑚 (1 + 𝐾(𝑇,𝑔) ∗ 𝐶𝑣 )
Equation (9)
𝐴𝑑
𝑃 = 1 + exp [−22 ∗ ( − 0.06)]
𝐴 Equation (10)
Or
15
𝐴𝑑
𝑃 = 1 + 6 ∗ 𝐴−0.36 ∗ exp [−22 ∗ ] Equation (11)
𝐴
Where QT is the T-year maximum daily flow, K(T,g) is the frequency factor;
P is the peaking factor and Qp,T is the peak flow with a T year return period.
The advantage of NOHM is that it is developed specifically for the Shield region where large
storages affect the rainfall-runoff response (Ontario Ministry of Transportation, 1997). However,
this method is only applicable to watersheds with drainage areas between 1-100 km2. The small
dataset used for the study (11 hydrometric stations) limits the accuracy to a regression based
method to observed flows in a larger geographic region. At the same time, the verification study
was not extended to stations which were not a part of the analysis. Also, as discussed in Section
2.2.2, precipitation is an important parameter which varies across the province and was not
considered during the development of NOHM.
16
Min. no. No of Range of Non-
Province Reference of record stations Drainage Predictors considered Regulation Parametric
years used Area Used Screening
Drainage area, basin slope upstream,
basin elevation, maximum
Waltemeyer, 0.059 mi2-
New Mexico 10 293 precipitation intensity ( storm of 24- No No
2008 12,7000 mi2
hour & recurrence interval 100
years, mean annual precipitation
Drainage area,
main channel slope, average
Soong et al., 0.03 mi2- permeability, % area of open water
Illinois 10 288 No No
2004 9554 mi2 & wetland, basin length, basin width,
main-channel length, & 2-day, 24-
hour rainfall depth
17
18
The chart (reproduced in Figure 3) illustrates that for short length of stream records and
prediction of large design floods, reliance on only station frequency analysis should not be
19
preferred. On reading the design chart it can also be inferred that while performing station
frequency analysis a record length of 10 years may be sufficient to predict a design flood of
approximately 50 years. However, to predict a 100 year design flood, the record length should be
approximately 25 years. Thus, it is important to use judgement for a balance between the
minimum acceptable record length, degree of extrapolation required and uncertainty in the
prediction of flood quantile.
The Annual Maximum series may sometimes lead to loss of information as only a single peak
flow of a given year is considered and the subsequent second or third peak of the same year are
ignored. These subsequent flows, ignored in AMS series, may be greater than the peak flows of
other years (Rao and Hamed, 1999). On the contrary, POT consists of all data / flow records
above a given threshold level that may be selected based on the number of available records and
the selected threshold. It is generally used when the record length is short. The inter event time
between POT events is also not equal. A minimum inter event time may be selected to ensure the
independence of the data series. Adamowski (2000) observed that the POT models are not useful
in the analysis of events which results from more than one flood causing mechanism like
combination of spring rainfall, snowmelt, thunderstorms etc. The condition of bimodal data may
not be new to the climatic conditions in Ontario where flooding may occur due to a combination
of different mechanisms (Moin and Shaw (1985), Gingras et al. (1994)). Thus, if sufficient data
is available, AMS series can be considered for analysis of streamflow data.
Water Survey of Canada (WSC) collects hydrometric data at its stream gauging stations across
Canada. Its central database, HYDAT, contains flow data such as daily and monthly mean flow,
water levels, sediment concentration, peak flow etc. WSC operates 2,500 gauging stations across
Canada. Achieved data of approximately 5,500 stations is also stored in HYDAT (Environment
Canada, 2011). Reference Hydrometric Basin Network (RHBN) is a subset of the national
network which is available for long term hydrological monitoring. These stations are a part of the
Global Climate Observing System (GCOS) and the long term flow records available from RHBN
stations maybe useful while dealing with pressing issues like climate change phenomenon.
The average flow recorded over a day, referred to as the average daily flow, and instantaneous
peak flows are reported at stream gauging stations throughout Ontario in the HYDAT database.
For a given stream gauge, an Annual Maximum Average Daily (AMAD) dataset reports the
20
maximum value of the average daily flows recorded for each year of the historical stream record.
A single maximum instantaneous flow value recorded over a year is also reported as the Annual
Maximum Instantaneous (AMI) flow. For RFFA, AMAD data series are generally used if AMI
data series are not available. If AMAD series is used for flood frequency studies it is
recommended to apply a peaking factor to the predicted T-year quantiles (Watt 1994). As evident
from Figure 4, which provides a comparison of AMI and AMAD values for a representative
station it is evident that the values of AMAD can vary considerably in relation to the AMI flow
data. It may be observed that AMI flow data provides a better representation of the peak flow
conditions in a drainage basin and should ideally be selected if sufficient records are available.
This view has also been supported by previous studies such as Sangal (1981).
Figure 4: Flow Data Comparison for a Typical Station (HYDAT station: 02HB021)
Historically, numerous procedures such as the Fullers Method and Langbein’s Approach
(Moin and Shaw, 1985) have been used to predict instantaneous peak flows from average daily
flows when gaps in the stream record exist. Watt et al. (1989) suggest that the missing or
incomplete flow data may be ascribed to two reasons: broken series or incomplete records. When
a broken series is not related to the magnitude of the event, such data, with gaps, may be
combined and used as a single dataset. An incomplete record on the other hand is when an
extreme event has left the gauging station un-operational. Such missing events should be
21
1. Fuller’s Method is based on the data for 24 drainage basins from Eastern United States uptill
1914. He plotted the ratio of ((Qp- Qm)/ Qm) with the drainage area on a log-log scale and
derived a curve with the following relationship.
where Qp is the peak and Qm is the mean flow and A is the drainage area (km2).
Sangal (1981), after investigation, opined that this method represents a statistically poor
relationship with an R2 of 0.48 but it continues to be used due to a lack of alternative
approaches.
2. Langbiens Approach: This method uses the data of mean flows for three consecutive days
(Q1, Q2 (or Qm) and Q3; where the mean annual flow is the flow of the second day), the peak
flow (Qp) and the time for the peak. The ratio of the Qp and Qm are described as functions of
Qm/Q1 and Qm/Q2. Thus, for similar ratios of the flows for the three consecutive days,
Langbiens approach gives similar ratios for Qp and Qm. This method therefore neglects any
effect of the size of drainage catchment (Sangal, 1981; Moin and Shaw, 1985).
3. Sangal’s Method: Sangal (1981) has put forth a procedure, which is an extension of the
Langbiens concept, for determination of peak flows from average daily flows. This method
has been developed and successfully applied for Ontario’s context in previous flood frequency
studies by Moin and Shaw (1985) and MNR (2014). Sangal (1981)’s procedure for prediction
of missing AMI flow values also uses the average daily flow for three consecutive days (Q1,
22
Q2 and Q3). The second position (Q2) is occupied by the AMAD flow value of the year. A
parameter called the base factor K is also employed in the study by Sangal (1981) which is the
base of the assumed triangular hydrograph. Equation (14) depicts the general form of the
relation between the instantaneous peak flow and average daily flow for any year. For the
station years having both AMI and AMAD flow data, Equation (14) was employed to
estimate the base factor (K) for the year. For the current study, the average of the base factor
for all the years was adopted as the base factor for the station. The station base factor was
subsequently used in Equation (15) to predict the AMI values for any given year which had
the AMAD flow data available.
Where, QP = peak flow (m3/s); QP' = predicted peak flow (m3/s), Q1, Q2 and Q3= mean daily
flow (m3/s) for 3 consecutive days where Q2 represents the AMAD value of the year.
Sangal (1981) study yielded 79% of predicted peaks within ±20% of their actual values for
Ontario demonstrating that it as an effective method for predicting AMI flows. Sangal (1981)
provides the value of the parameter K for 387 watersheds in Ontario, which can be used during
estimating the peak flows for situations when the K value cannot be predicted. However, in the
current dataset of stream flow records most station years have both AMI and AMAD data. Thus,
it was possible to estimate K values for all individual watersheds used in this study instead of
using the K values provided in Sangal (1981).
interest. The land cover information from OFAT is extracted from the land cover maps of
Ontario which combines the compilation from Provincial Land Cover Database, Far North Land
Cover and the Southern Ontario Land Resource Information System. Additionally, OFAT III
runs various Hydrology models at ungauged points of interests to calculate stream flow at these
locations. A limitation of OFAT is that it assumes natural flow condition and any regulation in
flow are not considered. A review of the data from OFAT is provided in Table 5.
Empirical Bayesian Kriging (EBK) is a geo-statistical Interpolation method which helps to make
predictions at unknown locations using values at known locations. EBK takes into account the
errors introduced by the variance of difference between two locations (Environmental Systems
Research Institute, Inc. (ESRI), 2012). Thus, EBK has an inherent advantage over other
interpolation methods, like inverse distance weighing (IDW), which tend to underestimate the
standard errors of prediction. Interpolation techniques are generally used for preparation of
isohyetal maps for region. It has also been widely used in flood frequency analysis in the United
States to prepare skew maps in various provinces like Illinois and Iowa (Soong et al. (2004),
Eash (2001)).
testing does not assume any underlying distribution and the evaluation are based on assigning
ranks to the dataset. These tests help in ensuring that a probabilistic model applies to the dataset
(Watt, 1994). The necessity of these tests has also been highlighted in Watt et al. (1989) and Rao
and Hamed (2000). The compliance with the aforementioned assumptions has also been tested in
previous studies like Watt (1994), Moin and Shaw (1985) and MNR (2014). The description of
these tests, given in Appendix A & B of Ottawa River Flood Mapping (1984) and Rollings
(1999), is reproduced in the subsequent sections. The hypotheses for all the tests are generally
accepted at either 1% or 5% significance level. Though Watt (1994) recommends these statistical
testing, rejection of the non-parametric hypothesis is not necessarily a strong evidence of
nonconformity with the statistical assumptions and, as such, rejected cases may require further
investigation. This could involve examining changes in the drainage basin for the beginning and
the end of the record period for urbanization, or changes in flow or storage.
Independent events are those where the probability of occurrence of one of the event does not the
affect the probability of occurrence of the second event (Rollings, 1999). Thus, it tests the
significance of correlation coefficient between N-1 pairs of ith and (i+1)th event. The significance
of the correlation coefficient helps in establishing the independence of the data series
(Rollings, (1999), Ottawa River Flood Mapping (1984)). Pearson coefficient has an underlying
assumption of a normal sampling distribution. Thus, for flood frequency studies where a single
distribution cannot be ascribed to the dataset with certainty, a non-parametric form based on
ranking of dataset is used Ottawa River Flood Mapping (1984). Spearman rank order serial
correlation coefficient for independence, detailed in in Ottawa River Flood Mapping (1984) and
(Rollings, 1999) is used to test the assumption of independence for the dataset of each station.
The null hypothesis for the test is that the two series are independent. The process is illustrated
below:
The data series Q1, Q2, Q3………Qn-1 is represented in chronological order and xi denoting the
ranks of Qi. Similarly, Q2, Q3………Qn is represented in chronological order and yi denoting the
ranks of Qi.
1
1 2 2 2 2 2 2
−
Equation (16)
𝑆1 = ( ∑ 𝑥𝑖 + ∑ 𝑦𝑖 − ∑ 𝑑𝑖 ) (∑ 𝑥𝑖 ∑ 𝑦𝑖 )
2
𝑚3 − 𝑚
∑ 𝑥𝑖2 = − ∑ 𝑇𝑥 Equation (17)
12
𝑚3 − 𝑚
∑ 𝑦𝑖2 = − ∑ 𝑇𝑦 Equation (18)
12
where di is difference in rank of xi and yi ; m=N-1and summation is taken for m pairs of xi and yi.
The moment of T adjusts for the tied ranks and is calculated as follows:
𝑟3 − 𝑟
𝑇𝑥 = Equation (19)
12
where r is the number of observations tied at a given rank. ∑ Tx 𝑎𝑛𝑑 ∑ Ty are then extended to
all the tied ranks.
For N less than 10, special tables are available for defining the region of rejection for S 1 at given
significance level. When N is 10 or greater, then the function t is distributed like students t and a
one tail test is used to test the significance of the hypothesis.
1/2
𝑚−2
𝑡 = 𝑆1 [ ] Equation (20)
1 − 𝑠12
The land use conditions of a watershed change with time thereby causing changes in flow data
series. If flow conditions are changing with time, a trend in the flow series may be observed.
Spearman rank order correlation coefficient is used to test the stationarity of the data set. The test
process from Ottawa River Flood Mapping (1984) is illustrated below. The null hypothesis for
the test states that the the there is no trend or serial correlation between the dataseries.
26
The data series Q1, Q2, Q3………Qn is represented in chronological order and yi denoting the
ranks of Qi . Similarly, 1, 2, ……N is represented in the sequential order and x i denoting the
ranks of Qi.
The spearman rank order correlation coefficient, illustrated in Ottawa River Flood
Mapping (1984), is computed as:
1
1 −
2
𝑟𝑠 = ( ∑ 𝑥𝑖2 + ∑ 𝑦𝑖2 − ∑ 𝑑𝑖2 ) (∑ 𝑥𝑖2 ∑ 𝑦𝑖2 ) Equation (21)
2
where Equation (17) and Equation (18) are used to compute the value of ∑ 𝑥𝑖2 𝑎𝑛𝑑 ∑ 𝑦𝑖2 .
Here di is difference in rank of xi and yi; m=N; summation is taken for m pairs of xi and yi and
∑ 𝑇𝑥 = 0 and ∑ Ty is calculated as in Equation (19).
For N less than 10, special tables are available for defining the region of rejection for 𝑟𝑠 at given
significance level. When N is 10 or greater, then the function t is distributed like students t. The
null hypothesis states that there is no trend either upward or downward so a two tail test is used
to test the significance of the hypothesis.
𝑁 − 2 1/2
𝑡 = 𝑟𝑠 [ ] Equation (22)
1 − 𝑟𝑠2
Homogeneity tests take into consideration any abrupt changes in the drainage basin, like
construction of a reservoir etc., by analyzing two sub samples from the drainage basin (Rollings,
1999). Mann-Whitney split sample test is used to ascertain the homogeneity of the sample. The
condition of non-homogeneity may be possible in hydrology due to natural as well as
anthropogenic reasons (Ottawa River Flood Mapping, 1984). The procedures for the Mann-
Whitney test, highlighted below, help in identification of non-homogeneous flood series. The
null hypothesis of the Mann Whitney U-test is that the two samples are from the same population
(homogeneous).
The sample is split into two sub-samples and ranks are assigned. The Mann-Whitney U statistic
is computed. It is defined as the smaller value of U1 and U2.
27
𝑛1 (𝑛1 + 1)
𝑈1 = 𝑛1 𝑛2 + − 𝑅1 Equation (23)
2
𝑈2 = 𝑛1 𝑛2 − 𝑈1 Equation (24)
where n1(smaller sample) and n2 are the sample size ; R1 is the sum of ranks in n1.
The significance of U is ascertained by assessing the critical values and the associated regions of
rejection which have been tabulated and published. For large sample size, a normal variate z
(0,1) as Equation (25) and the associated regions of rejection at different significance levels are
analyzed.
𝑛 𝑛
𝑈 − 12 2
𝑧= 1/2 Equation (25)
𝑛1 𝑛2 𝑁3 − 𝑁
{[ ] [( 12 ) − ∑ 𝑇]}
𝑁(𝑁 − 1)
Runs test, performed by calculating the runs below and above the median, is used to test the
randomness of the flow series for each station (Moin & Shaw, 1985). A run is a group of data
items which follow a sequence with similar adjacent elements. The runs test designates the data
into two different categories with values above and below the median (SPSS Statistics 21 Help,
2012). The number of ordered sequence of each group gives the runs in the sample. The null
hypothesis for runs test is that the sequence is random. The significance associated with the
number of runs helps decide the acceptance or rejection of the hypothesis (Mathworks, 2015).
type 3 and the Generalized Extreme Value (Watt et al., 1989). Historically, a 3-Parameter Log
normal distribution has been adopted for studies like Moin and Shaw (1985) and Joy and
Whiteley (1994) in the province of Ontario Similarly, Log-Pearson type 3 has been an accepted
probability distribution in the United States recommended by the Interagency Advisory
Committee on Water Data (1982). It has subsequently been adopted in all flood frequency
studies in the United States including those highlighted in Table 4 in Section 2.4.2. Similarly,
Generalized Extreme Value is the recommended probability distribution in the United Kingdom
(Chow and Watt, 1992). Chow and Watt (1992) argue that there may be more than one
distribution which fits the data. Thus, recommendation of a single distribution for a large
geographical region may not be desirable.
Chow and Watt (1992) recommend the Akaike Information Criterion (AIC) to select the best
probability distribution amongst a set of candidate distributions. AIC gives a model selection
criterion based on the combination of model fit, determined by the log likelihood term in
Equation (26), and the number of parameters (k) of the model determined by the second term in
Equation (26). A combination of these two terms would result in a unique value. According to
the Akaike model selection procedure the model with a minimum AIC value best describes the
sample data set and should be selected. So, choosing a distribution with more number of
parameters is not held by this selection criterion because there is an additional uncertainty
associated with parameter estimation, which increases as the number of parameters increase. The
improved fit and the number of parameters should compensate each other thereby resulting in a
minimum AIC value, as can be observed from Equation (26). Goodness-of-fit tests such as the
Kolmogorov-Smirnov or Chi-Square tests could not be adopted as these methods have a
tendency to select a distribution with more number of parameters, which essentially means a
better fit. However, the uncertainty inherent with these additional parameters is not reflected
during distribution selection by goodness-of-fit tests. This situation ultimately creates a false-
sense of certainty and confidence when applied to real-life design applications. AIC has also
been previously applied to the context of Northern Ontario in the study by Watt (1994) which
resulted in minimizing the relative standard errors of the RFFA equations when compared to the
results obtained by previous studies. Thus, considering all the above mentioned factors
distribution selection using AIC was identified as the best approach that can also be extended to
the current study.
29
Burnham and Anderson (2004) identified a limitation in existing model selection literature when
AIC value is used for small sample sizes (defined by n/k < 40; where n is the sample size and k
is the number of parameters). Using Equation (26) researchers have concluded that for small
sample sizes AIC sometimes over fits the data (Burnham and Anderson, 2004). Overfitting
implies that the statistical model has random errors and the model is not a true representation of
the underlying condition. For finite and small sample sizes a second order criterion as proposed
by Burnham and Anderson (2004). The data series generally available for flood frequency
studies are relatively small so a second order criterion, like in Equation (27) is applied.
2𝑘(𝑘 + 1)
AIC = −2 log(𝐿) + 2k + Equation (27)
𝑛−𝑘−1
For large sample sizes Equation (27) converges to Equation (26). However, for the purpose of
the current study, Equation (27) is used to estimate the AIC for each candidate distribution.
Statistical software’s can be used to estimate the value of the likelihood function and also the
associated station flood quantiles (2 year, 10 year, 25 year, 50 year and 100 year). These
software’s use different fitting methods like the method of moments, method of maximum
likelihood, L-moments etc. A distribution fitting software, Easyfit, with its excel add-in, utilizes
the least computationally intensive method for estimation of underlying distribution and the
distribution parameters. The methods used for parameter estimation for the different candidate
distributions are available in its software documentation. A comparison study for two
representative stations was also carried out in ‘R project’, an open source programming language
for statistical computation, which presented identical results of AIC values.
Alberta Transportation (2001) argues that the selection of an appropriate probability distribution
is more important than the differences caused by various fitting methods (like method of
moments, method of maximum likelihood, for example) for prediction of flood quantiles.
30
Chow and Watt (1992) recommended the used of AIC as a distribution selection criterion.
However, it may also be observed from Table 3 of Chow and Watt (1992) that for the 42 stations
used in the study, the difference in the AIC for the best and second best distribution is very
small. For instances when the difference in AIC values is small, Burnham and Anderson (2004)
recommends a range of difference in AIC values between the model with the lowest AIC and
other candidate distribution models when a particular model cannot be accepted with certainty.
For such models, Burnham and Anderson (2004) recommended calculation of Akaike weights as
in Equation (28) and averaging of the estimates from each candidate model based on these
weights.
∆𝑖
exp (− 2 )
𝑤𝑖 = 𝑁 ∆𝑖 = AIC𝑖 – AIC𝑚𝑖𝑛 Equation (28)
∆𝑟
∑ exp (− 2 )
𝑟=1
Thus, appropriate weights may be applied to the quantile estimates of the candidate distributions
to compute the station quantiles.
chooses the independent variables. It maximizes the coefficient of determination (R2) and
minimizes the number of predictors at the same time (Field, 2009). Thus, all independent
variables may not be a part of the final regression equation and the most significant variables are
chosen. The third procedure, called hierarchical regression, is generally used for testing
theoretical models of a researcher. The variable entry order is decided by the researcher and is
based on a theoretical justification (Field, 2009).
During the stepwise regression process, a variable is added and is retained at a particular
significance level (generally 5-percent significance for regression analysis). This variable may be
removed when a variable added in a subsequent step has higher significance (Field, 2009).
Stepwise linear regression procedures in SPSS also help in identifying multi-collinearity through
its collinearity diagnostics (Field, 2009). During regression analysis, the results of the
collinearity diagnostics are adopted by SPSS and the corresponding variable demonstrating
multi-collinearity is automatically dropped.
The stepwise regression procedure identifies a relation between the dependent variable (i.e. the
T-year quantile in the current study) and the predictor variables (i.e. climate and physiographic
parameters).The variables that are required for regression analysis with respect to flood
frequency studies can be classified as follows:
1. Response variables: Response variables are those which are measured and are also called the
dependent variables. Thus, the flood quantiles in flood frequency studies may be treated as
the response variables. Each of the variables should be separately tested with all the
predictor variables.
2. Predictor variables: Predictor variables are the independent variables used for development
of a regression equation with the dependent variables. These variables affect the response.
The physiographic and climatic variables in flood frequency studies may be considered as
the predictor variables.
Predictor variables used in a RFFA study should not be correlated with other predictor variables.
A condition when the predictor variables are related to each other is also referred as multi-
collinearity, which can be identified from the values of correlation coefficients. Parameters that
32
are highly correlated have the correlation coefficient close to 1 or -1 in their correlation matrix.
The highly correlated parameters need to be removed from the study unless there is an
appropriate rationale against the removal of variables. RFFA studies (Joy and Whiteley (1996),
Watt (1994), Karuks (1961)) have concluded that drainage area is the most important parameter
for determining peak flow. Thus, predictor variables correlated with the drainage area would
essentially convey the same information about peak flow and thus cannot be treated as its
independent determinant (Watt,1994). Previous studies on flood frequency analysis
(Watt (1994); Rollings (1999)) have also highlighted the importance of the effect of lakes in
decreasing the peak flows and a factor to represent this effect was adopted.
Another important assumption of multiple regression is the condition of normality for both the
response and predictor variables. It is essential to ascertain this assumption before proceeding
with the regression analysis. The assumption for normality can be tested by confirming if the
ratio of skewness and standard error (S.E.) is less than 1.96 or by visual examination of the
histograms before and after transformation. Generally, the annual maximum flow data is highly
right skewed (Chin, 2013). Thus, to perform a regression analysis the data is commonly log
transformed. Log transformations to base 10 have previously been applied to regression analysis
(GREHYS, 1996; Grover et al., 2002; Moin and Shaw, 1986). Logarithmic transformations,
when applied on all the dependent variables and predictor variables, help to linearize the
assumed power equation and to achieve equal variance (normality) required for linear regression.
Thus, a general form of the power function, Equation (29), is assumed which on log-
transformation took the form Equation (30), to be used as the linear regression model.
𝑄 = 𝐾𝑅 ∗ 𝐴𝑎 ∗ 𝐵 𝑏 ∗ 𝐶 𝑐 …. Equation (29)
where Q represents the dependent variable; A, B and C represent independent variables; KR, a, b
and c are constants.
2.7 Summary
The detailed review of the hydrology aspects of flood frequency analysis was conducted in the
above chapter with emphasis on the physiography and climate of Ontario. The chapter reviewed
33
statistical methods and the recent RFFA studies. Drainage area is the most important contributing
factor when predicting peak flows and urbanization causes changes the flow regime in
catchments. The climate in general and more specifically precipitation is the important flood
causing mechanism. Non parametric screening to assess independence, stationarity, randomness
and homogeneity of flow data has not been investigated in most of the studies from the United
States. But few Canadian studies have adopted non-parametric testing to assess the quality of
streamflow data. This is important considering the temporal changes in drainage catchments and
verifying the statistical assumptions before applying statistical methods. Different statistical
methods have been identified in this section and the availability of statistical packages has
caused a shift towards regression based RFFA equations. The chapter establishes the foundation
for the methodology used in Chapter 3.
34
Chapter 3
Methodology
The subsequent sections outline the thesis methodology. The study area for the project, i.e. the
province of Ontario, is divided into ecozones. Flow data were obtained for the gauged stations
across the province. The physiographic parameters were obtained from the Ontario Flow
Assessment Tool (OFAT) III and the precipitation data were obtained from Environment Canada
weather stations. Subsequently, a regression based regional flood frequency analysis is
performed.
Historical stream flow datasets were downloaded for all 271 HYDAT stations in Ontario that
have been operating for over 15 years (Figure 5). Flow data collected by WSC up to and
including December 31st, 2014 were considered for this analysis. All of the 271 stations represent
natural flow conditions and regulated stations were excluded in this study. Regulation affects the
natural flow data and stream flow series, if used, are not reflective of the true hydrologic
conditions of the catchment. It was noted that the actual station record length available was
occasionally shorter than the stations operational length (15 years selected as the threshold). In
all cases the AMI series were shorter than AMAD series. An example of the annual maximum
series for a representative station is provided in Appendix A. The complete dataset for 271
stations used for the analysis (AMAD, AMI and the daily flows) is not reproduced in the thesis
35
and is available from the Hydrometric data portal of WSC. A summary of the record data for all
HYDAT stations used in this study is provided in Appendix B.
Physiographic data (i.e. watershed characteristic and land cover) associated with each HYDAT
station were obtained from the Ministry of Natural Resources and Forestry (MNR)’s web-based
Ontario Flow Assessment Tool (OFAT) III utilizing latitude and longitude information collected
from the WSC portal. An example of the data obtained from OFAT is provided in Table 5.
Mean annual precipitation data was collected from the Climate Normals published by
Environment Canada for all 151 weather stations across Ontario from 1981-2010. An isohyetal
map was prepared in ARCMap 10.2 using the Empirical Bayesian Kriging (EBK). The isohyet
36
map is shown previously in Figure 2 and was used to determine the interpolated precipitation
values at the individual HYDAT stations.
02GH011
Additionally, information regarding seven stations could not be extracted as OFAT did not
recognize the latitude and longitude reported by WSC as valid points on the stream. The lack of
basin parameters required for the study necessitated the removal of these stations.
A limitation that was identified during data extraction was that OFAT only includes digital
elevation data that is within the provincial boundaries of Ontario. Parts of a watershed extending
outside of the province are not reported by OFAT. Thus, discrepancies were observed in the
drainage area reported by WSC and obtained through delineation by OFAT, for example, WSC
station 05PC018 (latitude: 48.6344, longitude:-93.9133), shown in Figure 9. OFAT delineates
the boundary through lakes and streams along the Ontario provincial boundary excluding the
contributing watershed area within the adjacent State of Minnesota. This led to gross mismatch
in the areas reported by OFAT and WSC for stations with cross-border watersheds.
For a few other stations, which were not along the provincial border, the root cause of
discrepancies between reported OFAT and WSC drainage areas could not be identified.
Discrepancies with respect to WSC data and observed data have also been highlighted in Joy and
Whiteley (1994). The stations with differences greater than 15% in reported watershed area from
WSC and delineated by OFAT were excluded from analysis. The uncertainty in ascertaining the
correct area and the associated data (e.g. channel length, slope, area of lakes and reservoirs)
forced the removal of these stations. Forty-one such stations were identified and removed.
Two stations with drainage areas less than 1 km2 were removed from the Boreal Shield ecozone
as the station quantiles (calculated in the Section 3.3.2) and the area occupied by lakes and
wetlands for these stations were approximately zero. For the current dataset, there were stations
with very large drainage area (largest drainage area = 13,559.5 km2). These stations had a
drainage area that was more than 5 times the mean of the drainage areas used in this study and
were identified as outliers. Thus, five more stations in the Boreal Shield with extremely large
drainage areas (> 5000 km2) were removed.
The Hudson Plain ecozone is a very remote region of Ontario and MTO does not currently
operate any provincial highways this far north. Only three HYDAT stations remained in this
ecozone after the initial data screening (shown in Figure 10) which was insufficient to support
regression analysis. Hence, subsequent analysis presented was limited to two regions: the Boreal
Shield and the Mixed Wood Plains.
40
Drainage boundary
Drainage area
Provincial boundary
The hypotheses for all statistical tests were accepted at 5% significance level. The summary of
the results of non-parametric tests is provided is Appendix D. For the sake of completeness,
Appendix D includes the urban watersheds and also the stations rejected in previous steps.
Fourteen stations in the Boreal Shield Region and twenty-nine stations in the Mixed Wood Plains
were eliminated due to non-compliance with the hypotheses of non-parametric tests. A map of
the stations, non-compliant with the statistical assumptions, is presented in Figure 11.
42
Akaike Information Criterion (AIC) was used for selection of the best probability distribution
amongst the set of candidate distributions. The model with a minimum AIC value best describes
the sample data set and should be selected. Equation (27) outlined in Section 2.6.2, with the
second order correction for finite datasets was used to compute the AIC value. The analysis was
performed using the excel add-in of the distribution fitting software, Easyfit. The initial results
obtained after AIC analysis indicated a small difference between the AIC values for the 1st and
2nd best fit distribution. Table 6 summarizes the results for the HYDAT data from Ontario.
For the five return periods selected for the study (Q2, Q10, Q25, Q50 and Q100) the station
quantile is estimated as below
where, QT1, QT2 and QT3 = T year quantile estimate for the three best candidate distributions
44
An example of the calculations for station frequency analysis, i.e. computation of AIC, AIC
weights and the station flood quantiles is shown in Appendix E.
𝑊𝐴
𝐿𝑎𝑘𝑒 𝐴𝑡𝑡𝑒𝑛𝑎𝑡𝑖𝑜𝑛 𝑖𝑛𝑑𝑒𝑥(𝐿𝐼) = 1 + Equation (32)
𝐴
Where
Further analysis of the histograms of the independent and dependent variables lead to the
interference that the dataset was highly right skewed. An example for Q25 quantile (dependent
variable) for the 118 stations used in the analysis is shown in Figure 12. As outlined in Section
2.6.3 and Equation (30) the response and predictor variables were log transformed and multiple
regression was subsequently performed.
Variable transformations were performed as outlined in Chapter 2 and stepwise regression was
subsequently performed. The output of the stepwise regression process for the log transformed
variables is enclosed in Appendix F and results are summarized in the subsequent chapter.
Regression analysis was carried out for forty-three stations in the Boreal Shield Region and
seventy-five stations in the Mixed Wood Plains Region. The 118 stations used for flood
frequency analysis, along with the location of Ontario Highways, are shown in Figure 13. The
stations not included in regression analysis, along with the reason for rejection, have been
summarized in Appendix G.
46
Figure 13: Regression Stations and Ontario Highways (Source: Highways, 2014)
3.4 Summary
This chapter presents the methodology undertaken for the development of new regional flood
frequency equations for the province of Ontario. All the data available up to and including Dec
2014 for Ontario, i.e. 271 HYDAT stations were selected. After extension of the data series
through established methods, like Sangal (1981), pre-screening was performed where a minimum
record length for each station was set to be 15 years. Non-parametric testing was performed for
HYDAT stations to verify compliance to statistical assumptions. 41 stations demonstrated non-
compliance and were not a part of the subsequent steps, i.e. station frequency analysis and
multiple regression. Station Frequency Analysis helped to calculate the dependent variables, that
is, the T-year Flood Quantile. Ten HYDAT stations were identified as urban watersheds which
had to be subsequently removed from the analysis due to lack of additional data (w.r.t the
47
urbanization pattern, initial land cover information at the start of the flow series, etc.). Issues
were identified during watershed delineation using OFAT where the location of HYDAT stations
from WSC was not recognized in OFAT. There were instances where the watersheds were
limited to the provincial boundary and its contributing parts in neighboring provinces are not
considered. This caused removal of such station as the data was not reliable. A correlation study
between predictors helped to identify variables which showed correlation with the drainage area.
Area of lakes and wetlands was transformed before including in the regression analysis.
Subsequently, multiple regression was performed for development of regional flood frequency
equation for Ontario. The results, referred as the Unified Ontario Flood Method, are presented in
Chapter 4.
48
Chapter 4
Results of Regional Flood Frequency Analysis
The subsequent chapter presents the results of the regression analysis and the development of the
Unified Ontario Flood Method (UOFM). Computational procedures for the UOFM are presented.
A Design table (Table 8) in conjunction with Figure 1 and Figure 2, can be used by engineers to
compute the T-year flood Quantile. Table 10 is used for the prediction of the range (i.e. the upper
and lower limit) of the T-year flood Quantile.
Where QUOFM = annual flood with a T year return period (m3/s) from the regression analysis
A = drainage area (km2)
LI = lake attenuation index (dimensionless)
P = mean annual precipitation (mm)
KR = 10x (x = value of constant obtained from the output of stepwise regression)
a,b,c = exponents (obtained from the output of regression)
Standard error
T x a b c Adjusted R2
(SE) (log units)
Mixed Wood Plains
2 -5.483 0.756 -3.061 1.837 0.824 0.147
10 -4.139 0.734 -3.780 1.491 0.790 0.165
25 -3.680 0.728 -4.017 1.372 0.769 0.177
50 -3.397 0.724 -4.162 1.299 0.752 0.186
100 -3.151 0.721 -4.287 1.236 0.736 0.195
The standard error of the estimate obtained from the output of the regression analysis is in log
units. Transformation from log units is undertaken to establish the probable range of flood
quantiles (Table 10) following log over addition/subtraction relations (Moin and Shaw, 1985).
These lower and upper limits are applied to the estimated value from Equation (33).
Where 𝑄𝑈𝑂𝐹𝑀 is the result obtained from regression and SE is the standard error of regression in
log units. Thus, the probable range for the QUOFM can thus be established by the relative standard
error:
𝑄𝑈𝑂𝐹𝑀
𝑄𝑈𝑂𝐹𝑀 −
RLL: 10𝑆𝐸 = 1 − 1 Equation (37)
𝑄𝑈𝑂𝐹𝑀 10𝑆𝐸
stream should be analyzed on a catchment level, and not using UOFM, as the flow conditions
downstream of a regulation will not follow a naturalized flow regime. The following example for
HYDAT station 02AE001 illustrates the steps to obtain the flood quantiles by UOFM. This
station, in the Boreal Shield region, is not a part of the regression study. The procedure, as
outlined should be used to predict the T year quantiles at the location of interest.
Step 2: Identify the region in which the drainage basin is located from the ecozone map of
Ontario (Figure 1)
Step 3: Obtain the physiographic parameters required in the regression equations from OFAT:
Step 4: Obtain the mean annual precipitation for the station from the IsoHyetal map shown in
Figure 2.
Step 5: Using Equation (33) and respective coefficients from Table 8 the regression T year
flood Quantiles, QUOFM, are calculated (summarized below). Use the coefficients of the region
where the station is located based the ecozone identified in Step 2.
Step 6: The standard errors of the regression equation were used to establish the lower and upper
limit of the prediction. The range (lower and upper limit) is provided in Table 10. These
percentages are applied to the quantile estimates from Step 5. The final values for T-year
quantile and the lower and upper limit are summarized below.
52
4.3 Summary
The result of the Regional Flood frequency Analysis for Ontario is outlined in Chapter 4, Section
4.1. Section 4.2 presents the methodology for designers for the application of Unified Ontario
Flood Method (UOFM). UOFM aims to provide designers with a tool to predict the T-year flood
quantile for drainage basins where the location of interest lacks stream flow data. The presented
equation and design table (Table 8) are anticipated to be adopted by MTO and replace the MIFM
and NOHM for prediction of design flows at streams and river crossings. The subsequent chapter
discusses the verification and comparison study for the UOFM.
53
Chapter 5
Verification and Evaluation
The verification and evaluation of the UOFM was undertaken and the results of the verification
analysis are illustrated in Section 5.1. As MTO may frequently encounter small sized watersheds
(less than the lower limit for the UOFM), a check was performed for a small urban watershed
(Section 5.2). Additional analysis was undertaken to validate the performance of the equation for
medium to large urbanized watersheds (Section 5.3). Subsequently, a comparison of UOFM with
other existing RFFA methods in Ontario was performed.
QUOFM (m3/s) 11 19 24 27 30
Lower limit of QUOFM (m3/s) 8 13 16 18 19
Upper limit of QUOFM (m3/s) 16 29 36 42 47
QT (m3/s) 0.08 0.23 0.32 0.39 0.46
05QD017
QUOFM (m3/s) 6 10 12 14 16
Lower limit of QUOFM (m3/s) 4 7 8 9 10
Upper limit of QUOFM (m3/s) 8 15 19 22 25
QT (m3/s) 7 11 13 14 16
02HK008
QUOFM (m3/s) 13 20 23 26 28
Lower limit of QUOFM (m3/s) 9 14 15 17 18
Upper limit of QUOFM (m3/s) 18 29 35 39 44
Overestimated UOFM quantiles relative to observed quantiles (QT )
55
250 800
200
600
150
400
100
200
50
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Return period, T (years) QUOFM QT Return period, T (years) QUOFM QT
250
Flood Quantiles (Station: 02AE001) 50
Flood Quantiles (Station: 02BF004)
200 40
150 30
100 20
50 10
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Return period, T (years)
Return period, T (years) QUOFM QT QUOFM QT
4
Flood Quantiles (Station: 05QD017) 600
Flood Quantiles (Station: 02FF007)
Flood Quantile (m3/s)
3
400
2
200
1
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Return period, T (years) Return period, T (years)
QUOFM QT QUOFM QT
30
Flood Quantiles (Station: 02GA043) 50
Flood Quantiles (Station: 02HK008)
Flood Quantile (m3/s)
40
20
30
20
10
10
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Return period, T (years) Return period, T (years)
QUOFM QT QUOFM QT
56
For three of the verification stations the observed flood flows (QT) are not within the probable
range established by the upper and lower limit of the regression relation (shown as bars above
and below QUOFM in Figure 14). Two of these stations (05QD017 and 02GA043) have very small
drainage areas (2.62 km2 and 13.16 km2, respectively) and therefore it is not surprising that
prediction errors are more significant. Station 02KH008 has a medium sized catchment
(89.29 km2) but observed flood quantiles still fall outside of probable range. In all three cases the
UOFM method overestimated the size of the flood. This produces conservative flood estimates
that will likely be larger than observed flood flows.
3
QUOFM (m /s) 4 8 11 12 14
3
Lower limit of QUOFM (m /s) 3 6 7 8 9
3
Upper limit of QUOFM (m /s) 6 12 16 19 22
Overestimated UOFM quantiles relative to observed quantiles (QT )
57
Similar to the verification stations the UOFM produced flood predictions are moderately larger
than observed flood flows. It may be noted that the urban water cycle and urban flow regime can
be drastically different from its pre-development state. Caution and judgement must be exercised
if applying the UOFM to urban catchments. Recommendations for urban flood modelling are
discussed further in Section 5.4.
02DD014
QUOFM (m3/s) 11 18 21 24 27
Lower limit of QUOFM (m3/s) 7 12 14 16 17
Upper limit of QUOFM (m3/s) 16 27 33 37 42
QT (m3/s) 31 53 65 74 83
02GE005
QUOFM (m3/s) 34 62 76 87 98
Lower limit of QUOFM (m3/s) 24 42 51 57 62
Upper limit of QUOFM (m3/s) 47 90 115 133 153
QT (m3/s) 24 36 41 45 49
02CF012
QUOFM (m3/s) 22 38 45 51 56
Lower limit of QUOFM (m3/s) 16 25 30 33 36
Upper limit of QUOFM (m3/s) 32 56 69 79 88
QT (m3/s) 28 41 44 47 49
02GH011
QUOFM (m3/s) 19 35 44 50 57
Lower limit of QUOFM (m3/s) 13 24 29 33 36
Upper limit of QUOFM (m3/s) 26 51 66 77 89
QT (m3/s) 34 56 67 75 83
02HC033
QUOFM (m3/s) 18 35 44 50 57
Lower limit of QUOFM (m3/s) 13 24 29 33 37
Upper limit of QUOFM (m3/s) 25 51 66 77 90
QT (m3/s) 19 25 27 28 29
02HA023
3
QUOFM (m /s) 10 20 25 29 33
3
Lower limit of QUOFM (m /s) 7 14 17 19 21
3
Upper limit of QUOFM (m /s) 14 29 38 44 51
Overestimated UOFM quantiles relative to observed quantiles (QT )
59
Figure 15: Observed and Predicted Quantiles for Medium to Large Urban Watersheds
50
Flood Quantiles (Station: 02DD014) 200
Flood Quantiles (Station: 02GE005)
Flood Quantile (m3/s)
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Return period, T (years) Return period, T (years)
QUOFM QT QUOFM QT
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Return period, T (years) Return period, T (years)
QUOFM QT QUOFM QT
80 80
60 60
40 40
20 20
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Return period, T (years) QUOFM QT Return period, T (years) QUOFM QT
80
Flood Quantile (m3/s)
40
60
40
20
20
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Return period, T (years) Return period, T (years)
QUOFM QT
QUOFM QT
60
The result of the limited analysis shown graphically in Figure 15 and presented in Table 16 does
not conclusively reflect the performance of the UOFM on medium to large urban watersheds.
Three of the eight watersheds of varied drainage area were over predicted by the UOFM. For
many of the cases, the UOFM under predicted the flood quantiles compared to the observed
values. Thus, appropriate caution and engineering judgement should be applied if using the
UOFM for urban drainage basins.
Simulation based approach also depends on the availability of relevant hydrologic and basin
parameters of the basin. It may be selected depending on the objective of the project and level of
precision required (which is high in case of urban projects due to its effect on life and property).
These involve detailed mathematical simulation of rainfall-runoff models in either 1-D or 2-D
environment. They are helpful in understanding the interaction between rainfall and flooding.
Models like rational method and unit hydrograph are generally used for simpler applications
whereas SWMM, which has a good applicability for detailed analysis, can be used for complex
problems like design of systems and prediction of peak flows (Watt et al. (1989)). Currently,
flood mapping techniques also integrate runoff models with GIS technology and produce
inundation maps which are necessary for planning in an urban environment. Additionally, there
are only few studies which deal with the condition of a surcharged pipe network and flooding on
the catchment surface together (Mark et al. 2004). This condition is a common occurrence in
urban environments which also needs detailed investigation.
61
In view of the limited availability of flow data (10 HYDAT stations) and incomplete land cover
information for the starting year of the flow series for the available stations, it was difficult to
assess the stationarity of the HYDAT stations. The lack of information about the urban flood
control measures in the catchments and the complex interaction among various elements in an
urban hydrologic cycle, the enormous cost on life and property in an urban environment
necessitates a detailed analysis on a catchment level for urban drainage basins. In view of the
above consideration, the feasibility of a study for regional flood frequency analysis for urban
catchments could not be established. All urban flood frequency analysis should be done
separately for each drainage basin taking into account all the mentioned considerations along
with the climate and physiography of the basin.
10 61 28 38 26 56 -54 -37
25 71 34 47 31 70 -52 -35
50 79 40 52 34 80 -50 -34
100 86 45 59 37 92 -48 -32
2 26 - 19 13 26 - -27
2.33 - 7 - - - - -
02MB006
10 38 12 30 20 43 -68 -22
25 44 15 35 23 52 -66 -21
50 48 18 38 25 59 -64 -21
100 53 20 42 27 66 -62 -20
10 22 25 32 21 47 14 45
25 26 30 38 25 59 13 47
50 29 33 44 28 67 14 48
100 33 38 48 31 75 15 47
2 16 - 25 17 36 - 54
2.33 - 14 - - - - -
02DD015
10 25 21 39 26 59 -16 60
25 28 24 46 30 70 -14 62
50 31 28 51 33 80 -12 64
100 34 31 56 36 88 -9 63
2 28 - 23 16 34 - -18
2.33 - 16 - - - - -
02CA002
10 49 23 38 25 56 -53 -23
25 60 27 45 29 68 -54 -26
50 68 31 50 32 77 -55 -27
100 77 35 55 35 86 -55 -29
2 13 - 13 9 19 - 0
2.33 - 8 - - - - -
02DD013
10 20 12 21 14 31 -39 6
25 23 14 25 16 38 -38 8
50 26 16 28 18 43 -37 10
100 28 18 31 20 48 -35 10
64
25 23 18 25 16 38 -24 8
50 26 19 28 18 43 -26 10
100 28 20 31 20 48 -28 10
2 2 1 2 1 3 -37 -7
10 3 2 3 2 5 -42 4
02BF006
25 4 2 4 3 7 -45 8
50 4 2 5 3 8 -47 11
100 5 3 6 4 9 -49 13
65
25 31 22 24 16 36 -31 -24
50 35 24 27 18 42 -32 -23
100 39 26 30 19 47 -34 -23
2 2 1 1 1 2 -42 -12
10 3 1 3 2 4 -47 -5
02BF007
25 3 2 3 2 5 -51 -4
50 4 2 4 3 6 -54 -4
100 5 2 4 3 7 -57 -5
5.7 Summary
The chapter above presents the results of the verification and evaluation of UOFM to ascertain its
applicability to Ontario watersheds. The smallest drainage area of the watershed analyzed in the
66
Mixed Woods region was 13.16 km2. The range for the Mixed Wood ecozones could not be
lowered further due to lack of HYDAT stations that fit the station selection criterions. Thus,
using the regression equation for watersheds below this range is not advisable. However, such
watersheds may be frequently encountered by MTO; hence a check was performed for a small
urban watershed with area less than 13.16 km2 (Section 5.2). UOFM was also tested medium to
large urbanized watersheds. A comparison study was conducted to examine the performance of
the equations relative to the peak flow prediction methods currently used in Ontario. The
comparisons of UOFM estimates with observed values (Section 5.1 and Section 5.5) produced
instances of both over prediction in few cases while under prediction in other cases. Such a
situation establishes that the regression model may be free from bias and the model can be
suitably applied. The chapter also discussed the case of urban floods and the implications of
climate change on flow regimes.
67
Chapter 6
Conclusions and Recommendations
This thesis presents the development of new regional flood frequency equations for the province
to Ontario, referred to as the Unified Ontario Flood Method (UOFM). Regional equations have
design applications in predicting the peak flows necessary for planning structures on water
courses. Thus, there is a need to keep them updated with the current flow regimes. The sections
below present the most significant conclusions and recommendations from the current study.
6.1 Conclusions
The broad objective of the study was the development of Regional Flood Frequency Analysis
(RFFA) procedures for the province of Ontario. Based on the analysis of currently used methods,
like MIFM and NOHM, some drawbacks were identified and highlighted in Section 1.2. These
drawbacks led to the need of the current study. The important conclusions with respect to these
considerations are discussed.
1. Availability of approximately twenty years of additional stream flow data since the last
study of MIFM and NOHM completed in 1996 and 1994, respectively. The additional
data, when accounted for, will provide a better representation of the current watershed
and stream flow conditions.
The available datasets until Dec 2014 was utilized for the development of the UOFM. The total
number of station analyzed in the province was increased to 271 stations (from 46 in MIFM and
11 in NOHM). As outlined in Section 5.5, the UOFM has shown to provide improved prediction
than the currently used methods, the MIFM and NOHM. UOFM minimized the errors with
respect to the observed station quantiles in comparison to MIFM and NOHM. The range of flood
quantiles, i.e. the upper and lower limit of UOFM, provides additional flexibility to engineers
and designers based on the catchment conditions under consideration. The results of the
verification study tend to provide conservative estimates of peak flows in drainage catchments.
Such a situation may be desirable for engineers and designers given the uncertainty of future
conditions both in terms of extreme weather and increasing urbanization. Overall comparison of
observed values of during the verification and comparison study showed both over prediction by
UOFM in some cases whereas under prediction in other cases. This leads to the conclusion of
68
less bias in the regression model. It is not advisable to use the UOFM prediction for drainage
area outside the size range used for its development. At the same time, UOFM may not perform
for urbanized catchments and regulated streams.
The catchments characteristics are gradually changing in the province, especially in southern
Ontario which is characterized as the Mixed Wood Plain Ecozone. Changes in catchment
characteristics lead to gradual changes in the flow regime. This needs to be taken into
consideration during the development of RFFA procedures. Thus, the RFFA equations should be
updated every 10 years (Vaill, 2000) and the proposed UOFM should also be updated in due
course.
The adoption of one method, i.e. the UOFM will remove the inconsistencies in the area
classification. This is especially relevant for the Shield region where there was uncertainty in the
equation to be used for drainage areas greater than 25 km2, when both the MIFM (Shield) and
NOHM can be applied. It can also be observed from Table 20 and Table 22 for HYDAT station
02DD013, which is tested with both MIFM and NOHM, the flood estimates obtained are slightly
different on comparison of both the methods. Thus, UOFM removes the uncertainty in the
method to be applied for drainage areas in the Shield region.
4. Analysis procedures for urban watersheds, predominately required for the south, are
non-existent. A review of the available data to ascertain the feasibility of such a study
was requested to be performed.
This thesis analyzed the data available for urban catchments. Only 10 urban catchments could be
identified for the whole province (Figure 7). A flood frequency study with 10 catchments was
not practical. Addition data was also provided by the Credit valley conservation (CVC) authority.
The maximum record length of the CVC stream gauging stations was approximately 7 years.
69
Thus, in the current scenario, flood frequency analysis for urban Ontario was not considered
feasible. There is a need to start long term monitoring of stream flows in urban catchments.
There is a need for a tool, such as the Consolidated Frequency Analysis Tool from Ministry of
Natural Resources and Forestry (previously available for Ontario but is no longer supported), for
station frequency analysis and computation of station quantiles. RFFA should be conducted at
least every 10 years (Vaill, 2000) to account for changes in the catchment and the flow regimes
and the current UOFM should be subsequently updated. A tool, specifically designed for Flood
Frequency Analysis, will provide an ease when revising the UOFM, instead of using various
statistical software for different computational tasks.
these stations at the beginning of their respective flow series were not available to ascertain if the
land cover changed drastically with time. Thus, it is recommended to study the urban floods for
each watershed separately rather than on a regional basis. The considerations and simulation
techniques identified in Section 5.4 may be applied here.
Over the course of this study many limitations of RFFA were encountered. Analysis work is also,
ultimately limited by scope, resources and time. Future investigators may wish to compare the
flood frequency values using regression residuals technique (used in studies like Moin and Shaw
(1985)) to classify homogeneous regions. Comparison of the results obtained from the ecozone
classification may either cause a reclassification of these homogenous regions or provide
legitimacy to the current classification which is based on the ecosystem perspective. At the same
time, ecozones extend to neighboring provinces such as Quebec and Manitoba. Stream flow data
from these regions may be considered if reliable sources of physiographic characteristic can be
obtained. The inclusion of additional data may improve the robustness of the equations.
A challenge during the course of the current investigation was a reliable method for distribution
selection. AIC values led to objective decision making during distribution selection. But, small
difference in AIC values between different candidate distributions posed a challenge of selecting
one distribution over the other. A study considering a distribution selection approach based on
analysis of the skew of the data and the log transformed series as recommended in Alberta
Transportation (2001) may provide more insight into distribution selection. At the same time, it
was also concuded during the course of the current investigation that no ‘one’ distribution can be
universally applied to a province like Ontario with wide variation in climate and physiography.
Finally, additional research needs to address ‘climate change’ scenarios which were not
considered during the course of current investigation. This issue categorically warrants further
investigation. The rejection of approximately 41 stations due to non-compliance with
assumptions of independence, stationarity, homogeneity and randomness indicate the need to
consider changes in the climate and flow regimes. Planned infrastructure and urban growth
should be adaptable to future climate scenarios. Further examining the HYDAT stations for
which the hypothesis of the non-parametric tests was rejected may involve investigating parts of
the data series for conformity with the hypothesis of the non-parametric tests. This may
subsequently alter the number of stations available for regression analysis.
71
References
Adamowski, K. (2000). Regional Analysis of Annual Maximum and Partial Duration Flood Data
by Nonparametric and L-moment Methods. J. of Hydrology, 229, 219-231.
Aucoin, F., Caissie, D., El-Jab, N., & Turkkan, N. (2011). Flood Frequency Analyses for New
Brunswick Rivers. Moncton, NB: Canadian Technical Report of Fisheries and Aquatic Sciences
2920.
Burnham, K. P., & Anderson, D. R. (2004). Multimodel Inference Understanding AIC and BIC
in Model Selection. Sociological Methods & Research, Vol. 33,(No. 2).
Capesius, J. P., & Stephens, V. C. (2009). Regional Regression Equations for Estimation of
Natural Streamflow Statistics in Colorado. U.S. Geological Survey Scientific Investigations
Report 2009–5136.
Center for Watershed Protection. (2003). Impacts of impervious covers on aquatic systems.
Ellicott City.
Chow, K. C., & Watt, W. E. (1990). A Knowledge-based Expert System for Flood Frequency
Analysis. Can. J. Civ. Eng, 17, 597-609.
Chow, K. C., & Watt, W. E. (1992). Use of Akaike Information Criterion for Selection of Flood
Frequency Distribution. Can. J. Civ. Eng., 19(4), 616-626.
DMTI Spatial Inc. (2014). Highways. Markham, Ontario: CanMap RouteLogistics Series DMTI
Spatial Inc.
Environment Canada. (2011). The Hydrometric Network. Retrieved 11 27, 2011, from
Environment Canada: https://ec.gc.ca/rhc-wsc/default.asp?lang=En&n=E228B6E8-1
Environmental Systems Research Institute, Inc. (ESRI). (2012, 2 11). ArcGis Resources.
Retrieved 10 1, 2015, from AcrMap Help 10.1:
http://resources.arcgis.com/EN/HELP/MAIN/10.1/index.html#//0031000000q9000000
Gingras, D., & Adamowski, K. (1993). Homogeneous Region Delineation Based on Annual
Flood Generation Mechanisms. Hydrological Sciences -Journal, 38:2, 103-121.
Gingras, D., Adamowski, K., & Pilon, P. J. (1994). Regional Flood Equations for the Provinces
of Ontario and Quebec. Water Resources Bulletin: American Water Resources Association, 30:1,
55-67.
GREHYS. (1996). Presentation and Review of Some Methods for Regional Flood Frequency
Analysis. J. Hydrology, 186.
Grover, P. L., Burn, D. H., & Cunderlik, J. M. (2002). A comparison of Index Flood Estimation
Procedures for Ungauged Catchments. Can. J.Civ. Eng., 29, 734–741.
Hebb, A., & Mortsch, L. (2007). Floods: Mapping Vulnerability in the Upper Thames Watershed
under a Changing Climate. Upper Thames River Conservation Authority.
Insurance Bureau of Canada. (2015). Weather Story. Retrieved 08 01, 2015, from Insurance
bureau of Canada: http://www.ibc.ca/nb/resources/studies/weather-story
73
Interagency Advisory Committee on Water Data. (1982). Guidelines for Determining Flood
Flow Frequency: Bulletin 17B of the Hydrology Sub Committee. US Geological Survey.
Retrieved from http://water.usgs.gov/osw/bulletin17b/dl_flow.pdf
Joy, D. M., & Whiteley, H. R. (1994). Report on the Evaluation of the Modified Index Flood
Method. Guelph: School of Engineering, University of Guelph.
Joy, D. M., & Whiteley, H. R. (1996). The Modified Index Flood Method Justification of
Recommended Improvements. Guelph: School of Engineering, University of Guelph.
Karuks, E. (1961). Flood Flow Frequency and Studies on Southern Ontario Streams. Department
of Civil Engineering, University of Toronto.
Landers, M. N., & Wilson, K. V. (1991). Flood Characteristics of Mississippi Streams. Jackson,
Mississippi: U.S. Geological Survey Water-Resources Investigations Report 91-4037.
Maclaren Plan Search Inc. (1984). Ottawa River Flood Mapping. Carleton: Regional
Municipality of Ottawa.
Mark, O., Weesaku, S., Apirumanekul, C., Aroonnet, S. B., & Djordjević, S. (2004). Potential
and Limitations of 1D Modelling of Urban Flooding. Urban Hydrology, 299, 284-299.
Mathworks. (2015). Runs Test for Randomness. Retrieved 12 01, 2015, from
http://www.mathworks.com/help/stats/runstest.html?s_tid=gn_loc_drop
Ministry of Natural Resources and Forestry (MNR). (2014). Flood Flow Statistics:For the Great
Lakes- St. Lawrence Watershed System Watershed System. Spatial Data Infrastructure Unit,
Ministry of Natural Resources and Forestry.
MNR. (2015). User Guide for Ontario Flow Assessment Tool (OFAT). Retrieved from
https://www.sse.gov.on.ca/sites/MNR-PublicDocs/EN/CMID/OFAT%20-
%20User%20Guide%20-%20eng.pdf
74
Moin, S. M., & Shaw, M. A. (1985). Regional Flood Frequency Analysis of Ontario
Streams:Volume 1 Single Station Analysis and Index Method. Ottawa: Environment Canada.
Moin, S. M., & Shaw, M. A. (1986). Regional Flood Frequency Analysis for Ontario Streams:
Volume 2 Multiple Regression Method. Ottawa: Environment Canada.
Ontario Minstry of Transportation. (2015). Bridge repairs. Retrieved 11 18, 2015, from Ontario
Minstry of Transportation: http://www.mto.gov.on.ca/english/highway-bridges/ontario-
bridges.shtml
Pope, B. F., Tasker, G. D., & Robbins, J. C. (2001). Estimating the Magnitude and Frequency of
Floods in Rural Basins of North Carolina-revised. Raleigh, North Carolina: U.S. Geological
Survey Water Resources Investigation Report 01-4207.
Rao, A. R., & Hamed, K. H. (2000). Flood Frequency Analysis. CRC Press.
Rollings, K. (1999). Regional Flood Frequency Analysis for the Island of Newfoundland. St.
John's, Newfoundland: Department of Environment and Labour, Government of Newfoundland
and Labrador.
Sandrock, G., Viraraghavan, T., & Fuller, G. (1992). Estimation of Peak flows for Natural
Ungauged Watersheds in Southern Saskatchewan. Canadian Water Resources Journal, 17(1),
21-31.
Sangal, B. P. (1981). A Practical Method of Estimating Peak from Mean Daily Flows with
Application to Streams in Ontario. Ottawa: National Hydrology Research Institute.
75
Soong, D. T., Ishii, A. L., Sharpe, J. B., & Avery, C. F. (2004). Estimating Flood-Peak
Discharge Magnitudes and Frequencies for Rural Streams in Illinois. U.S. Geological Survey
Scientific Investigations Report 2004-5103.
SPSS Statistics 21 Help. (2012). Runs Test Options (One-Sample Nonparametric Tests).
Retrieved 12 01, 2015, from IBM Knowledge Center: http://www-
01.ibm.com/support/knowledgecenter/SSLVMB_21.0.0/com.ibm.spss.statistics.help/idh_idd_np
ar_onesample_settings_tests_runs.htm
Tasker, G. D., & Stendinger, J. R. (1989). An Operational GLS Model for Hydrologic
Regression. J. Hydrology, 111, 361-375.
Vaill, J. E. (2000). Analysis of the Magnitude and Frequency of Floods in Colorado. Water-
Resources Investigations Report 99–4190.
Waltemeyer, S. D. (2008). Analysis of the Magnitude and Frequency of Peak Discharge and
Maximum Observed Peak Discharge in New Mexico and Surrounding Areas. U.S. Geological
Survey Scientific Investigations Report 2008–5119.
Watt, W. E., Lathem, K. W., & Neill, C. R. (1989). Hydrology of Floods in Canada: A Guide to
Planning and Design. Ottawa: National Research Council of Canada.
Wiken, E. (1995). An Introduction to Ecozones. Retrieved 07 08, 2015, from National Ecological
Framework of Canada: http://ecozones.ca/english/introduction.html
76
TIMEZ
ID Year HH:MM MM--DD MAX
ONE
02AD010 2006 EST 2:00 04--22 60.3
02AD010 2007 EST 2:50 06--03 41.1
02AD010 2008 EST 13:15 04--29 61.2
02AD010 2009 EST 4:40 05--04 47.8
02AD010 2010 EST 9:00 10--01 13.7
02AD010 2011 EST 9:35 05--05 47.8
02AD010 2012 EST 5:15 06--02 55.1
02AD010 2013 EST 21:25 05--12 52.6
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
BLOOMFIELD
BLUE JAY CREEK NEAR
25 02CG003 29 19 21 2 21
TEHKUMMAH
BLYTH BROOK BELOW
26 02FE014 30 21 21 0 21
BLYTH
27 02HF004 BOB CREEK NEAR MINDEN 18 13 16 3 16
BOWMANVILLE CREEK AT
28 02HD006 54 22 41 19 41
BOWMANVILLE
BOYLE DRAIN NEAR
29 02FE010 47 20 22 2 22
ATWOOD
BOYNE RIVER AT EARL
30 02ED102 47 26 32 6 32
ROWE PARK
BRIGHTSAND RIVER AT
31 04GB005 46 23 24 1 24
MOBERLEY
32 02HB022 BRONTE CREEK AT CARLISLE 25 23 24 1 24
BUCKSHOT CREEK NEAR
33 02KF017 21 15 13 0 15
PLEVNA
BUELLS CREEK AT
34 02MB010 25 19 16 2 21
BROCKVILLE
CANAGAGIGUE CREEK NEAR
35 02GA036 15 12 14 2 14
FLORADALE
CANARD RIVER NEAR
36 02GH003 38 30 36 7 37
LUKERVILLE
37 02KF011 CARP RIVER NEAR KINBURN 43 29 41 12 41
CARRICK CREEK NEAR
38 02FC011 61 39 50 11 50
CARLSRUHE
39 02LB006 CASTOR RIVER AT RUSSELL 66 22 57 35 57
CAT RIVER BELOW
40 04GA002 44 37 39 2 39
WESLEYAN LAKE
41 02GC030 CATFISH CREEK AT AYLMER 27 19 23 4 23
CATFISH CREEK NEAR
42 02GC018 50 42 49 7 49
SPARTA
CEDAR RIVER BELOW
43 05QE008 44 40 43 3 43
WABASKANG LAKE
CHIPPEWA CREEK AT NORTH
44 02DD014 40 37 39 2 39
BAY
45 02HK007 COLD CREEK AT ORLAND 33 28 31 3 31
46 02HC023 COLD CREEK NEAR BOLTON 52 37 40 3 40
COLDWATER RIVER AT
47 02ED007 48 41 45 4 45
COLDWATER
COLLINS CREEK NEAR
48 02HM005 45 32 43 11 43
KINGSTON
COMMANDA CREEK NEAR
49 02DD015 40 29 36 7 36
COMMANDA
CONESTOGO RIVER AT
50 02GA017 23 19 22 3 22
DRAYTON
CONISTON CREEK ABOVE
51 02DB007 34 25 32 7 32
WANAPITEI RIVER
52 02DB004 CONISTON CREEK NEAR 15 0 8 0 0
81
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
CONISTON
CURRENT RIVER AT
53 02AB021 25 21 22 1 22
STEPSTONE
DINGMAN CREEK BELOW
54 02GE005 49 18 40 22 40
LAMBETH
DODD CREEK BELOW
55 02GC031 27 23 25 2 25
PAYNES MILLS
DUCHESNAY RIVER NEAR
56 02DD008 27 13 26 13 26
NORTH BAY
DUFFINS CREEK ABOVE
57 02HC019 54 37 46 9 46
PICKERING
58 02HC049 DUFFINS CREEK AT AJAX 25 22 23 1 23
EAST BRANCH SCOTCH
59 02LB012 RIVER NEAR ST. ISIDORE DE 25 8 20 14 22
PRESCOTT
EAST CANAGAGIGUE CREEK
60 02GA035 15 12 14 2 14
NEAR FLORADALE
EAST HUMBER RIVER AT
61 02HC032 49 34 37 3 37
KING CREEK
EAST HUMBER RIVER NEAR
62 02HC009 61 42 58 16 58
PINE GROVE
EAST SIXTEEN MILE CREEK
63 02HB004 58 43 56 13 56
NEAR OMAGH
EKWAN RIVER BELOW
64 04EA001 47 15 28 13 28
NORTH WASHAGAMI RIVER
ENGLISH RIVER AT
65 05QA002 93 34 91 57 91
UMFREVILLE
ENGLISH RIVER NEAR SIOUX
66 05QA001 61 14 60 46 60
LOOKOUT
ETOBICOKE CREEK BELOW
67 02HC030 48 42 47 5 47
QUEEN ELIZABETH HIGHWAY
ETOBICOKE CREEK NEAR
68 02HC002 18 3 15 12 15
SUMMERVILLE
FAIRCHILD CREEK NEAR
69 02GB007 50 33 48 15 48
BRANTFORD
FAWN RIVER BELOW BIG
70 04CE002 26 19 23 4 23
TROUT LAKE
FISH CREEK NEAR PROSPECT
71 02GD010 69 34 55 21 55
HILL
GANARASKA RIVER ABOVE
72 02HD012 38 26 35 10 36
DALE
GANARASKA RIVER NEAR
73 02HD002 26 14 25 11 25
DALE
GOULAIS RIVER NEAR
74 02BF002 47 43 46 3 46
SEARCHMONT
75 02KF015 GRAHAM CREEK AT NEPEAN 27 15 11 0 15
GRAND RIVER NEAR
76 02GA041 30 15 22 8 23
DUNDALK
77 02HB012 GRINDSTONE CREEK NEAR 49 39 47 8 47
82
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
ALDERSHOT
HALFWAY CREEK AT
78 04KA002 22 12 20 8 20
MOOSONEE
HARMONY CREEK AT
79 02HD013 34 28 30 3 31
OSHAWA
HIGHLAND CREEK NEAR
80 02HC013 58 35 45 11 46
WEST HILL
HOG CREEK NEAR VICTORIA
81 02ED017 26 17 22 5 22
HARBOUR
HOLLAND RIVER AT
82 02EC009 49 41 48 7 48
HOLLAND LANDING
HUMBER RIVER AT ELDER
83 02HC025 52 35 45 10 45
MILLS
HUMBER RIVER NEAR
84 02HC047 33 17 22 5 22
PALGRAVE
HUNSBURGER CREEK NEAR
85 02GA045 16 1 1 0 1
HAYSVILLE
HUNSBURGER CREEK NEAR
86 02GA046 16 1 1 0 1
SCHINDELSTEDDLE
HUNSBURGER CREEK NEAR
87 02GA043 21 19 20 1 20
WILMOT CENTRE
INDIAN RIVER NEAR
88 02KC014 17 8 16 8 16
PEMBROKE
JACKSON CREEK AT
89 02HJ001 52 38 48 11 49
PETERBOROUGH
JOCK RIVER NEAR
90 02LA007 45 31 43 12 43
RICHMOND
JUNCTION CREEK BELOW
91 02CF012 37 30 35 5 35
KELLEY LAKE
KABINAKAGAMI RIVER AT
92 04JA002 38 19 36 17 36
HIGHWAY NO. 11
KAWINOGANS RIVER NEAR
93 04FA002 48 26 28 2 28
PICKLE CROW
KENNY CREEK NEAR
94 02GB009 31 18 28 10 28
BURFORD
KETTLE CREEK ABOVE ST.
95 02GC029 29 26 28 2 28
THOMAS
KETTLE CREEK AT ST.
96 02GC002 69 44 47 3 47
THOMAS
KWATABOAHEGAN RIVER
97 04KA001 46 26 41 15 41
NEAR THE MOUTH
LA VALLEE RIVER NEAR
98 05PC016 27 12 5 0 12
DEVLIN
LA VASE RIVER AT NORTH
99 02DD013 40 38 38 0 38
BAY
LAKE 114 OUTLET NEAR
100 05PD014 24 17 5 3 20
KENORA
LAKE 223 OUTLET NEAR
101 05QD017 21 14 6 2 16
KENORA
83
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
LAKE 224 OUTLET NEAR
102 05QD018 21 14 6 3 17
KENORA
LAKE 225 OUTLET NEAR
103 05QD019 18 15 3 0 15
KENORA
LAKE 226 OUTLET NEAR
104 05QD015 23 17 5 1 18
KENORA
LAKE 227 OUTLET NEAR
105 05QD008 27 18 7 3 21
KENORA
LAKE 239 OUTLET NEAR
106 05PD023 26 20 23 3 23
KENORA
LAKE 239, LOWER EAST
107 05PD024 21 12 6 2 14
INLET, NEAR KENORA
LAKE 240 OUTLET NEAR
108 05PD015 27 20 24 4 24
KENORA
LAKE 303 OUTLET NEAR
109 05PD019 26 21 7 1 22
KENORA
LAKE 470 OUTLET NEAR
110 05PD017 27 19 25 6 25
KENORA
LITTLE CURRENT RIVER AT
111 04JF001 46 31 32 1 32
PERCY LAKE
LITTLE DON RIVER AT DON
112 02HC029 33 25 31 6 31
MILLS
LITTLE DON RIVER NEAR
113 02HC004 21 2 18 16 18
LANSING
LITTLE FRENCH RIVER AT
114 02DD020 32 0 18 0 0
OKIKENDAWT ISLAND
LITTLE PIC RIVER NEAR
115 02BA003 42 34 39 5 39
COLDWELL
116 02GH011 LITTLE RIVER AT WINDSOR 31 23 25 2 25
LITTLE ROUGE CREEK NEAR
117 02HC028 51 38 48 11 49
LOCUST HILL
LONG-LEGGED RIVER BELOW
118 05QE012 34 33 34 1 34
LONG-LEGGED LAKE
LUCKNOW RIVER AT
119 02FD002 35 25 25 0 25
LUCKNOW
120 02MB006 LYN CREEK NEAR LYN 44 23 34 11 34
121 02HC018 LYNDE CREEK NEAR WHITBY 54 33 47 14 47
LYNDHURST CREEK AT
122 02MA001 43 15 15 1 16
LYNDHURST
123 02ED015 MAD RIVER AT AVENING 26 19 23 4 23
MAGPIE RIVER NEAR
124 02BD003 52 34 43 16 50
MICHIPICOTEN
MAITLAND RIVER NEAR
125 02FE011 33 24 25 1 25
HARRISTON
MARIPOSA BROOK NEAR
126 02HG001 32 15 22 7 22
LITTLE BRITAIN
MATTAWISHKWIA RIVER AT
127 04LK001 28 15 16 1 16
HEARST
128 02GE007 MCGREGOR CREEK NEAR 37 27 29 2 29
84
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
CHATHAM
MCVICAR CREEK AT
129 02AB019 29 21 23 2 23
THUNDER BAY
MIDDLE MAITLAND RIVER
130 02FE013 31 24 24 1 25
ABOVE ETHEL
MIDDLE MAITLAND RIVER
131 02FE008 47 38 40 2 40
NEAR BELGRAVE
MIDDLE THAMES RIVER AT
132 02GD004 76 59 68 9 68
THAMESFORD
MIMICO CREEK AT
133 02HC033 48 45 47 2 47
ISLINGTON
MISSINAIBI RIVER AT
134 04LJ001 94 51 94 43 94
MATTICE
MISSINAIBI RIVER BELOW
135 04LM001 42 28 39 11 39
WABOOSE RIVER
MISSISSIPPI RIVER BELOW
136 02KF016 26 26 26 0 26
MARBLE LAKE
137 02HL005 MOIRA RIVER NEAR DELORO 49 43 47 5 48
MONTREAL RIVER AT
138 02BE001 ALGOMA CENTRAL AND 21 0 7 0 0
HUDSON BAY RAILWAY
MOOREFIELD CREEK NEAR
139 02GA042 25 5 4 0 5
ROTHSAY
MUSKRAT RIVER NEAR
140 02KC015 45 9 14 5 14
PEMBROKE
MUSWABIK RIVER AT
141 04GF001 26 11 16 5 16
OUTLET OF MUSWABIK LAKE
NAGAGAMI RIVER AT
142 04JC002 64 41 61 20 61
HIGHWAY NO. 11
NAMAKAN RIVER AT OUTLET
143 05PA006 93 51 91 40 91
OF LAC LA CROIX
NEEBING RIVER NEAR
144 02AB008 60 40 55 16 56
THUNDER BAY
NITH RIVER ABOVE
145 02GA038 42 38 41 3 41
NITHBURG
NITH RIVER AT NEW
146 02GA018 63 53 59 7 60
HAMBURG
147 02GA010 NITH RIVER NEAR CANNING 100 51 71 21 72
NORBERG CREEK (SITE A)
148 02BF005 34 30 31 1 31
ABOVE BATCHAWANA RIVER
NORBERG CREEK (SITE B) AT
149 02BF006 35 31 32 1 32
OUTLET OF TURKEY LAKE
NORBERG CREEK (SITE C) AT
150 02BF007 OUTLET OF LITTLE TURKEY 33 31 29 0 31
LAKE
NORBERG CREEK (SITE D)
151 02BF008 34 29 32 3 32
BELOW WISHART LAKE
NORBERG CREEK (SITE E)
152 02BF009 33 27 30 3 30
BELOW BATCHAWANA LAKE
85
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
NORBERG CREEK (SITE F) AT
153 02BF012 OUTLET OF BATCHAWANA 32 29 30 1 30
LAKE
NORTH BRANCH SOUTH
154 02LB017 NATION RIVER NEAR 37 11 20 9 20
HECKSTON
NORTH CURRENT RIVER
155 02AB014 42 24 31 7 31
NEAR THUNDER BAY
NORTH FRENCH RIVER NEAR
156 04MF001 47 31 46 15 46
THE MOUTH
NORTH MAGNETAWAN
157 02EA010 RIVER ABOVE PICKEREL 45 40 44 4 44
LAKE
NORTH MAGNETAWAN
158 02EA005 99 45 98 53 98
RIVER NEAR BURKS FALLS
159 02ED024 NORTH RIVER AT THE FALLS 26 22 24 2 24
NORTHWEST TRIBUTARY TO
160 05PD022 26 18 6 2 20
LAKE 239 NEAR KENORA
NOTTAWASAGA RIVER AT
161 02ED026 25 19 22 3 22
HOCKLEY
NOTTAWASAGA RIVER NEAR
162 02ED101 47 20 25 5 25
ALLISTON
NOTTAWASAGA RIVER NEAR
163 02ED003 67 38 61 23 61
BAXTER
O.A.C. FARM GAUGE NO. 2
164 02GF001 17 3 4 1 4
NEAR MERLIN
O.A.C. FARM GAUGE NO. 5 AT
165 02GA032 19 15 17 2 17
GUELPH
OGOKI RIVER ABOVE
166 04GB004 43 39 40 1 40
WHITECLAY LAKE
167 02HD008 OSHAWA CREEK AT OSHAWA 55 40 53 13 53
OSWEGO CREEK AT
168 02HA024 26 16 17 2 18
CANBORO
OTOSKWIN RIVER BELOW
169 04FA001 48 27 34 7 34
BADESDAWA LAKE
PAGWACHUAN RIVER AT
170 04JD005 45 36 44 8 44
HIGHWAY NO. 11
PARKHILL CREEK ABOVE
171 02FF008 41 35 39 4 39
PARKHILL RESERVOIR
PARKHILL CREEK NEAR
172 02FF003 15 43 53 6 49
PARKHILL
PASHKOKOGAN RIVER AT
173 04GA003 OUTLET OF PASHKOKOGAN 46 6 7 1 7
LAKE
PAYNE RIVER NEAR
174 02LB022 38 24 30 6 30
BERWICK
PEFFERLAW BROOK NEAR
175 02EC018 27 16 25 9 25
UDORA
PEFFERLAW BROOK NEAR
176 02EC103 18 12 15 3 15
UDORA
86
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
PERCH LAKE INLET NO. 1
177 02KA004 22 20 20 0 20
NEAR CHALK RIVER
PERCH LAKE INLET NO. 2
178 02KA005 22 19 19 0 19
NEAR CHALK RIVER
PERCH LAKE INLET NO. 3
179 02KA006 21 18 20 2 20
NEAR CHALK RIVER
PERCH LAKE INLET NO. 4
180 02KA007 21 18 20 2 20
NEAR CHALK RIVER
PERCH LAKE INLET NO. 5
181 02KA008 16 15 16 1 16
NEAR CHALK RIVER
PERCH LAKE OUTLET NEAR
182 02KA003 30 28 27 0 28
CHALK RIVER
183 02BB003 PIC RIVER NEAR MARATHON 44 34 43 9 43
PIGEON RIVER AT MIDDLE
184 02AA001 79 47 75 28 75
FALLS
PINE CREEK NEAR PINE
185 05OD032 26 20 25 5 25
CREEK
186 02FD001 PINE RIVER AT LURGAN 40 25 31 6 31
187 02ED014 PINE RIVER NEAR EVERETT 47 32 41 9 41
PINEIMUTA RIVER AT EYES
188 04FA003 48 32 42 10 42
LAKE
PINEWOOD RIVER NEAR
189 05PC011 47 31 6 1 32
PINEWOOD
PIPESTONE RIVER ABOVE
190 05PB015 36 12 15 3 15
RAINY LAKE
PIPESTONE RIVER AT KARL
191 04DA001 48 39 42 3 42
LAKE
192 04MD004 PORCUPINE RIVER AT HOYLE 37 19 24 5 24
PUKASKWA RIVER AT
193 02BC005 17 6 8 2 8
PUKASKWA NATIONAL PARK
RAINY RIVER AT MANITOU
194 05PC018 83 65 82 17 82
RAPIDS
RAISIN RIVER AT BLACK
195 02MC027 28 10 8 0 10
RIVER
RAISIN RIVER NEAR
196 02MC001 54 32 49 17 49
WILLIAMSTOWN
RAWDON CREEK NEAR WEST
197 02HK008 32 22 27 5 27
HUNTINGDON
REDHILL CREEK AT ALBION
198 02HA023 15 14 15 1 15
FALLS
REDHILL CREEK AT
199 02HA014 37 27 29 2 29
HAMILTON
RIVIERE BEAUDETTE NEAR
200 02MC026 31 21 26 5 26
GLEN NEVIS
RIVIERE DELISLE NEAR
201 02MC028 29 16 21 5 21
ALEXANDRIA
ROCKY SAUGEEN RIVER
202 02FC004 26 0 25 0 0
NEAR TRAVERSTON
203 02CA002 ROOT RIVER AT SAULT STE. 43 41 43 0 41
87
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
MARIE
ROSEAU RIVER NEAR
204 05OD030 81 45 78 34 79
CARIBOU
ROSEBERRY RIVER ABOVE
205 04CA003 47 30 35 5 35
ROSEBERRY LAKES
RUSCOM RIVER NEAR
206 02GH002 43 37 39 3 40
RUSCOM STATION
SACHIGO RIVER BELOW
207 04CD002 23 18 20 2 20
OUTLET OF SACHIGO LAKE
SAUBLE RIVER AT
208 02FA004 27 16 19 3 19
ALLENFORD
SAUGEEN RIVER ABOVE
209 02FC016 38 24 28 5 29
DURHAM
SCHNEIDER CREEK AT
210 02GA037 22 20 20 0 20
KITCHENER
SCHOMBERG RIVER NEAR
211 02EC010 48 33 41 9 42
SCHOMBERG
SEVERN RIVER AT
212 04CC001 25 5 22 17 22
LIMESTONE RAPIDS
SEVERN RIVER AT OUTLET
213 04CA004 47 14 20 6 20
OF DEER LAKE
SEVERN RIVER AT OUTLET
214 04CA002 49 27 34 7 34
OF MUSKRAT DAM LAKE
SHAMATTAWA RIVER AT
215 04DC002 OUTLET OF SHAMATTAWA 48 28 38 10 38
LAKE
SHEKAK RIVER AT HIGHWAY
216 04JC003 38 22 37 15 37
NO. 11
SILVER SPRING CREEK NEAR
217 02GA044 16 1 1 0 1
WILMOT CENTRE
SKOOTAMATTA RIVER NEAR
218 02HL004 59 48 56 8 56
ACTINOLITE
SOPER CREEK AT
219 02HD007 29 13 24 11 24
BOWMANVILLE
SOUTH BRANCH SOUTH
220 02LB031 NATION RIVER NEAR 16 3 3 0 3
WINCHESTER SPRINGS
SOUTH CASTOR RIVER AT
221 02LB020 36 18 26 9 27
KENMORE
SOUTH MAITLAND RIVER AT
222 02FE009 47 42 45 3 45
SUMMERHILL
SOUTH NATION RIVER AT
223 02LB007 66 44 61 17 61
SPENCERVILLE
SOUTH PARKHILL CREEK
224 02FF004 59 43 47 4 47
NEAR PARKHILL
SOUTH RAISIN RIVER NEAR
225 02MC030 28 9 7 1 10
CORNWALL
SPENCER CREEK AT
226 02HB023 27 25 26 1 26
HIGHWAY NO. 5
227 05OD031 SPRAGUE CREEK NEAR 54 49 51 2 51
88
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
SPRAGUE
STEEL RIVER NEAR TERRACE
228 02BA002 25 22 24 2 24
BAY
STOKES RIVER NEAR
229 02FA002 38 22 37 15 37
FERNDALE
STONEY CREEK AT STONEY
230 02HA022 25 16 17 1 17
CREEK
STURGEON CREEK NEAR
231 02GH001 22 15 18 4 19
LEAMINGTON
STURGEON RIVER AT
232 05QA004 53 44 51 7 51
MCDOUGALL MILLS
STURGEON RIVER AT
233 05QE009 54 47 52 5 52
OUTLET OF SALVESEN LAKE
STURGEON RIVER AT UPPER
234 02DC012 28 28 28 0 28
GOOSE FALLS
STURGEON RIVER NEAR
235 05PC010 35 4 0 0 4
BARWICK
SYDENHAM RIVER AT
236 02GG003 30 28 29 1 29
FLORENCE
SYDENHAM RIVER AT
237 02GG005 48 36 36 1 37
STRATHROY
SYDENHAM RIVER NEAR
238 02GG002 67 43 65 22 65
ALVINSTON
SYDENHAM RIVER NEAR
239 02GG007 18 15 18 3 18
DRESDEN
SYDENHAM RIVER NEAR
240 02FB007 99 42 77 35 77
OWEN SOUND
241 02GD021 THAMES RIVER AT INNERKIP 36 32 35 3 35
THAMES RIVER NEAR
242 02GD006 20 2 10 8 10
INGERSOLL
TRIBUTARY TO NORBERG
243 02BF013 25 8 22 14 22
CREEK AT TURKEY LAKE
TROUT CREEK NEAR
244 02GD019 48 39 40 1 40
FAIRVIEW
TROUT CREEK NEAR ST.
245 02GD009 69 38 50 13 51
MARYS
TROUTLAKE RIVER ABOVE
246 05QC003 43 42 43 1 43
BIG FALLS
247 02GH004 TURKEY CREEK AT WINDSOR 30 29 29 0 29
TURTLE RIVER NEAR MINE
248 05PB014 100 47 95 48 95
CENTRE
TWENTY MILE CREEK ABOVE
249 02HA020 27 22 26 4 26
SMITHVILLE
TWENTY MILE CREEK AT
250 02HA006 57 43 57 14 57
BALLS FALLS
VENISON CREEK NEAR
251 02GC021 48 35 35 0 35
WALSINGHAM
VERMILION RIVER NEAR VAL
252 02CF011 44 23 30 7 30
CARON
89
No. of years
Annual
HYDAT Annual No of
S. No. Station Name Maximum Total
Station ID Years of maximum predicted
average AMI
operation Instantaneous AMI
daily years
(AMI) values
(AMAD)
253 02DD012 VEUVE RIVER NEAR VERNER 41 23 26 3 26
WAUBUNO CREEK NEAR
254 02GD020 36 30 33 3 33
DORCHESTER
WEST BRANCH LITTLE
255 02HM009 CATARAQUI CREEK AT 25 18 14 0 18
KINGSTON
WEST BRANCH SCOTCH
256 02LB018 RIVER NEAR ST. ISIDORE DE 35 10 15 8 18
PRESCOTT
WEST DUFFINS CREEK
257 02HC038 40 17 20 4 21
ABOVE GREEN RIVER
WEST HUMBER RIVER AT
258 02HC031 49 36 43 7 43
HIGHWAY NO. 7
WHITEFISH RIVER AT
259 02AB017 34 30 33 3 33
NOLALU
WHITESAND RIVER ABOVE
260 02BA005 24 22 22 0 22
SCHREIBER AT MINOVA MINE
WHITSON RIVER AT
261 02CF007 54 45 52 7 52
CHELMSFORD
WHITSON RIVER AT VAL
262 02CF008 54 35 38 3 38
CARON
WILLOW CREEK ABOVE
263 02ED009 23 21 22 1 22
LITTLE LAKE
WILLOW CREEK AT
264 02ED010 26 26 26 0 26
MIDHURST
WILMOT CREEK NEAR
265 02HD009 49 37 43 6 43
NEWCASTLE
WILTON CREEK NEAR
266 02HM004 49 32 46 15 47
NAPANEE
WINDIGO RIVER ABOVE
267 04CB001 47 30 33 3 33
MUSKRAT DAM LAKE
WINISK RIVER BELOW
268 04DC001 ASHEWEIG RIVER 48 18 40 22 40
TRIBUTARY
WOLF RIVER AT HIGHWAY
269 02AC001 43 38 39 1 39
NO. 17
WYE CREEK NEAR
270 02GD013 39 16 19 4 20
THORNDALE
YORK RIVER NEAR
271 02KD002 99 34 85 51 85
BANCROFT
90
02BF002
Annual maximum Annual Maximum
Instantaneous Average daily
Base
Year (AMI) (AMAD) AMI/AMAD
factor, K
Flow Flow
Date Date
(m3/s) (m3/s)
1999 04--08 169.00 04--08 162 1.04 1.39
2000 03--27 74.40 03--27 73.6 1.01 1.53
2001 10--14 199.00 10--15 157 1.27 0.59
2002 04--18 278.00 04--18 256 1.09 0.93
2003 04--21 186.00 04--21 179 1.04 1.69
2004 04--19 172.00 04--19 163 1.06 1.64
2005 04--11 88.00 04--11 85.9 1.02 1.28
2006 04--14 111.00 04--14 108 1.03 0.67
2007 10--06 75.60 10--20 63.2 1.20 1.20
2008 04--22 244.00 04--23 236 1.03 1.20
2009 04--25 128.00 04--25 123 1.04 1.57
2010 09--24 216.00 09--24 148 1.46 1.05
2011 04--28 115.00 04--28 109 1.06 1.58
2012 03--20 164.00 03--20 158 1.04 1.11
2013 11--18 271.00 05--03 248 1.09 0.67
r= 1.6 K= 1.26
Predicted Value
92
Station Station
Stationarity Homogeneity Randomness Independence
Number ID
1 02HB021
2 04DB001
3 05PB018
4 04FB001
5 04FC001
6 02CB003
7 05PA012
8 02BF001
9 02FF007
10 02LB008
11 02GG004
12 02GG009
13 02GG006
14 02FC017
15 02EC011
16 05RC001
17 02BF004
18 02GC011
19 02GC010
20 02BB002
21 02EC002
22 02AD010
23 02JC008
24 02HE001
25 02CG003
26 02FE014
27 02HF004
28 02HD006
29 02FE010
30 02ED102
31 04GB005
32 02HB022
33 02KF017
34 02MB010
93
Station Station
Stationarity Homogeneity Randomness Independence
Number ID
35 02GA036
36 02GH003
37 02KF011
38 02FC011
39 02LB006
40 04GA002
41 02GC030
42 02GC018
43 05QE008
44 02DD014
45 02HK007
46 02HC023
47 02ED007
48 02HM005
49 02DD015
50 02GA017
51 02DB007
52 02DB004
53 02AB021
54 02GE005
55 02GC031
56 02DD008
57 02HC019
58 02HC049
59 02LB012
60 02GA035
61 02HC032
62 02HC009
63 02HB004
64 04EA001
65 05QA002
66 05QA001
67 02HC030
68 02HC002
69 02GB007
70 04CE002
71 02GD010
72 02HD012
73 02HD002
74 02BF002
94
Station Station
Stationarity Homogeneity Randomness Independence
Number ID
75 02KF015
76 02GA041
77 02HB012
78 04KA002
79 02HD013
80 02HC013
81 02ED017
82 02EC009
83 02HC025
84 02HC047
85 02GA045
86 02GA046
87 02GA043
88 02KC014
89 02HJ001
90 02LA007
91 02CF012
92 04JA002
93 04FA002
94 02GB009
95 02GC029
96 02GC002
97 04KA001
98 05PC016
99 02DD013
100 05PD014
101 05QD017
102 05QD018
103 05QD019
104 05QD015
105 05QD008
106 05PD023
107 05PD024
108 05PD015
109 05PD019
110 05PD017
111 04JF001
112 02HC029
113 02HC004
114 02DD020
95
Station Station
Stationarity Homogeneity Randomness Independence
Number ID
115 02BA003
116 02GH011
117 02HC028
118 05QE012
119 02FD002
120 02MB006
121 02HC018
122 02MA001
123 02ED015
124 02BD003
125 02FE011
126 02HG001
127 04LK001
128 02GE007
129 02AB019
130 02FE013
131 02FE008
132 02GD004
133 02HC033
134 04LJ001
135 04LM001
136 02KF016
137 02HL005
138 02BE001
139 02GA042
140 02KC015
141 04GF001
142 04JC002
143 05PA006
144 02AB008
145 02GA038
146 02GA018
147 02GA010
148 02BF005
149 02BF006
150 02BF007
151 02BF008
152 02BF009
153 02BF012
154 02LB017
96
Station Station
Stationarity Homogeneity Randomness Independence
Number ID
155 02AB014
156 04MF001
157 02EA010
158 02EA005
159 02ED024
160 05PD022
161 02ED026
162 02ED101
163 02ED003
164 02GF001
165 02GA032
166 04GB004
167 02HD008
168 02HA024
169 04FA001
170 04JD005
171 02FF008
172 02FF003
173 04GA003
174 02LB022
175 02EC018
176 02EC103
177 02KA004
178 02KA005
179 02KA006
180 02KA007
181 02KA008
182 02KA003
183 02BB003
184 02AA001
185 05OD032
186 02FD001
187 02ED014
188 04FA003
189 05PC011
190 05PB015
191 04DA001
192 04MD004
193 02BC005
194 05PC018
97
Station Station
Stationarity Homogeneity Randomness Independence
Number ID
195 02MC027
196 02MC001
197 02HK008
198 02HA023
199 02HA014
200 02MC026
201 02MC028
202 02FC004
203 02CA002
204 05OD030
205 04CA003
206 02GH002
207 04CD002
208 02FA004
209 02FC016
210 02GA037
211 02EC010
212 04CC001
213 04CA004
214 04CA002
215 04DC002
216 04JC003
217 02GA044
218 02HL004
219 02HD007
220 02LB031
221 02LB020
222 02FE009
223 02LB007
224 02FF004
225 02MC030
226 02HB023
227 05OD031
228 02BA002
229 02FA002
230 02HA022
231 02GH001
232 05QA004
233 05QE009
234 02DC012
98
Station Station
Stationarity Homogeneity Randomness Independence
Number ID
235 05PC010
236 02GG003
237 02GG005
238 02GG002
239 02GG007
240 02FB007
241 02GD021
242 02GD006
243 02BF013
244 02GD019
245 02GD009
246 05QC003
247 02GH004
248 05PB014
249 02HA020
250 02HA006
251 02GC021
252 02CF011
253 02DD012
254 02GD020
255 02HM009
256 02LB018
257 02HC038
258 02HC031
259 02AB017
260 02BA005
261 02CF007
262 02CF008
263 02ED009
264 02ED010
265 02HD009
266 02HM004
267 04CB001
268 04DC001
269 02AC001
270 02GD013
271 02KD002
99
Representative station: 02FE009 (station no 222 from the summary list in Appendix B)
Station
222 Parameters
4 Name
Gamma3 /
3
Min AIC 462.508 Normal Lognormal Gumbel / EV1 3PLN LP3 GEV
Pearson III
Distribut Gumbel / Normal(43.636; Lognormal(0.40929 GumbelMax(34.023; Gamma(5.0275; Lognormal(0.33243; LogPearson3(61.4 GenExtreme(0.09754;
ion EV1 105.22) ; 4.5746) 85.584) 18.943; 9.9885) 4.7769; -20.294) ; -0.05282; 7.818) 30.778; 84.191)
Section 2: Presents the log likelihood calculation for the station data. The top row presents the summation of the log likelihood for each of
the distributions.
Section 3: Provides the parameters of each of the distribution as obtained from easy fit and the AIC value.
Parameters
Gamma3 /
Normal Lognormal Gumbel / EV1 3PLN LP3 GEV
Pearson III
Normal(43.636; Lognormal(0.40929 GumbelMax(34.023; Gamma(5.0275; Lognormal(0.33243; LogPearson3(61.4 GenExtreme(0.09754;
105.22) ; 4.5746) 85.584) 18.943; 9.9885) 4.7769; -20.294) ; -0.05282; 7.818) 30.778; 84.191)
100
AIC values for all candidate distributions:
Below is the summary of the AIC value of all the candidate distributions and the difference of the candidate distribution with the model
with lowest AIC.
101
102
(Note:
1. Only the final output is shown in the part.
2. SI1 in the regression output denotes the transformation of lake attenuation index
3. The output below is the results after the log transformation was applied on the variables.)
Correlationsa
logQ100 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ100 1.000 .938 .140 -.069 .724 -.815
CorrelationlogAreaKm .938 1.000 -.069 .134 .710 -.941
logMAP .140 -.069 1.000 -.309 .200 .224
logSI1 -.069 .134 -.309 1.000 -.053 -.311
logShapFactr .724 .710 .200 -.053 1.000 -.683
logChSlpDmls -.815 -.941 .224 -.311 -.683 1.000
Sig. (1-logQ100 . .000 .185 .330 .000 .000
tailed) logAreaKm .000 . .330 .195 .000 .000
logMAP .185 .330 . .022 .100 .074
logSI1 .330 .195 .022 . .369 .021
logShapFactr .000 .000 .100 .369 . .000
logChSlpDmls .000 .000 .074 .021 .000 .
N logQ100 43 43 43 43 43 43
logAreaKm 43 43 43 43 43 43
logMAP 43 43 43 43 43 43
logSI1 43 43 43 43 43 43
logShapFactr 43 43 43 43 43 43
logChSlpDmls 43 43 43 43 43 43
103
Variables Entered/Removeda,b
Variables
Model Variables Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter
logAreaKm .
<= .050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter
logMAP .
<= .050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter
logSI1 .
<= .050, Probability-of-F-to-remove >= .100).
a. Region = Boreal Shield
b. Dependent Variable: logQ100
Model Summarya,e
Std. Error of the
Model R R Square Adjusted R Square Estimate Durbin-Watson
b
1 .938 .879 .876 .27430
c
2 .960 .922 .918 .22380
3 .970d .941 .937 .19587 2.287
a. Region = Boreal Shield
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logMAP
d. Predictors: (Constant), logAreaKm, logMAP, logSI1
e. Dependent Variable: logQ100
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 22.467 1 22.467 298.612 .000c
Residual 3.085 41 .075
Total 25.552 42
2 Regression 23.548 2 11.774 235.070 .000d
Residual 2.004 40 .050
Total 25.552 42
3 Regression 24.056 3 8.019 209.004 .000e
Residual 1.496 39 .038
Total 25.552 42
a. Region = Boreal Shield
b. Dependent Variable: logQ100
c. Predictors: (Constant), logAreaKm
d. Predictors: (Constant), logAreaKm, logMAP
e. Predictors: (Constant), logAreaKm, logMAP, logSI1
Coefficientsa,b
104
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
c
1 logMAP .206 4.646 .000 .592 .995 1.005 .995
c
logSI1 -.198 -4.341 .000 -.566 .982 1.018 .982
c
logShapFactr .118 1.551 .129 .238 .495 2.019 .495
c
logChSlpDmls .593 4.473 .000 .577 .114 8.741 .114
2 logSI1 -.149d -3.636 .001 -.503 .892 1.121 .892
logShapFactr .016d .234 .816 .037 .433 2.310 .433
d
logChSlpDmls .394 2.891 .006 .420 .089 11.229 .089
3 logShapFactr -.009e -.156 .877 -.025 .427 2.342 .427
e
logChSlpDmls .215 1.469 .150 .232 .068 14.727 .068
a. Region = Boreal Shield
b. Dependent Variable: logQ100
c. Predictors in the Model: (Constant), logAreaKm
d. Predictors in the Model: (Constant), logAreaKm, logMAP
e. Predictors in the Model: (Constant), logAreaKm, logMAP, logSI1
Collinearity Diagnosticsa,b
Condition Variance Proportions
Model Dimension Eigenvalue Index (Constant) logAreaKm logMAP logSI1
1 1 1.923 1.000 .04 .04
2 .077 4.992 .96 .96
2 1 2.897 1.000 .00 .02 .00
2 .103 5.310 .00 .98 .00
105
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value -.1068 2.8216 1.7734 .75681 43
Residual -.46754 .43090 .00000 .18875 43
Std. Predicted Value -2.484 1.385 .000 1.000 43
Std. Residual -2.387 2.200 .000 .964 43
a. Region = Boreal Shield
b. Dependent Variable: logQ100
Charts
106
Descriptive Statisticsa
Mean Std. Deviation N
logQ100 1.9853 .38037 75
logAreaKm 2.2409 .34956 75
logMAP 2.9826 .04197 75
logSI1 .0390 .03444 75
logShapFactr 1.1352 .20216 75
logChSlpDmls -2.6722 .32149 75
a. Region = Mixed WoodPlains
Correlationsa
logQ100 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ100 1.000 .770 .137 -.560 .194 -.407
Correlation logAreaKm .770 1.000 .025 -.268 .340 -.532
logMAP .137 .025 1.000 .041 -.111 -.380
logSI1 -.560 -.268 .041 1.000 -.042 -.008
logShapFactr .194 .340 -.111 -.042 1.000 -.388
logChSlpDmls -.407 -.532 -.380 -.008 -.388 1.000
Sig. (1-tailed) logQ100 . .000 .120 .000 .048 .000
logAreaKm .000 . .414 .010 .001 .000
logMAP .120 .414 . .363 .172 .000
107
Variables Entered/Removeda,b
Variables Variables
Model Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter <=
logAreaKm .
.050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter <=
logSI1 .
.050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter <=
logMAP .
.050, Probability-of-F-to-remove >= .100).
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ100
Model Summarya,e
Adjusted R Std. Error of the
Model R R Square Square Estimate Durbin-Watson
b
1 .770 .593 .587 .24432
c
2 .853 .728 .720 .20116
3 .864d .746 .736 .19554 1.625
a. Region = Mixed WoodPlains
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logSI1
d. Predictors: (Constant), logAreaKm, logSI1, logMAP
e. Dependent Variable: logQ100
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 6.349 1 6.349 106.359 .000c
Residual 4.357 73 .060
Total 10.706 74
2 Regression 7.793 2 3.896 96.292 .000d
Residual 2.913 72 .040
Total 10.706 74
108
Coefficientsa,b
Unstandardized Standardized Collinearity
Coefficients Coefficients Correlations Statistics
Std. Zero-
Model B Error Beta t Sig. order Partial Part Tolerance VIF
1 (Constant) .108 .184 .584 .561
logAreaKm .838 .081 .770 10.313 .000 .770 .770 .770 1.000 1.000
2 (Constant) .520 .167 3.122 .003
logAreaKm .727 .069 .668 10.469 .000 .770 .777 .644 .928 1.077
logSI1 -4.210 .705 -.381 -5.974 .000 -.560 -.576 -.367 .928 1.077
3 (Constant) -3.151 1.619 -1.946 .056
logAreaKm .721 .068 .663 10.676 .000 .770 .785 .638 .927 1.079
logSI1 -4.287 .686 -.388 -6.251 .000 -.560 -.596 -.374 .926 1.080
logMAP 1.236 .542 .136 2.279 .026 .137 .261 .136 .997 1.003
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ100
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
c
1 logMAP .118 1.594 .115 .185 .999 1.001 .999
c
logSI1 -.381 -5.974 .000 -.576 .928 1.077 .928
c
logShapFactr -.077 -.968 .337 -.113 .885 1.130 .885
c
logChSlpDmls .004 .046 .964 .005 .717 1.395 .717
2 logMAP .136d 2.279 .026 .261 .997 1.003 .926
logShapFactr -.056d -.853 .397 -.101 .882 1.134 .820
d
logChSlpDmls -.079 -1.068 .289 -.126 .692 1.445 .643
3 logShapFactr -.038e -.585 .561 -.070 .867 1.153 .815
e
logChSlpDmls -.010 -.125 .901 -.015 .563 1.776 .563
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ100
c. Predictors in the Model: (Constant), logAreaKm
d. Predictors in the Model: (Constant), logAreaKm, logSI1
109
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value 1.1997 2.6951 1.9853 .32862 75
Residual -.39632 .41307 .00000 .19154 75
Std. Predicted Value -2.391 2.160 .000 1.000 75
Std. Residual -2.027 2.112 .000 .980 75
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ100
Charts
110
Descriptive Statisticsa
Mean Std. Deviation N
logQ50 1.7272 .78809 43
logAreaKm 2.3566 .99527 43
logMAP 2.9348 .05119 43
logSI1 .0575 .02563 43
logShapFactr 1.1180 .25951 43
logChSlpDmls -2.5392 .57989 43
a. Region = Boreal Shield
Correlationsa
logQ50 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ50 1.000 .939 .145 -.068 .727 -.816
Correlation logAreaKm .939 1.000 -.069 .134 .710 -.941
logMAP .145 -.069 1.000 -.309 .200 .224
logSI1 -.068 .134 -.309 1.000 -.053 -.311
logShapFactr .727 .710 .200 -.053 1.000 -.683
logChSlpDmls -.816 -.941 .224 -.311 -.683 1.000
Sig. (1-tailed) logQ50 . .000 .176 .332 .000 .000
logAreaKm .000 . .330 .195 .000 .000
logMAP .176 .330 . .022 .100 .074
111
Variables Entered/Removeda,b
Variables Variables
Model Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter
logAreaKm .
<= .050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter
logMAP .
<= .050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter
logSI1 .
<= .050, Probability-of-F-to-remove >= .100).
a. Region = Boreal Shield
b. Dependent Variable: logQ50
Model Summarya,e
Adjusted R Std. Error of the
Model R R Square Square Estimate Durbin-Watson
b
1 .939 .882 .880 .27345
c
2 .963 .927 .923 .21834
3 .973d .946 .942 .18977 2.273
a. Region = Boreal Shield
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logMAP
d. Predictors: (Constant), logAreaKm, logMAP, logSI1
e. Dependent Variable: logQ50
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 23.020 1 23.020 307.849 .000c
Residual 3.066 41 .075
Total 26.086 42
2 Regression 24.179 2 12.089 253.589 .000d
Residual 1.907 40 .048
Total 26.086 42
112
Coefficientsa,b
Unstandardized Standardized Collinearity
Coefficients Coefficients Correlations Statistics
Std. Zero-
Model B Error Beta t Sig. order Partial Part Tolerance VIF
1 (Constant) -.026 .108 -.238 .813
logAreaKm .744 .042 .939 17.546 .000 .939 .939 .939 1.000 1.000
2 (Constant) -9.600 1.944 -4.939 .000
logAreaKm .755 .034 .954 22.263 .000 .939 .962 .952 .995 1.005
logMAP 3.253 .660 .211 4.930 .000 .145 .615 .211 .995 1.005
3 (Constant) -7.371 1.792 -4.114 .000
logAreaKm .769 .030 .971 25.877 .000 .939 .972 .961 .981 1.019
logMAP 2.572 .602 .167 4.273 .000 .145 .565 .159 .904 1.107
logSI1 -4.520 1.210 -.147 -3.735 .001 -.068 -.513 -.139 .892 1.121
a. Region = Boreal Shield
b. Dependent Variable: logQ50
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
c
1 logMAP .211 4.930 .000 .615 .995 1.005 .995
c
logSI1 -.198 -4.412 .000 -.572 .982 1.018 .982
c
logShapFactr .120 1.608 .116 .246 .495 2.019 .495
c
logChSlpDmls .592 4.545 .000 .584 .114 8.741 .114
2 logSI1 -.147d -3.735 .001 -.513 .892 1.121 .892
logShapFactr .016d .241 .811 .038 .433 2.310 .433
d
logChSlpDmls .383 2.913 .006 .423 .089 11.229 .089
3 logShapFactr -.009e -.158 .876 -.026 .427 2.342 .427
e
logChSlpDmls .205 1.459 .153 .230 .068 14.727 .068
a. Region = Boreal Shield
b. Dependent Variable: logQ50
c. Predictors in the Model: (Constant), logAreaKm
d. Predictors in the Model: (Constant), logAreaKm, logMAP
113
Collinearity Diagnosticsa,b
Condition Variance Proportions
Model Dimension Eigenvalue Index (Constant) logAreaKm logMAP logSI1
1 1 1.923 1.000 .04 .04
2 .077 4.992 .96 .96
2 1 2.897 1.000 .00 .02 .00
2 .103 5.310 .00 .98 .00
3 .000 140.226 1.00 .01 1.00
3 1 3.770 1.000 .00 .01 .00 .01
2 .135 5.275 .00 .42 .00 .64
3 .094 6.334 .00 .57 .00 .25
4 .000 168.762 1.00 .00 1.00 .10
a. Region = Boreal Shield
b. Dependent Variable: logQ50
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value -.1801 2.7869 1.7272 .76658 43
Residual -.46136 .42615 .00000 .18287 43
Std. Predicted Value -2.488 1.382 .000 1.000 43
Std. Residual -2.431 2.246 .000 .964 43
a. Region = Boreal Shield
b. Dependent Variable: logQ50
Charts
114
Descriptive Statisticsa
Mean Std. Deviation N
logQ50 1.9393 .37473 75
logAreaKm 2.2409 .34956 75
logMAP 2.9826 .04197 75
logSI1 .0390 .03444 75
logShapFactr 1.1352 .20216 75
logChSlpDmls -2.6722 .32149 75
a. Region = Mixed WoodPlains
Correlationsa
logQ50 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ50 1.000 .782 .147 -.557 .195 -.422
Correlation logAreaKm .782 1.000 .025 -.268 .340 -.532
logMAP .147 .025 1.000 .041 -.111 -.380
logSI1 -.557 -.268 .041 1.000 -.042 -.008
logShapFactr .195 .340 -.111 -.042 1.000 -.388
logChSlpDmls -.422 -.532 -.380 -.008 -.388 1.000
Sig. (1-tailed) logQ50 . .000 .104 .000 .047 .000
115
Variables Entered/Removeda,b
Variables Variables
Model Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter
logAreaKm .
<= .050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter
logSI1 .
<= .050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter
logMAP .
<= .050, Probability-of-F-to-remove >= .100).
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ50
Model Summarya,e
Adjusted R Std. Error of the
Model R R Square Square Estimate Durbin-Watson
b
1 .782 .611 .606 .23535
2 .861c .741 .734 .19319
3 .873d .763 .752 .18643 1.640
a. Region = Mixed WoodPlains
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logSI1
d. Predictors: (Constant), logAreaKm, logSI1, logMAP
e. Dependent Variable: logQ50
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 6.347 1 6.347 114.592 .000c
Residual 4.044 73 .055
Total 10.391 74
2 Regression 7.704 2 3.852 103.212 .000d
116
Coefficientsa,b
Unstandardized Standardized Collinearity
Coefficients Coefficients Correlations Statistics
Std. Zero-
Model B Error Beta t Sig. order Partial Part Tolerance VIF
1 (Constant) .062 .177 .348 .729
logAreaKm .838 .078 .782 10.705 .000 .782 .782 .782 1.000 1.000
2 (Constant) .462 .160 2.885 .005
logAreaKm .730 .067 .681 10.951 .000 .782 .790 .656 .928 1.077
logSI1 -4.080 .677 -.375 -6.029 .000 -.557 -.579 -.361 .928 1.077
3 (Constant) -3.397 1.544 -2.201 .031
logAreaKm .724 .064 .675 11.245 .000 .782 .800 .650 .927 1.079
logSI1 -4.162 .654 -.382 -6.364 .000 -.557 -.603 -.368 .926 1.080
logMAP 1.299 .517 .146 2.513 .014 .147 .286 .145 .997 1.003
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ50
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
1 logMAP .127c 1.767 .081 .204 .999 1.001 .999
c
logSI1 -.375 -6.029 .000 -.579 .928 1.077 .928
c
logShapFactr -.080 -1.026 .308 -.120 .885 1.130 .885
c
logChSlpDmls -.009 -.101 .920 -.012 .717 1.395 .717
d
2 logMAP .146 2.513 .014 .286 .997 1.003 .926
logShapFactr -.059d -.924 .359 -.109 .882 1.134 .820
logChSlpDmls -.091d -1.265 .210 -.148 .692 1.445 .643
3 logShapFactr -.040e -.634 .528 -.076 .867 1.153 .815
logChSlpDmls -.019e -.243 .809 -.029 .563 1.776 .563
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ50
117
Collinearity Diagnosticsa,b
Condition Variance Proportions
Model Dimension Eigenvalue Index (Constant) logAreaKm logSI1 logMAP
1 1 1.988 1.000 .01 .01
2 .012 12.985 .99 .99
2 1 2.643 1.000 .00 .00 .04
2 .346 2.762 .01 .01 .83
3 .010 15.984 .99 .99 .13
3 1 3.612 1.000 .00 .00 .02 .00
2 .373 3.110 .00 .01 .87 .00
3 .014 16.053 .00 .99 .11 .00
4 9.755E-5 192.433 1.00 .00 .00 1.00
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ50
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value 1.1497 2.6462 1.9393 .32722 75
Residual -.39162 .40029 .00000 .18261 75
Std. Predicted Value -2.413 2.160 .000 1.000 75
Std. Residual -2.101 2.147 .000 .980 75
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ50
Charts
118
Descriptive Statisticsa
Mean Std. Deviation N
logQ25 1.6751 .79747 43
logAreaKm 2.3566 .99527 43
logMAP 2.9348 .05119 43
logSI1 .0575 .02563 43
logShapFactr 1.1180 .25951 43
logChSlpDmls -2.5392 .57989 43
a. Region = Boreal Shield
Correlationsa
logQ25 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ25 1.000 .941 .151 -.068 .730 -.818
Correlation logAreaKm .941 1.000 -.069 .134 .710 -.941
logMAP .151 -.069 1.000 -.309 .200 .224
logSI1 -.068 .134 -.309 1.000 -.053 -.311
logShapFactr .730 .710 .200 -.053 1.000 -.683
logChSlpDmls -.818 -.941 .224 -.311 -.683 1.000
Sig. (1-tailed) logQ25 . .000 .167 .332 .000 .000
logAreaKm .000 . .330 .195 .000 .000
119
Variables Entered/Removeda,b
Variables Variables
Model Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter
logAreaKm .
<= .050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter
logMAP .
<= .050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter
logSI1 .
<= .050, Probability-of-F-to-remove >= .100).
a. Region = Boreal Shield
b. Dependent Variable: logQ25
Model Summarya,e
Adjusted R Std. Error of the
Model R R Square Square Estimate Durbin-Watson
b
1 .941 .885 .882 .27349
2 .965c .932 .929 .21293
d
3 .975 .951 .947 .18351 2.257
a. Region = Boreal Shield
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logMAP
d. Predictors: (Constant), logAreaKm, logMAP, logSI1
e. Dependent Variable: logQ25
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 23.643 1 23.643 316.097 .000c
Residual 3.067 41 .075
Total 26.710 42
2 Regression 24.897 2 12.448 274.569 .000d
Residual 1.814 40 .045
120
Total 26.710 42
3 Regression 25.397 3 8.466 251.396 .000e
Residual 1.313 39 .034
Total 26.710 42
a. Region = Boreal Shield
b. Dependent Variable: logQ25
c. Predictors: (Constant), logAreaKm
d. Predictors: (Constant), logAreaKm, logMAP
e. Predictors: (Constant), logAreaKm, logMAP, logSI1
Coefficientsa,b
Unstandardized Standardized Collinearity
Coefficients Coefficients Correlations Statistics
Std. Zero-
Model B Error Beta t Sig. order Partial Part Tolerance VIF
1 (Constant) -.101 .108 -.937 .354
logAreaKm .754 .042 .941 17.779 .000 .941 .941 .941 1.000 1.000
2 (Constant) -10.057 1.896 -5.306 .000
logAreaKm .766 .033 .956 23.145 .000 .941 .965 .954 .995 1.005
logMAP 3.383 .643 .217 5.258 .000 .151 .639 .217 .995 1.005
3 (Constant) -7.834 1.733 -4.521 .000
logAreaKm .779 .029 .972 27.124 .000 .941 .975 .963 .981 1.019
logMAP 2.703 .582 .173 4.645 .000 .151 .597 .165 .904 1.107
logSI1 -4.510 1.170 -.145 -3.854 .000 -.068 -.525 -.137 .892 1.121
a. Region = Boreal Shield
b. Dependent Variable: logQ25
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
c
1 logMAP .217 5.258 .000 .639 .995 1.005 .995
c
logSI1 -.198 -4.488 .000 -.579 .982 1.018 .982
c
logShapFactr .123 1.677 .101 .256 .495 2.019 .495
c
logChSlpDmls .591 4.620 .000 .590 .114 8.741 .114
d
2 logSI1 -.145 -3.854 .000 -.525 .892 1.121 .892
d
logShapFactr .016 .258 .798 .041 .433 2.310 .433
logChSlpDmls .372d 2.940 .005 .426 .089 11.229 .089
3 logShapFactr -.008e -.149 .882 -.024 .427 2.342 .427
e
logChSlpDmls .195 1.448 .156 .229 .068 14.727 .068
a. Region = Boreal Shield
b. Dependent Variable: logQ25
c. Predictors in the Model: (Constant), logAreaKm
121
Collinearity Diagnosticsa,b
Condition Variance Proportions
Model Dimension Eigenvalue Index (Constant) logAreaKm logMAP logSI1
1 1 1.923 1.000 .04 .04
2 .077 4.992 .96 .96
2 1 2.897 1.000 .00 .02 .00
2 .103 5.310 .00 .98 .00
3 .000 140.226 1.00 .01 1.00
3 1 3.770 1.000 .00 .01 .00 .01
2 .135 5.275 .00 .42 .00 .64
3 .094 6.334 .00 .57 .00 .25
4 .000 168.762 1.00 .00 1.00 .10
a. Region = Boreal Shield
b. Dependent Variable: logQ25
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value -.2636 2.7478 1.6751 .77761 43
Residual -.45474 .42091 .00000 .17683 43
Std. Predicted Value -2.493 1.379 .000 1.000 43
Std. Residual -2.478 2.294 .000 .964 43
a. Region = Boreal Shield
b. Dependent Variable: logQ25
Charts
122
Descriptive Statisticsa
Mean Std. Deviation N
logQ25 1.8874 .36920 75
logAreaKm 2.2409 .34956 75
logMAP 2.9826 .04197 75
logSI1 .0390 .03444 75
logShapFactr 1.1352 .20216 75
logChSlpDmls -2.6722 .32149 75
a. Region = Mixed WoodPlains
Correlationsa
logQ25 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ25 1.000 .793 .158 -.553 .196 -.439
Correlation logAreaKm .793 1.000 .025 -.268 .340 -.532
logMAP .158 .025 1.000 .041 -.111 -.380
logSI1 -.553 -.268 .041 1.000 -.042 -.008
logShapFactr .196 .340 -.111 -.042 1.000 -.388
logChSlpDmls -.439 -.532 -.380 -.008 -.388 1.000
Sig. (1-tailed) logQ25 . .000 .088 .000 .046 .000
123
Variables Entered/Removeda,b
Variables Variables
Model Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter
logAreaKm .
<= .050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter
logSI1 .
<= .050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter
logMAP .
<= .050, Probability-of-F-to-remove >= .100).
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ25
Model Summarya,e
Adjusted R Std. Error of the
Model R R Square Square Estimate Durbin-Watson
b
1 .793 .629 .624 .22635
2 .868c .754 .747 .18563
3 .882d .778 .769 .17747 1.659
a. Region = Mixed WoodPlains
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logSI1
d. Predictors: (Constant), logAreaKm, logSI1, logMAP
e. Dependent Variable: logQ25
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 6.347 1 6.347 123.889 .000c
Residual 3.740 73 .051
Total 10.087 74
2 Regression 7.606 2 3.803 110.362 .000d
124
Coefficientsa,b
Unstandardized Standardized Collinearity
Coefficients Coefficients Correlations Statistics
Std. Zero-
Model B Error Beta t Sig. order Partial Part Tolerance VIF
1 (Constant) .010 .171 .058 .954
logAreaKm .838 .075 .793 11.131 .000 .793 .793 .793 1.000 1.000
2 (Constant) .395 .154 2.570 .012
logAreaKm .734 .064 .695 11.458 .000 .793 .804 .670 .928 1.077
logSI1 -3.930 .650 -.367 -6.044 .000 -.553 -.580 -.353 .928 1.077
3 (Constant) -3.680 1.469 -2.505 .015
logAreaKm .728 .061 .689 11.871 .000 .793 .815 .663 .927 1.079
logSI1 -4.017 .622 -.375 -6.453 .000 -.553 -.608 -.361 .926 1.080
logMAP 1.372 .492 .156 2.788 .007 .158 .314 .156 .997 1.003
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ25
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
1 logMAP .138c 1.975 .052 .227 .999 1.001 .999
c
logSI1 -.367 -6.044 .000 -.580 .928 1.077 .928
c
logShapFactr -.083 -1.094 .278 -.128 .885 1.130 .885
c
logChSlpDmls -.023 -.275 .784 -.032 .717 1.395 .717
d
2 logMAP .156 2.788 .007 .314 .997 1.003 .926
logShapFactr -.063d -1.007 .317 -.119 .882 1.134 .820
logChSlpDmls -.104d -1.492 .140 -.174 .692 1.445 .643
3 logShapFactr -.042e -.694 .490 -.083 .867 1.153 .815
logChSlpDmls -.028e -.379 .706 -.045 .563 1.776 .563
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ25
125
Collinearity Diagnosticsa,b
Condition Variance Proportions
Model Dimension Eigenvalue Index (Constant) logAreaKm logSI1 logMAP
1 1 1.988 1.000 .01 .01
2 .012 12.985 .99 .99
2 1 2.643 1.000 .00 .00 .04
2 .346 2.762 .01 .01 .83
3 .010 15.984 .99 .99 .13
3 1 3.612 1.000 .00 .00 .02 .00
2 .373 3.110 .00 .01 .87 .00
3 .014 16.053 .00 .99 .11 .00
4 9.755E-5 192.433 1.00 .00 .00 1.00
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ25
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value 1.0931 2.5910 1.8874 .32572 75
Residual -.38583 .38775 .00000 .17384 75
Std. Predicted Value -2.439 2.160 .000 1.000 75
Std. Residual -2.174 2.185 .000 .980 75
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ25
Charts
126
Descriptive Statisticsa
Mean Std. Deviation N
logQ10 1.5928 .81277 43
logAreaKm 2.3566 .99527 43
logMAP 2.9348 .05119 43
logSI1 .0575 .02563 43
logShapFactr 1.1180 .25951 43
logChSlpDmls -2.5392 .57989 43
a. Region = Boreal Shield
Correlationsa
logQ10 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ10 1.000 .942 .160 -.069 .734 -.819
Correlation logAreaKm .942 1.000 -.069 .134 .710 -.941
logMAP .160 -.069 1.000 -.309 .200 .224
logSI1 -.069 .134 -.309 1.000 -.053 -.311
logShapFactr .734 .710 .200 -.053 1.000 -.683
logChSlpDmls -.819 -.941 .224 -.311 -.683 1.000
Sig. (1-tailed) logQ10 . .000 .152 .331 .000 .000
logAreaKm .000 . .330 .195 .000 .000
127
Variables Entered/Removeda,b
Variables Variables
Model Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter <=
logAreaKm .
.050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter <=
logMAP .
.050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter <=
logSI1 .
.050, Probability-of-F-to-remove >= .100).
a. Region = Boreal Shield
b. Dependent Variable: logQ10
Model Summarya,e
Adjusted R Std. Error of the
Model R R Square Square Estimate Durbin-Watson
b
1 .942 .888 .885 .27563
2 .969c .939 .936 .20594
d
3 .978 .957 .954 .17493 2.232
a. Region = Boreal Shield
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logMAP
d. Predictors: (Constant), logAreaKm, logMAP, logSI1
e. Dependent Variable: logQ10
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 24.630 1 24.630 324.195 .000c
Residual 3.115 41 .076
Total 27.745 42
2 Regression 26.049 2 13.024 307.105 .000d
Residual 1.696 40 .042
128
Total 27.745 42
3 Regression 26.552 3 8.851 289.223 .000e
Residual 1.193 39 .031
Total 27.745 42
a. Region = Boreal Shield
b. Dependent Variable: logQ10
c. Predictors: (Constant), logAreaKm
d. Predictors: (Constant), logAreaKm, logMAP
e. Predictors: (Constant), logAreaKm, logMAP, logSI1
Coefficientsa,b
Unstandardized Standardized Collinearity
Coefficients Coefficients Correlations Statistics
Std. Zero-
Model B Error Beta t Sig. order Partial Part Tolerance VIF
1 (Constant) -.221 .109 -2.021 .050
logAreaKm .769 .043 .942 18.005 .000 .942 .942 .942 1.000 1.000
2 (Constant) -10.813 1.833 -5.898 .000
logAreaKm .782 .032 .958 24.442 .000 .942 .968 .956 .995 1.005
logMAP 3.599 .622 .227 5.783 .000 .160 .675 .226 .995 1.005
3 (Constant) -8.583 1.652 -5.197 .000
logAreaKm .795 .027 .974 29.052 .000 .942 .978 .965 .981 1.019
logMAP 2.917 .555 .184 5.259 .000 .160 .644 .175 .904 1.107
logSI1 -4.522 1.115 -.143 -4.054 .000 -.069 -.545 -.135 .892 1.121
a. Region = Boreal Shield
b. Dependent Variable: logQ10
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
c
1 logMAP .227 5.783 .000 .675 .995 1.005 .995
c
logSI1 -.199 -4.592 .000 -.588 .982 1.018 .982
c
logShapFactr .130 1.794 .080 .273 .495 2.019 .495
c
logChSlpDmls .593 4.722 .000 .598 .114 8.741 .114
d
2 logSI1 -.143 -4.054 .000 -.545 .892 1.121 .892
d
logShapFactr .018 .305 .762 .049 .433 2.310 .433
logChSlpDmls .357d 2.983 .005 .431 .089 11.229 .089
3 logShapFactr -.006e -.112 .911 -.018 .427 2.342 .427
e
logChSlpDmls .180 1.432 .160 .226 .068 14.727 .068
a. Region = Boreal Shield
b. Dependent Variable: logQ10
c. Predictors in the Model: (Constant), logAreaKm
129
Collinearity Diagnosticsa,b
Condition Variance Proportions
Model Dimension Eigenvalue Index (Constant) logAreaKm logMAP logSI1
1 1 1.923 1.000 .04 .04
2 .077 4.992 .96 .96
2 1 2.897 1.000 .00 .02 .00
2 .103 5.310 .00 .98 .00
3 .000 140.226 1.00 .01 1.00
3 1 3.770 1.000 .00 .01 .00 .01
2 .135 5.275 .00 .42 .00 .64
3 .094 6.334 .00 .57 .00 .25
4 .000 168.762 1.00 .00 1.00 .10
a. Region = Boreal Shield
b. Dependent Variable: logQ10
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value -.3970 2.6864 1.5928 .79510 43
Residual -.44448 .41232 .00000 .16857 43
Std. Predicted Value -2.503 1.375 .000 1.000 43
Std. Residual -2.541 2.357 .000 .964 43
a. Region = Boreal Shield
b. Dependent Variable: logQ10
Charts
130
Descriptive Statisticsa
Mean Std. Deviation N
logQ10 1.8054 .36207 75
logAreaKm 2.2409 .34956 75
logMAP 2.9826 .04197 75
logSI1 .0390 .03444 75
logShapFactr 1.1352 .20216 75
logChSlpDmls -2.6722 .32149 75
a. Region = Mixed WoodPlains
Correlationsa
logQ10 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ10 1.000 .809 .176 -.542 .197 -.464
Correlation logAreaKm .809 1.000 .025 -.268 .340 -.532
logMAP .176 .025 1.000 .041 -.111 -.380
logSI1 -.542 -.268 .041 1.000 -.042 -.008
logShapFactr .197 .340 -.111 -.042 1.000 -.388
logChSlpDmls -.464 -.532 -.380 -.008 -.388 1.000
Sig. (1-tailed) logQ10 . .000 .065 .000 .045 .000
131
Variables Entered/Removeda,b
Variables Variables
Model Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter
logAreaKm .
<= .050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter
logSI1 .
<= .050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter
logMAP .
<= .050, Probability-of-F-to-remove >= .100).
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ10
Model Summarya,e
Adjusted R Std. Error of the
Model R R Square Square Estimate Durbin-Watson
b
1 .809 .655 .650 .21424
2 .877c .769 .762 .17651
3 .894d .799 .790 .16590 1.691
a. Region = Mixed WoodPlains
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logSI1
d. Predictors: (Constant), logAreaKm, logSI1, logMAP
e. Dependent Variable: logQ10
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 6.350 1 6.350 138.357 .000c
Residual 3.351 73 .046
Total 9.701 74
2 Regression 7.458 2 3.729 119.690 .000d
132
Coefficientsa,b
Unstandardized Standardized Collinearity
Coefficients Coefficients Correlations Statistics
Std. Zero-
Model B Error Beta t Sig. order Partial Part Tolerance VIF
1 (Constant) -.073 .162 -.449 .655
logAreaKm .838 .071 .809 11.763 .000 .809 .809 .809 1.000 1.000
2 (Constant) .289 .146 1.976 .052
logAreaKm .741 .061 .715 12.160 .000 .809 .820 .689 .928 1.077
logSI1 -3.687 .618 -.351 -5.962 .000 -.542 -.575 -.338 .928 1.077
3 (Constant) -4.139 1.374 -3.013 .004
logAreaKm .734 .057 .708 12.805 .000 .809 .835 .682 .927 1.079
logSI1 -3.780 .582 -.360 -6.496 .000 -.542 -.611 -.346 .926 1.080
logMAP 1.491 .460 .173 3.240 .002 .176 .359 .173 .997 1.003
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ10
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
1 logMAP .156c 2.330 .023 .265 .999 1.001 .999
c
logSI1 -.351 -5.962 .000 -.575 .928 1.077 .928
c
logShapFactr -.088 -1.205 .232 -.141 .885 1.130 .885
c
logChSlpDmls -.046 -.569 .571 -.067 .717 1.395 .717
d
2 logMAP .173 3.240 .002 .359 .997 1.003 .926
logShapFactr -.069d -1.140 .258 -.134 .882 1.134 .820
logChSlpDmls -.124d -1.859 .067 -.215 .692 1.445 .643
3 logShapFactr -.046e -.794 .430 -.094 .867 1.153 .815
logChSlpDmls -.043e -.602 .549 -.072 .563 1.776 .563
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ10
133
Collinearity Diagnosticsa,b
Condition Variance Proportions
Model Dimension Eigenvalue Index (Constant) logAreaKm logSI1 logMAP
1 1 1.988 1.000 .01 .01
2 .012 12.985 .99 .99
2 1 2.643 1.000 .00 .00 .04
2 .346 2.762 .01 .01 .83
3 .010 15.984 .99 .99 .13
3 1 3.612 1.000 .00 .00 .02 .00
2 .373 3.110 .00 .01 .87 .00
3 .014 16.053 .00 .99 .11 .00
4 9.755E-5 192.433 1.00 .00 .00 1.00
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ10
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value 1.0032 2.5038 1.8054 .32355 75
Residual -.37517 .36837 .00000 .16250 75
Std. Predicted Value -2.479 2.159 .000 1.000 75
Std. Residual -2.261 2.220 .000 .980 75
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ10
Charts
134
Descriptive Statisticsa
Mean Std. Deviation N
logQ2 1.3572 .85698 43
logAreaKm 2.3566 .99527 43
logMAP 2.9348 .05119 43
logSI1 .0575 .02563 43
logShapFactr 1.1180 .25951 43
logChSlpDmls -2.5392 .57989 43
a. Region = Boreal Shield
Correlationsa
logQ2 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ2 1.000 .941 .189 -.074 .745 -.815
Correlation logAreaKm .941 1.000 -.069 .134 .710 -.941
logMAP .189 -.069 1.000 -.309 .200 .224
logSI1 -.074 .134 -.309 1.000 -.053 -.311
logShapFactr .745 .710 .200 -.053 1.000 -.683
logChSlpDmls -.815 -.941 .224 -.311 -.683 1.000
Sig. (1-tailed) logQ2 . .000 .112 .319 .000 .000
135
Variables Entered/Removeda,b
Variables Variables
Model Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter
logAreaKm .
<= .050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter
logMAP .
<= .050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter
logSI1 .
<= .050, Probability-of-F-to-remove >= .100).
a. Region = Boreal Shield
b. Dependent Variable: logQ2
Model Summarya,e
Adjusted R Std. Error of the
Model R R Square Square Estimate Durbin-Watson
b
1 .941 .886 .883 .29347
2 .975c .951 .948 .19513
3 .984d .968 .965 .15973 2.167
a. Region = Boreal Shield
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logMAP
d. Predictors: (Constant), logAreaKm, logMAP, logSI1
e. Dependent Variable: logQ2
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 27.314 1 27.314 317.156 .000c
Residual 3.531 41 .086
Total 30.845 42
2 Regression 29.322 2 14.661 385.063 .000d
136
Coefficientsa,b
Unstandardized Standardized Collinearity
Coefficients Coefficients Correlations Statistics
Std. Zero-
Model B Error Beta t Sig. order Partial Part Tolerance VIF
1 (Constant) -.552 .116 -4.753 .000
logAreaKm .810 .045 .941 17.809 .000 .941 .941 .941 1.000 1.000
2 (Constant) -13.155 1.737 -7.573 .000
logAreaKm .826 .030 .959 27.223 .000 .941 .974 .956 .995 1.005
logMAP 4.282 .590 .256 7.262 .000 .189 .754 .255 .995 1.005
3 (Constant) -10.870 1.508 -7.208 .000
logAreaKm .839 .025 .974 33.561 .000 .941 .983 .965 .981 1.019
logMAP 3.583 .507 .214 7.075 .000 .189 .750 .203 .904 1.107
logSI1 -4.633 1.018 -.139 -4.549 .000 -.074 -.589 -.131 .892 1.121
a. Region = Boreal Shield
b. Dependent Variable: logQ2
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
1 logMAP .256c 7.262 .000 .754 .995 1.005 .995
logSI1 -.204c -4.708 .000 -.597 .982 1.018 .982
logShapFactr .153c 2.129 .039 .319 .495 2.019 .495
logChSlpDmls .613c 4.905 .000 .613 .114 8.741 .114
2 logSI1 -.139d -4.549 .000 -.589 .892 1.121 .892
logShapFactr .028d .527 .601 .084 .433 2.310 .433
logChSlpDmls .331d 3.103 .004 .445 .089 11.229 .089
3 logShapFactr .005e .115 .909 .019 .427 2.342 .427
logChSlpDmls .154e 1.416 .165 .224 .068 14.727 .068
a. Region = Boreal Shield
b. Dependent Variable: logQ2
137
Collinearity Diagnosticsa,b
Condition Variance Proportions
Model Dimension Eigenvalue Index (Constant) logAreaKm logMAP logSI1
1 1 1.923 1.000 .04 .04
2 .077 4.992 .96 .96
2 1 2.897 1.000 .00 .02 .00
2 .103 5.310 .00 .98 .00
3 .000 140.226 1.00 .01 1.00
3 1 3.770 1.000 .00 .01 .00 .01
2 .135 5.275 .00 .42 .00 .64
3 .094 6.334 .00 .57 .00 .25
4 .000 168.762 1.00 .00 1.00 .10
a. Region = Boreal Shield
b. Dependent Variable: logQ2
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value -.7794 2.5082 1.3572 .84304 43
Residual -.40212 .37227 .00000 .15392 43
Std. Predicted Value -2.534 1.365 .000 1.000 43
Std. Residual -2.518 2.331 .000 .964 43
a. Region = Boreal Shield
b. Dependent Variable: logQ2
Charts
138
Descriptive Statisticsa
Mean Std. Deviation N
logQ2 1.5694 .35051 75
logAreaKm 2.2409 .34956 75
logMAP 2.9826 .04197 75
logSI1 .0390 .03444 75
logShapFactr 1.1352 .20216 75
logChSlpDmls -2.6722 .32149 75
a. Region = Mixed WoodPlains
Correlationsa
logQ2 logAreaKm logMAP logSI1 logShapFactr logChSlpDmls
Pearson logQ2 1.000 .840 .227 -.494 .196 -.530
Correlation logAreaKm .840 1.000 .025 -.268 .340 -.532
logMAP .227 .025 1.000 .041 -.111 -.380
logSI1 -.494 -.268 .041 1.000 -.042 -.008
logShapFactr .196 .340 -.111 -.042 1.000 -.388
logChSlpDmls -.530 -.532 -.380 -.008 -.388 1.000
Sig. (1-tailed) logQ2 . .000 .025 .000 .046 .000
139
Variables Entered/Removeda,b
Variables Variables
Model Entered Removed Method
1 Stepwise (Criteria: Probability-of-F-to-enter
logAreaKm .
<= .050, Probability-of-F-to-remove >= .100).
2 Stepwise (Criteria: Probability-of-F-to-enter
logSI1 .
<= .050, Probability-of-F-to-remove >= .100).
3 Stepwise (Criteria: Probability-of-F-to-enter
logMAP .
<= .050, Probability-of-F-to-remove >= .100).
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ2
Model Summarya,e
Adjusted R Std. Error of the
Model R R Square Square Estimate Durbin-Watson
b
1 .840 .705 .701 .19163
2 .885c .783 .777 .16555
3 .912d .831 .824 .14704 1.763
a. Region = Mixed WoodPlains
b. Predictors: (Constant), logAreaKm
c. Predictors: (Constant), logAreaKm, logSI1
d. Predictors: (Constant), logAreaKm, logSI1, logMAP
e. Dependent Variable: logQ2
ANOVAa,b
Model Sum of Squares df Mean Square F Sig.
1 Regression 6.411 1 6.411 174.573 .000c
Residual 2.681 73 .037
Total 9.091 74
2 Regression 7.118 2 3.559 129.848 .000d
140
Coefficientsa,b
Unstandardized Standardized Collinearity
Coefficients Coefficients Correlations Statistics
Std. Zero-
Model B Error Beta t Sig. order Partial Part Tolerance VIF
1 (Constant) -.317 .145 -2.196 .031
logAreaKm .842 .064 .840 13.213 .000 .840 .840 .840 1.000 1.000
2 (Constant) -.028 .137 -.207 .836
logAreaKm .764 .057 .762 13.375 .000 .840 .844 .734 .928 1.077
logSI1 -2.946 .580 -.289 -5.080 .000 -.494 -.514 -.279 .928 1.077
3 (Constant) -5.483 1.217 -4.504 .000
logAreaKm .756 .051 .754 14.879 .000 .840 .870 .726 .927 1.079
logSI1 -3.061 .516 -.301 -5.936 .000 -.494 -.576 -.289 .926 1.080
logMAP 1.837 .408 .220 4.503 .000 .227 .471 .220 .997 1.003
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ2
Excluded Variablesa,b
Collinearity Statistics
Partial Minimum
Model Beta In t Sig. Correlation Tolerance VIF Tolerance
1 logMAP .206c 3.469 .001 .378 .999 1.001 .999
c
logSI1 -.289 -5.080 .000 -.514 .928 1.077 .928
c
logShapFactr -.100 -1.499 .138 -.174 .885 1.130 .885
c
logChSlpDmls -.116 -1.561 .123 -.181 .717 1.395 .717
d
2 logMAP .220 4.503 .000 .471 .997 1.003 .926
logShapFactr -.085d -1.460 .149 -.171 .882 1.134 .820
logChSlpDmls -.183d -2.917 .005 -.327 .692 1.445 .643
3 logShapFactr -.055e -1.055 .295 -.125 .867 1.153 .815
logChSlpDmls -.085e -1.316 .193 -.155 .563 1.776 .563
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ2
141
Collinearity Diagnosticsa,b
Condition Variance Proportions
Model Dimension Eigenvalue Index (Constant) logAreaKm logSI1 logMAP
1 1 1.988 1.000 .01 .01
2 .012 12.985 .99 .99
2 1 2.643 1.000 .00 .00 .04
2 .346 2.762 .01 .01 .83
3 .010 15.984 .99 .99 .13
3 1 3.612 1.000 .00 .00 .02 .00
2 .373 3.110 .00 .01 .87 .00
3 .014 16.053 .00 .99 .11 .00
4 9.755E-5 192.433 1.00 .00 .00 1.00
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ2
Residuals Statisticsa,b
Minimum Maximum Mean Std. Deviation N
Predicted Value .7393 2.2555 1.5694 .31955 75
Residual -.33889 .31825 .00000 .14402 75
Std. Predicted Value -2.598 2.147 .000 1.000 75
Std. Residual -2.305 2.164 .000 .980 75
a. Region = Mixed WoodPlains
b. Dependent Variable: logQ2
Charts
142
143
Copyright Acknowledgement
This project was supported by the Ontario Ministry of Transportation (MTO) under its Highway
Infrastructure Innovation and Funding Program (HIIFP). The author has also produced a project
report for MTO. The contents of the thesis and results would find similarity with the final project
report submitted to MTO in December 2015. A publication based on this thesis is also currently
in progress.