Professional Documents
Culture Documents
Wavesand WaterLevels:
JOIN.SEA
A rigorousbut practicalnew
approach
ReportSR 537
November1998
(Re-issued
with minoramendments
May2000)
LANCASTE
UNIVERSIT r)
TheJoint Probabilityof Wavesand Water
Levels:JOIN-SEA
A rigorous but practicalnew approach
ReportSR 537
November1998
(Re-issuedwith minor amendmentsMay2000)
lnnwttingtora
AddA and RegisreredOm€ HR Wrlingrnrd Lrd. tlo*hery Park, Watlingtod, OXON OX10 EBA
Tel. +rt4 (lt) 1491835301 Fr* +44 ({l) l49l 83!231
Prepared
by ....,P Hq,tr.keE
r'f*...1.,
(name)
?q*l tl
(Title)
Approvedby
(name)
?rq*or
(Title)
D","19.-IlU.Z.aes..
O Ministryof Agriculture,FisheriesandFood,Copyright2000
Er**,"*.0 tll
nf"-
WallirEford IV
Summary
The Joint Probability of Waves and Water l,evels: JOIN-SEA
Report SR 537
November 1998
(Re-issuedwith minor amendmentsMay 2000)
We presenta new approachto the joint probability of large wave heightsand high
waterlevelswhich:
!"**n,.rro"
Summarycontinued
distributionof theprogramsto industryspecialists in the form of a
lor beta-testing,
briefingworkshopin February2000.at whichthe issuesrelatingto dissemination
wereclarified.
computerprograms
For furtherinformationon this reportandthe associated
pleasecontactDr PeterHawkesof the CoastalDepartment at HR Wallingford.
!Hn*" tngrora V1
Contents
Title page I
Contract iii
Summary
Content:t vii
1. I n t r o d u c t i o .n. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . I. . . .
1.1 Background......... . . . . . . . . . . . . .I. . . .
1.2 Definitionsusedinjoint probabilityanalysis............... .................2
1.3 Dependence . . . . . . . . . . . . . . . . . . .3. . . . . .
1.4 Review of altemativemethods........ .................5
1.5 O u t l i n eo f t h en e w m e t h o d. . . . . . . . . . . . . . . . ...............1
I .6 R e s e a r cohu t p u t s . . . . . . . . . . , . . . . . ...............................7
2. T h en e wa p p r o a c h . . . . . . . . . . . . . ................8
2.1 I n p u td a t ar e q u i r e m e n.t.s. . . . . . . . . . . . . . . . ..................8
2.2 Fitting of statisticaldistributionsto the marginal variables...........9
2.3 Statisticalmodelsfor dependence .................. l0
2.4 L o n g - t e r ms i m u l a t i o n ..................... . . . . . . . . . . . . l.0. . .
2.5 Analysis ofjoint exceedanceextremesand structuralresponse
functions . . . . . . . . . . . . . . . . . . . . .l.l. . . . . .
2.6 S o f t w a r ei m p l e m e n t a t i o n . . . . . . . . . . . . . . . . . . . . . . . . . . . . .I .I. .
3. Applications...................... ..............l2
3.1 l3
Exampleapplicationsto syntheticdata........................................
3.2 Casestudiesusingfield data.... 16
.......................
3.3 Example applicationusing both inshoreand offshorewave data20
3.4 Issuesraiseddurins analvsisof otherdatasets...........
.................24
4. Discussion ...........26
4.1 Pastpracticefor useof resultsin design-..................................... 26
4.2 A d v a n t a g eosf t h en e w a p p r o a c h . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1
4.? New possibilitiesbasedon long-termsimulation........................28
4.4 Recommended optionsforjoint probabilityanalysis..................29
4.5 Performance of the varioustechniques........................................31
4.6 Availability of methodsand stafftime involved......................... 33
4.'7 F u t u r ed e v e l o p m e n t s . . . . . . . . . . . . . . . . . . . . . . . . ...............34
5. References........................,. ............35
Tahles
Table I Direct hindcastingald extrapolationof overtoppingrate,
a p p l i e dt o d a t as e tS i m l . . . . . . . . . . . . . . . . . .................1. .5. . . .
Table 2 Direct hindcastingand extrapolationof overtoppingrate,
appliedtodatasetSimz................. .......................15
Table 3 Direct hindcastingand extrapolationof overtoppingrate,
appliedto datasetSim3 ...........
16
Table 4 Direct hindcastingand extrapolationof oveltopping rate,
a p p l i e dt o d a t as e tS i m 4. . . . . . . . . . . . . . . . . .................1. .6. . . .
Figures
Figure t Jointexceedance andjoint structuralprobabilities..............................
3
Figure 2 Probability density contoursshowing negativedependence
betweensurgeandwave height(off Hythe) ........................................4
Figure 3 Probability densitycontoursshowing positive dependence
betweensurgeandwaveheight(Christchurch 5
Bay)............................
Figure 4 Joint exceedancecontoursofH, (m) and water level (mOD)
for a sitewith very low dependence.. 13
....................
Figure 5 Joint exceedancecontoursofH, (m) and water level (mCD)
for a sitewith high dependence 14
............................
Appendices
Appendix I Comparisonsbetweenjoint exceedanceextremesfor different analysis
methods
!***"u,nuo* vlll
Part I HR Wallingford Report
lHnwat'nsfo.a
1. INTRODUCTION
1.1 Background
Meaning of ioint probabilitv
Joint probability typically refers to two or more partially relatedenvironmentalvariablesoccurring
simultaneouslyto producea responseof interest. Examplesare:
The Start-upWorkshoo
In August 1994,MAFF hosteda Workshop on joint probability (MAFF, 1995) at which MessrsHawkes,
Owen and Tawn presentedtheir own preferredapproaches.As a result of the Workshop, MAFF agreedto
fund a two-yearjoint researchproject at HR Wallingford and LancasterUniversity. The intention was to
combine the best ofthe existing methods,and to develop,test and eventuallydisseminatea rigorous and
practical approachto the joint probability of large wave heights and high water levels.
The prpsell]L_Brqieal
MAFF provided funding for the developmentand testing of new joint probability methodsfrom April 1995
to March 1997,although in practicedevelopmentcontinueduntil September1998. Most of the
developmentand validation of the methodsand computerprogramswas carried out at Lancaster
University, but the processcontinuedwhen programcode was transferredto HR Wallingford in
April 1996. In principle, the computer programsand reportsare freely available,sincethey were
developedentirely with MAFF funding. However, in practicethere are additional costsassociatedwith
training, follow-up advice, NAG licencesand copying of documentation.
t"* Wallingford
1.2 Definitionsused in joint probabilityanalysis
Marginal probability and return period
Marginal probability refers to the distribution of a single variable, for example wave height or water level.
Return period refers to the averageperiod of time betweenoccurrencesof a panicular high value of that
variable. So, for examplethe | 00 year retum period still water level is the level equalledor exceeded
once, on average,in eachperiod of 100 years.
-1.00 -0.?s -0,50 -0,25 0.00 0.25 0.50 0.75 1,00 1.25
NO- IN
(METRES) ROl4
4.00- 4.50 0 0
3,50- 4.00 0 3
3.00- 3.50 0 lz
2,50- :.00 0 11
2.00- 2.50 0 248
1.50- 2.00 0 588
1.00- 1.50 0 1 79 5
0.50- 1.00 I 4071
0,00- 0-50 ) 5 1 00
Figure 2 Probability density contours shorying negative dependence between surge and wave
height (off Hythe)
!"^*",'*"* sR 537o,t/ottx)
SURGERESIDUAI LE"VEL {M)
-1-00 -0.?0 -0.40 -0.10 0.20 0.50 0.80 1,10
Independentanddependentcases
The dependentcaseis trivial, assumingthat the statisticsof wavesaloneandof waterlevelsaloneareknown,
sinceone only hasto combinewavesandwaterlevelseachwith the samematginalprobabilitiesasthe
response.The independentcaseis easywhenworkingwithjoint exceedance probability(which providesan
approximationto the responsewith the sameprobability)asit is just the productof the two marginal
probabilities.As the independentanddependent casesaresimpleto calculate,it maybe worth deriving them
earlyin anyproject. Treatingtheseasthe 'mostoptimistic' and 'most pessimistic'scenarios,respectively,
mayhelp in judging the valueof anymoredetailedjoint probabilityanalysis.
Inhritivejoint probabilit]'assessment
The simplestmethodfor assessment of dependence is an intuitive onebasedon generalexperienceandthe
shapeandsizeof the seaareaaroundthe predictionpoint. The assessment may concludethat thereis a
modestdependence, andfor example,that high wavesandhigh waterlevelswith a 100yearjoint exceedance
retum periodare 10 to 100timesmorelikely to occurtogetherthanthe assumptionof indeoendence would
suggest.It may concludethat thereis a strongdependence, andfor exarnple,that high wavesandhigh watEr
levelswith a 100yearjoint exceedance returnperiodareonly 1Oto 100timeslesslikely to occurthanthe
assumptionof dependence would suggest.(Theprobabilityratio betweendreindependentanddependent
g* Wallingford
casesis 706 x 100.) This is the basisof ajoint probability methoddescnbed in the BeachnuTtxagenwnt
manual (C}F.IA, 1996, Section3.5.3).
Offshoreand inshoreextremes
Joint probability extremescan be calculatedand presentedeither offshoreor inshore. Offshoreresultsare
more generallyapplicableover a largerarea,but may needto be transformedinshorebeforefurther use.
Inshore results are more site-specific, and can take better account of any hydraulic interactions, but may be
applicableonly in one smalI area. Wave predictionsandjoint probabilitiesare often calculatedas a function
ofdirection. This is important,sincegeneralexposureto waves,the relationshipbetweenhigh wavesand
high water levels,and transformationinshore,may all be dependentupon stormdirection.
Depth-limitedconditions
In coastalengineeringapplications,wave heightsmay be depthJimited. For example,theremay be only a
few metres of water at the toe of a structure even in extreme conditions. so that the full force of the waves
!***,"oo.o
may not impact directly upon the structure. In extremecasesofthese circumstances,only the highestwater
levels need to be considered. A possible general approachfor depth limitation is to determine the extreme
water levels, and then to check the probability that the depthJimited wave height could occur at the same
time, However,it is worth noting in this situationthat swell waves,which do not usually havethe largest
wave heights,may be a worsecasefor structuraldesignthan depth limited wind waves.
Presentationof results
Joint exceedanceresultsare normally calculatedand presentedas 'contours' of combinationsof wave heights
and water levelswith givenjoint exceedanceretum periods,from which a 'worst case' can be determinedas
and when required for any particular situation. The sameinformation might be shown in the form of a table
of pairsof valuesof wave height and water level (drawn from the 'contours'). This approachwould allow
'worst
the case' to be identified in terms of run-up (or other designvariables)at the site of interest,but
usuallyrequiresthat severalpotential 'worst cases'be tested. Altematively, resultscan be presentedas
extrapolatedprobability densities,in which 'contours' indicatethe likelihood of occurrenceof particular
combinationsof wave heightsand water levels.This approachis useful for risk analysis,wherethe total risk
is found by integratingprobabilitiesover a wide rangeof wave conditionsand water levels in which 'failure'
rnay occur. Either results format can be converted to other relevant variables, for example run-up,
overtoppingor force on a coastalstructure.Pleasenote that althoughthe joint densityapproachcan provide a
direct estimation of the retum period of the response,tbejoint exceedanceexfemes contours can provide
only an approximationto this probability.
3 . Fitting the dependencebetween wave heights and water levels, and between wave heights and
steepnesses.
4 . Simulation of a large sample of wave height, wave period and water level data, using the fitted
distributions.
Main develonments
The main developmentsof the new method and its apptications are in elements 2 to 5. It is more objective,
pafiicularly in its modelling of dependence,than most existing approachesto joint probability assessment,and
allows wave period (or wave steepness) to be includedas a variable. With carefuluseits long-tem simulation
approachcan deliver more accuratepredictions ofjoint extreme values and structural responsefunction values
(eg overtopping,run-up,force) than methodsalreadyin commonuse. It will also assistin a gradualmove
towardsrisk-baseddesignof seadefences.
1.6 Researchoutputs
Products
Therearethreemaintypesof outputfrom this researchproject,namelyrcports,computerprogramsand
workshops,plusof courseimprovedjoint probabilityanalysisin consultancy
studies.
!***,,'**o sR 5-170.1/0tr)l)
JOIN-SEA software
The software consistsof FORTRAN computer programs, incorporating numerical statistical subroutines
(in NAG). The programs are research-basedand have not been testedto normal commercial standards,but are
available,initially on a trial basis,to appropriateusers.
JOIN-SEA reports
The first part of this document gives a brief description of the new approachand its application in coastal
engineering. The accompanying staiistical methodology and further casestudies are given in the technical
reportwhich forms the secondpart of this document.A companionuser manual(HR, 2000)describinginputs,
outputs and options available to the user is intended for day-to-day use with the new programs.
Workshoosand dissemination
A specialistworkshopwas held at HR Wallingford on 3 December1998,at which this reportand the
associateduser manual were releasedin draft form. The workshop was attendedby those directly involved in
the researchand by a number of invited coastalengineersfamiliar with joint probability applications. It was
hoped that the workshop discussion would guide future dissemination, usageand continued development of
joint probability methods,and ultimately MAFF policy in regard to their use. A subsequentbriefing workshop
was held at HR Wallingford on 4 February 2000. This was attendedby about a dozen industry specialists
interestedin receiving copies of the programs and training in their use. It is hoped that the subsequentbeta-
testingprogamme will yield usefulfeedbackon continueddevelopmentand usageofthe methods.
2. THENEWAPPROACH
2.1 Input data requirements
Prepara[qro:f]hetinputdata
Thefirst of thefive stagesinvolvespreparation of theinputdata.Generation andpre-processing of theinpur
waveandwaterlevel datais probablythe mosttime-consuming part of thejoint probabilityanalysis
procedure.Eachinput recordconsistsof a waveheight,a waveperiodanda waterlevel (or altemativelya
surge)preferably usingidenticalmeasurement or predictionlocationsfor bothwavesandwaterlevels.The
datacancomefrom measurements or hindcasts,but for eachrecordthe valuesshouldrepresentconditionsat a
particularpoint andtime, A convenientway of satisfyingtherequirementfor the recordsto be both
temporallyindependent, andrelevant,is to useonly thoserecordsrepresenting conditionsat the peakof each
tidalcycle(ie onerecordevery12or 13hours).At leastthreeyearsof data"notnecessarily continuous but
representativeof thetype of seastatesof interest,areneededtojustify theeffort involvedin applyingthe new
approach.
Stagesin datapre-pr@qa$sirg
Typicallythefollowingpre-processing
of the datawouldbe carriedout:
- combining of the separatewave ard waier level data into a single sequentialdata file;
- exfacdon and retention of only one record, closest to high water, per tidal cyclei
- removal of obviouslyfaultydata,and
!nnw"rrrgrora sR 53?o4nvll)
- running of the preliminary diagnosticprogram TESTDATAto ^ssessthe type of dependence
and whether independenceor dependencewould be adequateassumptions.
!***u.oo.o sR53?&/05/00
refine the extremespredictionsfor that one variable. For example,there may be 20 years of water level
data but only l0 yearsof wave data; accurateextremevaluesfor one or other variable may have already
beenestablished(eg from spatialanalysisof water levels (POL, 1997)); or there may be anecdotal
evidenceof severeseaconditions outsidethe period ofthe measurements.Good joint probability analysis
usesthis additional information. This might involve modification of the parametersof the fitted
distribution(s)or scaling of the predictedextremesto achievebetter agleementwith the refined marginal
predictions. The presentmethod incorporatesany refinementsby scaling during the long-term simulation
of data,thus permanentlybuilding this information into the synthesisedseastatedata to be usedin
subsequentstructuralanalysis.
Selectionof statisticalmodel
The choice betweenone and two BVNt is determinedby the relative goodnessoffit to the data, which can
be assessedwith referenceto the varying degreeof correlation,expressedas a function of exceedance
level- The single BVN may be adequatefor a location at which all the wave conditions belong to a single
population,althoughfrequently the correlationwill increaserapidly towards the tail and the mixture model
will be needed. However, where tle wave conditions belong to more than one population, for example
wind-seaand swell, the mixture model is always likely to be neededto capturethe different dependences
in the two populations.
User input
In this and in the previous stagesthe user retainssome control over the process,primarily by having the
choice of selecteddependencemodel, and secondarilyby being able to selectboth the thresholdsabove
which the fitting will be applied,and the staning valuesfor optimisation of the fits: this is assistedby
referenceto diagnosticsto assessthe fits.
2.4 Long-termsimulation
The simulation procedure
The fourth stageinvolves simulation of a large sample of synthetic records of I{", T* and water level, basedon
the fitted distributions, and with the same statistical characieristics as the input data. At this stage it is
possibleto re-scalethe marginal extremedistributionsto more establishedvalues,if they are known. For
example,concurrentwave and water level data may be availablefor a period often years,and by
extrapolationof this data the marginal distributionsof the two variablesare calculatedwithin the joint
probability software. However, if either or both of the data setsextendsbeyond the concurrentperiod of
ten years,it is preferableto usethe additional information containedwithin the extra data. Thereforethe
marginaldistribution can be derived from the longer data set and incorporatedinto the data simulation
processthrough the use of the re-scalingoption. This is achievedby altering individual affected wave
heightsand water levels from one scaleto the other betweensimulation from the fitted distnbutions and
writing permanentlyto an output file. Thus, basedon all availabledata,thousandsof years worth of sea
conditions can be simulatedwith fitted distributions,extremesand dependencesfor wave height, water
level and wave period. This provides greaterflexibility in the subsequentanalysisofthe seastatedata.
'count-back'
The extremesanalysis method
Rather than fitting probability distributions to the synthesiseddata, extreme values are estimatedfrom the
appropriateempincal exceedanceprobability in the synthesiseddata. This is achieved by 'counting back'
through the highest values within the simulated data, a method based on the literal definition of retum period
and bestdescribedby meansof an example. If, for a 2000 year simulation,the 100year retum period value
(ie the level equalledor exceededon averageonceevery hundredyears)is required,then this is given by
'counting
back' to the 2000/100= 20th highestvalue, which is then assigneda returnperiod of 100years.
Joint exceedanceextremesare detemined in a similar way, for exampleby 'countingback' throughthe
highestwave heights,but only consideringthoserecordsabovea certainthresholdof water level. Thrs
intuitive approachis reliablefor retum periodsup to aboutone quarterof the simulationlength. Henceit is
necessaryto synthesiseat leastfour timesas much dataas the highestretum period of interest,and in practice
ten times would be a more rypical ratio.
2,6 Softwareimplementation
Introduction
This section briefly describesthe software packageand the functions of the five main programs for the joint
probability analysis: BVN, MIX, SIMBVN, SIMMIX and ANAZYSIS. For a more detailed description and
instructionson the use of the programs,a companionmanual(HR, 2000) is available. In normal usein
User inpul
The ptocedureis largely automated,but requiresquile substantialcomputertime, memory and storage
capacity. Although there is a large number of questionsput to the user during operationof the programs,
the majority can be answeredby acceptingthe default values. The user should be preparedto check for
mixed populationsor faulty recordsin the original data,and to re-run parts of the procedureif indicatedto
do so by the diagnosticinformation. It is sometimesnecessaryto take a view on the importanceofone,
two or three outliers,that is recordsof severeseastateswith apparentlyhigh dependencebetweenwave
height and water level in an otherwiseuncorrelateddata set. As in any joint probability analysismethod,
the eventualresultscan be quite sensitiveto the importancegiven to theserecords,and it may be worth
checkingthe validity of the individual recordsconcemed. More specific advice is given in the JOIN-SEA
user manual (HR, 2000).
3. APPLICATIONS
The test data sets
During validationand comparisonof resultswith altemativejoint probability methodsseveraltestdatasets
were assembled. Some were basedon mea-suredand hindcast wave and water level data - these had the
advantagesof not coming from pre-determinedstatistical distributions and of being representativeof data sets
to be usedin consultancystudies;somewere synthesisedusing specifiedprobabilitydistributionsloosely
basedon measureddata - these had the advantaeethat the 'true' extremesand ioint orobabilities were known.
t"- Wallingford t2
3.1 Exampleapplications to synthetic data
The simulateddatasets
Five setsof waveand water level datawith known statisticaldistributions,but broadlyrepresentativeof
actualconditionsat different points aroundthe coastof EnglandandWales,were synthesised.Details of
the derivationof thesedatasetsaregiven in Chapter5 of the LancasterUniversity part of this report.
(o
- EstimatedModel
--.- Simulation
Model
--- HR Estimates
f<o
C{
0123456
WL
Figure 4 Joint exceeilancecontours of H, (m) and water level (mOD) for a site with very low
dependence
tr)
.!4 0
=+;+J.
r- - - StiCIeiBglt%#l
t-tH Es males
4 5 6 7 .8 9
WL
Figure 5 Joint exceedancecontours ofH" (m) and water level (mCD) for a site with high
dependence
Additionalcomoarisons usinqdirecthindcasting
Onefurther analysismethodwas appliedto the simulateddatasetsaftercompletionof the Lancaster
University (LU) part of the report,to illustrate an analysismethodusedsatisfactorilyby HR Wallingford
severaltimes in the past. The remainderof this sectioneffectively forms a post-scriptto LU's Chapter9.
It involves direct hindcastingand exhapolationof overtopping,aiplied in the way that hasbeenusedin
previousHR consultancy studies.This is comparable with LU's 'SVM appliedto Q.', referringto the
StructureVariable Method appliedto overtoppingrate,which suggestedthat the rnethodwasflawed due to
its greatsensitivity to threshold,but that the SVM appliedto log (eJ wasreasonable.
Table I Direct hindcasting and extrapolation of overtopping rate, applied to data set Siml
!Hnw"rr*sro.a l6 SR53?l).Vl'sX)
data came from a numerical hindcasting model, but the sequentialwind and water level data are from actual
measurements.) Details of the data setsare given in Table 5.1 of the Lancaster University part of the report.
The structuralresponsefunction
For the purposesof this study the samestructuralresponsefunction was used as in the Lancaster
University comparisons, namely sea wall overtopping. Idealised seawalls were devised and the
overtopping dischargewas calculatedusing the formulae of HR (1980) at eachof the sites. Further details
of the formulae and the seawall parameterscan be found in Table 9.1 of the LancasterUniversity part of
this report. However, it is not so much the valuesof overtoppingcalculatedthat are of interest,more the
comparisonbetweenthe different methods. Tables 6-11 give overloppingdischargeratesusin-gthe
different analysismethodsdescribedbelow. All dischargesin the tablesbelow are given in m'7s/m,but
pleasenote that the idealisedseawalls usedin the testsdo not correspondto the actual seadefencesat any
of the five sites.
JOI{PROB analvsis
The JOINPROB combinationscolumn in Tables 6-l l refers to the existing HR analysismethod whereby
different combinationsof wave height and water level, with the samejoint exceedanceretum period, are
extractedfrom a j oint exceedanceprobability contour plot. Thesecombinationsare then applied to the
structuralresponsefunction and the 'wofst case' is found. The wave steepnessis calculatedfor the top few
percentof wave conditions and is assumedto have a constantvalue, which is then applied to the extreme
conditions to calculatethe wave oeriod.
JOIN-SEA analysis
The new joint probability methodswere applied in threedifferent ways in order to test different aspectsof
the procedure. Two involved the joint exceedancecombinationsapproachfor comparisonwith eadier
methodsand one involved direct simulation of the struchrralresponsefunction. The mean wave period
(T) for the JOIN-SEA analysiswas calculatedin three ways:
Empirical extremes
Column 6 in eachof Tables 6-11 gives e.,ttremevaluesderived directly from the original data. Each
record, in terms of significant wave height, meanwave period and water level, was convertedto an
equivalentrate of overtopping. The I year return period extremevalue was determinedfor eachlocation
using the 'count-back' method describedin Section2.5 (so, for example,in a 28 year data set, the required
value would be the twenty-eighthlargestovertoppingvalue). This is perhapsthe most accurateexlreme
E***u"*.0
value estimatein eachtable. The 10 year, and in somecasesthe 20 year retum period values,were also
estirated from the data, but theseare subjectto greatuncertainty.
!"**u,n*'. 19
Table 11 Comparison betweenovertopping rates (m'/Vm) for different analysismethods at North
Wales
!"**",r"o.0 20
deep water conditions at the structure. To illustrate the differences an example study was carried out for a
hypotheticalsite on the Somersetcoast. This locationwas chosenfor a numberof reasons:it has a high tidal
range; it is exposed to a mixhrre of locally generatedwaves and waves arriving from the Atlantic; wave
refraction effects cause a significant increasein T* as the waves prcpagatefrom offshore to nearshore;there is
increasing dependencebetween wave heights and water levels as wave height increases;and 28 years of
simultaneousmeasuredwater level data and hindcast wave data were available.
- offshore wave data representativeof conditions severalkilometres away from the coast in
about 20m water denth:
- MIX-SV analysisusing two fitted BVN's - ii) working directly in terms of the structural
responsevariable, and now using a variable wave steepnessderived from the orjginal data;
a plain seawall with a crest level 2,8m aboveMIIWS and deepwater at the toe of the
wall:
a plain seawall with a crestlevel 2.8m aboveMHWS and a toe elevation2.7m below
MHWS.
!"**n,"ro.o 21 SR53?0,U05/00
The structural responsevariables
Two structural responsevariables were tested:
- overtopping rate using the SWALLOW equationsin HR (1980) for a wall slope of l:4;
- armour size using equations(5.44) to (5.46) in CIRIA/CUR (1991) for a wall slope of l:2.
Table 12 100 year overtopping rates (m3/s/m) for Somerset (no limit on wave steepness)
E* Whllingford 5R5-r7ov0t/(Xr
Lower llmit on wave steeDness
The importance of wave period was dramatically demonstrated,in the form of much increasedovertopping
rates for the struchlre variable calculations using nearshoredata where wave steepnesswas allowed to vary
over a wide range about its mean value, basedon the variability seenin the original data. To some extent this
is a genuine effect, reflecting the existenceof swell waves at the site, but it is over-emphasisedin the results
seenin Table 12. However, in practice this would be corrected by setting a lower limit on wave steepnessin
the synthesiseddata as seenin the much improved results in Table 14. The variability of the results illustrates
the care that should be taken in selecting input data and methods of analysis and interpretation.
Tees Bay
The overtopping hazard was assessedat three different locationsaroundthe TeesEstuary. Wave
conditions were calculatedat the three locationsusins numerical models. Waler level data from North
Shields was convefted to the sites of interestusing faitors basedon differencesin the tidal range. The
converslonofthe water level data to the TeesEstuaryraisedseveralimportant issuesregardingwind and
wave set-up.
Wind set-upis an increasein water level that occursover a large areadue to the direct effect of wind stress
on the water surface. Wave set-upis a much more localisedeffect that causesincreasedwater levels in the
surf zone as waves break. It was thought that wind set-upwould havebeen presentin the measuredwater
level data at North Shields,as it is a widespreadphenomenon. However, the extent to which it should be
included in the converted water level at the TeesEstuary was unclear. Wave set-upwas thought to be
excludedfrom the measurements,sincetide gaugesare generallyplacedwith a view to excluding wave
set-up.
As a result of this assessmenttidal flow modelling was carried out to assessthe extent of wind set-upat
Tees Bay in relation to North Shields. An empirical formula was derived that allowed the water level data
at Tees Bay to include the effects of wind set-up;additionally an allowancefor wave set-upwas made. As
it transpired,the inclusion of theseprocesseshad little effect on the joint probability extremesas the wind
was always from the west or south-west(ie off the land) at the time of the maximum measuredwater
levels. However, there was a small positive corelation betweenlarge wave heightsand overall water
levels in the modified data,which was usedin the subsequentanalysis.
Lyme Bay
JOIN-SEA was also used at a location in Lyme Bay where flood risk was to be assesse.d
over an area
comprising of severaldifferent seadefencestructures. The JOIN-SEA method of exffapolatingthe joint
The nearestmeasuredwater levels to Lyme Bay (with sufficient length of data) are at Devonport, which is
a considerabledistanceaway. The tidal range changessignificantly along this stretchof coastline,causing
uncertainty in the interpolationof water level data to Lyme Bay. To overcomethis problem, extremesat
Devonport and Weymouth (to the eastof Lyme Bay) were derived, and also variationsin the spring range
considered,before calculating factors to be appliedto the measuredDevonport data. The converteddata
was then compafed to severalmonths of measuredwater level data at the site of interest,which showed
good agreement. However, this study highlighted the problemsthat can arisein 'moving' measure.dwaler
levels along a stretchof coastlinewhere the tidal range varies significantly, and reinforced the preference
for having measureddata directly at the site of interest.
An interestingpoint to make in passingis that it is not strictly necessaryto 'move' the time seriesdata,
either on waves or water levels, from the measurementor prediction point to the point of application. It
would be possibleto determinethe dependencebetweenlarge wavesand high water levels using data for a
nearby location and then, as in the intuitive approach(seeSection 1.4) to apply it at the new point of
interest. However, it would still be necessaryto determinethe marginal extremesat the new point.
Korea
A consultancyproject off the west coastof Korea provided a good oppomrnity to use the new software in
environmentalconditions that were considerablydifferent to UK conditions. The wave and water level
data were split into two seasons,summerand winter, ald consideredas two separatecases,since there is a
distinct difference in climate betweenthe two. The highestwater levels were expectedto occur in summer
and the highest wave conditions were expectedto occur in winter. The analysisfor eachof the two
different data setsrevealeda negativedependencebetweenwave heightsand water levels. This was
thought to be feasible,if unexpected. However, as a conservativemeasurefor the design calculationsthe
relationshipbetweenthe two variableswas taken to be independent.
This study highlighted the requirementfor a relatively long simultaneousdata set. As the original three or
four years of data was split into two populations,the lenglh ofthe time serieswas effectively halved, thus
creating a relatively short period of data on which to basethe fitted distributions. There was thereforea
Sreaterthan normal degreeof uncertaintyin the calculateddesignconditions,prompting the inclusion of a
higher than averagemargin of safety allowancein thejoint probability results.
Truro
Until now the only primary variablesconsideredhavebeen wave height and water level. However, there is
no reasonwhy JOIN-SEA cannotbe usedfor different variables. With MAFF funding, HR Wallingford is
currently involved in researchinto the prediction of extremewater levels in estuaries.In parts of many UK
estuariesthe combined effects of wave conditions,tidal surgesand river dischargesneed to be considered.
As part of the researchJOIN-SEA has beenused to assessthe dependencebetweenriver flows and wave
conditions, and river flows and tidal surges,as well as wave conditionsand tidal surges.
!*o*"r,*"'.0 25
data on flow rate or wind speedto an equivalentincreasein water level remainsas a separateproblem
probably best addressedusing hydraulic modelling.
Humber Estuarv
As part of a major strategystudy, HR Wallingford undertookwave hindcastingat sixteenpoints within the
Humber Estuary (basedon | 4 years of winds measuredat Spurn Point) and a j oint probability analysis
basedon water levels measuredwithin the Humber. A number of interestingpoints are noted.
Although there are many tide gaugeswithin the Humber, HR's client neededto undertakea spatialjoint
probability analysis(of tides and surges)similar to that describedin POL (1997) in order to achievethe
desiredresolution in the water level data.
Although river flow and wind set-upwill affect the water levels within the Humber, they were not
explicitly accountedfor in thejoint probability analysis,since they would alreadybe presentin the
measurementsand in the spatialanalysisbasedon those measurements.
The great majority of the recordsshowedno dependencebetweenwave heightsand water levels, but for
most of the prediction points a small handful of recordsshowedlarge wavesoccurring simultaneouslywith
high water levels. This ratheruncertainindication of a significant correlationin the high tail of the
distribution meantthat some manual smoothingof the derived conelation coefficients was necessaryto
maintain spatial consistencybetweenprediction points.
Anzrlysisof extreme water levels basedon over forty years of water level data in the Humber gave
sufficiently different extremevaluesthat refined valueswere determinedby HR's client for use in
re-scalingduring the simulation (otherwisebasedon only fourleen yearsof data).
4, DISCUSSION
4.1 Past practicefor use of resultsin design
Joint exceedancecombinations
In the past,joint probability analysishas usually resultedin a rangeof combinationsof wave heights and
water levels, eachwith the samejoint exceedancereturn period (and these 'design seastates'are a valuable
output from tle new method). Each combination is expectedto be equalledor exceededonce, on average,
in each retum period. In designing or assessinga structure,one would needto ensurethat it could
withstand every 'design seastate' for the return period being used. In other words, for eachstructural
responsevariable of interest,eachcombinationof extremewater level and wave condition should be
tested,to determinethe worst casefor that resDonsevadable. The retum Deriodof the value thus derived
for the responsevariable(eg overtoppingtwiligenerallybe lessthanth" r.tu- periodof thejoint
exceedanceextremes,as illustrated in Figure I and discussedin the LancasterUniversity part of this
report. This discrepamcybetweenthe two retum periods is usually addressedby inclusion of a small
margin of conservatismsomewhereelse in the assessment.A convenientmetiod sometimesusedin
previous studiesis to use marginal extremewave conditions with the standard3-hour event duration, as
opposedto the 12-hourduration which is more strictly correct for extremesconditional upon being at high
tide. The resulting wave conditions are then appropriatelyconservative.
1. Incorporation of the variability of wave period (or wave steepness)both in the analysis and in the
simulationof long samplesof data.
2. Assimilation into the analysis of improved marginal predictions basedon additional (non-simultaneous)
data on any of the vanables.
3. More direct analysis of struchrral responsefunction(s) such as run-up or overtopping, thereby addressing
the risk of failure or damagemore directly. Oncethe long-termsimulationhasbeencarriedout, several
!"**n,"u*o 27
different structural responsefunctions and/or different designscan be testedconsis0entlyand relatively
easily.
Al altemative risk-base.dapproach
Design and assessmentcould thus be done directly in terms of any structural responsevariable(s) of interest.
Using this approach,it would not be necessaryto detemine a designseastate,but insteadthe designwould be
basedmore directly on the overall risk of structural damageand/or unacceptableovertopping etc. A further
point is that if a number of altemative designsand a number of different failure modes are to be analysed,then
the risk-based approach may be quickest. This would imply a major change in design practice, but it is
consistentwith a gradualmove towardsmore risk-baseddesign.
t* VVallinglord
28
4.4 Recommendedoptionsfor joint probabilityanalysis
There are a range of techniques available: from an intelligent choice of a single water level to use with
establishedwave conditionsto a rigorousjoint probability assessment using long time seriesdata; from a
general offshore study which might be valid over a wide area to a site-specific prediction of overtopping
volumes.
Simplerioint probabilitvmethods
In mostcoastalengineeringprojectsit is worth giving somethoughtto joint probability,evenif it doesnot
justify a fbrmal study. At the simplestlevel, an engineermight decideto test the designwave conditionsat
Mean High Water Springs (if waves and water levels are almost uncorrelated) or at Highest Astronomical
Tide (ifthey are well correlated).Altematively (or additionally)the independentand dependentcasesfor
waveszmdwater levels might be consideredto determinethe maximum rangeof uncertaintyin water level
for use with any given extreme wave condition. A better, but still empincal, approach would be to determine
the degreeof dependencebetweenwavesand water levelsand then to apply the intuitive methoddescribedin
the third paragraphof Section 1.4. The necessary'correlationfactor' can be estimatedfrom general
experienceor with referenceto earlier studiesin the samearea. (In view of the uncertainty,a conservative
value should be taken,as the methoditself is not inherentlyconservative).Armed with this 'factor', extrcme
combinationsof wavesand water levels (expressedin termsof their marginalrehrm periods)can easily be
determined. These approachescarry a wide margin of uncertainty in terms of joint return period, but they are
betterthan not taking any accountofjoint probability andcheaperthan a more rigorousstudy.
Summary
To summarise, consideration of the dependencebetween high waves and high water levels should never be
neglected, where both variables are important. However, the scope of eachjoint probability assessment
shouldbe decidedon its own merits,in termsof input dataavailable,the intendedend use,and the potential
benefitsto be derived. HR's presentpracticein consultancyshrdiesinvolvingjoint probability analysis
follows the recommendations above. Just over half of the studies are carried using the one of the methods
outlined in the paragraphheaded'Simplerjoint probability methods',with most of the remainderbeing
undertaken using JOIN-SEA. Onshore or offshore wave data, wave period ald/or direction, better
knowledge of marginal extremes,joint exceedanceextremes and/or struchrral responsevariables, and
allowance for clinate change, are included as and when appropriate.
Dependencemodelling
JOIN-SEA's statisticalmodelshave sufficientflexibility and robusmessto capturethe featuresofthejoint
distribution of seaconditions producing extreme sea statesfor a range of observational and simulated data
sets. As with all methods,experienceis requiredin handlingdata setswhich do not conform to expectations:
examples are an unusually wide distribution of wave steepnessor more commonly a very small number of
severeseastatesexhibiting high dependencewithin al otherwiseuncorrelateddata set. With care,it is
possibleto treat suchdata setslesssubjectivelyusing JOIN-SEA than with other methodstestedduring this
proJect.
!"**"u.oo.o 3l
Wave period modelling
The importance of wave period was demonstratedin both parts of this report. Where a standard wave
steepnesswas applied to obtain the wave period to use with extreme combinations of wave height and water
level, significantly different overtopping rates were predicted for a site-specific value of wave steepness
compared to a 'global' value. The use of a global value tended to under-predict the oveftopping rate, due to
its sensitivityto wave period. The facility to allow a distributionof wave steepness within JOIN-SEA
worked well, but somecare is neededto preventover-predictionof low steepness wavesand consequent
over-predictionof overtoppingrate.
. Intelligent estimate of dependence,leading to sensible water levels at which to test derived extreme
wave conditions:% day
' Rigorous site-specific derivation and extrapolation of force or overtopping, based on long time senes
data on wavesand water levels: 5 days
The times given above refer to analysis at the first point within a study area. Each subsequentanalysis point
within the same areaand samerepoft would add only about one third to the staff time for the first point.
Acquisitian of data, and wave generation and transfornwtion modelling are rct included in the times given.
Since about 1998,HR Wallingford has stoppedusing the earlier JOIMROB programs, and has switched
entirely to the new JOIN-SEA methods. Resultscontinue to be usually quoted in terms of joint excervdance
extremes,but direct simulationof responsessuchas overtoppingis sometimesused.
Application to othervadables
Most of this report deals with joint probability as it affects large waves and high water levels. However, it is
worth mentioning that, in principle, the methods have wider application, for example to waves and winds,
swell waves rmd stom waves, waves and currents, and water levels and river flows. However, it remains
necessaryto have good data on the marginal extremes,and sufficient data to estimatethe dependencebetween
the two variables. Also, the new progams are specificto wave heightsand waterlevels,with wave periodsas
a related third variable, so some re-programrning would be neededbefore use with other variables.
!"**"u,*",0 34
5. REFERENCES
CIRIA (1996). Beachmanagementmanual. CIRIA Report 153.
CIRWCUR (199I). Manual on the use of rock in coastal and shoreline engineering. CIRIA Special
Publication83.
HR Wallingford (1994). Valifution of joint probability methodsfor large wavesand.high water levels.
HR ReportSR 347.
Ministry of Agriculture, Fisheries and Food (1995). Joint probabilities - a workshop to further research.
MAFF Flood and CoastalDefenceNewsletteqJuly 1995.
lHa w"ttinsrora
AppendixI
betweenjoint exceedance
Comparisons eKremesfor differentanalysismethods
FigureAl: Ckistchurch
FigureA2: Dover
FigureA3: Dowsing
FigureA4: Cardiff (0-190"N)
Figure,{5: Cardiff ( 190-360'N)
FigureA6: North Wales
Notes:l) Dotsshoworiginaldata
2) Predictionpointsshowjoint exceedance with given
extremes
retum oeriods
!*^
Christchurch
a.1
v)
E
tn4
0)
E
C)
6
F
0
0.0 0.5 1.0 1.5 2.0
'lVater
Level (m ODN)
FigureA1 Christchurch
l!"***noo-
Dover
tr
->4
a
E
bo
'6
o
>2
=
/?
'lVater
Level (m oDN)
Figure A2 Dover
lHnw"nngora
Dowsing
rl
E
m^
a4
uo
0.)
o
(!
=
4 o
Water Level (m oDN)
Figure A3 Dowsing
!***","rr*
Cardiff (0-tOO aeg)
(4
a'1
bD 1.v
C)
T
0)
d
=
0.5
0.0
4
Water Levei
[*^*n,*o,"o
Cardiff (t0o-zzo aeg)
a'i
a
E
a'i
u0
a)
:T:
G)
>
(n n
' ' '6
=
4 6
'lVater
Level ( m o D N )
E*
North Wales
.1
a':
un=
q)
E
o
6
=
tz
Water Level (m cD)
tno*,'*o*
Part II LancasterUniversity Repoft
!"^*r,"0"'.0
JOINT PROBABILITY METHODS FOR
EXTREME STILL WATER LEVELS AND WAVES
In collaborationwith
HR Wallingford.
1997.
September
CONTRACT
Introduction I
1.1 Frameworkof the Study 10
L2 CurrentImplementation 13
1.3 Commenton the Current Implementation t4
1.4 Objectives t5
1.5 Outlineof the Report 16
I.6 GoodStatisticalPractice 17
2.3 Comparison
of Methods 24
3.3.1 Models. 35
3.3.2 Diagnostics 40
COI\iTEIVTS
3.3.3 Illustrativeexamples 4l
3.4 TechnicalCalculations 42
3.4.1 Independence 43
3.4.2 Bivariate extreme value dependence 43
Marginal Estimation 89
6.1 Still Water Level 89
6.1.1 ObservationalData 89
6.1.2 SimulatedData o<
6.2 Surge. 96
6.2.1 ObservationalData 96
6.2.2 SimulatedData 101
6 . 3 SignificantWaveHeight 101
6.3.1 ObservationalData 101
6.3.2 SimulatedData 120
6.4 Steepness 120
6.5 Wave Direction 124
CONTEAI"S 5
L1 References ?7s
A Background information 221.
A.1 Joint Distributions 22L
223
A.2.1 UnivariateNormalDistribution 223
A.2.2 MultivariateNormalDistribution
A.3 The Probability Integral tansform 226
A.4 Likelihoodsfor Dependence Models 226
A.4.1 BivariateNormalModel 227
A.4.2 BivariateNormalThresholdModel 227
COIf?EAITS
Notation
Structure parameters
Input variables
TYansformed variables
Diagnostics
Introduction
Since 1991, MAFF have funded a study of joint probability methods at HR Walling-
ford. HR have developedpractical methodsof estimatingthe degreeof correlationbe-
tween wavesand water levelsfor practising engineersto apply to the designof coastal
flood defences.Under a separateMAFF-funded project, the Proudman Oceanographic
Laboratory and LancasterUniversity have developedstatistically rigorousmethods for
estimating the joint probability of tides and surges,and more generally,techniquesfor
handling joint probabilitiesof extremesof any environmentalvariables. However,these
rigorousmethodscan only be appliedby specialiststatisticians.
In previous work by Michael Owen, whilst at HR, joint probability methods were
identified as being important in estimatingthe probability that coastalflooding occurs,
asthis is determinedby overtoppingdischargerates (givenby sea-levels and waves)rather
than by sea-levelsalone.
In August 1994 MAFF hosteda Workshopon joint probability chaired by Michael
Owen. As a result of the WorkshopHR and LancasterUniversity submitted a joint
researchproposalto MAFF entitled The use of joi,nt probabili,tgdata on wauesand,water
leuelsi,n coastaleng,ineering This proposeda 2 year study starting April
appli,cati,ons.
1995.
MAFF funded this project with the aim of bringing together the best aspectsof ex-
isting approaches,combiningstatistical rigour with engineeringpracticality, to provide
more powerful tools for routine use by practisingengineersand consultants.This report
describesthe work in the first year of the study focusingon the progressat Lancaster
University into producingrigorousyet applicablemethods.
1. the selection of a design (either the existing design or one basedon some preliminary
analysis),
2. the identification of all possible types of failure for that design - the modes offailure,
3. for each mode of failure, the identification of the combinations of sea condition
variables which cause failure,
o overflow when the water level exceedsthe level of the crest of the defence,
r overtopping - when the combined effect of waves and water levels results in waves
running up and breaking over the defence,
For any particular mode of failure we term the associated variable of interest the
structure variable. Generallv the structure variable is related to the sea conditions
o tide,
. Sulge,
A more useful notation is that, at time t, we }ave seaconditions,X;, and the structure
variableX, with
Yr = A(&) for all t.
HereX1 is multivariate(5-dimensional) , (Xr,r,... , X5,1),with X1,1denotingthe iide, X2,1
denotingthe surge,X3,sdenotingsignificantwaveheight,Xa,1denotingwaveperiod, and
X5,1denotiugwavedirection.
Throughout the study the methods that are developedapply to any mode of fail-
ure/structure function. However,to illustrate the methods in application we consider
only overtoppingas the modeof failure and take the structurevariableto be the overtop-
ping dischargerate. For this examplethe structure function is
for a sea-wallof height a and where g is the accelerationdue to gravity. This form
was suggestedby HydraulicsResearchStation (1980)basedon wave tank studiesfor a
simple type of sea-wall.Here the sea-wallis a slopingdesignand the constantsa1 and a2
in equation(1.1.1)dependon the characteristics of the sea-walldesign.From equation
(1.1.1)it is clearthat overtoppingwill occuronly if at leastoneof the variables
. significantwaveheight, or
r waveperiod
is sufficientlylarge.
Aside: note that Jor thi,ssturlg the structure function ignoresthe directi,onalityof
the uaues but waued,irectionts retai,nedin the sea conrl,iti,on uectorX2 to ai,d,the latter
stati,sticalmodellingof thejoi,nt distri,butionof the otherseacorul,itr,on
uari,ables.Srmi,larly,
the decompositi,on of the sti,ll uater leuel into tide and,surgecoTnponents i,snot essential
other than to ar,dthe statr.sticalmod,elli,ng.
Now, the probability of somecritical dischargerate, u say,beingexceededat time f is
given bv
Pr{Y1> u}.
12 CHAPTER 1. INTRODUCTION
2. The use of the structure function beyond the data: this knowledge is ignored by
the structure variable method but, if known, can be fully exploited by the joint
probabilities method.
3. The generality of the analysis: the statistical extrapolations for the structure vari-
able are specific to the site, design, and mode of failure under consideration, whereas
fbr the joint probabilities method the design and site-specific features are excluded
from the extrapolation as they are features of the structure function. Thus the
statistical analysis for the joint probabilities method applies to any design analysis.
1.2. CURRENT IMPLEMENTA'IION 1t
Pr{SWL>r,Hs>A}
A more refined versionof this method follows the format abovewith certain steDs
modified as below:
1/. The probabilityF(z,g) -PI{SWL) n,Hs > g} is estimatedas aboveand from this
the probabilityof (SWL,11s)fallingin a rectangularcell [16,r1j x lAo,yt]is obtainedby
2 . Wave period is taken to be given exactly by the significantwave height and the
estimated constantwave steepnessvalue. Waveswith longer periods for a given
wave height can occur and thesewill typically producelarger valuesof the struc-
ture variable. Sometimesin applicationsof the refined method this restriction is
overcomeby incorporationof variationsin S.
a A feature of the more basic current approachis that the failure region used to
estimate the probability of failure, Pr{X e ,4,}, is a subset of the true failure
region ,4,. Thus, even if the joint distribution of the seacondition variableswas
1.4. OBJECTIVES li)
I'rom work prior to the commencement of this project it was unclearhow important any
ofthese deficiencies/approximationsin the existingmethodswere. They may canceleach
other out, or they may causeadditive errors. However,evenif the errorsare additive, the
approximationmay be suflicientlygood to obtain the optimal design.
The only evidenceprior to the project comesfrom inconsistentoutputs from current
implementationsof the structure variablemethod and the joint probability method. Dif-
ferent approacheshavebeenusedfrom time to time, dependenton budget,availability of
input data and intendedend useof the results. However,when differenceshavearisenit
has not beenclearwhetherthe disprecancylies in the estimatesprovidedby the structure
variable method, due to its poor extrapolationfeatures,or is due to the weaknesses in
the implementationof the joint probability method. This feature has resultedin some
doubts about the eiven estimates.
L.4 Objectives
This study proposesto examineeachof the two statistical methods as aids to design
assessment. The methodswill be developedgenerallyand applied,as an exampleapplica-
tion, to the estimationof the probability of overtoppingdischargeratesexceedingcritical
Ievels.In so doing, the study objectivesare:
The way that we shall approacha simple solution to the problem is to adopt the
rigorousframeworkof Colesand Tawn (1994)but insteadofusing their complexstatistical
modelsfor dependence in the extremejoint tail of the distribution we shall explorea much
simpler but more flexible approachbasedon useof the multivariate normal distribution
for the seacondition variablesafter suitabletransformation.This differs from the Coles
and Tawn approachwheremultivariateextremevaluedistributionalmodelsare usedafter
a different marsinal transform.
2-4 are applied to thesedata. Specifically,for both types of data set, in Chapter 6 each
of the separateseacondition variablesis analysedwith differentstatistical modelsbeing
considered;in Chapter 7 the dependence betweenextremesof the seaconditionvariables
is analysed;and in Chapter 8 the joint extremesof still water level and significantwaves
are estimated.
Part III comprisesChapter 9 where the distribution of the overtoppingdischarge
variable is derived from both joint probability and structure variable methods. A key
featurein this context is the assessment of the impact of extra knowledgeon the separate
sea condition variableson the estimation of the distribution of the structure variable.
This chapter comparesthe structure variable and joint probability methods relative to
eachother and against the methods usedby coastalengineers.For the simulateddata
the joint distribution of the seacondition variablesis known (givenin Chapter 5) so the
true distribution of the structure variable (e.g. the overtoppingdischargerate) can be
inferred. Thus the true performanceof the methodscan be assessed for the simulated
data. In Chapter 10 someconclusionsare presented.Throughout,all technicalstatistical
details have been kept to a minimum, with details when necessarygiven in a technica,l
aooendix.
Incorporatlonof scientificknowledge
r Assessmentof sensitivity
o Quantificationof uncertainty
from the frameworkwe have tried to follow the rationa.lebehind the framework. The
particular area where this study is weak, is on the quantificationof uncertainty. There
are three sourcesof uncertainty:
When assessingthe probability of failure for a singlesea condition variable, the events
which give failuresare easilycharacterisedas failuresof the structure causedby extreme
valuesofthe variable,i.e. failuresoccurwheneverthe variableexceedssomelevel. When
the sea condition variable is inherently multivariate evaluation/estimationof the prob-
ability of failure is more problematicas there is no generalordering which determines
which are extremevalues.However,giventhe context ofthe problem,a natural approach
is to basethe orderingon the associatedstructure variable,sincea failure is deemedto
have occurred,in a multivariate problem, if the structure variable,A(X), exceedssome
critical level.
An added complicationto the problem is that a designmay have to satisfy several
di{ferentdesigncriteria, i.e. withstand the extremesof severalstructure variables. For
simplicity we will focus on there being a singledesigncriterion. The work in the second
year of the project will extendthis frameworkto more generalcases.
Thereforefor a structurefunction A, wherefailuresoccurwhenA(X) ) z, the extreme
valuesof X are the set ,4r, given by
P r { X 1e A " ) : P I { A ( X 1 )> z }
: pr{y, > z},
where Y1- A(Xr) is the structure variable. The left and right hand sidesof the above
equation provide alternative formulationsfor the probability of failure. Starting from
eachof theseexpressionsin turn providesthe basisfor the derivationof the two statistical
27
CHAPTER2, OUTLINE STATISTICALMETHODS
3. Completeobservationsare availableas the wave data are hindcast over the same
time periodsas the tide and surgedata are available.
The choiceof which of these approachesto use dependson how simple the true
distribution of the structurevariableis. Clearly what is neededin practiceis a good
fitting statistical model which can be accuratelyestimatedand extrapolateswell.
The method to chooseis the approachwhich most adequatelymeetsthesecriteria
in practice.
1. Estimation of the joint density ft of the sea condition variables. This requiresa
selectionof a statistical model for
o Simolicitv.
Extrapolation properties,
Flexibility to coverdifferentdesigns,
Simplicity
The SVM is much the simplerto useas the statisticalcomponentof the methodis routine
relative to the JPM. For the SVM the analysisand extrapolationis for one variableonly,
the structure variable,whereasfor the JPM analysisextrapolationis requiredfor eachof
the separateseaconditionvariablesand for the dependencebetweenthe separatevariables.
Extrapolation properties
When extrapolating the distribution of the structure variable the SVM makesno as-
sumption about the form of the structure function beyondthe observeddata. Therefore
information about more extremestructure variablevaluescan only be inferredfrom the
observeddistribution of the largeststructure variablevalues,i.e. a purely statistical ex-
trapolation. By comparisonthe JPM extrapolatesthe seacondition variablesand then
builds in knowledgeof the stmcture function beyondthe data when integratingthe joint
density. This omissionof the knowledgeof the structure function from the extrapolation
of the SVM can influenceto the extrapolationas shownby the followingexample.
If A1 and A2 are two structurefunctions,which a.resuchthat A1(x1) : A2(x1) for all
t : I,... , n then the associated
observationson the two structurevariablesare identical.
Applying the SVM to the two data setsgivesidenticalextrapolationseventhough ,A1and
A2 maj be quite different outsidethe observedrangeof the data. Thesedifferencesin
A1 and A2, if known, are incorporatedinto the JPM through the use of different failure
regionsover which the commonjoint densityof seaconditionsis integrated.
In conclusion,the SVM is much the simpler of the two methodsto apply but has
drawbacksconcerningthe requirementto have simultaneousdata on the sea condition
variables,and the necessityto assumethat the form of the structure function doesnot
changebeyond the data. The JPM is dificult to implement which has often led to
independenceor completedependencebeing assumed.Howeverif it is properly applied
the JPM is preferablesinceit separatesthe statisticalanalysisfrom the oceanographic and
engineeringcomponentof the assessment of the probability that the structure variableis
extremeand hencehas wider applicability.
Chapter 3
There is much literature on the subject but the most suitable reviewsare Davison
and Smith (1990),Srnith (19B9)and Thwn (1992)on univariateextremes,and Colesand
Tawn (1994)and Ledford and Tawn (1996)for multivariate extremes.
27
28 CHAP'|ER3. EXTHEMEUALUEMETHODS
+ e@- Alrl;'/e),
Pr{Y < y} : exp{-11
r ;r a location parameter,
r { a shapeparameter.
The level, 9p, exceededwith probability p, i.e. which satisfiesPr{Y > yo}
bv
'u.p
= tt+ o{[- log(t - d]-t - I]l€,
Letting ) : P(X > u), it followsthat the likelihoodfunction associatedwith the Thresh-
old Method is given by
where1: {1 < i 1n X; > rr}, N is the numberof elementsof 1, i.e. the number
of exceedances of the thresholdu. The parametersS and ) are estimatedby maximum
Iikelihood,whichinvolvesmaximizingthe likelihood,L(^, d, with respectto the unknown
parameters.Sometimes,,\ is unconstrainedand treated as a parameterof the statistical
model (seebelow); in this case,the maximum likelihood estimateof ,l is -nf/n,,i.e. the
proportion of exceedances of the threshold. In other situations,) is explicitly linked to
the parametersof the exceedance distribution(cf. Section4.1.2).
So far no mention has been made of how to chooseG
6, and, in fact, no unique
specificationfor G4 can be given,unlessthe true distribution is known.
However,basedon essentiallythe sameasymptoticargumentsthat justified the use
of the GEV for the annualmaximum above,it can be arguedthat the natural family of
distributions to use in order to describethe excesslevelsof observationsabove a high
thresholdis the generalisedParetodistribution (Pickands,1975).That is, if the threshold
z is sufEcientlyhigh, the conditionaldistribution of a randomvariableX given X > z,
F(z) - F(z)
P(X <rlX >u) : T)M, (3.1.2)
I-F(u)'
30 CHAPTER3. EXTREMEUALUEMETHODS
with F being the true distribution of X, can be well approximatedby the generalised
Paretodistribution,GPD(o,{), with distribution function
. o (o > 0) is a scaleparameter
r { is a shapeparameter.
A specialcaseof this distributionis the GPD(a,0), i.e. when{ : 0: this is the exponential
distribution with distribution function
+r- u)lollltq
Pr{X-u<tlX >t}: t- 11+{(d
[1+
-l
lr + fulo.l+I/E (3.1.4)
o+(iL-u)t
E(X-ulx>u): (J.r.J.l
1-f
Equation (3.1.5) showsthat the mean excessof X over d is a linear function of z. This
result suggeststhe construction of a graph in which the empirical mean excessesof are
plotted against fr, with a taken to be the smallest value of d over which the mean residual
lile plot exhibits approximately a linear behaviour.
G ( r 1 ; ;a
) g a i n st t/ ( n + t ) f o r e = 1 , . . . , n ;
o doesnot differ substantiallyfrom studieson still waterlevel (Dixon and Tawn, 1994,
iee5).
There are additional benefits of this declustering scheme as it also largely removes the
still water level (or surge) variable non-stationarity due to tide, and it enables data sites
which only possesshigh water level data to be analysed.
ILLUSTHATIVE DEPENDEATCEEXAMPLES
Y: min(Xr,Xr),
Y - ma-r(Xr,X:),
these are idealised casesto illustrate extreme design conditions. They are given to aid the
illustration of the methods in a problem where we can explicitly evaluate the probability
of failure, and do not necessarilyrepresent realistic structure va,riables.For example, the
secondstructure function correspondsto failure from two unrelated failure types, in which
an extreme of one variable and/or the other leads to failure irrespective of the associated
value of the other variable.
We also consider three forms of dependencebetween (Xt, &)
o independence,
r complete negativedependence.
Again these are idealisedand are presentedwith illustration, rather than practice, in
mind.
Throughout we take the structure to fail when Y > u, for large z, and to simplify
notationlet Pr{X1 > z} - Pr{& > u} : p.
1 completepositivedependence
under
Pr{& > ulXl > u} - Pr{X, > u} independence
under
0 under
completenegativedependence.
(3.2.2)
Combiningthe expressions
in equations(3.2.1)and (3.2.2)givesthe probabilityof desrgn
failure to be
tp under completepositivedependence
Pr{)' > u} : \ p ' under independence (3.2.3)
Io under completenegativedependence.
seeSection3.4 for working. Similarly,for the designto havea 100year return period, u
must be chosento give marginal return periods (in years)for X1 and X2 of
Aside: note that Jor complete negati,uedependenceang leuel is adequatefor the marginal
return leuel as d,esignfai,Iure i,s impossi,ble.
under completepositivedependence
Pr{Y > z} :
{;r - p) under
under
independence
completenegativedependence.
(3.2.5)
To illustrate this, considera, to be taken as the 10 year return level for eachmarginal
variablethe return period (in years)of designfailure is
10 under completepositivedependence
5 under independence
5 under completenegativedependence,
seeSection3.4 for working. Similarly,for the designto havea 100year return period z
must be chosento give marginal return periods(in years)for X1 and X2 of
3.3 MultivariateMethods
3.3.1 Models
This sectiongivesa simpleintroductionto dependence modellingin multiva.riateextremes.
We focuson bivariateextremaldependence only and restrict attentionto Pr{X2 > ulXr >
u) where (Xr, Xr) are identicallydistributed randomvariables.In Section3.2 we consid-
eredthree specialcasesof dependence betweentwo variables(Xr, &), and found that for
la.rgez
( t under completepositivedependence
P r { X 2 > z l X r> u } = j p r t x r > u } u n d e r independence
[ 0 under completenegativedependence.
Clearly many diflerent statistical models for intermediate degreesof dependencebetween
(Xr, &) exist, however for joint probability methods it is the dependencebetween the
extreme values of X1 and X2 that a,reof interest, and the form of this dependenceis well
identified by considering how Pr{X2 > ulxl > z} behavesfor large z, i.e. how does the
knowledge that X1 is big influence the probability that X2 is big?
Here we will give this conditional probability for two statistical dependencemodels
that are often used in ioint nrobabilitv studies.
36 CHAPTER3. EXTREMEUALUEMETHODS
o a: 1 corresponding
to independence,
. o -+ 0 correspondingto completedependence,
o negativedependence
beingimpossible.
Pr{x1
>u'x2>u} <u'x2<u}
: l;"ii; ;1, i;.4:= l;""'t"'
- (2 - 2") pr{x1 > u} + z'-1(2" - t) pr{X1 > u}2, (J.3.2)
Pr{xr < u}2' - (t - P.{x, > u})"" * r -2"pt{X1 > u} +2"-1(2" - 1)Pr{X1 > z}2.
Hencethe conditionalprobability is
P.{& > zlXl > z} x (2-2") +2a-1(2a- t)Pr{& > z} for Iargez.
of this conditional probability is zero and the secondterm reducesto Pr{X, > u}. For
completedependence(i.e. when o -+ 0) the conditionalprobability is 1.
FinaJIy,in Figure 3.1 we give the joint density contoursfor the bivariate logistic de-
pendencestructure whenthe marginaldistributionsfollow standardnormal distributrons.
The figure showshow dependence, particularly in the joint tail region,growsa"str is de-
creased.When a : 1 the variablesare independentand herethe contoursare circles. For
a < 1 the contoursare elliptical in form but with a distinct pointednesson the diagonal
in the joint upper tail.
for large z, where C, is a constant and -1 < p < I determines the degreeof dependence
with
. p :0 correspondingto independence,
. It follows that
Figure 3.1: Joint density contoursfor the bivariate logistic dependencestructure when
the marginaldistributionsare standardnormal,i.e. Z;: O-1(exp(-l/Xa)) for X; a unit
Frdchetvariableand i : 1, 2. The dependence parametera : 1,0.9,0.8,0.7,0.5,0.3.
alpha=1
-3 -2
alpha=0.8 alpha=0,7
-3 .2
alpha=0.5 alpha=0.3
-3 -2 -1
3.3. MUI.JTIUARIATE
METHODS 39
Figure 3.2: Joint density contoursfor the bivariate normal dependencestructure when
the marginaldistributionsare standardnormal,i.e. Z - O-1(exp(-l/Xr)) for X; a unit
Fr6chetvariableand i : 1,2. The dependence parameterp: -0.5, -0.2, 0.0,0.2,0.5,0.8.
rho=-o.5 rho=-o-2
rho=o.0 rho=0.2
rho=o.5
40 CHAPTER3. EXTREME VALUEMETHODS
3.3.2 Diagnostics
In applicationsit turns out to be important to distinguishbetweenthree forms of depen-
dencestructure illustrated by
3. independence.
for large z and all z > 1. The diagnostictest is to plot ?(z), estimatedby replacing
the probabilitiesby the associatedempirical proportions,againstlogz for a range of .a.
If Pr{X2 > ulXr > u} -+ c > 0 then 7(z) is approximatelyzero for aIl z, whereas
for Pr{X2 > ulXr > z} -r 0, we have fhat T (z) is linearly increasingwith log z, with
the gradient related to the rate at which P.{& > zlXl > u} -+ 0. Specifically,the
3.3- MULTIVARIATE METHODS 4I
faster the convergence of the conditional probability to zero the steeperthe gradient,
with independencecorrespondingto a gradientof one.
To illustrate this theoreticallyfirst considerthe bivariate extremevalue distribution
with logistic dependence structure. For largez this distribution gives
?(z)r-rog : -rog1,to-'111t+o)}
-fi^"", (3.3.5)
{ ,,*e*+#}
so is linearlyincreasing in logz, with gradient(t - p)lQ + p) for all p < 1. When p - 0,
i.e. independence, then expression (3.3.5)givesthe gradientto be one.
In Section7.1.2we usethis form of diagnosticstatistic plot prior to dependence anal-
ysis to identify the line of attack that can be taken.
Pr{Y> u} = 2p-Cnp2l(t+o)
x 2p for large u.
3.4.L Independence
First consider]' : min(Xr, &). Then
Pr{Y < z} : Pr{Xr < u} +Pr{X, < z} -Pr{X1 < u,X2 < u},
( 1- p a ) " x ' l - a p + p 2 a ( a - 1 ) 1 2 f o r s m a l l p .
and
P r { a n n u am
l a x i m u mY < u ) : l l - p o 1 ' t " " " 1 " ": ( t - p o ) " x L - 2 p n ,
which implies that the return period (in years)of the structure variable exceedingz is
(2pn)-t. For the exampleof po :0.1 the return periodis 5 years.
Pr{Y < ?r}: Pr{Xr < z} +Pr{X2 < u} -Pr{X1 < u,X2 < u},
and
Pr{ annualmaximum Y < u} : - 'po1tt""'- (l - ,o12"/",'1"-
lZlt
44 CHAPTER3. EXTHEMEUALUEMETHODS
and
which implies that the return period (in years)of the structure variableexceedingz is
(z"pA)-t. The exampleof pA : 0.1 givesa return periodof 5.7yearsfor the design.
Chapter 4
r SignificantWaveHeight
o Steepness
r Wave Direction
^ = r - w(u):."o
' t {- (' I ")'} .
\ b t)
Three motivationsfor this approach/fittingmethod are:
wave heights over some threshold level z. Consequently,if all significant wave
heightsfollow a Weibull distribution, then the distribution of theseobservedvalues
is a truncated Weibull distribution.
Quantiles r - r l : . 0 2 1 - 1 1 1 0 3r - L 1 r c 4 1 - r l t 0 5
Bias 0.001 -0.0107 -0.06 -0.1451
GPD
Mean SquaredError 0.0012 0.0074 0.0448 0.1535
Bias -0.0023 -0.0033 -0.0041 -0.0045
Weibull
Mean SquaredError 0.0011 0.0048 0.0118 0.0221
4.1.3 Steepness
We model the joint distribution of (1{s,S) insteadof (Hs,Ts) for the wavecharacteristics.
Wavesteepness, S, and Tz arc rclatedby the followingequation
q- 2nHg
--_-=- (4.1.3)
st2
whereg denotesthe gravitationalconstant,so a statisticalmodel for (Ils, S) determines
alsothe distributionof Q.
There are a number of reasonswhich support an analysisbasedon the variable S
insteadof ?2.
o It is commonpracticein oceanographic studiesto estimateextremesof waveperiod
by fixing a typical value of steepnessand employingthe distribution of ffs and
equation(4.1.3)to derive?2 (Alcock,1984). The average,3", of the observed
steepness of the 1% largestwaveheightsis calculatedand the valueof Q associated
to an estimatedextremelevel of I{* is obtainedfrom
rr1_ - (a't 4\
r For most of the sites analysedin this work the relationshipbetweenHs and Tz
tends to assumea quadraticform, whereasthe relationshipbetween1{s and ,5 is
more linear, and hencestatistically modelledmore easily.
r swell waves alone are rarely believed to pose a significant threat and, furthermore,
they are deemed to be independent of the other sea condition variables.
/ x 2 o s . / Z n -) u \ - r l (
Pr{Ts< o}: r - ^ (t **t"t#)', rorr> (Zrufs3")t/2. (4.r.7)
PlTz<d : 1*}Fr":fu's)o,
Jo ds
P{rz
s"}::8","(#) (4.1.e)
54 CHAPTER4, STATISTICALMODELS USEDFOR JPM
Tz
Christchurch
oo CHAPTER 4. STATISTICAL MODELS USED FOR JPM
There are essentially two physical mechanismswhich generate the sea condition variables
1. astronomical
2. meteorological.
Variables generated by the different physical mechanisms are independent unless they
interact with each other. Offshore, the water depth is sufficient for the waves to be
effectively independent of tides (not exactly independent becauseof the effect of currents),
but the surges typically depend on the tide. In shallower water, waves become dependent
on the tide, as the water depth influences the speed of the waves and through depth-
dependent frictional effects, causesthem to break.
That the tides and surgesare dependent processesin shallow water areasis well known.
Prandle and Wolf (1978) discuss the nature of observed interaction, which subsequent
numerical model results show to be remarkably well captured by the known dynamical
processesof depth basedfriction effects (Flather, 1987). However, if only high water levels
are considered then, to a reasonableapproximation, the associatedtide and surge levels
are independent.
Since surge and waves (significant wave height, wave period and wave direction) are
both influenced by the wind climate, it should be expected that there is a relationship
between these variables. The form of dependenceat a site dependson the coastal location
of the site, its associated fetch length and direction. For some sites positive dependence
between surge and significant wave height is expected. At other sites negative dependence
(or independence) between these variables is expected.
DEPENDENCEMODELS
o IIs and S,
t . strongerdependence
betweenthe pairs (SWL,Hs) and (ffs,,9) than betweenthe
pair (^9I7.L,
S);
2. strongerdependence
betweenthe pairs (SWL.,Hs) and (IIs, d) than betweenthe
pair (SWL,0).
o (SWL,Hs)
o (fIs, S).
z)
f swt,ur(n,a)f ur,s(Y,
fo"(a)
wherethe subscriptsindicate the marginaland joint distribution variables.Equivalently,
the joint density can be rewritten as
x-: o-1(rx(x)),
The second and third models are both extensions of the bivariate normal model.
'- t pl
"-\o/ ti
This statisticalmodelis fitted usingthe likelihoodgivenin AppendixA.4.1. The maxi-
mum likelihood estimatorof p, the singleparameterin this statisticaldependence
model,
is approximatelythe samplecorrelationbetween(SI4zI-, Hi) pairs.
This model is only completelyexplicit in this joint extremeregion. The known marginal
distributions and information derivedfrom this joint extremeregionimposestructure on
the positionsof observationsin regionswhereone or other variableis large,and also the
region whereneither is large
For a given threshold this statistical model is fitted using the threshotd likelihood
given in Appendix A.4.2. The maximumlikelihoodestimatorof p,,, the singleparameter
in this statisticaldependence model,is essentially the correlationbetween(SWL-,Htr)
pairs which are simultaneouslyabove the threshold z in each variable. However,the
estimation of this parametertakes into account the number of observationsin each of
the four regionsdeterminedby the thresholds,and for the regions(,9W.L.> u, Hfi > u),
(SW L- > u, HI < u,) and (SI4z.L.< u, HI > u,) the sizesof the thresholdexceedance
valuesare accountedfor.
We can treat p,, as a function of the common marginal threshold, a. For u less
than the minimum data value this estimateis approximatelythe estimateof p for the
bivariate normal model, i.e. the correlationcoefficient,whereasfor large z this measures
the dependencebetweenthe extreme pairs. If in fact the data were from a bivariate
normal distribution then p,, would be independentof a.
A thresholdhas to be chosenfor this statisticalmodel. What is requiredis that u is
high enoughso that the degreeof correlationdoesnot changeabovethat threshold. A
suitablethresholdcan be identifiedfrom the estimatedp.,function, by taking the smallest
z abovewhich the function is constant.
from the (SW L, H s) data only, we cannot identify which of the different mechanisms
produced a particular data pair.
4.2, DEPENDENCEMODELS
1. regionally generated waves and still water levels are dependent, but
A consequenceof such a structure is that the dependencebetween still water level and
significant wave height is a mixture of two different dependenceforms. If one component
of the mixture describesthe dependencebetween low values and the other between high
values then the correlation function pu will change with z to give the correlation of the
second form for the largest events.
The statistical model here is more complex than for the above cases,being given by
Appendix A.4.3. For each form of dependencea bivariate normal distribution is used to
describe the variations. The overall statistical dependencemodel has seven parameters,
which can be fitted by maximum likelihood:
r one parameter pM: determining the proportion ofthe data of each dependencetype
(pu : 0 or pt,r : 1 suggests that there is only one type of dependence and a
bivariate normal model would fit well);
. two parameters p1 and p2: which measure the correlation associatedwith each type
of dependenceform; and
o four parameters p, = (gzt, Fzz),ozt, o22: which explain the differencesin mean level
and variation between events generatedby the two mechanismsafter transformation
to the standard normal marqinal scale.
This statistical model has two maior benefits over the bivariate normal threshold model:
1. the model is fitted with all the data having equal weight contrasting with the thresh-
old model where the most extreme values are given most weight;
On the negative side the main disadvantageis that the statistical dependencemodel
is more complexto fit and parameterestimatesmore dificult to interpret. Critical to
its application is testing whether two, or one, bivariate normal dependenceforms exist.
Unlessthere is significantevidencefor two forms, only one (i.e. the standard bivariate
normalmodel)shouldbe used.
62 CHAPTER 4. STATISTICAL MODELS USED FOR JPM
s - l ( H ; - h ) - l r ( o + b ( h - u ) , o z ) ,f o rh > u (l) L\
a:0
b-- p
tt2 : I-p2.
Thus the statistical model we propose is slightly more general than a bivariate normal
dependencestructure as our model does not require a zero intercept to the linear model
or a link between the residual variation and the gradient of the linear model.
Finally we have the problem of threshold selection again. The choice of threshold is
less important than for other aspects of the modelling of the joint distribution as results
are reasonably insensitive to threshold choice here. To aid threshold choice we produce
plots of
E(S-lIlj > z) and Var(S-lI1} > z)
EVALUATION OF THE PROBABILITY OF FAILURE USING?HE JPM 63
where / is the estimated joint density of X and A., is the failure region. This may seem
simple, but the joint density of the sea condition variables is (4/5)-dimensional and the
set over which the integration is required (the failure region) is a complex set, so this
integration requires some care.
No analytical expressionis obtainable in these problems so numerical methods of some
form need to be used. There are two approaches:
e numerical integration;
o simulation.
There are many numerical methods for integration, but in practice these are not ideal
when the dimension of the integral is relatively high and the form of the failure region is
non-linear. Therefbre the simulation approach is proposed, which involves a three stage
procedure:
1. simulation of a large number, n, say, of pseudo observations from the fitted joint
distribution of X;
Step 1 aboveis generaland independentof the form of the structure function. Thus if
a wide rangeof designsare to be consideredthis step only needsto be undertakenonce.
The output in the form of a seriesof eventsso is of interestfor a rangeof designstudies.
By itself this is a valuableoutput from the analysisfor coastalengineers.
Increasingz, improvesthe precisionof the evaluationof the estimatedprobability of
failure. In practicen, shouldcorrespond to 10-100times the length of the return pe-
riod of interest,so the largestsimulationsamplesizesare requiredfor the most extreme
extrapolations. This can be computationallyintensivealthough straight forward to im-
plement. More generally,methodsexist for improvingthe precisionwithout increasingn".
Thesemethodsuse importancesamplingin the simulationstage,which involvessimulat-
ing a dis-proportionatenumber of extremeseacondition observationsat the simulation
stageand appropriatelydown-weightingthe proportionof failuresin the estimationstage.
More specifically,pseudoobservationsare generatedfrom a suitably chosenjoint distri-
bution with density,9(x), giving a disproportionatenumber of extremeX values. The
importancesamplingfactor for a simulatedvalueX is
/x(x)
e(x)
i.e. the fitted joint density relative to the joint density used for the simulation of the
pseudoobservations.So Step 1 involvesgeneratingn, valuesX;, from g, with importance
samplingfactorsft(X6)/9(X;) for i: I,. .., n,. The estimatedprobabilityof failure,i.e.
Pr{X e ,4"}, is then givenby
1i i1x,e A,)ft(x;)/e(x,).
where
if X,€A,
IIY.cA.'_I'
.,.-,-,,,,_
I 0 if X,#,4,.
This method is less computationallyintensivethan the direct simulation method but
requiressomeexperiencein the selectionof an appropriatejoint densityfunction g.
In this report we haveusedthe importancesamplingtechniquesin Chapters7 and 8,
but for applicationsof the JPM to the estimationof the probability of failure in Chapter
EVALUA'TION OF THE PROBABILITY OF FAILURE USING THE JPM
67
Chapter 5
Study Data
1. Observationa.l
data consistingof measureddata and hindcastdata;
In this chapterwe describethe two differenttypes of data setsusedin the study, and for
the simulateddata give the statisticalmodel usedto generatethese.
69
70 CHAPTER5, STUDYDATA
Table 5.1: Information about the observationaldata sites: SWL - still water level,Waves
significantwaveheight, waveperiod, wavedirection.
r still water level (SI4/Z). (In addition for Sim3, the tide and surge constituents were
separately generated);
In the simulations wave period was obtained by generating Ils and the waue steepness.,
,5, and using the relationship
^1z: 1znl:.,rltl'
| -----;-
gJ
| ,
\ J
5.2. SIMULATEDDATA
Siml data
Marginal distributions
The marginal distribution of d is taken to be the sameas the equivalentempiricaldistri-
bution of the Cardiff data. For both SW L anclHs a mixture of the empiricaldistribution
and parametricmodelsareused.For ,9172belowthe 95%empiricalquantileof the Cardiff
SW L data, the empirical distribution is used;abovethis threshold,the distribution fol-
lows a generalisedParetodistribution(seeChapter3, equation(3.1.3))with parameters
o-0.362and{:-6.264.
Both ^9and f15 havedistributionswhich dependon d:
oWhen0'<d<110":
For f15: belowthe 95%empiricalquantileofthe Cardiff ffs data from this direction
sector,the empirical distribution is usedwhereasabovethis threshold,the distri-
bution follows a generalisedPareto distribution with parameterso : 0.1958and
€ : -0.102.
For ,9 the empiricaldistribution of the Cardifl data in this sectoris used.
o When110'<d<360':
For ff5: belowthe 95%empiricalquantileof the Cardiff Ils data from this direction
sector,the empirical distribution is used,whereasabovethis threshold,the distri
bution follows a generalisedPareto distribution with parameterso : 0.1417and
( : -0.0604.
For S, the empiricaldistribution of the Cardiff data in this sectoris used.
Dependence Structure
After transformationof eachmarginalvariable(Hs, SW L, S) to a standardnormal distri
bution, the form of the dependence
betweenthe three variablesis a trivariate normal dis-
tribution with zero-meanvector (seeAppendix A.2.2). Howeverthe variance-covariance
matrix, X, dependson wavedirection,with D givenby
1 0.3 0.62
0.3 1 0.3x0.62
0.62 0.3x 0.62 1 )
whend<1l0",and
1 0.3 0.742
0.3 | 0.3x 0.742
0.7420.3x 0.742 )
72 CHAPTER5. STUDYDATA
Sim2 data
Marginal distributions
The marginal distributions of d and S are taken to be the sameas the equivalentempir-
ical distributions of the Dover data. For both SWL and F1sa mixture of the empirical
distribution and the parametricmodel is used. Specifically,for SW L belowthe 95% em-
pirical quantileof the DoverSW L data,the empiricaldistribution is used,whereasabove
this threshold,the distribution followsa generalisedParetodistribution with parameters
o : 0 . 1 8 6 6 a n d€ : - 0 . 0 7 6 4 .
Significantwaveheight has a distribution which dependson the wavedirection:
For0'(d<110":
belowthe 95%empiricalquantileof the DoverIls data from this directionsector,the
empiricaldistribution is used,whereasabovethis threshold,the distribution follows
a generalisedParetodistributionwith parameters o :0.731 and { : -0.4215.
For110"<d<360':
below the 95% empiricalquantile of the Dover fls data from this direction sector,
the empirical distribution is used whereasabove this threshold, the distribution
followsa generalised Paretodistributionwith parameters o :0.46 and { - -0.112.
Dependence Structure
The direction,d, only influencesthe marginaldistributionsof the variables,but not their
dependencestructure which is assumedto be constantacrosssectom.'Ihe (Hs.,SW L, S)
variablesare taken to be dependent.After transformationof eachmarginal variable to
a standard normal distribution, the form of the dependencebetweenthe (11e,SWL, S)
variablesis a trivariate normal distribution with zero-meanvectorand variance-covariance
matrix, X, given by
1 -0.2 -0.1
F- -0.2 1 -0.2x-0.1
- 0 . 1 - 0 . 2x - 0 . 1 1
SimS data
Marginal distributions
The marginal distributions of d, ,9 and tides are taken to be the sameas the equivalent
empirical distributions of the North Wales data. For both Su,rge arrd /1s variablesa
mixture of the empirical distribution and the parametricmodel is used. Specifically,for
botrhSurge and IIe belowthe 95%empiricalquantileof the associatedNorth Walesdata,
the empiricaldistribution is used,whereasabovethesethresholdsthe distribution follows
a generalised Paretodistribution,with parameters o :0.L446 and { : 0.0803for ,9urge
and o: 0.6119and {: -0.1583for I/s.
Dependence Structure
The wavedirectionand tide are takento be independent ofthe (Hs,Surge,S) variables,
which are taken to be dependent. After transformationof eachmarginal variable to a
standardnormal distribution, the form of the dependence betweenthe three variablesis a
trivariate normal distribution with zero-meanvector and variance-covariance matrix, D.
given by
( t 0.16 o.o49T \
x:l 0.16 1 0.16xo.oaezl.
\ 0.04970.16x 0.0497 | l
This dependencestructure correspondsto positive dependencebetween (Hs,Surge) a,nd.
very weak positive dependencebetween (fls, S), and with (Surge, S) almost independent.
Sim4 data
Marginal distributions
The marginaldistributionsof I and S are takento be the sameas the equivalentempirical
distributions of the Christchurchdata. For both SWL and./{s valuesa mixture of the
empirical distribution and parametricmodelsis used. Specifically,for both SW L and
,I/s below the g5% empiricalquantile of the associatedChristchurchdata, the empirical
distribution is used,whereasabovethesethresholdsthe distribution followsa generalised
Paretodistribution,with parameters o : 0.126and {: -0.15 for SWL and o:0.6685
and { : 0.15for fls.
74 CHAPTER5. STUDYDATA
Dependence Structure
The wavedirection and ,9 variablesare taken to be independentof the (Hs,SWL) vari-
ables,which are taken to be dependent.After transformationof eachmarginal va.riable
to a unit Fr6chetdistribution,(i.e. with distributionfunctionPr{X < r} : exp(-I lr)
for r > 0) the form of the dependencebetween the two variables (X, Y), is taken to be a
bivariate extremevalue distribution (with logistic dependence
structure) grvenby
where0<a{ldeterminesthedegreeofdependence.Wetooka:0.8.Thisdependence
structure ccrresponds to a strong form of positive dependencebetween (Hs,SWL) wifh
particularly high levels of dependencebetween the most extreme values of each variable,
seeSectiou3.3.1.
Simb data
Marginal distributions
The marginal distributions of d and ,9 are taken to be the sameas the equivalentempir-
ical distributions of the Dowsingdata. For both ,SWtr and f15 valuesa mixture of the
empiricaldistribution and parametricmodelsis used. Specifically,for both SWL and Hg
belowthe 95% empiricalquantileof the associatedDowsingdata, the empiricaldistribu-
tion is used,whereasabovethesethresholdsthe distribution followsa generalisedPareto
distribution, with parameters o:0.2091 and{: -0.0711for SWL and o:0.8091 and
{ - 0.1121for I1s.
Dependence Structure
The wavedirection and S variablesare taken to be independentof the (Hs,SWL) var\-
ables,which are taken to be dependent.After transformationof eachmarginal variable
to a standardnormal variable,the joint distribution of thesevariablesis given by by the
mixture of bivariatenormaldistributionsdescribedin AppendixA.4.3. The parameters
of this dependencemodel are
p2: ) ,2 1- - o z z : 1 , p : 0 . 9 ,
(0.3,0.3o h:0, a1dpz
5.3. PRINCIPAL DIFFEREATCESBETWEEN THE SIMULATED DATA SETS 75
c the SWL and 1{5 tails determinedby the valueof the shapeparameter,{, usedto
simulatevaluesfrom the GPD distribution for the tail of the variable,
r the distorted tails for Sim4 are a consequenceof the way that the tails of /{s and
Sll/.L have been deliberatelylengthenedand shortenedrespectivelyfrom values
estimatedat Christchurchto illustrate how the SVM and JPM comparewhen the
two variableshavequite diflerent tail forms. The simulateddata do not reproduce
the hindcastdata well at this site.
. snrgeand 115
In addition, for both Cardiff and Siml data sets,wavedirectionis plotted againstF/s and
Featuresof interestare:
For Cardifr and SimL LargerTs valuesoccurin Siml than the observationaldata. The
reasonfor this is that the simulationmodel for dependence betweenf/s and ,9 has
not sufficientlyrestrictedsmall ,Svaluesto occur only with small I15 values.Other
than this the simulateddata reproducesthe key characteristicsof the Cardiff data
well, possiblywith the exceptionof not giving largeenoughTz v^ltes for directions
lessthan 100".The probableexplanationof this is our useof 110"ratherthan 190'
as a basisto separatedirectiontypes whensettingup the Siml data. Later analysis
of the Cardiff observationaldata takesthe split at 190'.
North Wales and Sim3 There is a good agreementwith the observationaldata. Note
the outlier in the observationalsurgedata.
Christchurch and Sim4 Here the key differenceis in the scaleof the 7z upper tail,
which is much longer (almosttwice as long) for Sim4 than the observationaldata.
This is due to the (fls,,9) valuesbeingtaken to be independent,thus allowingsmall
,9 valuesto occur with largeIls values(to producevery largeT2) in the simulation
model. The ft valuesare unrealisticallylarge for Sim4. Sim4 was the simulated
data set wherethe ,[1stail wasdeliberatelylengthened.However,this doesnot seem
to havemadea notablechangefrom the observationaldata set (as seenfrom Figure
5.15). As a result of thesefeatures,Sim4 cannotbe considered to be a realistic
representationof seaconditionsat Christchurch.
COMPARISOAI OF OBSERVED AAID SIMUIATED DATA 77
For Dowsing and Sim5 The main difference between these sets is that the simulated
data have a few much larger values of f15 (almost twice as large) than appear in
the observational data. Correspondingly, these observationSproduce a few larger
ft values for Sim5 than the Dowsine observational data.
78 CHAPTER5, STUDYDATA
2468 234567
SWL swL
:
g.
t
l,
t!. i3;
o2468 2 4 6 I 10
1z
5.4. COMPARISONOF OBSERVEDANDSIMULATEDDATA 79
=J
2468 2 4 6 I 10
qll:---*
100 150
01234 01234
swL swL
TN
2468 2 4 6 I t0
B2 CHAPTERS. STUDYDATA
a., ;N
2468 2 4 6 I 10
fz
-1 0t
Surge
5,4. COMPARISONOF OBSERVEDAND SIMLILATEDDATA 83
Figure 5.11: North Wales observationaldata and SimS data: IIs vs SWL
Observationaldata Simulateddata
11 12 13 10 11 12
swL SWL
tz
CHAPTERs. STUDYDATA
I'igure 5.14: North Wales observationaldata and Sim3 data: F1svs Surge
Observationaldata Simulateddata
68
Iz
86 CHAPTERS. STUDYDATA
Fiqure5.17:Christchurch
observational
data and Sim4data: SWL vs T"
Observationaldata Simulateddata
=
o;
.^ =-
I 10 14
Surge
5.4. COMPARISONOF OBSERVEDANDSIML|LATEDDATA 87
45670
swL SWL
2 4 6 I 10 2 4 6 8 10 12
fz
BB CHAPTERs. STUDYDATA
Figure5.21:Dowsingobservationai
data and Sim5data: SWL vs ft
Observational
data Simulated
data
3r4 3
2 4 6 8 10 2 4 6 I 10 12
Marginal Estimation
In this chapter we apply the models proposed in Chapter 4 to each of the marginal
variables of the sea condition vector.
89
90 CHAP'IER 6. MARGINAL ESTIMATION
Cardiff
A
o
o
i: (\l
-c
F(u)
Christchurch
o
(DY
o- c;
-c
oO
F(u)
92 CHAPTER 6. MARGINAL ESTIMATION
Dowsing
c.i
rrJ
bF
!!o
=F
(! rc)
o)
o
c;
F(u)
Shoreham
-i
c
g'-
<d .-i
o;
o
-c
CfJ
F(u)
STILL WATER LEVEL
NorthWales
O-i
E
O.|l)
(D-i
o'i
F(u)
Dover
{
E c\l
c
o-Y
c)v
(s
F(u)
94 CHAPTER6, MARGINALESTIMATION
Figure 6.4: Probability and quantile plots for the GPD fiited to SWL data at Cardiff
and North Wales.Also shownin eachplot is the o: grhne.
eF.
-o c
I (d
o- a^
Y-
c) oF-
q)
_c
-c?
-ci o
a
d+
Ed
o
0.0 0.4 0.8 12.813.213.6
emp.prob. emp.quanr.
STILL WATERLEVEL 95
6.2 Surge
6.2.1 Observational Data
The ThresholdMethod wasappliedalsoto declusteredsurgedata. Figures6.5-6.7contain
sensitivityplots on the basisof whicha thresholdequalto the empiricalg5%surgequantile
was chosenfor all sites. Maximum likelihoodestimatesare shownin Table 6.3. With the
exceptionof Christchurch,estimatesof the shapeparametertend to be higher than the
va.luesobtained from still water level. This is consistentwith the Sim3 data results
above. Dixon and Tawn (1994,1995)noticed that for sites at which still water level is
dominated by the tidal variation, extremevalue modelsapplied to extremesurge data
indicate longertail distributionsthan the samemodelswhen fitted to extremestill water
levels. In contrast, they observean agreementin the resultsfor sites in which the surge
variation is dominant. Christchurchbelongsto this latter classof sites,and as seenfrom
Tables6.1 and 6.3, the shapeparametersare of the samesign in eachcase. Graphical
assessments of goodness-of-fitfor Cardiff and Christchurchare containedin Figure 6.8.
For still water level,somedepa,rturefrom linearity is observablein the Cardiff Q-Q ploi.
At the other sites,the GPD seemsto providea reasonablefit to the surgeexceedances of
u-
Table 6.3: Maximum likelihood estimatesof the parametersof the GPD fitted to Surge
level data at varioussites (standarderrorsin parentheses).
6.2. SURGE 97
Cardiff
a,l
6 c.'l
(E
o.
x,..i
(!'
E
Christchurch
.:
o
(I)
E
.i
^, -':
dY
o
,n
F(u)
CHAPTER6, MARGINALESTIMATION
Dowsing
Cr?
(trv
XF
F(u)
Shoreham
\t
o
(I) 'i
lY nr
8c;
o-
o-
F(u)
6.2. SURGE
NorthWales
(o
:Il, o
6
(Es ( \ 1
Eo
6-' c.l
6Y
F(u)
Dover
ga
Po
c
E-
q O "-l
O6
o
F(u)
100 CHAPTER6. MARGINAL ESTIMATION
Figure 6.8: Probability and quantileplots for the GPD fitted to Surgeleveldata at Cardiff
and Christchurch.Also shownin eachplot is the r: g line.
C
(g
it'
;-i
o
_c
a
0.0 0.4 0.8 0.4 0.8 1.2
emp.prob. emp.quant.
Christchurch
: P-PPLOT h ristch
urch:Q-Q PLOT
-l
95
d oql
o ol
o
_c
=
.1
"l
OJ
zv
-c
N
1 . Weibull;
2. TruncatedWeibull;
J. GPD,
the latter two being fitted through the thresholdmethods,but all fitted using maximum
likelihood. The first two modelscan be comparedby examiningtheir parameterestimates,
the latter two by comparingestimatestability to thresholdlevel. In eachcaseP-P and
Q Q plots are usedto assessgoodnessof fit.
First considerthe Weibull distribution fitted to all /1s values.Table 6.5 showsmax-
imum likelihood estimatesof the Weibull distribution parameters(o, b,c) for eachsite.
The goodnessof fit at each site is most easily assessed using the Q-Q plots shown in
Figures 6.9 6.11. The model fits well for the Dover data; for Cardiff and Christchurch
the fitted model has a slightly longer upper tail than the empirical upper tail, with the
t02 CHAPTER6, MARGINALESTIMATION
one be]'ond the rangeof the data, as functionsof thresholdunder eachtail model. For
both quantilesthe GPD seemsmore stable,giving empiricalsupport for the preference
of this tail model. Furthermore,with the exceptionof Dowsing (which was the site for
which we estimatedno upper endpoint),the GPD leadsto lowerquantile estimatesthan
the TruncatedWeibull, suggestingthat the latter is unduly conservative.
Christchurch 0.00827
(0.00057) 1194(0.0106) 0.972(0.0107)
Dowsing 0.0565
(0.00158) 563 (0.0149) 1.443(0-0126)
Shoreham 0.00598
(0.001) 144(0.01286) 0.e45(0.01124)
Dover 0.0271
(0.00141) 61 (0.016) 1.231(0.0104)
Table 6.5: Maximum likelihood estimatesof the parametersof the Weibull distribution
fitted to -FIsobservationsat varioussites (standarderrorsin parentheses).
104 CHAPTER 6. MARGIIfAL ESTIMATION
Table 6.6: Maximum likelihood estimatesof the parametersof the Truncated Weibull
distribution fitted to Hs exceedances
at varioussites (standarderrorsin parentheses).
Table 6.7: Maximum likelihood estimatesof the parametersof the GPD fitted to lls
exceedances at varioussites (standarderrorsin parentheses).
6-3. SIGNIFICANTWAVEHEIGHT 105
Figure 6.9: Probability and quantileplots for the Weibull distribution fitted to.F/s obser-
vations at Cardiff and Christchurch.Also shownin eachplot is the r : ? line.
q
-o C
fY
o
o a o
E 9!o
!d
Christchurch
: P-PPLOT :Q-Q PLOT
Christchurch
(o
-o c
(g
o +
0) d)
E c{
o
0.0 0.4 0.8 0246
emp.prob. emp.quant.
106 CHAPTER 6. MARGINAL ESTIMATION
Figure 6.10: Probability and quantile plots for the Weibull distribution fitted to 115
observationsat Dowsingand Shoreham.Also shownin eachplot is the r = g line.
^o c(6
E
a- E
o o
E
emp.proD. emp.quant.
(d+
o+ Y(")
pcn
o
0.0 0.4 0.8 012345
emp.proD. emp.quani.
6.3. SIGA/IFICANTWAVEHEIGHT
Figure 6.11: Probability and quantile plots for the Weibull distribution fitted to 115
observationsat North Walesand Dover. Also shownin eachplot is the z: z line.
-o *r
I (!
-q
q) O'!
E Q)
Dover:P-PPLOT O-OPLOT
(f)
o-
o .N
E
Figure 6.12: Probability and quantile plots for the TruncatedWeibull distribution fitted
to lls exceedancesat Cardiff and Christchurch.Also shownin eachplot is the r : grline.
_o
(t
CD
o- q(\
.i
c) Or
o
-c
emp.prob. emp.quanl.
Christchurch
: P-PPLOT : Q-QPLOT
Christchurch
_o c
(! rf)
J
u
o q)
-c -c
o
Figure 6.13: Probability and quantileplots for the TfuncatedWeibull distribution fitted
to FIs exceedancesat Dowsingand Shoreham.Also shownin eachplot is the r: g line.
Dowsing:P-P
PLOT Dowsing
:Q-Q PLOT
o)
.o
E r...
q.@
+
cr)
Shoreham:P-PPLOT Shoreham:Q-QPLOT
to
€o c
YOt d
+o Y.A
o o_1
-c o(r)
-c
qt
o |r)
cri
0.95 0.97 0.99 2.5 3.5 4.5
emp.prob. emp.quant.
110 MARGINAL ESTIMAI:ION
Figure 6.14: Probability and quantile plots for the TruncatedWeibull distribution fitted
to Ils exceedancesat North Walesand Dover. AIso shownin eachplot ir trhgs : E line.
. -.1' .q
o E
(s{
-l F. qu)
O
(l)
.ri
--
-c
tc,
u?
o, (\
-o
o-
o
emp.prob. emp.quant.
6.3. SIGNIFICANT WAVE HEIGHT 111
Figure 6.15: Probability and quantile plots for the GPD fitted to I{s exceedances
at
Cardiff and Christchurch.Also shownin eachplot is the r : rr line.
c
d
Er (!
oF
0)
(\l
oq
emp.prob. emp.quant.
Christchurch
: P-PPLOT : Q-QPLOT
Christchurch
llt rO
f
4) +
=
(r)
emp.prob. emp.quant.
t12 CHAPTER6, MARGINALESTIMATION
Figure 6.16: Probability and quantile plots for the GPD fitted to I{s exceedances
at
Dowsingand Shoreham.Also shownin eachplot is the z: E line.
Dowsing:P-PPLOT Dowsing
: Q-QPLOT
o
a-
o
emp.prob. emp.quant.
Shoreham
: P-PPLOT Shoreham: Q-Q PLOT
(r)
_a
o
f
q) ctd
-c C)
-c
rq
(v
0.0 0.4 0.8 2.5 3.5 4.5
emp.proo. emp.quant.
6-3. SIGNIFICANT WAVE HEIGHT 113
Figure 6.17: Probability and quantileplots for the GPD fitted to IIs exceedances
at North
Walesand Dover.Also shownin eachplot is the r: y line.
lt c
d
:l
6
u?
(v)
o o
q)
c
ro
C!
emp.proo. emp.quant.
Dover:P-PPLOT Dover:Q-QPLOT
6
_o o
o
+ {a;
o o
- v.
!t)
-i
ni
0.0 0.4 0.8 2.5 3.0 3.5 4.0
emp.prob. emp.quant.
114 CHAPTER 6. MARGINAL ESTIMATION
Figure 6.18: Cardiff: Estimatesof two extremequantilesunder the Weibull and the GPD
modelsversusthe empiricaldistribution function calculatedin u.
Cardiff Quantile=
99.9%
s
Weib u
- - - - - - -G P D
-:
'F (O
q(f)
(\I
F(u)
Cardiffguantile=
99.999%
e.l
C\I
c.i
OO
c\\
(d
f.n -:
Er
- WeiDU
GPD
\
F(u)
SIGI /IFICAAI? WqVE HEIGHT
= 99.9o/o
Christchurchquantile
+
|r)
Weibull
ro GPD
a?
6lo
=
ddi
F(u)
quantile=
Christchurch 99.999%
EN
F(u)
116 CHAP'IER 6. MARGINAL ESTIMATION
DowsingQuantile=
99.9%
C!
tr
tr
ar@
:=
c
<S re.
=-
s
- Weibull
C\l GPD
F(u)
DowsingQuantile=
99.999%
Weib u
GPD
(D
:=
(Ev
fr
F(u)
SIGATIFICAAITWAVE HEIGHT '1.r7
Figure 6.21: Shoreham: Estimates of two extreme quantiles under the Weibull and the
GPD models vercus the emDirical distribution function calculated in z.
ShorehamQuantile=
99.9%
n"
- Weibull
\| GPD
l.E
gv
c
(d
+
c',1
.V
F(u)
Shorehamguantile=
99.99%
ro
.; Weib u
GPD
o(o
ou)
ri
0.90 0.92 0.94 0.96 0.98
F(u)
118 CHAPTBR 6. MARGINAL ESTIMATION
Figure 6.22: North Wales:Estimatesof two extremequantilesunder the Weibull and the
GPD modelsversusthe empiricaldistribution function calculatedin z.
NorthWales Quantile=
99.9%
v _ weibull
+ GPD
ot o?
Esf
c
a$l
F(u)
NorthWales Quantile=
99.999%
- Weibull
1() GPD
,^
F(u)
6,3. SIGNIFICANTWAVEHEIGHT
Figure 6.23: Dover: Estimatesof two extremequantilesunder the Weibull and the GPD
modelsversusthe empiricaldistribution function calculatedin z.
DoverQuantile=
99.9%
Weib u
+ GPD
(t)^,
cv
J
+
0.90 0.92 0.94 0.96 0.98
F(u)
Doverguantile=
99.99%
q - Weibull
ro GPD
.;
o*,
=
c@
g+
+
n
s
F(u)
r20 CHAPTER 6. MARGIAIAL ESTIMATION
6 . 3 . 2 Simulated Data
For the simulated/1s data we only fitted the GPD thresholdmodel. In eachcasewe took
the 95% empiricalquantile of fls as the threshold. From Section5.2 we can seethat for
Sim3-Sim5this is the correct specificationof the distributional form, whereasfor Siml
and Sim2 the marginaldistribution of Ile is more complexthan this as, in eachcase,the
distribution of I/s dependson the associatedwavedirection. Table 6.8 givesestimatesof
the fitted GPD parametersfor eachsimulateddata set. For Siml-Sim3/Sim4 Sim5 the
IIs distribution is estimatedto havea finite/infinite upper endpointrespectively,which is
consistentwith the true simulationmodels.For Sim3 and Sim4 the estimatesare closeto
the true values(seeChapter 5) whereasfor Sim5 the shapeparameterestimateis slightly
too large,correspondingto an over-estimated upper tail. Comparisonsfor Siml and Sim2
are more difficult. For Sim2.the fitted model approximatelyis an averageof the models
for the two directional IIs distribution. This is reasonable,as eachdirection sector is
approximatelyequally likely to produceextremeI1s values. For Siml the fitted model
closelyresemblesthe lls distribution for d > 110". Again this is reasonable,as most
waves,including the largestwaves,comefrom this direction sector.
6.4 Steepness
Figure 6.24 containshistogramsof steepness
for Cardiff, Christchurch,Dowsing,Shore-
ham, North Wales and Dover. Our approachfor modelling the distribution of 5 is to
STEEPI{ESS L21
1. locally generated(steeperwaves),and
Cardiff Christchurch
(\
0.0 0.02 0.04 0.06 0.08 0.0 0.02 0.04 0.06 0.08
Dowsing Shoreham
0.0 0.02
E
,l!!t
0.04 0.06 0.08
(\l
NorthWales Dover
0,0
,xXXXXxx,,,
0.02 0.04
S
0.06 0.08
N
Morecambe
0.0
C\I
()
0.0 0.08
724 CHAPTER6. MARGINALESTIMATION
Cardiff Christchurch
i|u.,,,,,,,J
100 200 300 100 200 300
Dowsing Shoreham
NorthWales
Dependence Estimation
7.L (SWL,115)Dependence
Figures 7.1 and 7.2 show the correlationfunction, p", plotted against the threshold
non-exceedance probabilityp : Pr{IIs < u} : Pr{SWL < z} for (Hs,SWL) and
(Hs,Surge) respectively(note on the plots p, is shownas p(p)) The main featuresof
theseplots are:
727
128 CHAPTER 7. DEPENDENCE ESTIMATION
5. in all cases, as the threshold is increased, the confidence intervals (not shown on
these plots) grow so any irregular changesin the function for large z can be ignored.
For Cardiff the direction of the waves is known to be important to the form of the
joint distribution of the other sea condition variables, so in Figures 7 .3 and 7.4 the cor-
responding correlation function plots are produced conditionally on the wave direction
variable, d, being 0 < I90" or d > 190". Here we see a distinct difference in the degree of
dependencefound, with strongest dependencefor 0 > 190', and that the feature of rising
correlation at more extreme levels observedin Figures 7.1 and7.2 is retained.
These findings suggest that the mixture of bivariate normals model should be consid-
ered as a feasible candidate at all sites. Tables 7.1 ard 7.2 give the pa.rameterestimates
of this statistical model for each site.
Although the parameters of the dependencemodel are of some interest, it is really
how they combine to produce a changing degreeof dependencewhen looking further into
the joint tail of the distribution that is important. To examine this we evaluate lhe pu
function for this dependencemodel. This cannot be evaluated in closed form so we use
simulation techniques. Figures 7.5 7.10 show the resulting pu plots for the mixture of
bivariate normals model together with the threshold model estimate of p, for compa,rison.
These plots are shown for each site and for both (Hy,SWL) and (-FIs,Surge) d.ata. ln
each case the mixture model gives a very good approximation to the pu curve. This
suggeststhe use of the mixture of bivariate normals model, as this has the added benefits
of:
1 . the inference uses all data, not just the values of threshold exceedancesand down-
weighted (censored) values of other observations;
3 . the fit provides extrapolations for dependencewhich allow for an increased, or de-
creased,degreeof dependencebeyond the data.
,------"')
{
11.)
CHAPTER7. DEPENDENCEESTIMATION
Table 7.1: Parameter estimates for the mixture of bivariate normals applied to observa-
tional (I{s, ,9tr7.L)data.
Ch slchurch:
Fho(Hs-Surg€)
Figure 7.6: Dowsing: Estimated correlation function, p.,, between (Hs, SW L) and
(Hs, Sur ge) versusthe threshold(in probability of non-exceedance)
for the fitted mixture
of bivariate normalsand the thresholdbivariate normal
Dowsingi
Fho(Hs-SwL)
Dowsing:Rho(Hs.SulgB)
3
e"
E:
134 CHAPTER7. DEPENDENCEESTIMATION
oover:Rho(Hs-SurgB)
Nodhwales:Rho{Hs'suee)
f-
7.1. (SW L, H s) DEPEI{DEI\ICE
f!
Carditl:BholHs-Surge)
Sholeham:
Rho(Hs-sues)
s
136 CHAPTER7. DEPENDENCE
ESTIMATION
with correlationcoefficient-0.21;
Sim2 bivariate normal dependence
Figure7.11:Siml: Diagnostictest for the form ofthe dependence in the extremesof IIs-
SWtr. The solid line is the test statistic and the dotted line its pointwise95%confidence
interval.
log z
7.1. (SWL,Hs) DEPE TDEI'ICE
Figure 7.13: Sim3: Diagnostictest for the form ofthe dependence in the extremesof Ils-
SW L. 'Ihe solid line is the test statistic and the dotted line its pointwise95% confidence
interval.
1
log z
Figure 7.14: Sim4: Diagnostictest for the form of the dependence in the extremesof 11s-
SWL. The solid line is the test statistic and the dotted line its oointwise95% confidence
interval.
140 CHAPTER7. DEPENDENCEESTIMATION
0.03;
Figures 7.20 and 7.21 show the fitted and true/simulationjoint density contoursfor
Siml and Sim2respectively.In eachcasethe bivariatenormalmodelis of the correctform
for the dependencemodelling,so inconsistencies in the estimatesare due to parameter
estimation rather than mis-specificationof the statistical model. The agreementis very
good for Siml, and reasonablefor Sim2. In the latter the upper tail of the fls variableis
a bit short in the estimatedjoint densitymodel.
For SimS we have dependercemodelsfor borh (H5,SWL) and (I15,Szrge). Joint
densitycontoursare shownin Figures7.22 and7-23. "the agreementis goodin both cases,
with the only obviousdisagreementarisingfrom the estimatedjoint distribution having
a shorter upper tail for Surge and SWL than the simulatedvalues.
For Sim4we havetwo fitted statisticalmodelsto compare.Figures7.24and 7.25show
joint densityestimatesfor the thresholdand mixture bivariatenormal modelsrespectively.
For the former the estimation of the dependencestructure is extremely poor, with the
contoursin poor agreementover most of the distribution. By comparison,the mixture
model is in good agreement,with the exceptionof the SW L upper tail, which is under-
estimated.
Finally considerestimatesfor SimSshownin Figures7.26and7.27. Againthe thresh-
old baseddependence modelfits poorly, and the mixture modelis a markedimprovement.
However,whateverdependencemodel is fitted the joint density estimateis poor as the
marginal modelsover-estimates/under-estimate the tails of .FIsand SW L respectively.
In ChaptersB 9 we examinehow important thesediscrepancies are when considered
from the designperspective,and by taking the marginsto be both known and estimated,
we examinethe importanceof marginal and dependence modelling/estimation.
7.1. (SWL,Hs) DEPE TDE^ICE 141
Figure 7.15: SimS: Diagnostictest for the form of the dependence in the extremesof f15-
SWL. The solid line is the test statistic and the dotted line its pointwise95% confidence
interval.
logz
a;
E
7.1. (SWL,Hs) DEPEI{DE TCE 143
Figure 7.20: SimT Hs-SWL joint density contours: the contours of joint density value
0.01,0.001,0.0001,0.00001ale shown. The estimatedmodelis the bivariatenormal model.
45478
Figure 7.21: S1m2H1-SWL joint density contours: the contoursof joint density value
0.01,0.001,0.0001,0.00001
areshown.The estimatedmodelis the biva,riate normalmodel.
7.1. (SW L, H S) DEPENDENCE L45
Figure 7.22: Sirn3 I{5-Surge joint density contours: the contours of joint density value
0.01,0.001,0.0001,0.00001 are shown. The estimatedmodel is the bivariatenormal model.
I
Surgo
Figure 7.23: Sirr'J Hs-SWL joint density contours: the contours of joint density value
0.01,0.001,0.0001,0.00001are shown. The estimatedmodel is the bivariatenormal model.
11 12 13 14 15
146 CHAPTER7, DEPENDENCE
ESTIMATION
Figure 7.24: Sim4 Hs-SWL joint density contours: the contoursof joint density value
0.01,0.001,0.0001,0.00001
are shown. The estimatedmodel is the thresholdbivariate
normal model.
IF
Figure 7.25: Sim4 Hs-SWL joint density contours: the contoursof joint density value
0.01,0.001,0.0001,
0.00001are shown. The estimatedmodel is the mixture of bivariate
normalsmodel.
IF
SimL the rising mean showsa strong positive relationshipbetweenthe variables. The
variancestabilisesabovea thresholdz:0.5, so this level is taken for subsequent
analvsis.
SimB the rising mean showsa weak positive relationship,with stability of the variance
obtainedabovea thresholdof u : 0.5.
Now we fit the linear regressionthresholdmodel to ,S* given I1}. Estimatesof the
intercept, a, gradient,b, and residualvariance,o2, for Siml-Sim5 are given in Table 7.4.
The estimatesconfirm the empiricalevidencefound above:highly significantdependence
for Siml, significant(but small) dependence for Sim2- Sim4,and independence for Sim5.
Sim4 is surprising as the data were generatedusing an independencestatistical model,
while the estimateddependence is non-zerobut still very weak.
7.2. (S,Hs) DEPE TDETCEFORSIMULA'.|EDDATA L49
For Siml the data simulatedfrom the fitted model reproducethe main featuresof
the original data very well. The correspondingplots for Sim2-Sim5are lesseasyto
assess,but in eachcasethe original data structure appearsto be well replicated.
Data Thresholdz a b 02
Siml 0.5 0.43(0.02) 0.67(0.03) 0.43(0.03)
Sim2 -0.5 0.16(0.02) -0.11(0.02) 0.80(0.06)
Sim3 0.5 -0.02(0.03) 0.11(0.04) 0.8e(0.12)
Sim4 -oo 0.00(0.01) 0.05(0.01) 0.93(0.10)
Sim5 -oo 0.00(0.01) 0.00(0.01) 1.oo(0.10)
Table 7.4: Parameterestimatesfor the linear regressionof S* on I{ applied above a
thresholdz (on the Gaussianscale).Standarderrorsare given in parentheses.For Sim4
and Sim5 the valueof u usedin the resressionmodel is u : 0.
150 CHAPTER7. DEPENDENCE
ESTIMATION
Figure 7.28: Siml: S versusff5 and regressioncurve (plotted on the original scale).
01234
(5, HS)DEPENDENCEFORSIMUL,+TEDDATA 151
Figure 7.31: Sim4: 5 versusIls and regressioncurve (plotted on the original scale).
r52 CHAPTER7. DEPENDENCE
ESTIMATION
Figure 7.32: Sim5: S versus .t15and regressioncurve (plotted on the original scale).
.t a :r:i1-riJ:i.-::'-:-.i:::i,:jj:i.lii:i..r;.:l.l
j,
-2 -1 012 3
-202
1i)4 CHAPTER7. DEPENDENCE
ESTIMATION
-2 -1 012 3
HS
-202
HS
7,2, (S,11S)DEPENDENCEFORSIMULATEDDATA -Li)i)
Figure 7,38: Siml: Mean and Va,ria,nceof S (on the Gaussian scale) versus threshold for
I/s (on the Gaussian scale). Dotted lines give 95% confidence interval.
MeanversusThreshold VarianceversusThreshold
A9
tt
uJ
-1 0 1 2 -1 0 1 2
u (gaussian
scal€) u (gaussian
scale)
Figure 7.39: Sim2: Mean and Variance of .9 (plotted on the Gaussian scaie) versus thresh-
old for 1y'5(on the Gaussian scale). Dotted lines give 95% confidenceinterval.
MeanversusThreshold VarianceversusThreshold
i
E
\)) .
-'f012 -1 0 1 2
u (gaussianscale) u (gaussian
scal€)
156 CHAPTER7. DEPENDENCEESTIMATION
ao !
I
uJ Eo
-1 0 1 2 -1 0 | 2
u (gaussian
scale) gcale)
u (gaussian
I 4 .,i
ul
'''''' ''''''''''
''''-
'i'"^,..''.i
.1 0 1 2 .1 0 1 2
u (gaussian
scale) u (gausslan
scale)
7,2, (S,HS)DEPENDENCE
FORSIMULATEDDATA 157
Figure 7.42: Sim5: Mea,nand Variance of S (plotted on the Gaussian scale) versus thresh-
old for 1{s (on the Gaussian scale). Dotted lines give 95% confidenceinterval.
MeanversusThreshold VarianceversusThreshold
ol
I
E 6o.
I!
.1 0 1 2 -l 0 1 2
u (gaussian
6cale) u (gaussian
scale)
Figure 7.43: Siml: ,9 versus /1s for the original data and the data simulated from the
fitted resression.
Originaldata Dataslnulatedtrem
lhe lrned modet
ct
d
1.0 1.5
158 CHAPTER7. DEPENDENCEESTIMATION
Figure 7.44: Sim2: S versus f1s for the original data and the data simulated from the
fitted reEression.
Originaldata DataI
tne
2 2
Hs t"1s
Figure 7.45: Sim3: S versus 115 for the original data and the data simulated from the
fitted resression.
Originaldata Datasjmulatedtram
tne flned model
8
7.2. (5, HS) DEPENDENCEFORSIMULATEDDATA 159
Figure 7.46: Sim4: .9 versusI1e for the original data and the data simulated from the
fitted regression.
Originaldata D"ASif|?{{etfi%fr3r
HS t-is
Figure 7.47: Sim5: S versus Ils for the original data and the data simulated from the
fitted resression.
Originaldata D+asifluFtedJr?m
10 I 10 14
Fts
CHAPTER7. DEPENDENCEESTIMATION
Chapter 8
Pr{SWL>r,Hs>gl:
Probabilitiesof this form, thought of as a function of c and g, are termed the joi,nt sur-
uiuorJunctionof the joint distribution ol (SW L, H s). Herewe are particularly interested
in finding all the possiblecombinationsof (r,y) suchthat the joint survivor function is a
small fixed probability, i.e. extrapolationfrom the sampleis required. We presentthese
combinationsas curves,which correspondto contoursof equaljoint survivor function.
Thesecontoursare the main output of the standardapproachof the current implemen-
tation of JPM (seeSection1.2) and are often used for initial designcalculations. The
refinedversionof current implementationusesthis information to derivethe probability
of failure.
In Sections8.1 and 8.2 we presentthe contoursof the estirnatedjoint survivorfunction
for Siml-Sim5 for caseswhere
respectively.Throughoutthesesections,contoursareshowncorresponding to 5, 10,50,100
and 1000year return periods (i.e. for eachpoint (2, g) on the associatedcontour curve,
the probability of the event{SW L > n, Hs } g} is suchthat this eventoccurson average
oncein the specifiedreturn period). Suchcontoursare shownfor
2. the joint distribution of the actual simulation model used to generate the Sim data
sets.
161
162 EXTREMESOF (SWL,HS)
CHAPTERB. SIMULTANEOUS
Limit values
The limits of the joint survivor function as either r -+ -cc with g fixed, or y -+ -cc with
r fixed, are important specialcasesas
Independence
When the variablesare independent,the joint survivor function simplifiesas
These three features are helpful in assessingthe contour curvesof the joint survivor
function as:
o for completepositivedependence,
the 7 year contourconsistsof the valuesof (r, g)
which give
I
min(Pr{X > z}, Pr{r > s}) Ar. .r'
where No6, is the number of observationsper year. Defining r" and g/" by
I
Pr{X > or} : Pr{}'> g"}
B.l. MARGI iAL DISTHIBUTIOAISK rOWI\r 163
i.e. they are the marginal ? year return levelsfor the two variables,we have that
the contour takesthe form:
This is just two perpendicularlines at the marginalT year return levelsfor X and
Y. For practical purposes,it is suficient to consideronly the intersectionpoint for
thesetwo lines,i.e. (rr,Ur).
2. estimateddependence
structure(i.e. as estimatedin Chapter7).
This joint densityestimateis modifiedfrom the full fitted form to havethe samemarginal
densitiesas thoseusedto simulatethe data. From this estimatedjoint densitythe associ-
ated joint survivor function, PI{SWL ) r,Hs > gr} is evaluatedby integrationover the
region{,SW-L> r,Hs > y}.
In practice,had contoursof the joint survivor function alreadybeenestimatedbefore
incorporating the knowledgeof the marginal distribution then the modification to the
joint survivor function contoursis simply obtainedby re-calibratingthe marginal scales
of the previousestimate.For example,changingthe 100year marginal retutn level from
the estimatedvalue to the known value. Consequently, estimatedcontoursof equaljoint
survivor function should agreeexactly with the contoursfrom the simulation model at
the limits (c -+ -oo and alsog -+ -oo) of the curves.
Unfortunately,in this study someinconsistencyarosein the evaluationof theseknown
marginalsso that the valuesusedin the current implementationwereslightly incorrectin
164 CHAPTER8. SIMULTANEOUS
EXTREMESOF (SWL,HS)
a few cases.The reasonfor this wasa changein the handlingof small waveheightswhich
the removalof resultedin extra observationsbeing generatedin somecases(lengthening
the tail of .I{5. Neverthelessthis error does not substantiallydetract from the inter-
comparisons.
Figures8.1-8.8showcontoursof equalioint survivorfunction estimatesfrom the fitted
model and derivedfrom the simulationmodel. Any significantdiscrepancybetweenthe
two setsof contourswould showthat either
SimS Here we have plots for (Surge,Hs) and (SWL,11s) in FiguresB.3 and 8.4 re-
spectively. The contoursagain seemwell reproducedby the fitted model. Some
discrepanciesoccur for (SWL,H1\, due primarily to the fact that as we have not
modified lhe SWL marginal distribution to follow the simulationmodel (which is
complexas it arisesfrom the combinationof independenttide and surge).
Sim4 Here we have plots for two different fitted dependencestructures: in Figure 8.5
the thresholdbivariate normal model is used,in Figure 8.6 the mixture of bivariate
normalsmodel is considered.In both casesthe Iitted statisticaldependence models
under-estimatethe contourcurves.For the thresholdmodelthe contoursare further
from the contoursgiven by the simulationmodel than for the mixture of bivariate
normals model. This is particularly obvious at the more extremecontours. For
thesedata the diagnosticplots suggestedthe variablescould reasonablybe taken
as completelydependent.From Figures8.5 and 8.6 we seethat a very closeagree-
ment with the simulationmodel is given by perpendicularlines - correspondingto
completedependence, and that completedependence is a better model than either
of the fitted models.
Simb Here we have the sametwo figrres as given for Sim4. We seethat the threshold
model averagesthe two forms of dependencethat occur in the simulation model,
the compromiseleadingto a poor fit (seeFigure 8.7). In contrast, the mixture of
bivariate normals givesan excellentfit. Note that here, although the dependence
8.1. MARGINAL DISTHIBU'ilOI\IS KI\IOWI\I 165
Once the marginal features have been corrected for, the main sourceof error in estimat-
ing contours of equal joint survivor function arises from not having a sufficiently flexible
parametric dependencemodel. The bivariate normal (and threshold fitted version) were
fine for Siml-Sim3 when this was the correct family of dependencestructures, but were
poor for both Sim4 and Sim5. On the other hand, the mixture of two bivariate normals
(which includes the bivariate normal as a special case) gives good fits in Sim4, when it
is not the correct form, as well as for Sim5, when it is the correct family. Note a.lsothat
differencesthat have been observedbetween the contours of the estimated and simulation
models occurred for combinations (r,g) where r > maxSWL and g > max Hs, and so
were due to long range joint extrapolations.
1()r) CHAPTER 8. SIMULTANEOUSEXTREMES OF (SWL,HS)
Figure 8.1: Siml: H1-SWL curvesof equaljoint survivor probability expressedin return
periods (in years)when the marginal distributionsare known
Figure 8.2: Sim2: Hs-SWL curvesof equaljoint survivor probability expressedin return
periods (in years)when the marginal distributionsare known
Iv
2345
8.1. MARGINALDISTRIBUTIOIISKI\rOm\r
Surge
-- 5flfi31"${P"dfJ,
Figure 8.6: Sim|: H s-SW L curves of equal joint survivor probability expressedin return
periods (in years) when the marginal distributions are known. Here the dependencemodel
is the fitted mixture of biva,riate normals model
-._Eliii[]EBnY,r*o-Jr
M ARGINAL DISTRIBUTIONS
KNOWN 169
Figure B.7: Sirn5: -[{5 SWL crtwesof equal joint survivor probability expressedin return
periods (in years) when the marginal distributions are known. Here the dependencemodel
is the fitted bivariate normal threshold model
Figure 8.8: SimS: fls SWL strves of equal joint survivor probability expressedin return
periods (in years) when the marginal distributions a,reknown. Here the dependencemodel
is the fitted mixture of bivariate normals model
170 EXTREMESOF (SWL,HS)
CHAPTER8, SIMULTANEOUS
Sim2 The HR contour estimatesagreevery well with the simulation and fitted model
contours.
Sim4 The HR contoursseempoor in this case. The strong form of dependencein the
simulated data has not been captured, and the contourshave been badly under-
estimated.For the 1000yea.rcontour,it is worrying that the error is so largein the
HR estimatedcontourwithin the rangeof the SW L data. Specifically,the contour
decreases rapidly in the Il5 value. This feature was not observedin either of the
statistically fitted models,or for the other four Sim data analyses.
-
5Pilfl3lfJ"{f$"J
Hti cslrmates
f,,
23456
sftqffiiBq{#$
172 CHAPTER 8. STMUT,iTANEOUS
EXTHEMES OF (SWL,Hs)
- EstimatedModel
- -- Simulationlvlodel
--- HR Estimates
8.1. MARGINAL DISTRIBUTIOIVSKl\tOWl\r 173
Table 8.1: HR estimatesof points on the 10 and 1000year contoursof the joint survivor
function of (SW L, Hs) for Siml: separateestimatesconditionalon the wavedirection,d,
satisfying0" < 0 < 110' and 110"< 0 < 360'.
Table 8.2: HR estimates of points on the 10 and 1000 year contours of the joint survivor
function of (SW L,Ils) for Sim2, when the marginal distributions are known.
174 0HAPTER 8. STMULTANEOUSEXTREMES OF (SWL,Hi)
Table 8.3: HR estimatesof points on the 10 and 1000year contoursof the joint survivor
function of (SW L,.FIg) for Sim3,when the marginaldistributionsare known.
Table 8.4: HR estimatesof points on the 10 and 1000year contoursof the joiut survivor
functionof (SWL,Ils) for Sim4,whenthe marginaldistributionsare known.
8.2. MARGINAL DISTRIBUTIONSUI\rIfl{OWl{
Table 8.5: HR estimatesof points on the 10 and 1000year contcursof the joint survrvor
function of (SI4zL,fls) for Sim5,when the marginaldistributions are known.
Siml The two setsof curves(shownin Figure 8.13) exhibit good agreement.The only
obvioussmall differenceis in the extrapolationof the f/5 variable,with the fitted
model over-estimatingmarginal return levels.
Sim4 Herewe haveplots for the two differentfitted dependence structures:in Figure8.17
the bivariatenormal thresholdmodelis used,whereasin Figure 8.18the mixture of
bivariate normalsmodel is considered.In eachcasethe samemarginal distribution
fits are used,so differencesbetweenthesefigulesare due to differencesin the fitted
dependencestructure. In Section8.1 we found that the mixture model fitted much
the better. That is still apparent here despitethe marginal return levels being
under-estimatedfor each margin (most notably for I1s which has the contrived
longer tail here). The dependencefeaturesare best consideredby comparingthe
shape of the contour curvesin the region of greatestcurvature. Most notable is
that in Figure 8.17this part of the curveis further from the true form derivedfrom
the simulationmodel than in Figure 8.18. This is particularly obviousat the most
extremecontours. The generalshapeof the contour of the fitted model in Figure
8.18seemsto representthe dependercewell (recallthat the simulationmodel here
had a strongerform of dependence in the extremevaluesthan any of the modelswe
are fitting).
SimS Herewe havethe sametwo figuresas givenfor Sim4. The most striking featureof
thesefiguresis the very poor estimateof return levelsfor "I1s,with the tail being
substantiallyover-estimated(althoughwithin the uncertaintyof the estimatesit is
not statistically significantlydifferent). The likely reasonfor this over-estimation
can be seenin the figures,as the data showthat the 300 year return level occurred
by chancein the sample (correspondingto a 10 year sample). For ,9Wtr, return
levelsare slightly under-estimated.This large distortion of the ma.rginalfeatures
makesdependenceaspectsof the fit hard to assess.
In conclusionwe have found that the main sourceof error in estimating contoursof
equaljoint survivor function is from marginal estimation (a feature largely outside the
remit of this current project). If no additional information were available,then relative
to the marginal estimation uncertainties,the dependencemodelling proposedhere for
8.2. M ARGINAL DIST EIBUTIOATSUI{Ifl\rOWl\r t77
r extra data for the site: for examplewhen .FIsis hindcastfor a longerperiod,
r dependence
estimation.
Figure 8.14: Sim2: Hs-SWL curves of equal joint survivor probability expressedin return
periods (in years)
I
8,2. MARGINAL DISTRIBUTIOIVS
UA'-I(NOI,4/'N 179
Figure 8.15: Sim3: fls-Szrgre curves of equal joint survivor probability expressed in
return periods (in years)
1
Surge
Figure 8.16: Sim3: 115-SWI curvesof equaljoint survivor probability expressedin return
periods (in years)
180 CHAPTER 8. SIMULTANEOUSEXTREMES OF (SWL,HS)
- Esllmatedlrodel
--- SimulationModel
0.5
Figure 8.18: Sin4: Hs-SWL curves of equal joint survivor probability expressedin return
periods (in years) for the fitted mixture of biva.riate nolmals model
- 5f'lllEif,n{&da"J'
0.5
8.2. M ARGINAL DISTRIBUTIOATSUI\rI{IVOWN 181
Figure 8.19: Sim5: fI5-,9WL curves of equal joint survivor probability expressedin return
periods (in years) for the fitted bivariate normal threshold model
Sim4 Here the HR contouris good at this return period. In Section8.1 the strong
form of dependencefor thesedata was not captured by the HR approach,so we
know that this good estimateis not sustainedfor larger return periods. Also the
marginal return levelsfor I19 at higher return periodsare under-estimated.The
SWl marginal distribution seemsto fit well.
Sim5 Again the HR contouris adequateat this return period. However,from Section8.1
we know that the strong form of dependence has not beencaptured (the curvature
of the contour is not sufficientlypeaked). The Ils return levelshave been under-
estimated (with the estimatedmarginal 10 year level the same as the simulation
model 5 year level) but is still - better than our estimate (given in Figure 8.19)
which is an over-estimate(correspondingto the simulation model 50 year level).
The SW L marginaldistribution seemsto be well fitted.
8.2. MARGINAL DISTRIBUTIOATSUI\rI(NOWI{
SWL Hg
(0"sd<1e0") (1e0'<d<270")
4.0 1.82 r.25
4.5 r.82 r.24
5.0 1.80 !.zo
Table 8.6: HR estimatesof points on the 10year contourof the joint survivor function for
Siml: separateestimatesconditionalon the wavedirection, d, satisfying0" < d < 190'
and190'<0<270".
si4/, | 1.50 r.75 2.00 2.25 2.50 2.75 3.00 3.25 3.50 3.75
Hs | +.or 4.bT 4.4s 4.28 4.06 z.zg 3.48 8.01 2.r7 o.sg
Table 8.7: HR estimatesof points on the 10 year contour of the joint survivor function
for Sim2.
Table B.B: HR estimatesof points on the 10 year contour of the joint survivor function
for Sim3.
184 CHAPTER 8. SIMULTANEOUSEXTREMES OF (SWL,HS)
SWL I 0.4 0 . 5 0 . 6 0 . 7 0 . 8 0 . 9 i . 0 1 . 1 1 . 2
H5 | 7.42 7.34 7.27 7.23 7.t5 6.88 6.48 5.86 4 . L 4
Table 8.9: HR estimatesof points on the 10 year contour of the joint survivor function
for Sim4.
Table 8.10: HR estimatesof points on the 10 year contour of the joint survivor function
for Sim5.
Part III
The methodologydevelopedin the first two parts of this report can be applied to any
designproblem. In this chapterwe examinean applicationwherethe mode of failure is
an extremeovertoppingdischargerate, and the analysisis requiredfor an existingdesign.
We fbcus on the problem of estimating return levelsof extremeovertoppingdischarge
rates,which can be rephrasedin terms of estimatingthe probability that the overtopping
dischargerate exceedssomecritical level.
In this chapterwe considerfive applicationsites (correspondingto the five simulated
data sets,Siml Sim5). For eachhypotheticalsife, the overtoppingdischargerate, Q6:,is
givenby equation(1.1.1),with the sea-r,r'all
designcharacteristics(a1,a2,o) givenin Table
9.1. This specificationof the problemmay be unrealistic,as the seaconditiondata are
offshoredata (as the waveshavebeenhindcast),whereasthe overtoppingdischargerate,
equation(1.1.1),presumes the wavesto be asmeasured at the sea-wall.Asaresultasmall
number of high waveheights,with very low steepness, tend to dominatethe overtopping
results.On the otherhand,the analysisis primarilyto
enable comparison between the methods developed in this report and methods cur-
rently implemented, such as the two different levels of analysis described in Section
187
188 CHAPTER 9, AATALYffS OF OVERTOPPING DISCHARGE RATES
Simulated data Ul A2 u
Siml 0.019246.96 8.5
Sim2 0.019246.96 5.0
SimS 0.019246.96 14.5
Sim4 0.019246.96 2.5
Sim5 0.019246.96 9.0
Table 9.1: Information on the structural designfor the simulateddata. Herethe sea-wall
is a simpleslopingwall with a 1:4 slope,n1 and a2 dimensionless constantsand u is the
crestelevationin metres(relativeto ODN exceptfor SimBand Sim5which are relativeto
Chart Datum). Seeequation(1.1.1)for detailsof how thesedesignparameters influence
overtoppingdischargerates.
First we outline special featuresof these methods in their application to the specific
problem of estimatingextremesof overtoppingdischargerates, Q6:.
SVM applied to Qs
The standard SVM, as describedin Section2.1, is applieddirectly to the structure vari
able, Q" the overtoppingdischargerate. The resultswerederivedby applying the GPD
thresholdmodel to a rangeof thresholds(correspondingto upperquantilesover the range
e0- ee.5%).
9.1. BACKGROUNDMETHODS
JPM statistical model the estimateis a slight over-estimateof the level (giving the
actual 140 year level). Here the statistical model is basedon estimatedma.rginal
distributions. Known marsinal distributionsare not usedbecauseof the direction-
ality.
JPM current implementation estimatesof the 100year level are givenfor two drrec-
tion sectorsonly, so direct comparisonis not possiblefor thesedata. Herewe report
only estimatesobtainedusing known marginal distribution for fls and SWL. For
the 0' < d < 110"directionsector,HR take ,9 : 0.06which givesthe estimated100
year overtoppingdischargerate of 0.016(corresponding to the actual 30 yea.rrate).
Also for the 110' < d < 360"directionsectorHR take S:0.025 obtainingthe 100
year overtoppingrate of 0.019(corresponding to the actual 40 year rate).
1. for this site, the estimatedjoint survivorfunction givenby the statisticalmodel and
current implementationseemsadequatelyestimated,
3. the variation of ^9given I1s is large,is relativelysmall, so taking this value as fixed
is a reasonableapproximation.
rn
SWL HS alc
7.28 7.47 3.66 0.01676
7.48 0.99 3 . 1 0 0.00397
6.59 l.D4 J./O 0.00237
6.98 1 . 2 5 3.45 0.00221
o.J1 0 . 5 9 6.82 0.00142
Figure9.1: Siml Q6 100year return valuefor the SVM appliedto logQ5' and the JPM
basedon the bivariate normal dependencemodel for (SWL,Hs). The thresholdaxis is
relatedto applications
of the SVM
E
9.3. APPLICATION'IO SIM2
JPM statistical model the estimategiven for the statistical model very slightly over-
estimatesthe rate (giving the actual 140year rate). Here, the statistical model is
basedon estimatedmarginal distributions.
Of all the methods applied to the data, only the JPM statistical model and the JPM
current implementation(with ,9 speciallychosen)appearto produceacceptableestimates.
Possiblereasonsfor this poor performanceof the other estimatorsare that
IT
SWL tts rz Qc
2.38 3.34 10.31 1.1465
2.18 3.22 9.28 1.0920
2.46 3.42 10.44 0.939
1.88 3.89 11.00 0.9314
3.26 3.64 7.65 0.8798
Table 9.5: Sim2: Return levels for Q6 (in metre3/sec/metre) from simulation model
Figure 9.2: Sim2 Q6 100year return value for the SVM applied to Q6 and logQc, and
the JPM basedon the bivariatenormaldependence modelfor (SWL,Hs\. The threshold
axis is related to applicationsof the SVM
- vadabt€
Slructure on ec
.. gifri'nffi$ffi,liiiHl*
0.94 0.96
Throshold
196 CHAPTER 9. A^IALYSISOF OVERTOPPI]VGDISCHARGERATES
JPM statistical model the estimateis very good (the estimatedvalueis the artual 90
year rate). Here,the statisticalmodelis basedon estimatedmarginaldistributions.
JPM current implementation For ,9 : 0.06 the estimateof the 100year rate is 1.86
(correspondingto the actual 25yearrate), whereasfor the derivedvalueofS : 0.048
the estimated100year rate is 2.3 (correspondingto the actual 70 year rate). Here,
the known marginaldistributionswereused.
SWL Hg Tg Qc
1 3 . 1 14-J I 8.57 2.20413
12.88 4.95 7.92 1,8589
12.45 4.30 8.70 r.2745
12.78 3.62 7.76 0.9231
1 1 . 8 3 5.05 7(:)0 0.8149
Table 9.7: Sim3: Return levels for Q6 (in metre3/sec/metre) from simulation model.
198 CHAPTER 9. AAIALYSIS OF OVERTOPPING DISCHARGE RATES
Figure 9.3: Sim3 8c 100 year return value for the SVM applied to Qc' and log Q5rand
the JPM basedon the bivariatenormaldependence modelfor (SWL,Hs). The threshold
axis is related to applicationsof the SVM
Et!ilffifi,[q'ffi[,9$&,""
.E
0.94
Threshold
9.5. APPLICATION TO SIM4
JPM current implementation For ,5 : 0.06the esti.mateof the 100 year rate is 15.0
(correspondingto the actual 18 year rate), whereasfor ,5 - 0.04 (derivedfrom the
data) the estimateis 20.8 (corresponding to the 45 year rate). Here,the known
marginal distributions were used. Holding ,S constant has been beneficial,as the
sensitivity to Ts has beenreduced.
For these data, in Section 7.1.2 we identified that without being overly conservative
(SWL,Hs) couldbe takento be completelydependent.Howeverincreasingthe depen-
dencemakesthe JPM statistical model estimateworse.In contrastfor the JPM current
implementation, with S:0.04, with thesevariablestakento be completelydependent
the actual 100year rate is estimatedalmostperfectly.
Detailed comparisonof methodsis unjustifiedhere. Again the JPM (at a1llevelsof
implementation)seemsto be preferableto the SVM.
9.5. APPLICATION TO SIM4
SWL Hs IZ Qc
I.I7 6.64 1 2 . 1 3 8.003
0.50 6.38 t4-75 7.924
0.75 6.54 72.t5 6.444
0.89 6.89 7t.22 6.413
0.97 5.75 12.24 O . U O J
Figure 9.4: Sim4 Q6 100 year return value for the SVM applied to Q6 and logQg and
the JPM based on the bivariate normal dependencemodel fot (SW L, Hs) with estimated
and known marginal parameters. The threshold axis is related to applications of the SVM
.?
g
0.94. 0.96
Threshold
Figure 9.5: Sim4 8c 100year return valuefor the SVM appliedto Qs and log Q6 and the
JPM basedon the mixture of biyariate normalsdepeldencemodel for (SWL,H1) with
estimatedand known marginal parameters.The thresholdaxis is related to applications
of the SVM
- Slructur€
Vadableon Oc
ilflYFl90#8{,"l,Xilis?.
ixtureof BVN
--- f;HiFI""#811ff
ffi3,lid.Pxtureol BvN+fixed
6
E
;V
0.94 0.96
Threshold
APPLICATION TO SIM\ 203
JPM current implementation For S : 0.06the estimateof the 100year levelis 10.32
(correspondingto the actual 20 year level),whereasfor ^9- 0.05 (derivedfrom the
data) the estimateis 15.1 (corresponding to the actual 45 year rate). Here, the
known marginal distributionswereused.
SWL HS 41
Qc
5.89 10.48 1 1 . 3 1 6.236
7.64 7.02 9.87 5.982
5.90 9.83 1 1 . 3 4 5.680
6.01 8.63 10.34 3.845
6.47 7.64 o o o 3.650
Table 9.10: The largestfive seacondition valuesfor Simb in terms of the structure func-
tion, overtoppingdischargerate Qs givenby equation(1.1.1).HereSWL is in terms of
metresrelativeto ODN, Ifs is in metres,and Q in secs.Overtoppingdischargerale, Qs,
is in metre3/sec/metre.
9.6. APPLICATIONTO SIM\
Figure 9.6: SimSQ5' 100 year return value for the SVM applied to Qs and log Q5 and
the JPM basedon the bivariatenormal dependence model for (SW L, Hs) with estimated
and known marginalparameters.The thresholdaxis is relatedto applicationsof the SVM
- S!rucluleVaiabtg.qn Qc
----- ilSY'Fl!"8.U'P'R"l,Ytr$?-"ur
-- slruclurevadableon loooc
JolntProbabllity
l\rethod+BvN+lixed
marglns
-l
0.94 0.96
Thfeshold
206 CHAPTER 9, ANA.LYffS OF OVEHTOPPINGDISCHARGERATES
Figure 9.7: Sim5 Q6 100year return valuefor the SVM appliedto Q6 and log Qc' aud the
JPM basedon the mixture of bivariate normalsdependencemodel for (SWI, f/s) with
estimatedand known marginal parameters.The thresholdaxis is related to applications
of the SVM
- V?riabte
Struclut€ 0n Oc \
o,BvN
---- trfriFii{rli$&:bY$'d,?r.M*,ure \
JJiHiF'1"%XB'iifrl'&&llod+ffixt're
otevru*nxeomarsins \
\
\
6^ \
8- \
\
.
__:___-_-_:___:_-,-:---
0.94 0.96
Threshold
9-7. COMMEN? OAI CURREAIT IMPLEMENTA?/OT\TRESUL?S
Figure9.8: Siml Qc 10 year return valuefor the SVM appliedto logQ6 and the JPM
basedon the bivariate normal dependencemodel for (SWL,11s). The thresholdaxis is
related to applicationsof the SVM
- Shuclurevafiableon loo
- tJtYFt#0Jltfveilv&l+e:
o
9-
Figure 9.9: Siml Q5' 50 year return value for the SVM applied to logQ6, and the JPM
basedon the bivariatenormaldependence modelfor (SWL,Hs). The thresholdaxis is
related to applicationsof the SVM
Figure9.10: Sim2 Q6 10 year return valuefor the SVM appliedto Q6 and logQg and
the JPM basedon the bivariatenormaldependence modelfor (SWL,,11s).The threshold
axis is related to applicationsof the SVM
c2
E
E c'l
E
ll
0.94 0,96
Threshold
Figure 9.11: Sim2 Qc 50 year return value for the SVM applied to QL- and 1ogQ6 and
the JPM basedon the bivariatenormaldependence modelfor (SWL,Hs). The threshold
axis is related to applicationsof the SVM
-?^
9:
Eq
0.94 0.96
Thteshold
9.8, OTHERFIGURES 271
Figure 9.12: Sim3 Q5, 10 year return value for the SVM applied to Q6 and log Q6r and
the JPM based on the bivariate normal dependencemodel fot (SW L, H s). The threshold
axis is related to applications of the SVM
.:
0.94 0.96
Threshold
Figure 9.13: Sim3 Qc 50 year return value for the SVM applied to Qg and 1ogQ6, and
the JPM basedon the bivariatenormal dependence model fot (SWL,Hs). The threshold
axis is related to applicationsof the SVM
0.94 0.96
Threshold
2t2 CHAPTER 9. AATAIYSIS OF OVEHTOPPING DISCHARGE RATES
Figure 9.14: Sim4 Q6 10 year return value for the SVM applied to Qg and log Q5, and
the JPM basedon the bivariatenormal dependence model for (SW L, /1s) with estimated
and known marginalparameters.The thresholdaxis is relatedto applicationsof the SVM
Structure
Vafiableon Oc
Simulation
ModelValue
JoinlProbabilitv
frethod+BVN
$ilHf
FISbYBiif,
9rffin$.gfN+rxed
marsins
3r
0.94 0.96
Threshold
0.94 0.s6
Threshold
9.8. OTHERFIGURES 213
Figure 9.16: Sim4 Qg 50 year return value for the SVM applied to Q6 and logQ6 and
the JPM based on the bivariate normal dependencemodel for (SI4IL,.I15) with estimated
and known marginal parameters. The threshold axis is related to applications of the SVM
- StrudureVadableon Oc
-:
--
- !3ifftrtt'8uU,rl&"r[,rv"i[uo?.ev'r
structurevrnaDl€ on loo uc
- Joinl PfobabiljtyMethocf+BvN+ljxed
maeins
5H
E
G
Figure 9.17: Sim4Qc 50 yearreturn valuefor the SVM appliedto Qg and 1ogQ6 and the
JPM basedon the mixture of bivariate normalsdependence model for (SWl, t15) with
estimatedand known marginal parameters.The thresholdaxis is related to applications
of the SVM
E
i
E6
P
o
0.94 0.96
Threshold
2r4 CHAPTER 9. AAIAlYSff OF OVERTOPPING DISCHARGE RA.IES
Figure9.18: SimSQc 10 year return valuefor the SVM appliedto Q5' and logQ6 and
the JPM basedon the bivariatenormal dependencemodel for (SW L, Hs) with estimated
and known marginalparameters.The thresholdaxisis relatedto applicationsofthe SVM
:
E
E
0.94. 0.96
Thfeshold
of BVN
margins
of BvN+fixed
.?
0.s4 0.96
Threshold
9.8. OTHER FIGURES
Figure 9.20: Simb 8c 50 year return value for the SVM applied to Qs and logQs and
the JPM basedon the bivariatenormal dependence model for (SW L, Hs) with estimated
and known marginalparameters.The thresholdaxis is relatedto applicationsofthe SVM
- Vadable
Slructure on Oc
-- IJH'tt$"H8$"l,rY?t'""a.ev',l
slruclurevaiableon loooc
JointPrcbability
MethodlBvN+fixed
marglns
Ei6
E
0.94 0.96
Threshold
Figure 9.21: Sim5Qc 50 year return valuefor the SVM appliedto Q6rand log Q6rand the
JPM basedon the mixture of bivariate normalsdependencemodel for (SI71,,t15) with
estimatedand known marginal parameters.The thresholdaxis is related to applications
of the SVM
$ffifldlHfi{#fl{tu?1
ijiHfHISb:8r'lf'y'fi
sllid-
Xixtureol BVN
Mlxtureof BvN+lixadmarqins
0.94 0.s6
Threshold
216 CHAPTER 9. ANAIYSIS OF OVET{TOPPINGDISCHARGER.ATES
Chapter 10
277
218 CHAPTER 10. COMPARISOATS
AI\rD COI\rC, USIOArS
The simplilicationof the failure regiontor (SW L, Hs) causesrelatively little contri-
bution to any under-estimation(typically the return period of an overtoppinglevel
is over-estimatedby a factor of 1.5-2.0,which may be important in someapplica-
tions). Conversely,the refinedversionof the current implementation,which re-uses
resultsfrom the basicmethod to producea better estimateof overtopping,has po-
tential for someover-estimationdue to the methodsusedto evaluatethe probability
of failing in a rectangularregion.
The need for future comparisonsof the methods, basedon a range of carefully
implementeddesignproblems.
Chapter 11
References
2t9
CHAPTERll. REFERENCES
Prandle,D. and Wolf, J. (1978).The interactionof surgeand tide in the North Seaand
River Thames.Geophgs. J. R. Astr. Soc.,55,203-216.
Smith, R.L. (1989).Extremevalueanalysisof environmental time series:an application
to trend detectionin ground-level
ozone(with discussion),Statist.gci,.,4,367-393.
Tawn, J.A. (1992). Estimatingprobabilitiesof extremesea levels,AppL Statist., 41,
77 93.
Appendix A
Background information
E l:ft(x)do1...d,ra:
I: I:/x(x)dx:1
r The probabilityof someevent,E say,is givenby
That is, marginal distributions are obtained by summing over all the possibilities of the
other random variables. Similarly, by summing over a set of the random variables, the
joint probability density function of the subset of the remaining variables is obtained, i.e.
the joint density function of (&+r, . . . , Xa) is given by
" f x ( r ; * ' ,. . . , r a ) : [*
J -c<r
. .[*
J-m
1;*)d.q...dni.
221
222 APPET{DIX A. BACKGROU ID il\TFORMATIO T
"f"(*)
. -.,,o)
f 6r*r,...,xo1(r;+t,
where the denominator is the joint probability density function of (X,;+r, . . . , Xa)
AII the information about the variation of the vector of random variables is contained
in the joint probability density function. Summariesof this function are helpful in compar-
ing different distributions, so summary statistics a.reoften evaluated. The most common
summa.riesare the marginal mean and variance of each individual variable and, as mea-
sures of dependencebetween the variables, the couariance and,correlation between each
pair of variables.
Correlation and covariance are measuresof linear dependencebetween two va,riables.
Let J (r;, r) denote the joint probability density function for the pair (X', X ) Let p; and
oo2denote the marginal mean and variance of variable e, then Cov(X;, Xi), the covariance
of (&, Xj), is given by
The correlation, p, is a non-dimensional quantity always lying in the range -1 < p < 1.
When p:0 there is no linear dependencebetweenthe variables;when p > 0 there is a
positive linear association between the variables, with large values in each variable (and
small values in each variable) coinciding more often than if there were no relationship
between the variables; and when p < 0 there is a negative linear association between the
variables, with large values in one variable coinciding with small values in the other van-
able, and vice-versa, more often than if there were no relationship between the variables.
4.2. THE NORMALDISTRIBUTION
| (/,, -,r)?'l
pt',r)- ------;- exp { _i_f,| for -all -co<y<oo
J \y/ (2n1rtro_.., , },, -".
2o2 J
where o > 0. The expectation of the variable is p and the variance is o2, hence the
standard notation V - N(p,,oz). The distribution function for this random variable
cannot be evaluated analytically, but symbolically is expressedas
* : I:_#^*'{-#}*,
Pr{Y<
: o[(s- r")1"].
When p:0 and o: l then the distributionis said to be in standardform, i.e. Y -
1r/(0,1). To transformY - N (p,,o2)to Z - N(0,1) we usethe relationship
Y-p.
Z:
o
1 - - ''zJ)
) : --. - - * - . . - * e * p r
f z ( z ) : d ( z"t L 1
en)d/\der(D))U2'-i'p\-iz-
wherethe rangeof zi, for i : L,. ..,d, is -m ( z;.--cr.).Heredet(D) is the determinant
of the d x d matrix X, termed the variance-covariancematrix,
lott
":l I
\ oar
224 APPENDIXA. BACKGROU\rDINFORMATIOI\r
with (z,j)th element corresponding to the pairwise covariance of variables Zi, and, 2,1.
Thus, as
p,; : Correlatio Cov(z;'zi) - --
ru r' -
n(2,. Z,\
(var(Z;)Yar(2,))'t'
we have
(' " " nra)
'-l^ 0"
s'-|"'"'Pij " I
;J
\ Par " " L )
where p;3 is the correlation of Z; and 27. Therefore, the diagonal of D is a seriesof entries
of ones and the matrix is symmetric.
To illustrate the notation, we now give this joint density in the bivariate case:
1 ( 1 )
rl'2'1)
fzr,z"(21,22):((21,22):
G$: t;+/,exr{-4f-a1tz!-2p2""+A}'
where-oo < (zr, zz) < m. Herep: prz: p21,wilh -l < p ( 1, is the correlation
between21 and 22, so in terms of D,
'-/t o\
"- t )
\,
and
t ( t -P\
F
' -r--(r-t")\-P
1)
Property l: An importantspecialcaseis whenp: 0, i.e. no correlation.Then
z2): *o
lz,,z"(zy, * =
# {-t t": "3)\ f ,,(",)f,,("r),
so (fi, 22) are independent.
Property 2: The marginal densityof Zt - 11719,1).
This result followsfrom
1r7,.,'l
'"P * Pz2)'J
@F4W t-41 - l)\zt
so that z1lz2: zt * N(pzt, (1 - pt)) The resultfollowsfrom
f- - (t. z \
: 4i#(!.
f z,lz,=,"(zrltr)
l 221r'21
4.2. THE NORMAL DISTRIBUTIAN
:
/Y(y) 6V+rE r*o {-}r" - p)rL tv- r)},
w i t h g q: - *
l At< oo for i:1,...,d. H e r ep : ( h , . . . , p a ) r a n d X i s t h e v a r i a n c e -
covariance matrix with (rt,j)th element corresponding to the pairwise covariance of vari-
ables )t and Yi. Thus, as
Cov({, Y3)
nri: Con(Yu,Yt): : oijl@ioi),
(Var(4)Var(r1))1/'?
we have
("t
D: I
t
\ olodqd|
wherep;7is the correlationofY; and Y7.To illustrate the notation we againgivethis joint
densityin the bivariatecase:
( t -/i,)'?)],
J Y t y ' r: r P x l l - 2 1 1 - p e 1 ( 1 y , , -u , 1 ' _ r o ( a r - p t \ ( ! 2 - t r , \ + f ? ,
qr ot / \ ot /\ 02 / \ 02 / /)
with
1
c: - p'Y
21To1oil1
and -co < (Ayyz) < cc. Herep: pr2: p21is the correlationbetweenYi and Y2. Now,
Y - MVNa(p,D)
Y - BVN(p,x).
226 APPE TDIX A. BACKGROU\ID ilIFORMATIOAT
"i##{: d@i,o)
A2d\1."r ."+ \
where /(ri,ri) is the joint density function of the bivariate Normal distribution with
standardmarginaldistributions.
Normal Marginals
o(zl,uj);
}a@\,u).
0'i
d@i,"\).
228 APPET{DIXA, BACKGROU,ryD
IATFO.RMATION
where .\; : Pr(d; > u;), and o,; ) 0. Then for ri > ui, the marginal variablesare
transformedto normality through the probability integral transform (A.3.1), giving
x;: o-'(r",(&))fori:r,2,
and in particular,
ui: Q 1 ( F x , ( u e )f)o r i : 1 , 2 ,
o(ul,uj);
)a(ri,u)) d,rI.
)ri drt'
,b@;,';)##
for i : 1, 2,
In the aboveexpressions,
T:eZt+(7-e)22
Le.
Tj : eZti + (I - e)221fot j :1,2
where
o?,Ptoton
,,-(
"'
\ Ptottotz o1z )
- ( pzoztotz\
03,
"':
\ Prorrorz oEz )
it follows that
r; : a-r (Frj(]:j)) for j : 1,2
wheret3(ij): IE'(o({)).
In practicethis statisticalmodelis over-parametrised
for statisticalpurposes.
To avoid
parameter redundancv we take
! 4 = 0 , o n = 1 a n do n : 7 .
List of Figures
231
232 LISI: OF FIGURES
6 . 8 Probability and quantile plots for the GPD fitted to Surgelevel data at
Cardiffand Christchurch.Also shownin eachplot is the z: g line. 100
6 . 9 Probability and quantile plots for the Weibull distribution fitted to I15
observationsat Cardiff and Christchurch. Also shownin eachplot is the
r:u line. . .
LIST OF FIGURES
6.10 Probability and quantile plots for the Weibull distribution fitted to flg
observationsat Dowsing and Shoreham. Also shown in each plot is the
s : y line.
6.11 Probability and quantile plots for the Weibull distribution fitted to Ils
observationsat North Wales and Dover. AIso shownin each plot is the
r - y line. 107
6.12 Probability and quantileplots for the TruncatedWeibull distribution fitted
to Fls exceedancesat Cardiff and Christchurch.Also shownin eachplot is
the z :3r line.. 108
6.13 Probability and quantileplots for the TruncatedWeibull distribution fitted
to -FIsexceedancesat Dowsingand Shoreham.Also shownin eachplot is
the z: g line.. 109
6.14 Probability and quantileplots for the TruncatedWeibull distribution fitted
to I{s exceedancesat North Whlesand Dover. Also shownin eachplot is
the r : 9 line. . 110
6.15 Probability and quantile plots for the GPD fitted to /1s exceedances
at
Cardiff and Christchurch.Also shownin eachplot is the r :9 line. 111
6.16 Probability and quantile plots for the GPD fitted to Ils exceedances
at
Dowsingand Shoreham.Also shownin eachplot is the r: grline. . t72
6.17 Probability and quantile plots for the GPD fitted to lls exceedances at
. l s os h o w ni n e a c hp l o ti s t h ec : y l i n e . . . . . . 1 1 3
N o r t hW a l e sa n dD o v e r A
6.18 Cardifi: Estimatesof two extreme cuantiles under the Weibull and the
GPD modelsversusthe empiricaldistribution function calculatedin u. 1r4
6.19 Christchurch: Estimatesof two extremequantilesunder the Weibull and
the GPD modelsversusthe empiricaldistribution function calculatedin t. 1 1 5
6.20 Dowsing: Estimatesof two extremequantilesunder the Weibull and the
GPD modelsversusthe empiricaldistribution function calculatedin u. 116
6.21 Shoreham:Estimatesof two extremequantilesunder the Weibull and the
GPD modelsversusthe empiricaldistribution function calculatedin z. tL7
6.22 North Wales: Estimatesof two extremequantilesunder the Weibull and
the GPD modelsversusthe empiricaldistribution function calculatedin u. 1 1 8
6.23 Dover: Estimatesof two extremequantilesunderthe Weibull and the GPD
modelsversusthe empiricaldistribution function calculatediu z. 119
6.24 Histogramof Steepness
at varioussites.
6.25 Histogramof measuredsteepness
at Morecambe,Boygrift and L1.me. 1t1
7.14 Sim4: Diagnostictest for the form ofthe dependence in the extremesof 1{s-
SWL. The solid line is the test statistic and the dotted Iine its Dointwise
95% confidenceinterval.
7.15 Sim5:Diagnostictestfor the form ofthe dependence in the extremesof fls-
SW L. The solid line is the test statistic and the dotted line its pointwise
9 5 %c o n f i d e n icnet e r v a l . ...141
7.16 Correlationfunction,p,, between(Hs, SW L) versusthe threshold(in prob-
ability of non-exceedance) for the differentsimulateddata sets. r42
7.17 SimS: Correlationfunction, pu, between(Hs,SWL) and (Hs,Surge) ver
sus the threshold(in probability of non-exceedance) r42
7.18 Sim4: Correlationfunction,p,,,between(Hs,SWL) versusthe threshold
(in probability of non-exceedance) for the fitted mixture of bivariate nor-
mals and the thresholdbiva.riatenormal . . 143
7.19 SimS: Correlationfunction,p,,, between(Hs, SW L) versusthe threshold
(in probability of non-exceedance)
for the fitted mixture of biva,riatenor-
mals and the thresholdbivariatenormal . 1,43
7.20 Siml Hs SWL joint densitycontours:the contoursof joint densityvalue
0.01,0.001,0.0001,0.00001
areshown.The estimatedmodelis the bivariate
n o r m am
l odel. ....I44
7.2I S\m2 Hy-SWL joint densitycontours:the contoursof joint density value
0.01,0.001,0.0001,
0.00001areshown.The estimatedmodelis the bivariate
n o r m am
l odel. ....144
7.22 Sim3 Ils Surgejoint densitycontours:the contoursof joint density value
0.01,0.001,0.0001,
0.00001areshown.The estimatedmodelis the bivariate
n o r m am
l odel. ... . ...145
7.23 Sim3 H7-SWL joint density contours:the contoursof joint density value
0.01,0.001,0.0001,
0.00001areshown.The estimatedmodelis the bivariate
n o r m am
l odel. ....145
7.24 Sim4 Hs SWL joint density contours:the contoursof joint densityvalue
0.01,0.001.0.0001.
0.00001are shown.The estimatedmodelis the thresh-
old bivariate normal model.
7.25 Sim4 Hs SW L joint densitycontours:the contoursof joint density value
0.01,0.001,
0.0001,
0.00001areshown.The estimatedmodel is the mixture
of bivariate normals model.
7.26 Sim5H;-SWL joint densitycontours:the contours of joint density value
0.01,0.001,
0.0001,
0.00001
areshown.The estimated model is the thresh-
old bivariate normal model.
236 LIST OF FIGURES
9.1 Siml Qc 100 year return value for the SVM applied to log Qg and the
JPM based on the bivariate normal dependencemodel for (SW L, Hs).
The thresholdaxis is relatedto applicationsof the SVM . . . . . 192
9.2 Sim2 Q6' 100year return valuefor the SVM appliedto Qs and.IogQ6' and
the JPM basedon the bivariatenormal dependence model for (SW L, H s).
The thresholdaxis is relatedto applicationsof the SVM . . . . . 195
9.3 Sim38c 100yearreturnvaluefor the SVM appliedto Q6 and logQg and
the JPM basedon the bivariatenormaldependence modelfor (SWL,Hs).
The thresholdaxis is relatedto applicationsof the SVM . . . 198
9.4 Sim4 Q6' 100year return valuefor the SVM appliedto Qs and logQo and
the JPM basedon the bivariatenormaldependence modelfor (SWL,Hs)
with estimated and known marginal parameters. The threshold axis is
related t o a p p l i c a t i oo
n fst h e S V M .......202
9.5 Sim4 Qc 100year return valuefor the SVM appliedto Qc and log Q6 and
the JPM basedon the mixture of bivariate normalsdependencemodel for
(SW L, Hi with estimatedand knownmarginalparameters.The threshold
axis is relatedto applicationsof the SVM . . . . 202
9.6 Sim5 Qc 100year return valuefor the SVM appliedto Q6 and logQ6rand
the JPM basedon the bivariatenormal dependence model for (SW L., Hs)
with estimated and known marginal paraneters. The threshold axis is
related t o a p p l i c a t i oo
n fst h e S V M ...... .205
LIST OF FIGURES 239
9.7 Simb Q6 100year return valuefor the SVM appliedto Qc andlog Qa and
the JPM basedon the mixture of bivariate normalsdependencemodel for
(.SWL, Hs) with estimatedand knownmarginalparameters.The threshold
axis is related to applicationsof the SVM . . . . 206
9.8 Siml Qc 10 year return valuefor the SVM appliedto log Q6,and the JPM
based on the bivariate normal dependencemodel for (Str;tr/],Ifs). The
thresholdaxis is related to applicationsof the SVM . . . 209
9.9 Siml Q,?50 yearreturnvaluefor the SVM appliedto logQ6 and the JPM
based on the bivariate normal dependencemodel for (SW-L,I1s). The
thresholdaxis is relatedto applicationsof the SVM . . . 209
9.10 Sim2 Q6 l0 year return valuefor the SVM appliedto Qs ald logQo and
the JPM basedon the bivariatenormaldependence modellor (SWL,Hs).
The thresholdaxis is relatedto applicationsof the SVM . . . 210
9.11 Sim2 Qcr 50 year return valuefor the SVM appliedto Qs and log Q6: and
the JPM basedon the bivariatenormaldependence modelfor (SWL,Hs).
The thresholdaxis is relatedto applicationsof the SVM . . . . . 210
9.12 Sim3 Q6r10 year return valuefor the SVM appliedto Q6' and log Qc, and
the JPM basedon the bivariatenormal dependence model fot (SW L, H s).
The thresholdaxis is relatedto applicationsof the SVM . . . . . 211
9.13 Sim3 Qc 50 year return valuefor the SVM appliedto Q6 and 1ogQs and
the JPM basedon the bivariatenorrnaldependence model for (SWL, Hs).
The thresholdaxisis relatedto applications
of the SVM . . . . . 2I-
9.14 Sim4 Q6, 10 year return valuefor the SVM appliedto Q6 and log Q6 and
the JPM basedon the bivariatenormaldependence modelfor (SWL,Hs)
with estimated and known marginal parameters. The threshold axis is
relatedt o a p p l i c a t i oo
n fst h e S V M .......212
9.15 Sim4 Q6 l0 yearreturn valuefor the SVM appliedto Qa1and logQ6 and
the JPM basedon the mixture of bivariate normalsdependencemodel for
(SWL, Hs) with estimatedandknownmarginalparameters. The threshold
axis is related to applicationsof the SVM . . . . 2I2
9.16 Sim4 Q6 50 year return valuefor the SVM appliedto Q5rand log Q6: and
the JPM basedon the bivariatenormaldependence modelfot (SWL,Hs)
with estimated and known marginal parameters. The threshold axis is
relatedto applicatioon fst h e S V M ...... .213
240 LIS'| AF FIGURES
9.17 Sim4 Q6: 50 year return valuefor the SVM appliedto Qs and logQ6 and
the JPM basedon the mixture of bivariate normalsdependencemodel for
(SW L, Hs) with estimatedand knownmarginalparameters.The threshold
axis is relatedto applicationsof the SVM . . . . 2I3
9.18 Simb Qg 1-0year return valuefor the SVM appliedto Qs and log Q5' and
the JPM basedon the bivariatenormaldependence modelfot (SWL,Hs)
with estimated and known marginal parameters. The threshold axis is
related t o a p p l i c a t i oo
n fst h e S V M .......274
9.19 Sim5Qc 10 yearreturn valuefor the SVM appliedto Q6 and.logQg and
the JPM basedon the mixture of bivariate normalsdependence model for
(SW L, Hs) with estimatedand knownmarginalparameters.The threshold
axis is related to applicationsof the SVM . . . . 2I4
9.20 SimSQc,50 yearreturn valuefor the SVM appliedto Q6'and logQ6'and
the JPM basedon the bivariatenormaldependence modelfot (SWL,Hs)
with estimated and known marginal parameters. The threshold axis is
related to applicatioo n fst h e S V M .......2I5
9.21 SimSQ6: 50 year return valuefor the SVM appliedto Q5rand log Q6rand
the JPM basedon the mixture of bivariate normalsdependence model for
(SWL, Hs) with estimatedandknownmarginalparameters. The threshold
axis is relatedto applicationsof the SVM . . . . 21,5
List of Tables
5.1 Information about the observationaldata sites: SWL - still water level.
Waves - significant wave height, wave period, wave direction. 70
5.2 Differencesin the statisticalmodelsfor the simulateddata setsSiml-Sim5.
24r
.,,4n
LIST OF TABLES
9.1 Information on the structural designfor the simulateddata. Here the sea-
wall is a simple sloping wall with a 1:4 slope, a4 a;nda2 dimensionless
constantsand u is the crestelevationin metres(relativeto ODN exceptfor
Sim3and Sim5which are relativeto Chart Datum). Seeequation(1.1.1)
for details of how thesedesignparametersinfluenceovertoppingdischarge
rates.. ......I88
9.2 The largest five sea condition valuesfor Siml in terms of the structure
function,overtoppingdischargerale Q6 grvenby equation(1.1.1). Here
SWL is in terms of metresrelative to ODN, I1s is in metres, and Ts in
s e c sO
. v e r t o p p i ndgi s c h a r gnet e , Q 6 , i s i n m e t r e 3 / s e c / m e t r.e. ... . . . 1 9 1
9.3 Siml: Returnlevelsfor Q6 (in metre3
/sec/metre)from simulationmodel.. 192
9.4 The largest five sea condition valuesfor Sim2 in terms of the structure
function,overtoppingdischargerate Q6 givenby equation(1.1.1). Here
SW L \s in terms of metresrelative to ODN, lls is in metres,and Tz in
s e c sO
. v e r t o p p i ndgi s c h a r gr ea l , eQ, s , i s i n m e t r e 3 / s e c / m e t r.e. ... . . . 1 9 4
9.5 Sim2:Returnlevelsfor Qc (in metre3/sec/metre)
from simulationmodel.. 195
9.6 The largest five sea condition valuesfor Sim3 in terms of the structure
function,overtoppingdischargerate Q5:givenby equation(1.1.1). Here
SWL is in terms of metresrelative to ODN, Ils is in metres, and.Ts in
secs.Overtoppingdischargente, Q6, is in metre3/sec/metre. . . . . . . . 197
9.7 Sim3: Return levelsfor Q6 (in metre3/sec/metre)from simulationmodel. . 197
9.8 The largest five sea condition valuesfor Sim4 in terms of the structure
function,overtoppingdisch-arge rate Q6 givenby equation(1.1.1). Here
SW L is in terms of metresrelative to ODN, J{s is in metres, and T2 in
s e c sO
. v e r t o p p i ndgi s c h a r gr a
e t e ,Q c , i s i n m e t r e 3 / s e c / m e t r.e. ... . . . 2 0 I
9.9 Sim4: Return levelsfor Qq (in metre3/sec/metre)from simulationmodel. . 201
9.10 The largest five sea condition valuesfor SimS in terms of the structure
function,overtoppingdischargerate Q6: givenby equation(1.1.1). Here
SWL \s in terms of metres relative to ODN, Hs is in metres, and Tz in
s e c sO
. v e r t o p p i ndgi s c h a r gnel e , Q 6 , i s i n m e t r e 3 / s e c / m e t r.e. ... . . . 2 0 4
from simulationmodel.. 205
9.11 SimS:Returnlevelsfor Q6 (in metres/sec/metre)
244 LISTOFTABLES