Professional Documents
Culture Documents
Libro de Pierre Goovaerts
Libro de Pierre Goovaerts
Geostatistics for
Natural Resources
Evaluation
PIERRE GOOVAERTS
Risk u Risk [3
Dcparlinent of Civil and Envil~on~nenldHnginecring
The University o f Michigan, Ann Arbor
Nrrrhnlie,
Maxime,
and Xuvier
Copyrighl 1997 by Oxf<rrdIlnivcrsily Press. Inc.
As a last remark, heware that uncertainty . . . arises horn our imperfect krlowletlgc
of that phenon~enon,it is data-dependent and most importantly model-dependent,
that model specifying our prior concept (decisions) ahout the phenomenon. No
rnndcl, hence no uncertainty rneasurc, can ever he objective.
Pierre Goovaerts closes no authoritative work with a comment that inany oias-
ters could envy, showing his co~ntnandingdistance from a suhjcct he otherwise
meticulously articulated over 400 pages. 'i'lris is the signature of the very best.
'l'his (still) young Inan is neither French nor Soutlr Africa~r.Hc is neither mining
cr~gir~cer nor pctnilc~~rncngineer Hc is an agroa~mistwillt ;I tradem:irk no-now
sense practicality that, from the beginning, t m d e geostatistics diSSercnt and
hencc, 1 dare say, successful. An author must iiave rare insight lo present such a
con~pendiumof tncthods and tools witl~outletting them lakc cornn~ando f the
hook. Picrre built his hook ;troond the study goals, giving to data aiialysis m d
data integration the lend they deserve. Never mistake tlrc tools Sor the goals-
this is the recipe S~ira good geostatistical study, and Pierre Ins demonstrated that
it also works for a good hook. His illustration of every concept, every kriging
system, by an cxarnplc from the same data set (heavy ntetal contamination in the
Swiss lura) is an er~chanttncntand a necessary hrenthc~:I" .: siiine trwnh read-
ing.
Characteristically. exploratory dat;~analysis (chapter 2) precedes the intro-
duction of the random function (chapter 3). The chapter on inference and niodel-
ing of a multivariate model (chapter 4) is, in my opinion, tlre best ever written OII
the suh~cct.Pierre could have heen holder hy giving prccedencc to uncertainty
assessolent over estinratioo, but Ire chose to present first the estimation tools
(cl1;tpt~r~ 5 and 6). l'lris is tlrc most complete yet cohcsive expos6 of all the vari-
ous llavors of kriging and cokriging. The en~pllasisis not on the illusory kriging
variance hut rather on building esti~nalorsthat can account for the large diversity
11f information types clraractcristic of earth sciences. The very reason for geo- Acknowledgments
statistics and the future of the discipline lie in the modeling of uncertainty, at
each node through conditional distributions (chapter 7) and globally through
stochastic images (conditional simulations, chapter 8). In modern geoshtistics,
which is driven by conditional simulations, kriging is an engine, and not the
only one, to build models of conditional probability distributions. Kriging esti-
males and kriging variances have lost their original luster: the fonner hccause of
thcir uneven s~ooothingand the latter because of thcir data independence.
The practicc of gcostatistics has always been ahead of academic publications.
This hook linally may I w e caught up with the use of random function models in
l l ~ cearth sciences, hut gcoslatislics has already freed itself from tlie frame of
such models. It belits that the door of an era he closed hy a man o f the future.
A.G. Jorrn~el
Most of this hook was written in 1994 during my second year as 21 postdoc at the
Department of Geological and Environmental Sciences (St;rol'ord University).
'The idea of a geostatisticd hook was launched by AndrC Journel on a flight hack
from the forum "Geostatistics for the Next Century," held in Montreal in June
1993. In addition to initiating the project, AndrC was an enthusiastic adviser and
a tireless revicwcr, tracking any sentencc that failed to follow his golden rules of
clarity and practicality. Witl~outhis ceaseless support, this book probably would
never llave appeared on the shelves.
711e 20 months of writing were followed by a 6-tnonth peer review. I wish to
thank the IiAlwving persons for their time and palicncc in reading all or part of
the 550-page draft manuscript: Clayton Deutsch, Jennifer Dungan, Guy Girard,
J;rinrc Ci6111cz-llerndodcz.Ricardo Olea, Plrilippe Sonnet, Mohan Srivastava,
fl;nis W;rckcr~i;tgcl,i ~ n t lRic11;rrd Wchstcr. Spccial tllanks to Molian Srivastava
for Iris thorough review of the book and Iris I06 comments, sontetirncs irritating
but always pertinent. 'The draft manuscript also was used as a textbook for pre-
qualifying exams of Stanford University Ph.D. students and benefited from the
careful reading of Phacdon Kyriakidis, Srinivas Rao, and Tingling Yao.
Most of my knowlcdgc ahout indic;rtor gcoslalistics ;uid stochastic simulation
was gained during my 2 years at Stanford, and I am gratcful to Ciilles Bourgault
and nrany graduate students for their stirnulatittg discussions and seminars. This
time at Stanf6rd would not have heen possible without the linancial support of
S ~ m f o r dUniversity, the Belgian Naliooal Fund for Scientilic Research, the Bel-
gian American Educational Foundation, NATO, and a Fulhrigltt-Hays grant. I
a, ,ally indebted to the Belgian National Fund for Scientilic Research for
$"I .;my research for 7 years.
6 , i~listicscannot exist without data. Throughout the book, case studies
have proved valuable cornplcrncnts to thc theoretical introduction of concepts
and algorithms. I thank Jean-Pascal Duhois of the Swiss Federal Institute of Contents
Technology at Lausanne for providing me with a priceless environmental data
set and for autliorizing the publication of these data.
Lorrvain-la-Neuve, Belgirrn~ P.G.
December 1996
1 Introduction 3
Appendixes 443
A Fitting an LMC 443 Introduction
B List of acronyms and notation 447
B.l Acronyms 447
B.2 Common notation 448
C The Jura data 457 Earth scier~ccsd a t a are typically distributed in space and/or in t i n ~ c .Knowl-
C.1 Prediction and validation sets 457 edge of all attribute value, say, a mineral grade or a pollut.a~rt.conccnt.r;r-
C.2 Transcct data set 464 t.ion, is l,lrus of litlle interest unless locatio~iand/or tinrt: of ~ r ~ c a s n r c ~ are ~~ent
k~rownand accuul~tetlfor in the dat,a analysis. (:coshI.ist,ics provides ;I srl.
Bibliography 465 of statistical tools for incorporating the spatial and t e n ~ p o r a lcoordinal.cs of
ol?servations in d a t a processing.
Index 477 ,,
I he developrne~~t. of geostatistics i l l the 1960s rcsultcd from the n e d for a
n r c l l ~ o d o l o gto~ evaluat.r the rccoverahle reserves in m i n i ~ ~ g d c p o s i t sPriority
.
was given l o practicalil,y, a cnrrellt trademark of gcostatisl.ics tlrnt explains
it,s succr:ss ;rnd application in such divt:rsr firlds as r r ~ i l r i ~pel,rolculn,~g, soil
science, occanograpl~y,irydrogeology, remote scnsi~rg,and c ~ ~ v i r o n n r e sci- ~~td
ences. Ilntil the late 1080s, geostalistics was r:sscnt,ially viewed as a rrreaus 1.0
dcscrilx spatial patterns and ioterpol;~tethe valhr of l,l~eattril)ote of i111.t:r-
est a t ~ ~ n s a m p l eIocaiions.
d (kostatislics is now irrcreasi~rglyr~setlto morlcl
tllc rmcertainty about I I I I ~ I I ~ values ~ I I t,lrrongh the generation of allkrnat.ive
inrages (realizations) that all honor tlw d a t a and reproduce aspcct,s of llre
patterns of spatial depcntlerrce or olher statistics d c e ~ ~ r ecorrstxll~enlialtl for
t . 1 1 ~prot'lenr a t i~;tntl.A give11 scenario or I.r;tnsf(:r funclinn (re~llcdiat,iolrpro-
ccss, flow sin~ulabor)call iw ;,ppliad 10 t . l ~se(, of roalie;~l.ions,nliowillg 1 . 1 ~ :
t r r ~ c e r t ; ~ i of t y r q m l s l : (rcnrt:di;itio~r cllicicncy, (low p r o p d i r s ) 1.0 bo as-
~ ~Llre
scssed St.ocllast,ic i m n g i ~ ~isgow! of (.he ~nosl,vihriltll. :wd pronlisit~gx r w s of
rcsearcl~in gcost,atistics.
O f t m there are o ~ ~ al yfew tneasurc~ncnlsof the a1.t.ril~irl.cof i~rl.ercsl,;l.l~i!
resultant predicted maps t,hus provide poor resolution, and the correspo~~rling
uncertainty may bc very large. In strch situat,iorls, it is critical Lo nccollnt. fol.
informat.ion t h a t is more densely s a n ~ p l e d .For example, insr~fficicnt,p o l l n t , a ~ ~ t
d indirect, yet exlraustive, in for ma ti or^ 11rovidcrl
d a t a can be s ~ ~ p p l e t n c n t ewit11
by LIE calibration of a soil or land nsc m a p . 111l.cgrationof secondary 11al;r in
prediction and simulation algorithms is a ~ ~ o t l active ~ e r aveunc of resmrcl~in
geostalistics.
,,
Il~corct.iraldcvcloplncnls and i ~ p p l i c a t i oof ~ ~gwstatisI,iciil
s 1.0ols i t r ~11t:ing
p ~ ~ l ~ l i s li ~n call
t l wcr-increasing variety of jour~lals;mrl coirgress p r o c e e d i ~ ~ g s .
Yt!I., a s of 1997, t.l~crcwas no tcxthook to provi(lr~st.~ldc~~f.s RIICI p r i ~ t i t i o ~ w r s
with a colnprclmrsive as well as practicd ovrrvicw of tl~r:ever-growir~gpaleblk
of gcost~atisticdconcepts and a l g o r i t h s . 'len years after David's (1077) a l ~ d
Jo~rrncland Iluijbrcgts' (1978) books olr nlit~inggeoslkitistics, Isaaks m d Sri-
v;tst.itva (1989) wrot.e a rernarkahle i ~ ~ t r o d u c t i oto n applied geost,atist,ics a t
lllc n l ~ r ~ r r g r a d u a icvcl.
tc 'I'lrcir foc~rswas 011 tmplr~r;tl.orydal;t a d y s i s and
spat,ial prt:dict,io~~, wit,l~liI~l.11: < I I ! \ , ~ I ~ O I I uf
I ~ ~111orf:
I I I , advmced t,opics, such ;IS
;rsscssnm~f,of unccrl.ainty slid stocl~ast,icimaging. 'Thr: goi(lebouk ;~ssociatr:d
wil,ll lllr goost.atisLical soft.~varclil)r:rry (:Sl,lll (Ijeulsclr and Jourucl, 1992a)
@VP a co111p1cLo{,rfwrtl.ahio~rof rccc~rlgr:oshlisI.ic;rl dcv(\lrrl,~l~e~ll.s, p;lrticw
M y i l l 1 . 1 ~ : ttrcn of condil.ion:d s i ~ ~ u ~ l i i t~ i co ~Lit~ was , not i n t ~ ~ ~ 10i d I)(.
~ da
t.heorctica1 n!f~!rt:~~cc l.extbook. 'I'lkis book aims a t I~ridgilrgthe gap 1~t:tweeo
Isaaks and Srivaslava's introductory hook and GSLIR's more complt:l,e user
guide.
'The main text of the book is divided into seven chapters, covering the
n ~ o s ilnport,itl~l
t arras of gcostalistical n~ethodology'I'l~e prescnlatiorr follows
tllr l.ypical steps of a geosL;tt,istical analysis, ir~l.roducingtools for description,
quantitative modeling of spal.ial colrt.inoity, spatial prediction and unc.ertaint.y
a s s c s s n m k 'To lacilitnlc reading and as art attenlpt a t standardizat~ion,this
book llses Lllc lota at ion of the GSLIR guirlcbook.
,.
I lic various tools are illr~stratedusing a nlullivari;ttc soil d a t a set relat.ed
to l ~ e a v yn~et,;tlc o ~ ~ t a ~ ~ l iof~ a~ a14.5 ~ ~ regi011 i l l the Swiss J u r a . TIIC
t i okltlZ
geostalistical analysis was carried out using the (;SLID software. Alll~ougll
tliis d a t a s e t gives t l ~ el ~ o o ka definite environmental flavor, presentation of t.he
a l g o r i t l ~ n ~iss gcl~crala ~ l diutelded for str~dcntsand practitioners desiring to
gain itn undersla~ldingof the n~etl~odology. Mathematical developments un-
derlying most interpolation algoritlrms are given; therefore, the reader should
lravc some prior i~otionsof lil~earalgebra, in addition to an undergradunto
Figure 1.1: Map showiug the split of the 359 data locatio~tsinto a test set (closed
k~lowlcdgeof slutistics. 'fhese tl~eoretical dcvelopmcnls may be skipped, circles) and n prediction set (open circles).
Ilowcvcr, OII first reading, without altering cornprcllcrrsion of the case studies.
1.3 Terminology
3 . E s t i ~ n a t ethe inct;tl conct:~rLrat~ions
;it lest loca1,ions. Altlrougl~all statislical concepts and ~ ~ o l a t i o l are
r s defined in the Lcxt, and
are suirmrarized in Appcndix B, a few terms used extensively t l ~ r o u g h o u the
t
4. Model the probability distributions of nlctnl co~~cenf.r;ttions ;tt lcst lo- book are now introduced.
cations, and assess the risk of exceeding critical tlrresholrls.
r Attribute. 1'hysic;tl properties arc called "attrilrutcs" a d denoted by
lowcrcme lellcrs, silch as z or s. Continrtu~i~s al.I,ril)utcs SIICII a s I I I ~ ~ I
concentrations are rncasr~rcdon a c o n t i ~ ~ o oqo ~s ~ a ~ ~ t i t . ; ~ t i v ewharws
sci~lc,
6. Model joint spatial n n c c r t a i ~ ~ tofnirl.;tl
y c o ~ ~ c e n t r a t i o illrough
a sct of ~rs categoricd att,ributes take only a limit,ed nunrbcr of stales, usually non-
Prediction and classificat~ionof t,est locations are checked against the trne
. ordered, e.g., rock types or land uses.
Variable. T h e variable % or S, dcnoted by capital letters, is defined as
the set of possible values or states that the attribute 2 or s can take
valnes from the v a l i d a h n set. over the study area or a t a location will, coordinates vector 11. III the
1.2 Plan of The Book . latter case, the variable is denoted %(I]) or ,S(II).
Individual. T h e attrihutc valnc is n~easnreilou ;I pltysical s;~nrple,s u c l ~
a s a piece of rock or a core of soil taken from the field. l o scct,ions 2.1
and 2.2 where n o accon~ttis taktw of d a t a locatiorrs, that physical sarrrpl(!
T h e book starts with an exploratory analysis of tile J o r a d a t a set i l l C l ~ a p t e 2.
r
is referred t,o a s an individual. In subsequent chapters, each physical
T h e nnivariate distrib111.ionof calcgorical and corrl,inuous attribulks and their
sample is associat,ed l o a precise lucntion 11, in thc stndy area.
relationslrips arc first described ignoring data locations. D a t a locations itre
tlrerr considered for niodrling spatial conthuity and cross d e p t ~ ~ d c n chetwce~l e is delil~ndas the sel, of all ~ ~ ~ e a s ~ r l ~ c nol cf l , I . s
r f'opa/niioa. ' n ~ apcq1111alio11
altributt:~. the attrihutc of i~~t.ercstt,lral, could I)<:
111;1dcovcr LIIC sl.udy itrc;l. '1'111'
Chapter 3 introdnces the r a n d o ~ ~function
r concept, wlricl~allows a p r o b finite collection of measurenlenls availa1,le is referred to as a smnyle or
ahilistic presentatior~of the various tools irrtrodoccd i n t,he preview cllaptcr. sample set.
Clrapter 4 addresses the prohlcnr of irrferri~~g stathtics that, arc reprcsonta-
tive of the sl.udy area and not only of the availal)lc sample. 'I'l~coretici~l and Parameter. Parameters arc constant (not random) q ~ ~ a n t i t iof
e sa trio11~1,
practical issues related l o ~ ~ r o d e l i ~ ~ g c x p e r i ~ ~
(cross)
l c n t a srmivariograrrrs
l are for example, the rallge parameter of a seriiivariogra~nmodcl or tlie l u m n
paranrclcr of a lognormal proldrilily ~iistribubionfikncl.io~linodeling i t
.
discussed.
,
1 he sub~cq11e111 two chapters introdi~cetlie proble~rlof i n t e r p o l a t i o ~; ~I I I ~ I~istograrn.
the nrulliple versions of the kriging (i~~terpolation) paradigm. Chapter 5 Stnlislics. Stal.istics are qrrantities s u ~ n ~ n a r i z i nagdistril~uhion,wlricll
presents krigirrg techniqrles t i n t utilize only values of the attribute under may involve sevcral attributes and/or several loci~tionsin space. Uai-
study. Algorithms for i n c o r p o r a t i q secondary infornlation are discr~ssedin vuriaie, bivaviate, and maltivaviute statistics relate, respectively, to one,
Chapter 6. two, and multiple a1,trihutes. 'l'be terminology one-poi71t, two-point, and
Chapter 7 introdnces thi: Gaussiau a n d indicator a1go:orithrtlsfor modeling multiple-point statisl.ics is used when the varial~lesrelate 1.0 1.lre same
local probability distributions of either corrtin~~ous or categorical attributes. attribute a t one, two, and multiple locations. For example, t,he corre-
T h e use of these rrrodcls to assess the uncertainty allout unknown valr~esa ~ d lat,ion coefficient is ;I bivariatc stat,isl.ic, whcrcirs thc scrnivariogra~tris
determi~ratiorrof optirrtu~nestinrat.cs is discussed. a 1 . ~ 0 - p o i nst.atistic.
t T h e cross scrnivariogra~r~ is a bivariak! two-point.
Chapter 8 preserrts algoritl~nisto gcuerate n~nltiplercalizirtio~~s distril~~~t~cd statistic ljecause it i~rvolvr!~ two different at,l.ril)nl.es al, l\vo difbronl. Icl-
in space of either c o n t i ~ ~ or
~ ~cat,egorical
o~~s attribntes. 'I'lrc simulation tt:clr- cations.
niqties presented inclndc s c q ~ ~ e n t i aGaussian
l m d indica1,or algorillrn~s,t.he
Chapter 2
S n ~ n m a r ystatistics
l ~ n p o r t a ~frrt a l ~ ~ r of
e s a distribution are its central value and measures of its
spreiid and syrrunetry.
i:hI1
Concentration (pprn) 'I'lrr cwt.ral valrlc of a dist.ril~t~tion is u s ~ ~ i d tl ayk m its t.lw ;tritlniietic nlean,
<lcfi11m1as
16 Ni data
." n = 1
Yor highly ;tsytt~t~~r%ric d i s t r i l ~ u t i o ~;I~ utorc
s, appropriate ceut,rnl value is the
K rncdiar~,M , wlriclr is the value c o r r c s p o ~ ~ d i rto ~ ga cun~ulativefrequency of
' 004
0.5, i.e., the value t h a t splits the distril~utioninto two halves: lower-valued
<!atkt and l~ighcr-valueddab.
'I'he p-qoantile value of the distribution, noted q,,, is Ll~evalue that a
0 00
0 10 20 30 40 50 60
p r o p o r ( , i o p~ ~of l,lle &ita docs not cxca:d; i.e., ql, is sr1c11that F(q,,) = p. T h e
Concentratlon (ppm) Concentralion (pprn) rr~ediarlis 1.he 0.5 quantile of the distribution, M = q0.5. Other qllallliles of
2Throegl~oottbe hook, these individuals or sites will be referred t u a- "contaminated"
Figure 2.1: liistograms of mctal coscentr.itions or "polluted," although the critical threshold may he exceetletl as a result of rocks that are
naturally rich in that metal, in the nbrencr of any man-made pollution.
A 2. EXPLORATORY DATA ANALI'SIS
"&
.. 0.6 t
a, w
-5 0.4 Reladion ('2.8) er~tailsthat the rneaii and median of the dist~rihntionare equal,
m
m = M. A measure of asylometry is the coefficicrrt of skcwr~css,defined as
0.0. ,..~--,
0 40 80 120 160 0 50 100 150 200
Concentration (pprn) Concentration (pprn)
Table 2.4: Avera, concentrntions of seve~rheavy metals for each 'I'iiblc 2.5: l'crcc~~l,agc of i ~ ~ d i v i d ~ l t11;rl.
als
land use and rock m). excrrd t~olcrnhle111axi111a ( G I , 0 1 , or 1'13)
- - rvitllin mcli 1;rlld use iwd rock type. 'I'hii I Y -
Crl
- - 1'1,
suits for tillagc m d l ' o r l l a ~ ~ d irocks
a ~ ~ rclnte
Lam1 use
to less t,hat~10 individuals (see 'I'ahle 2.1).
Forest 1.40 50.6
Pasture 2.0 61.8
Meadow 1.06 52.0
Tillage 0.96 52.6
Rock lype
Argovian 1.14 41.3
Kinrnmidgian 1.35 56.4
Sequani;tit 1.51 62.9
Portlandian 1.85 48.0
Quaternary 1.16
- 52.3
-
E
-:: .........
: prank : 0.64
. .. wlrrrc mi iuld 111; are the aritlnnetic means of variables Z; and Z,, respec-
tively. 'The covariance beco~ncsthe variance if i = j , see relation (2.7).
,,
111ec o r r r l a l i o ~coefficient,
~ pjj is readily dorlncrd as
0 10 20 30 40 50 0 50 100 150200250
Nickel (ppm) Zinc (pprn)
rvllcrc n, F I I I ~n j are the standard rlevialiorls of %; and Zj, respectively.
'l'llc IIII~L-bcr: correlation cor.llicic111is easier 1.0 i~~lr:rprel t l ~ a t.lle
~ i covariance,
w l ~ i c lrleprnds
~ 011 tl~r:rr~easnrcn~cnt scales of LIE lwo variables.
.>
1l1e t o l l o a i ~ ~a gr r g~~irlaliur!~ for llsing t l ~ cli~w;trcorrelatiotr c o e f i c i e ~ ~1.0t
~ [ u a ~ ~ t ~i l~' y~ P I I ~ C ~I ~I ~ C~ IP, w w I lwo
I \~:tri:d~lc:s:
I . 'The quant.ity pij provides ;t nmrsure only of lirrear r e l a l i o ~IAween
~ two
variablcs. It ~ ; I I I bc n~islcadinyiC i~rberprcterlotl~crwisc(see discussion
in section 9.2.2). I w o variables may be highly d e p e n d r ~ ~and l yet have
zero linear correlation: a classic exarnplc is %; = [[;,I2, where Z, has a
syn~rnctricdislril~nl,ion.
0 10 20 30 40 50 0 50 100150200250
Nickel (pprn) Zinc (pprn) 2. Like Ll~cvariance, t l ~ ecorrelation coe&ient is st,rongIy afFected by ex-
t,ren~evalws. A nlorr: robust rncasnre is {,lierank correlalion coetlicient
wlliclr crr~rsid<.rs
Lllc ranks o l tlre data, v(rj(n)) aud v(r,(n)), rather
t11il11llw origi11;11va111r.s:
pi,11 = -1 ] [ I ( ( ) ) - l , ] . [ ( ( ) ) - llll(,]
(2.13)
.Prank ' 035 ' . Prank n ~ r i- o n j
~
.. . 11
0.0 1.'
,
0.0 0.2 0.4 0.6 0.8
Linear correlation
Figure 2.4: Srattcrgrarv of the rank rorrelnlion cocffirienLvcrsus the lincar corrp-
l ~ l i o t tcoefliricltt far f.lm 5 1 pairs of lboi~vy~ r w t a l h .
Co data Cr data Cd data
type. Starlillg from eacll of tlrcse 38 nodes, the surveyors chose a first location cluster. Such clost.criug could reflect potential sourccs, surll irs originating
100 In away. Front tlral, locatiolr t h y chose asccond location 40 1x1 away; from from llre rock or mart-rnarle pollubion. In general, observatiol~sthat arc close
that second locatiotr, a third location was chosen 16 m away; and finally from to each other on the ground arc also alikc in metal concentrations.
t h a t third location, a i'o11rtI1 location was chosen G rn ;way. 'fhe dist~anr:es
were fixed but tlre directions were ra~ldonr. TIE ol,jective of this nest,cd
sampling wrrs to cover ;i wide range of spatial scales ht:twtwr 0 aud 250 ni.
2.3.2 The h-scattergram
111sectiou 2.1.2, tnany sites were found to exceed the tolorable maxinmrn Sp;tl.i;d rc1;~lionsbetween d;tt;~r;ru be displayed nsing an 11-scatt.crgralrr. .lust
I I , I , n l I . 'I'IIc indicator n ~ a p sin I'ligure 2.7, wl~irltsliow I.he as l.ltc sc;tttcrgranl is a plot, of :ill pairs of v;tlnes relnbcd t.o two different
contanri~iatedloc;tt,ions ill l~lxck,conrplclc t l ~ cdescript.ion. ' f h dcnsily of atlrihotes rrreasnrcd a t the sirme loc;lt,ion, t l ~ el ~ - s c a l . l m g r ais~ a~ ~plot of all
black d o h reflect,^ the greater extent of contarninahio~thy c a d n ~ i u and
~ ~ r lcarl. pairs of nmtsurenrcrrts ( ~ ( I I , ) ,z ( I I , + ~ ) ) on LIE sa111eaLIri1)ute :a t locations
,,
l h e pi~t.ternof spatial distribution of conta~ninatcdsites is also informative. separated by a give11 distance h ill a particular direcliou 0. 'fhe vector nola-
'The lrlack dots in I'igurc 2.7 are not rnndon~lydistribntcd L h c y t,e~rd1.0 Lion 11 ;tcconnts for both dist,;mce and direction Hy convention, 1.lre valuc a t
26 CHAPTER 2. EXPLORATORY DATA ANA1,YSIS
the start of the vector 11, r(u,), is called the tail value, whereas the valne at.
the end, z(u, + h), is the head value.
Data pairs are typically grouped into clases of distances (lags) and a~rgles,
[h i A h ] and [B & AO], so that each h-scattergram is built on a sufficient
number of pairs. For example, consider the relation between Cd values in the
east-west (E-W) direction. Figure 2.8 shows the h-scattergrams of C d values
in the easterly3 direction ( A 0 = 22.5O) for six different classes of distances
( A h = 100 m). T h e abscissa corresponds to tail values and tlie ordillate
to head values. Because the two axes relate to the same variable, a perfect
correlation would entail that all points lie on the first bisector (46' dashed
line). Tlre spread of the cloud of points around this 45' line reflects tlre
variability between data values. T h e increasing irrflat,iol~of tlle cloud wit,h
increasing separation disl.ance h reflects the increasing dissimilarity between
measurements farther apart.
Tail value (ppm) Tail value (ppm)
T h e h-scattergrarn also draws attention to possible outliers or n~isrecorded
values. Isolated pairs on the 11-scattergram should he identified, located 0 1 1
the map, and then analyacd more carefully. For small separatiou d i s t a ~ ~ c e s ,
outlier pairs involving t l ~ esaltle location, say, u,,, i l ~ ~ l i c a that
t c r l a t t ~ nz(11,)
~ h = 388 rn 1
' h=616rn
is very unlike its i~eighbors.For example, ill Figure 2.8, a few pairs fall far
from tire 45' line 01, tile 11-scattergram for lhl= 214 rn. Three of these pairs
involve the same location wit11 Cd concentration of 4.50 P ~ I I I ;tile s~?cond
measurements of t i m e pairs are 0.40, 0.77, and 1.43 ppm. If tlieri: is a sound
physical reason for collsidcring the extreme value (4.50 ppm) as erroneous,
then it should be removed from tlie data sct. Such a decision slrould not,
however, be based on a single lag h , since the same datum may not yield
outlier pairs for other lags h or in otlrcr directions.
Distinct clouds of points on a h-scattergram 111ayindicate the presence
Tail value (ppm) Tail value (ppm)
of separate populations wit11 d i f f e r o ~spatial
~t conlinuil.y. 'I'llc data coi~ldlrc
split into more hornogeneo~~s suhscts if suflicient d a t a are avai1al)le w i t l ~ i ~ l
each subset.
'7 h = 787 rn 1
' h=1024m
2.3.3 Measures of spatial continuity and variability
Each h-scattergrani displays the relation between pairs of z-valncs for a given
class of distance and dirrction. The similarity or dissimilar it,^ between dat,;t
separated by a vcctor 11 call he quantified by several measures (Dentscll and
Journcl, 1Y'32a, p. 40).
Covariance function
Tail value (ppm) Tail value (ppm)
Tire covariance and correlation coellicient inbroduced in section 2.2.2 can be
extended t o ~neasrrresimilarity between non-colocatetl d a t a . 'I'l~e covariance
between d a t a valurs separated by a vecl.or 11 is computed ,zs follows: The scrlriv;~riogra~ri vrluc a t a givm lag 11, so~iietirriescalled semivari-
nnrc, c;tli 11c i~itrvprekrlas the moment of incrliii of t.l~r11-srat,t.ergraniabout
its lirsf. I~isecl.or(k'iplrr 2.9). Indeed, the orll~ogolialdisLancc of any point.
(z(u,), z(u, + h)) to the first biscctor is d, = Iz(u,,) - i(n, -1- 11)1. COB 45'.
t 45' line is 1,lie ;werage of all such squared
'The rirolr~rntof iiiert.ia a h ~ tlte
distances:
wl~crcc -h11 w d c2 are the variarices of the tail and Imd valrir:~(lag vari-
ill
) . ordcrcd sct of correlat,ioii c.oe%cie~iLsp(111), p(lr2), . . . is ct~llcd
a ~ ~ c e s 'Tile
tlic r:xpcri~ncnt~;~laulocorrr:lation fnircl.ioil or r o r r ~ I o g r a ~ ~ i .
Uiilikr tlic cov;tri~~lc,e and corrrlat.ion frurctions, wlrich arc mcasures of sinli-
larity, t l ~ cexperilirental seniivariogra~ny(11) nrcasorcs the average dissir~iilar-
ity betxvee~idata separated by a vect,or 11. It is couiputcd a s half the average
s q ~ l a r c ddiffaencc hctwccti the components of every d a t a pair:
Figure 2.9: Int.erprctation ol the semivariogram value 7(h) as the moment of inertia
of the 11-scuttcrgrarn urouad the first bisector. y(h) is the average of all squared
orthogonal distances d , to thst bisector.
30 CNAI'TEN 2 ISXPLORATOtiY UA'L'A ANALYSIS
1 ~(11)
%(I,) = ;----
iN(11)
IZ(II,,) - z(u, + 11)j" rvil.ll w t [O, 21
o=1
(2.18)
For w = 2, one retrieves the traditional scn~iv;rriogranry(11). 'I'lre
srrtallerw, the lesser tlre i~lfluenceof exlrerrte valucs on tlre meitsure
Figure 2.10: kkpcrinreatal covariance functiuns (kit c o l ~ m a )correlograms
, (cei~lcr y,(li). Two co~nniorrlyused rneaslrres arc the nrirdogrn~r~ (w = I )
column), and semivnriogmms (right colnrrie) of cadmiam i n thi: atsterly dircctioa. and rodogra~n(w = 112) (Ifcntscl~and .lor~rncl,1!1!12a, 1). 41).
,,
I lie ll~reetop graplts are coml,uted irmu the original values, whereas tho o t l m
7 7
I hcse n~easurcsarc itol suljstitut?s for t . 1 1 ~ t,r;rdiliotr;ll s i : n ~ i v ; ~ r i ~
graplts relate to tlre indicator data disphyed in Figure 2.1 1, page :XI. 'I'lw indicator ograln; riil.l~er,they slro~tldbe i~sed1.0 inC<,rf m l . ~ ~ r csuch
s , ;is ritngc,
semivariogram vnlars are standardized try the varianct: oi the indicator data. itnd anisotropy (scc rclatcd discr~ssionin soct,ion 4 2 . 4 ) .
34 CIlAP?'Eft 2. EXP1,ORATOJW DATA ANALYSIS
23 UNIVA RIATfl' SPATIAL L1E:SCRIPTION 35
Graphical interpretation
where F - l l ( ~ k )a d F+ll(zk) arc tlie proportions of tail and Iread values not
exceeding l l ~ chlrresl~oldvalue zn
l l r c intlica1,or covari;rncr (,'l(li; zt) ;rppc;trs ; ~ Ll~c
s " c u ~ l . c r i ~ ~ofg "bl~cl,wr)~
poilrt c ~ ~ ~ m t l a tfreq~rency
ivc l~'(11;zn). Tl~;tlfrrqtrency tlreitsurcs h v a f h ~
two values of tlrc sarrrc atlrihulc z separated l,y a vcclor 11 are jointly no
g r e e l h t t w ~the tl~rcsholdval~rt:"z t . For a s ~ n a l tl~reslrold
l vd111: zk, I;'(h; zr)
measures the co~~~~ct:tivit,y l)trt.weeu s~llallvalrrtis sclmral.cd by ;L v d o r 11: the
larger F(h;zk), the better c o ~ ~ n e c t eill d space are t,l~csnrall z-valr~es. '1,Irc
spatial connectivity of large values ( e g . , v;~lursgreater tharr a largc tlrresl~old
a,)call be measured by I " ( l ~ ; z p ) , where the iridic;tl.or t r a ~ ~ s f o ris~ nIIOW
j(u,; zr,) = 1 - i(u,; znr). Note that C>(h; zn) = Cr(11; zr), V z r .
Indicator correlogra~n
'Die indicator corrclogran~is t,lte standardized for111of the prcvions iodic;btor
covariance function:
I n d i c a t o r senrivariograrn
Figure 2.12: Graphical iaterprotatio~iof the indicator spatial statistics. 'l'lte nor>-
centered indicator covarimrco F(h;I*) and the indicator variogranr value 2yr(l1; z k )
are the proportions of points idling in the horizonlal inid vertical hi~tchedareas of
tlie h-sci~ttcrgranr,respectively.
Fd data
0.0) , ,
0.0 0.4 0.8 1.2 1.6
Distance (km)
Cu data Pb data
I I
0J 0)
0.0 0.4 0.8 1.2 1.8 0.0 0.4 0.8 1.2 1.8
Distance (km) Distance (km)
available for each lag. Cobalt conceritration, lrowever, appear to vary niore Cd data
continuously (smaller semivariograrn values) in the 67.5' direction (SW-NE). I
S e n s i t i v i t y to o x t r e m c va111cs
Figure 2.14 (solid line) slrorvs tlie experimental o r ~ ~ ~ ~ i d i r c c t isenrivari-
onal
ogralns of rnethl conce~rtrations. For the four positively skewed variables
(Cd, Cu, Pb, Zn), the seniivariogran~is also conrput,ed on llie logaritlrr~r 0.0
transform and plottc:d o n the w m c grapli after rescalii~gto t,lw z-v;tri;rnct: 0.0 04 0.8 1.2 1.6
(dashed line). 'f'lre se~nivariogramsof t,lii. lognritluns ;ippc;ir slighLly lcss cr- Distance (km)
ratic wil,h a s~iiirllernugget clft,ct. 'I'lic dif~ercncrsarc, i m w v r r , s~rr;rlland do
not justify tire l o g - t r a ~ ~ s f o r ~ n a l i o r ~ Cu data P b data
Tillage
Meadow
Pasture
1
--
] Forest
Sequanian Quaternary
Geology
Quaternary
Portlandian
1"igure 2.16: Indicator maps showing, is black, tllc locntions 1xlo1lgin.gto a par-
ticular ruck tgpc.
U Argovian
'J'lte indicat,or vxriogmm value 2yl(11; sk) mcasures liow o f k n two locations a
vcclor h apart bclong to difirrul, cntcgories sn.t # sn..'Tlrc smaller Zyr(h;sn.),
tlir hcttcr the spatial co~~rlectivily of category sn.. T l ~ eralrges and shapes
of tltc directional indicat,or semivariogrants reflect t.lre g e o ~ ~ ~ e tpatterns
ric of
category sk.
I2igure 2.17 sl~owstllc i~rdicat,orsi:~~~ivariogr;r~lrsof i.he I I I O S ~c o ~ ~ r r r 1a11d
ro~~ Forest
~ ~ s and
c s rock types c o ~ n p ~ ~ in
l k fdh r rlircctioris with all ir11g111i~r toler;~nceof
22.5', with the following resolts:
C r o s s c o v a r i a n c e firnction
0 10 20 30 40 50
Tall N I value (pprn)
with
-11 tll
ordered set of cross covariances Cjj(lrl), (>;(lr2), . . . i s callrd tllc c x p c r i ~ i i c ~ ~ t a l Figure 2.20: Cross 11-scattergrains of Ni a d Cd conr:cntr;~tionsin thc: westcrly
cross covariauce funct,ioo. 1x1 g ~ n c r a l (,' , ,,(11) # C;,(-It), ~ L I I , I I ~ I I(,';;(h)
~II = (left column) and easterly (right column) directions (angle t,olcrmre = 22.5') lor
four classits o i distances.
( 7 . . l - l > l 2 c Ak",9.x,~,lu,31,c" , , , I *>,,ll"
'I'lrr cross c o r r e l o g r ; ~pij(11)
~ ~ ~ is givrn hy
Nickel-Cadmium Nickel-Cadmium
.I 0.94
-2.
00 0.4 0.8 1.2 1.6 00 0.4 0.8 1.2 1.6
Distance (km) Distance (km)
6-Nickel-Cadmium Nickel-Cadmium
- . 0 I I O . O
00 0.4 0.8 1.2 1.6 00 04 0.8 1.2 1.6
Distance (km) Distance (km)
I n c h a t o r cross corrologra~n
!
-11
I n d i c a t o r cross sclnivariogranl
T l ~ ouly
c data pairs t h a t have non-aero contributions to the indicator cross
sclnivariogralr~arc those wl~crethe valrles of both attrihutes zi and z, are on
opposite sides of their ihresl~oldvalues ( q k , zjk,). 'l'l~rcontribut,ion of a data
pair t.n y f j ( l ~zjk,
; zjk,) can be positivr: ( + I ) or negative ( - I ) , rlepending on
I e t l ~ Ir : ;rt~dz ; - v ; ~ l ~ ~joint,ly
es decrease (i~lcrmse)frmn 11, to 11, 11, +
'The indicator cross cov;rriirnre is co~npot.edas or vary ill opposite ways. 'l'licrcfore, t.lie indicator cross scmivariogram value
cannot. he interpreted as a joint f r e i p e ~ ~ cofy transition, that is, it does not
Ilieasure horn o f t c ~values
~ of hot11 athribotes z; and z; are 011 opposite sides
of I.11reshold values.
Figure 2.25: 1Sxperimeutal onrnidircctiotinl indicnlor cross corre1ogr;~rnshetweetr 6. ' l h c slrort.-rimge strrtctrsre relates to the spatial dist.ribntion of rock
the two cx1t;uistively sampled rnelnls (Ni, ZN)and Cd: second- ( ) , lift11 ( .), types ;and laltd uses irr the sludy area. '1'111: long-range structure re-
;rnd viglt01-clccilc(- - -) tl~rcsholds. flecl.s the inlluencr of Argoviart altd I<ir~~lttr:ridgiatr
rock types on nretal
ctrrtccrttratiolls.
9. 'I'h t ~ ~ c l k with
~ l s widespread c o ~ i l ~ ~ ~ t ~ i (Cd, i o t tatrd 1'1)) show a
~ ~ a tCu,
sIrorl,-range cross dcpc~tdcncewith the bet,tcr sampled Zn. There is a
I O I I ~ - ~ ;cross i)el.wetw Cd atld N i concnnlrations.
L I ~ dt:pcttrlc~~~c
~C
,.
Data description is rarely, if ever, the ultimate goal of a stabistical s l t ~ d y .
lypically, o m w m l s to go beyond the d ; ~ t atri clraraetcriac t . l ~populat,iorr
from rvlricll the sample 118s fxxn drawn. One asscnlinl step i l l llris process
is t.he q u k i t a l i v e modelillg of the spatial slatislics of thal, population irorn
the data avitilahle over t,he study area. A11 s~~bseqtienl applical.ions, such as
prcdict,ion or risk a ~ ~ a l y s irely
s , oil the model (represeutatio~~) choserr.
*,
1Ire del,ernri~tislic;md prolral~ilist.icappro;~clreslo r~ro(lr:lir~g
arc col~iparcrl
~l . Scct,iuu 3.2 int.rod~rccst h rmdw!r ft~n,:l,iot~
i l l s c c t i u ~X I I I O f
~ r~o ~ wlrirl~
r~
~nosl.gm"St;~l.isl.icalitlgoril.h~i~s ;rrc llt~ilt,.
uil or nrgligiblc. Such in~plicildisrcgnrrt for llrr polential error is just.ifi:~blc Cd concenlratlon (ppm)
if the estimate z*(n) is b a d on eitlrcr many d a t a or some k n o w l d g e of the
Figure 3.1: 'l'lic problem of rnodeliag the spatial clistrihution of a conti~iuousat-
physics governing the spatial distribuf.ion of the attribute z. Unfortor~ately;in
tribute (Cd concentration) along a NE-SW trsnsect. TIE detcrn~inisticmodel as-
c:~rtltsriurcw such li~mr~lctlgr is l i ~ ~ ~ iand k d ,tlrc i~suallysparsc i~rforniation sociates to the nasampled location u n single estimated value, say, a. concentration
;~vailel,lcclors not allow o~rr:lo iglrorc Ll~ccrror ;lssociatcd lo ally t'stinralr of 0.7 ppm, whereas thc probabilistic approach provides a probability distribution
z * ( u ) , ~ r onmtter how the variable is estimated. for the possihlc valuss at tlmt location
Chnsidcr, for example, the problem or rnodeling the spatial disdrilrr~tio~r
of C d concentrations aloug the NE-SW transfcl slmwn in I'ignrc 3.1 (lop
gr;iplr). A deter~~rinistic represcotation would attribute to Llre onsan~plcd
the r ~ ~ ~ s a n r pv;rlr~c
l e d ~ ( 1 1 )and, n ~ o r egenerally, of (.Ire distribution of z willrin
, e d say, ~'(11)-0.7 p p n . Suc11 a n c s l . i i ~ ~ ;is
location u a single r s t i ~ ~ ~ ; t tvalue, ~tc
llre are;$. For example, Figure 3.1 (hotlonr graph) slrows the distribution of
likely t o be in crror 0ec;t11sefew data are available and our knowledge of (,Ire
probabilily for C d concentrations a t locatio~r11. T h e unk~~owrr concentrat,ion
physical processes that control the spatial dist,ributiou of Cd values ovcr the
has a 0.2 probability of exceeding llrr critical threshold 0.8 ppm. T h e loca-
area is impcrfcct. In decision making, it is critical to assess the potential error
Lieu 11 would be classified as safe on the basis of the sir& estimated value
~ ' ( I I )- ~ ( 1 1 )or, better, l o know lhe range of values t h a t the onk~rownz ( u )
z'(u) =0.7 p p m , ycl. the rr~odelof nrrcertainty indicates that there is a sig-
may take ;tnd the prolxrhility of occurrence of ench outcome. In the e x a n ~ p l e
nificant, risk for the Cd co~~centr;ition to exceed the tolcrahle m a x i ~ r ~ u an ti
co~rsid~:red, it is critical l o cvaluate the probahilily that t.l~cCd c o ~ ~ c c ~ ~ t r a t , i o n
a t loc;rtion 11 excer:ds the toicrahlr I E I ~ ~ ~ I I I 0.8 I I I ~ppru.
I
u. 'L'hc dc~isionof declaring tire l o c a l i o ~u~safe or contatrrinated would the11
d e p ~ m don wlretlrcr one chooscs to ignore that risk.
Whereas dotern~inisticinodels usually rely on the pl~ysicsof the phe-
Probabilistic: m o d e l
nomenon, most of the infor~rrai.ionused in a probabilistic model comes from
Irrstcad of a single cst,imated value for the unknow~rz(u), the probabilistic the d a t a . T h e spatial distribution of Cd values may he nlodeled witlrout any
approach provides a set of possible values with the corresponding probahil- knowledge of the r~nderlyir~g physical processes; one would then capitalize on
itics of occurrence Such represcntalion reflects our imperfect kr~owledgcof the spatial dependence between any two Cd values as inferred through the
3.2. T H E R A N D O M FUNCTION MODEI, 63
s5
s4 i lypes
s,: Kimmerldgian
$3: Sequanian
sa: Portlandian
Good discussions about the pl~ilosoplryand practice of rrrodeling spa1.ial da1.a
can be found in Journel (1986a, 1994a), lsaaks and Srivastava (1989, p. 196.-
236), and Matllerorl (1989)
11,
, >
1 Ilc qua111.it.yp(n; s r ) represents t,lre probahility for the category st to Figure 3 . 3 sliorvs cdf and pdf represerrtal.ions for both categorical (t.op
prwail at. loc'<t1'I O I I 11: graphs) and colitir~uousItVs ( b o t l o n ~grapl~s)a t location 11. T h e cdf rep-
resentation is typically used for co~itir~uous variables, whereas tbc pdf rep-
resentation is rnore appropriate for categorical attributes wit,]] nowordered
states.
'flic qrrantity i.'(ll;sn) is the probability for ally one of the categories sr.,
ordcretl lesser or i:qu;tl to s k t,o a t u, Accounting for c o r ~ d i t i o ~(3.2)
~s
o t ~ d(3.3), ally cu111111;rt~ivc
probabilily Ic'(11; s r ) t~irtst.lie witl~in[O, I], and the
cwnulative probability distriliution of t.ype ( 3 . 4 ) must he a non-decrcasi~~g
frrrlcthr~of t,hc tlm3sltok! s:,:
Categorical RV S(u)
C o n t i n r ~ o l r~asn d o n r v a r i a b l e
Continuous RV Z(u)
' f l ~ ederivative of the cdf, wl~etlit exists, is the probability density function
( p d f ) f(u; z) = F ' ( I ~2).
; Ry analogy wit11 the case of discrete random vari-
ables, t.he following order relations are satisfied:
Any cun~ulativcprobability F ( u ; z) u~rlstlie within [0, 11, and the cdf must
111: a nolt-dccrr;wi~igfi~~rctiotr
of the t h r ~ s l ~ o lzd.
Similarly, the probal~ility(0.7) that the variable Z ( u ) is no groatcr t.li:r~~ No uncertainty
any given threshold z can be expressed as
T w o extreme p r o b a b i l i t y d i s t r i b u t i o n s
Maximum uncertainty
Each d a l n n ~~ ( I I , . )ar ~ ( I I , ,is) vicwed as a particular rc;iliz;~tio~~
of t l ~ crii11d0111
variable S(11,) or Z(11,). Provided nieasurcnterrts are precise, t,l~ercis no
uncertainty about the sample valne ~(11,) or z(11,). 'f'lins, the probability for
the outcornc v;ilue a& = ~ ( I I , , )to prcv;~ilat (I,, is 011~;the prnl~;tl~ilil,y is zero
for 1111: r e ~ ~ ~ i ~ i(lin i u-g I) ~:al,cg(~rics:
I!y s ~ r h s t i t r ~ t i nthe
g first expression itrto rclatiot~ (:1.25), tlrc indcpendcl~ce
condition hecorncs
?'lrus, tlrr set of all i n d i c a l h cross covaria~rcesLj(11, u'; L , 2') for a11 lltresh-
ulds z , z', provides a irreasurc of deperrdcnce Retwce~rthe t.wo HVs Z(u) arrd
Z(u'). 111 c o ~ ~ t r x sthe
t , Z-covariancr C(II, 11') is R measlrr~only of linear cor-
relolion betwce~ithe two RVs, k . , a rne;tsnre of the ability of a straiglrl line to
clescrilre the rclatiou between these two variables. Tire %-covariance G ( t ~ , u ' )
arid all indicator cross covariances Cj(u,11'; z , 2') vtu~ishwhen the two RVs
,~
I l~t:one-poiel and luio-point ter~ni~rology
rcrrrids us l.lmt the two ra~idonr are indcpcrrde~rt,. Ifowcvcr, the couditio~rC ( n , n l ) = O docs not necessarily
variables relate l o tlic sanre atlrihute ia t iwo dilferent points or locations imply relation (3.26): two li~rrarlyr~ncorrclatedltVs may st.ill he dependent,
r d l ~ c rtlratl to two differel~tat,lrihotrs. i l l wl~icllcase t,Iie depcndcncc relalion is non-linear.
70 CIlAI'Y'ER 3. THE RANDOM FUNCTION MODEL 3.2. ?'HE HANDOM r'UNG??ON MODEL 71
The d e c i s i o n of s t a t i o n n r i t y An I t F model is said to be stationary of order two when (1) the expected
value E {Z(11)} exists and is invariant within A, and (2) the two-point covari-
T h e one- aud two-poinl cdfs of the H.F and their niomeots as rfelirred by ance G ( h ) exists and depends only 011 the separation v e c t h 11. T l ~ covariance
c
relations (3.19)-(3.24) are location-dependent. Their inference thus requires f u o c t i o ~ correlogran~,
~, and semivariograr~~
of a slatimary 1W are relat.txl by
repetitive realizations a t each location u. For example, the inference of the Z-
covariance C(11,11') between 1.l1etwo RVs Z(11) and %(II') separated by a vec-
tor 11 = 11' - u calls for a set of repetitive ~neas~lre~rrerlts {z(')(u), z(')(11'); 1 =
1 , . . . , L ] , which is never2 available in practice. T h e idea is 1.0 use all pairs of
measurements a vector 11 apart w i l l ~ the i ~ ~s l t ~ d yarea A, {z(llo), Z(II@ 11); +
a = I , . . . , n], as a set of repetitions. T h e iniplicit a s s t ~ ~ n p l i oisn that. the cor- As the separalion distance 1111 increases, tlre correlation between any lrvo ILVs
responding pairs of RVs {Z(u,), Z(u, + h ) ; u = 1 , . . . , 7 1 1 originn1.e fro1111 . l ~ +
Z ( n ) and Z ( u 11) generally tends to zero:"
same two-point disl.ribul,io~~. S i ~ c pooling
cations calls for t,lte phe~io~r~erron
l~ of data pairs regardless of l l ~ c i rlo-
under s t , ~ ~ dtoy be spatially "l~onrogeneo~~s"
wit,llin A. In probabilistic terms, the R.F rr~orlelZ ( o ) m r ~ s the clroserr to i ~ c
C(11) - O for 1111 - a2
Recall t h a t the vector notation 11 accoonls for 110th distance 111/and dirt:ctio~~.
' n ~ efnnctions C ( h ) and 2r(ll) are said to 11c irnasolropic if (.hey d e p c ~ ~ou
both distance aud direction 'l'hcy arc, said to 11i: isolrupiv if they <lcpi.~~rl
only on the r~rodulr~s of Ir.
d . T h e avail;rl~ilil,yof er~ouglrd a l a to iufw t.111: p a r m ~ ~ c l e r( sn ~ c n n ,
covariance f1111c1.iorr) of ear11 scparatc 1il2
,\
l l l c cross semivdriograrn yij (11) incorporates only tlre even t e r ~ of r ~ the cross
covariance fr~ncliotr,see rel;rl.io~r(3.38), lle~rccit is ~ y ~ n n ~ e till~ r(11,
i c -11).
In ~w;i':l,ice,asylilnietry of t l ~ cross
r covariance funct.iot~(recall sccl.iorr 2.4.2)
is ii~osl,o f l m igl~oredfor t,hcse reasolrs:
2yij (11) = Cov {[Zi(ll) - Zi(11 ll)], [Z,(u) - 2, (u ll)]) Accolmting for relatio~r(3.38), thr sill value o f a horr~~tlerl
cross setnivariograr
+
= I< {[Z;(11) - Zi(ll El)] , [Zj(ll) - Z j ( l l + ll)]} v i , ;
(3.37) ; j ) -
t,rmds toward I.lw cov;rrin~ii.i.value Cij(0):
( 0 ;IS lhJ - a
1>he t m n s j o i n t or cross apply when i
7
# j, rvhereas the terms direct or a u t o T h~-
e cross c,qVm5&~prat~
apply wlrcri i = j .
.-:P A unit-free measure of linear correlation bctwt:en ally two RVs X i ( ~ )and
Lag effect +
Z j ( u 11) is the cross correlogra~r~,
drfi~redas
defines:
+ T h e mean-value vector:
Chapter 4
T h e covariance function matrix:
4.1.1 Preferential sampling ,( co~rccvtrationsin soils bordering roads trray be larger than in opeu fields he-
/ cause of road trafficpollution. Similarly, oversartrpliug of accessible farmlands
I'hc sarnplirrg is said to he pvefereatial wllcrrever data locatiorrs are neithsr relative t o forest soils, which Iiave low levels of contaminatiorr, would lead l o
regularly ]lor r a n d o ~ d ydistrihutcd over tlre study area. Several fact.ors may ovcrestirr~atio~r of the average concentrations of rrrost metals.
cause specific subareas to be prefcrentially saltrplcd: l'referential sar~lplingof specific classes of values can be del.ected by corn-
I. (:(,nditio~rs of accessibilit.y; fields Imrdcring roads or farnrlatrds arc, casier pariltg tlrt: qrl;rrrtiles of the distribrrtiolis of clt~stcrcdversus uon-clustered
t,o sarj~plrt , l ~ a r~ ~r ~ g gkrraiu
d or d m s r for?st,s. (griddctl) d a t a , Recall that. the pqoarrtile value of a rlist~ribul.ion,q p , is 111~
v a l w l~clmvwlricl~a proportion p of the da1.a falls; tlral. is, qy is such t h a t
( q , ) = . 1,et qb")
and d c ~ r o ktlrc p-qua~rlilrsof (.he distributions of
grid(lcd and clnstertd (lath, rcspcctively. Sirtrilar distrihutions should have
sit~rilarqt~:r~~t,iIm; I,l~tist,Iw gr;tph of 'q: vmstls q/:), c ; ~ l l ~a dQ Q plot, s1to11Id
:I. S;rrrlpli!q2 str;rhcgy; cl~rsl.eredloc;~t.iot~s
rll;ry have heerr salripkcd to char- appear as tlrc st,r;tiglrt line &) = qF) V 1,.
at:t.<,rizcsl~orl.-rirngc:vi~riithility. 't'l~e J u r a sample set was split, i11t.o 11, = 152 clustered data (38 clusters
of 4 locat.ions caclr) and n9 = 107 gridded data (71, = 207 for nickel and zinc
I'referential sanipli~rgsliould always he clearly documented by surveyors, co~rcr~~t.rations). 1-'igorc 4.2 (Icft, graphs) s l ~ o \ r stlrc Q-Q ylols for llie three
since i t may skew the results of any cxplor;rtory dat,a analysis. For Ll~cJ u r a nkctals with widespread aml.artrination ( ( A ,( l u , and PB) and for chromium.
d a t a set., 38 clusters' of 5 locatio~lseach were sarrlpled to characterize tllc 1\11 111r1;tlssltorv discrcpancics of the r~ppcrtails of the two distributions: the
small-scale variabilit,y (scc section 2.3.1). Fnrtlrermore, t,lre sarr~plitrgdesign large quantilc values of t.lre cl~rstcrcddat.a are larger t.li;ru the correspondirrg
was such t.l~attlrc clusters are random stratifictl, whiclr explains their some- quantilc valr~esof t h r grirlded d;rt.a (dot,s are ahove the 45' line). Such Q-Q
what ollrvt:lr ilistkiljutio~rovcr the area (Figure 4.1). plots irrdicatc thztt t.hc larger metal corrcerrtratio~ls are nicasured prefererr-
tSvcn if l~iglr-or low-valued arcas were not purposely targeted, any !>ref- t,i;rlly ;it c l ~ ~ s l r r c1ocat.iorrs.
d 'l'ltis dilfcrcnce is caused hy prcf<:rcr!lial Iocalior!
crerrl.i;~lsa~rrpli~rg is likcly t,o impact salr~plr,st;rl,istics. For exa~rryle,lcad of t.lic clt~st,orsi n far~r~larrd or gr;tsslanil with iriglier metal colrceutrations (see
conditiorral ~rteairsill Table 2.4, page 18). lotlcrd, only 8.39)of the clustered
d&a lravr beer1 collect,cd trndcr forest, whereas that proportion is 18.4% for
gri<lrlr(ld a l . ; ~(glol,;rl proport.io~lof foresL is 12.8%). 'rlris rliffrrc~rce,I~owcver,
is tlol pronoor~crdC I I O I I ~ ~toI sig~lilicanlly:tllkcl st.;rl.ist.ics stlch as meall cow
ec~rtr;tt.ionor porcentrrgc of colrl.;utlinatcd locat.ions ('l'able 41). Note t , l ~ a t
the standard deviation of the clustered d a t a is also larger than that of the
gricldd dt1t.a.
' A I ~ l m u ~the
b a c r d ~ a ~ r ~ p l iwas
r r g nl:strrl, ion-gridded locatinrrs are lrcrenftet. cons id^
r ~ c as
d clustered for illostratiun where t.lrr indicat,or d a t u m i(11,; z) is defi~rcdas in equat,ion (2.19).
78 CHAPTER 4. INFERENCE AND MOIJISIJNC; 4.1. S?'A'I'ISTICAI, INFEI<.ENCE
.. .. ..
.... .
m I'
u
u Men71
Clustcrcd d a t a
Gridded da1.n
0 2 4 6 0 2 4 6 Sld de1iintio71
Gridded data (ppm) Gridded data (ppm) Clustered d a t a
Griddt!d d a t a
% co7tln~r~.
Clnstercd d a t a
Gridrlrd d a t a
0,
0 0 .'
0 20 40 60 0 20 40 60
Gridded data (ppm) Grldded data (ppm)
sm;tll polygoirs of influence rcccive less weight t l ~ a risr)lal.cd
~ locations will^
large p o l y g o ~ ~ofs inlluencr.
, .
I l w cell-decl~isterir~g approach calls for dividing the study area A into rect-
angular cdls, and co~mtingLire rrunrhcr 11 of cells that, c o n t a i ~ ~a st least one
daLu111and the I I I I I I I ~ 7~ cell 6. Each datum loca-
) ~ ~of data falling wit.l~ine a d ~
ti01111, then receives a weight A, = l / ( B .na), wliich gives inore importance
l o isolated locat,ions:
wllcrc liia and bi,(z) arc tllc cqu;~l-weight,cdii~e;tr~ (4.1) and eqii;&weigbted
c u ~ ~ r u l a t i vdistrihrrtiolr
c (4.2) of z-values within cell b. Figure 4.3 (bottom
graph) slio\vs t . 1 ~swell d a t a locations and fonr decli~sleririgcells (B=4).
E a c l ~of (,lie four clust.rred dat,a valires receives a weight. A, = 1/16 since they
share t l ~ csame cell (na = 4). 'Tl~ethree other cells contain o d y orie dal,u~rr
(??a= I ) , hence t11at isolatsd dalum receives a weight of 1/4. 'l'l~c total weight,
s ~ m tso I ;is it, sl~orlld.
'Two kcy p;iralrlcters of llie ccll-dcclusterirrg tcclmiq~icare the cell size
and t,lre location (origin, o r i e ~ ~ t a t i o nof) the grid. Clnsters are frequently
added on an or~derlyirrgpseudo-regular g r i d A natural cell size would tlie~r
he the spaciiig of (,lie grid: a, where ng is the i ~ u ~ r hof
e rgridded d a t a .
l'roviileil d a t a arc rcgolarly spaced, the celltcr of the cells should correspond
t o grid nodes.
Wlrcn the s a ~ ~ ~ p l pat.tern
ing clors not suggest a natural ccll size, several
rrll s i m ;1111l origins I I I I I S ~IIP
. t,ricd '1'11~ro1111)ini~t.im
tlml. yiclds the sn~allest.
or largrsl, rleclt~sl.t:rcd ir~c;trris rctair~edaccording 10 wl~r:l.l~cr t l ~ ehiglr- or
low-valued arcas werc preferer~li;tllysampled. 'Ib avoid erratic results caused
by ext.renie vallres falling into specilic cells, i t is useful to average results for
several din'ererlt grid origins for each cell size.
T h e J u r a dst.zt were declnstered using square cells of 250 in. T h e sarnple
grid was rol,atrrl 1.0 hc p;tr;dlel to the N-S direction, tlle~leach ccll was centered
OII a g r i d node. 'i'lrc ileclt~sl,eretlrlistrihution is compared rvilll the distribution
of tlic 107 griddrxl d:d;t using a &-Q plot (Figure 4.2, right graphs). Using only
Figurr 4.3: Two teclutiques that correct lor the clust,ering oC data locations (black the gridded r1al.s~or applying the cell declustering t.ecl~niqt~e provide sirrrilar
dots): llrc polygonal method (top graph) and the cell-declustnring tecll~~iqnr! (bot- rcs~ilts:hotlr distrihntions plot close to the 45' line with similar stat,istics.
tom graph). 'Yhe declustering wcights are proportional to the area of the pulygoll
of inlluencc of each datum in the first case; they are inversely proportional to the
number na, b = 1 , . . . , 4 , of data that fall within the same cell in thr secor~dcase.
82 C K A P T E H 4. I N F E R E N C E A N D MODI<LING
II
I
type (4.1) or (4.2) if the cell size is very small (each cell contains a t
most o w datum) or very large (all data fall into same cell). i
i
2. By arralogy with the meall, t.lre declustered variance is a ~ r n p u t c da s ?
!
P r o p o r t i o n a l effkct
Most often, ttte local variability of d a t a changes across tlre study arca, a
feature known as hrteroscednslicity. Tlic proporlionnl cffecl ( J o u r ~ ~ cand l
Huijbregl,~,1978, 11. 1 8 6 189) is ;L particular form of Ircl,eroscedast,icity wlrerc
the local variance of rla.l,a is rclated t.o 1ht:ir local itrcatl. For positively skewed
distributions, the local v;triance i~icreimcswit.11 the local r t i r w i (dircct. propor-
tional elfect). Corrversaly, if the ~listril~nliorr is r~cgativelyskcwed, lwgtir
variauccs gc~trri~lly c ~ r r ~ s p o ntod s~trallcriit<:ans (i~rwrst!proporl.ioititl cff~:d,).
A proportiorial enbcl, can 1)e &!tcctctl froirt a scnt.i.eq~lotof 1oc;J ~ ~ ~ c ; t l t s
versus local varimces as calculated from lrtovi~rgwindow statistics. 'I'lrc a r m
is divided into wi~tdowsof equal siac, and tlre mean a11d variance are compt~terl
within each wirrdow. Each wi~rdows h o ~ ~ linclode d ctrouglr (lat,a for relialilr:
inference, yet titere slrould be enough wi~~rlows to detect ally spatial irer~d.
For srnall d a t a sets, t,lrc w i ~ ~ d o wmay
s have t,o overlap.
Fourteen oo~~-ovt:rla~,ping :k n ~x I kur W ~ I I C ~ O Wcir(41
S, i n ~ l u d i n gI J C ~ W C C I I
Local mean Local mean
10 and 28 d a t a values, were r l e f i ~ ~ over a l the study arca. '1'11,: positivt.ly
skewed variables ((31, (:!I, I'b, Zn) show a clear direct rclaliot~bct.wcen local
average concentr;rtio~~s i n metal a 1 1 1 local variaocc (l'igrrrc 4.4). 'f'l~ercis 110
significant proportional clrect for the otlrer rr~rtals.
4.2. MODELING A REGZONALIZATION 8'7
and cross correlogrartr (2.26) provide more robust cross correlation mcasurcs
than the traditional cross se~nivariogratn(2.28).
G a r ~ s s i a rtrrodel
~ with practical range n Figure 4.6: Boarded serniv;~riogrammodels with the same practical range (top
and power inodcls for dilkrerrt values of tire pararncter w (bottom graph).
' f l ~ n ?t,ypcs
~ of 111~lraviorilrar tlic origin art. dist.ing~~islrr:d:
1. I'araholic hcliavior, c.g., a G a i ~ s s i amodel.
~~ Such behavior is cliarac-
teristic of highly reg~tlarplicnoniena such as topographic elev;tlioti of
gently onrlulating Idls.
2 . Linear behavior, e . g , splierical or exl~onential~ n o d r l . Note that. for
the s a ~ l i epractical range, the expo~rc~itial
rrrodel st,;irt.s iucrensit~gfaster
t.lian the splwric;~lntorlel (Figurr 4 . 6 t,op graplr).
3. D i s c o t r t i ~ ~ r l obehavior,
~~s e g . , nugget effect niodel.
T h e bcllavior near tlie origin of the power iilodel cl~ungcswit.l~the valne of
the parameter w . It is linear for w=l (li~war~riodel)and approaches l~aralmlic
beliavior as w incre;rses t.ow;rrd 2 (Figure 4.6, I)ott.~,~ii
gr;ipl~).
2. the rose diagralrr of ranges2, that is, tlrc plot of raugc v i t l ~ ~ cversus
s t.lir
ahimoth B of the direction, is an ellipse.
, elliptical rose diagram of ranges in l i i g ~ ~ 4r e7 (top
Consider, for e x a ~ n p l ethe
right graplr). T h e rnajor axis of tlic ellipse corresporitling to tha dircction ,I> he anisotropy corriicliori cousists of L r a ~ i s f r ~ r n i tlic
i ~ ~ gvcrt.or of ,)rigillirl
of niaxinlu~ncontiunity forms an angle 0 with the coordinate axis y (north o vcctor 11' = (It;, hi)'', so t h a l tllc value
cooriliriatrrs 11 = (h,, hy)?' i ~ ~ al new
direction). By convention, the azimuth angle B is iire;isured in degrees clock- of t.lis ailisotropic sernivariogrim~iriodcl g ( h ) irlmt.ilics thal, o f ;ti1 isot,rol)ic
wise from the y-axis. T h e minor direction of itnisotrapy is p q ~ e n d i c u l a rto inotlcl gl(/ll'I) in the ~rewsystrrrl of coortlir~atas:
tlie major axis of t.lie ellipse and has an azimuth 4 = B + 90'.
T h e two semivariograms computed in the directiot~sof ;tailnutlr 0 and 9 arc
fitted by splrerical models with the sanlc unit sill but differeut range values;
see Figure 4.7 (lop left graph). The major and minor ranges of anisotropy,
an and am, are plotlad as the major and minor radii of tlrc ellipse. 'I'lrc w11crc: g t ( . ) is t i u isot,ropic 111odc1wit,h EL r m g e c y t ~ d10 t.he ~iiinorr m g c a4 of
anisotropy factor h is dcfit~edas the rsl,io of l l 111i11or
~ r ; q e 1.0 t.lw tll;~jor auisotropy.
range, h = amJan < 1. ,~
I lie coordin;rt.e tririlsfr,rtn;riii,~i cells I'w t.wo kcy ~ ~ i ~ r i i ~ ~t,l~o
~ c ivait~~ullt
krs:
Z170rlinear scmivariogreo~nrorlcls y ( h ) = I . Ir, it is tltc d o p e I llml rllitngrs witl, angle B of tlic direction of n~irxininmcoot.il11ii1.y;rnd Llic anisol,rr~pyf;trt.or A.
direction, and the rose diagram of the irwerse of slope 1 / b is considered 'I'lrc t . r ; i r l s f o r ~ ~ ~ nproccwls
t i o ~ ~ i r t t.wo st.eps:
2. 111 the minor direction of azimutlr 4,the distance hs is equal t o zero,
hc~rct: Ih'I = Irg. 'I'l~c model g'(111'1) resuttws to a spllcrical model of
range a+:
An anisotropy t , l ~ a ir~volves
t sill values varying with direction is said t o be
o ( i r 4.8 r i g r ) '1'112 sm~ivariogra~rl
i n tllc direction of aa-
illluth d hzis a longer range ag and also a larger sill than in ot.ller directions.
Snclr anisotropy can be ~nodtilrdas tlic sum of an isotropic t.ransitiotl model
y l ( J h J )and a "zot~id"I I I ~ O P I y2(hg), wl1ic11 depends o ~ l yon the distance hg
i n HIP ~lircrt.iorlo f g r r ~ ; ~ l ov;~riaucc:
r
2. 'L'llc ncw axcs ;ur t,Iwn rcscalcd so t,l1;111,lic zor~allilodcl does not r o w
1,rillul.e 1.0 t.lw dircctio~lof tn;rxi~~lnln
conl.innity ( a a i ~ ~ ~ n0).
t l i Such
roscaling amounts 1.0 setl.ing t,lw range a" in that direction to infinily,
hcncc t.lie ;ulisotropy factor X = og/na 1.0 zero:
CHAPTER 4. INFERENCE: ANII MODELING 4 . 2 . MODELING A RECIONALIZAT1ON 95
Tlre directional semivariograrrr y ( h < ) would theu plot brtwcerr the two
n ~ o d e l sof Figure 1.8.
Consider, for cxa~nple,the aoi~alconlpo~~cril ya(hg) in Figure '1.8 (right do111 function. 'I'he covariance f~mctiono r scrnivariogram of l l ~ a trandom
graph). This model can be seen as an anisotropic model y~(11)that ident.ifies fi~nctionis then, hy definit,ion, permissible.
a spherical rnodel of range urnin tlic <lircct,ion of greater variance hut does I I O ~ 'rlic linear model of regionaliaatior~bnilds l l ~ craniloni f~i~icl.ioti Z(11) as
contribute irk the p ~ r p e r l r l i c ~ ~direct.ioi~
lar of wi1m1LI10. Wi1.h 1l1c I I ~ WsysI.crri a 1i11r:ar conlbinaliorr of ([, + 1) iridcpendcnt r;tndom functio~lsY 1 ( u ) , t:xh
of coordinates (It;, hi)'", LIE a~lisotropicniorlr:l gz(h) c;rn l x nxprmxd as an wit11 EWO mean and h s i c covariance function cr(li):
isotropic splieric;d n d n l g:([hfl) wit,l~;L rauge a*:
willr
Basic models I
Linear model of regionalization
Distance h Distance h
A si~rrilnrd e v c l o p m e ~ c;m
~ t be dotre in tornts of semivariograrr~s.Let gl(11)
~ l c n o t cthe sernivariogram of the ILF Y i ( o ) , with <.he cross sernivariogr;mm
hetween any two dilkrent RFs Y 1 ( u ) and ~ " ( 1 1 )equal to zero: Remarks
1. Fhch i h i c rnoricl of ;r lil~ear~tlodelcan br isolmpic or can display
cit,l~crt,ypc of auisotropy. I'or exnn~ple,the ~ ~ ~ o in d cFigure
l 4.8 is a
c o ~ n b i ~ ~ a tofi oit11
~ r isotropic nroclcl and a zonal model. Accounting for
t,lre cor~ceptof linear rcscalit~gof coordinates introdr~ct!din section 4.2.2,
tlw g c ~ c r a llinc;rr ir~odclis rlcfi~~crl as
with the three correspo~~ding semivariogram rnodels: a nugget cffect, a splrer- 1, experimcnlal serniv;triogram or covariance valoes,
ical model of range n = 1, and a power n ~ o d c lwith w = 1. Tlle three basic 2. permissible se~rrivariogramor covariance niodels, see section 4.2.1, and
st:rnivariogranr n ~ o d c l sare isotropic and arc slrown in Figure 4.9 (Icft graph).
According to relati011 ( 4 2 8 ) the s c n ~ i v a r i o g r amodel
~ ~ ~ is
. knowledge of the area and plteoomcno~~ under study (e.g.,
where go(h) is the nugget effect inodel (4.16). 'The rnodel y ( h ) is, hy con-
st,rur:t.ion, pcrrrrissiblc rind is dcpict,rd in Figure 4.9 (right graph).
. rlirrrt,iolis of anisot.ropy, ilnporl,once of ~ ~ ~ c a s r ~ r eerrors)
l~tcl~t
robust rneasorcs, such as the maclogra~n(for ~valuitlionof anisotr01)y
directions, ratio, and range) or relative scmiv:triograrns
T h e art of n ~ o d e l i l ~consists
g of c:rpitaliaiug on t l m c diflcrcl~tsotirccs of
information Lo build a per~nissiblernodcl that captures t.l~ei t ~ a j o rspal.ial
features of the atl,ril>t~te under study. 'I'lrc following d i s c ~ ~ s s i ~o ~t ~C I I S Pon
S tltc
sernivariogralrt, w l ~ i c lis~ the most frequetltly l~seclstructural tool. A sinrilar
approach can be adopted for covariaure functions.
A u t o m a t i c or v i s u a l i n o d e l i n g ?
Too often, the modeling process is viewed as a mere exercise it1 litting a curve
to experimental valnes. T h e scrnivariogran~is tlrm inodclcd itsing statisl.ical
fitting procedures, wlticll cat, he roughly classified into two catcgorics: Distance (km) Distance (km)
Black-box procedures tnusl be avoided because they cautrot take ir1t.o ac-
count ancillary information tlrat is cri1,ical when sparse or prcfcrcntial saln-
pling rnakcs the cxperitrrcntal sernivariograrn unn:liable. S e ~ ~ r i - ; r n t o ~ ~pro-
~at,ic
cedures may facilil.ate thr: det,crmination of n ~ o d c lparalneters wl~ettt h r forill
of the experirr~entalse~nivariograrnis clcar. IIowcver, snch procedures cow
tribute little t o the aclnal nrodeling, since the most importal~t.decisions re-
garding number, type, and ania~t,ropyof basic s t r u c t ~ ~ r emuel,
s be t.aken by
the user. With t,l~ehelp of a good interactive graplrical program (e.g., C l u ~ ,
1993), the user wo~rlddo better than sop1listicatr:d fully ;rul.on~aticfitt.ing
procedures.
The modeling process relies crilhAly O I I a series of user's deczszons, which
must be backed hy experinrental d a t a or ancillary informatiot~,as follows:
3, which paranreters (sill, ;tnisotropy, range or slope) for each l)asic sctni-
variogranr n ~ o d e to
l use.
I s o t r o p i c or a n i s o t r o p i c m o d e l s ?
T h e conventional approar:lr for delecli~tgar~isolropyis 1.0 compare cxpcriinen-
tal sernivariograms comp~lt.edi n several rlin!ctiol~s(Figure 4.10, l.op graphs).
To decide wltctl~crguorncl.rir: a~tisot.ropyis presr:111.,a t [cast t l ~ r c ?rlireclious
must he considered. 1Iircct.ioos of nmjor and minor spat,i;rl rot~I.i~ruit.y arc
often suggested by banding of large or small valucs 011 a lomtion rnap, or hy
aucillary information, sllclr as orientation of lithologic fornratious or prwailing
contour lines wliose major axis indicates the d i r e c l i o ~of~ maxinrum c o ~ ~ t i m - In earth sciences, there is rarely physical ground for cl~oosirrgany partic-
ity (Figure 4.10, hottom right graph). Note that cobiilt. co~icentrationssl~orva ular basic i~rodelg,(h). For interpolation, a critical feature is llie behavior of
c o ~ r ~ b i n a t i oofn slrort.-range isotropy (sm;ill c.ircl<,s) ;md lorrg-rmgc g c o ~ n d r i c the st:nrivariogra~nn ~ o d e al t the origin. Models wit,]] a parabolic. behavior a t
;u~isot.ropy(largo cllipscs) (Figurc 1.10, right grapl~s). the origin sl~o~rlcl
i,e used only for p l ~ c n o n ~ e nthat
a are know11 to be lriglrly
'I'lre computation of.a sen~ivariogran~ rrlal, requires co~rsidcringnr;t~rydi- continuous, for example, the surface of a water table. 111 most other cases,
rections and lags. Thus, such a represcntatio~~ is best suited for large griddcd any model with a linear bchavior a t the origin wonld be appropriate.
dat,a sets. When d a t a arc sparse and irrcgr~larlyspaced, large angular and
lag tolcranccs are needed to fill irr the s c r ~ ~ i v a r i o g r anriq~, ~ n lre~rceI.Ir<:spatial
resolntiorr of t,hc m a p niay be drastically redur.etf. Beware that, the scmi-
v a r i o g r a ~ rinap
~ provides i ~ i f o r ~ r ~ aonly t i w about. I.hc directions of major and T l ~ clast sky ill the rr~odelirrgprocess consist,s of dctcr~uiningt,he parameters
nrirror spatial co~~linuil.y. Anisot,n~pictr~odclsd i l l nrusl. 11e fitted t,o rlircclional (sill, rarrge, anisot.ropy ratio) of the selocled models. It l m r s repealing t h a t
sc~~rivariogr;i~ns. a good inl.eractive graphical prograrrr is rnore usehl t l ~ a nany sophisticaled
,411ar~isoI,ropyl.h;it,is I I cltwly ~ ;ipparc;~~l, O I I q ~ ~ ? r i n ~ c rsr~t ~
wa~Ii v t ~ r i o g r a ~ r ~ s sti~lislicirlfitting p r o c e d n r ~ .'l'h foll~wingtips s l i o ~ ~ guide
ld I.he user in the
nor 11acked hy i u ~ yq~~alit.at,ivc i ~ ~ l i ) r n r ; t t ,isi ~bct,lter
~r igt~orfcl. Co~~v?rst:ly, inf(:rr:t~ceof the paratr~et,ersof t,lw scn~ivariogrammodel
sl.ro~rgprior qr~alil,alivei ~ ~ f o r n ~ h t imay o n lead one Lo adopt all anisotro$)ic I . Nlrggel rfrecl
~~roclel cvtw i l d;il.;r sp;rrsil,y prcvct~l.sswing ar~iw)l.ropyfronr l.lre c x p e r i l ~ ~ c n l a l
'l'lre nugget elfect is usually determined by extrapolating the behavior of the
se~rrivariograms. first few sernivariogram values to the vertical axis. T h e following should be
taken i11t.o con side ratio^^:
0
0.0
C
0.4 0.8 I
Distance (km)
- 1.6
0J
0.0 0.4 0.8 1.2
Distance (km)
1.6
00
0.0 0.4 0.8 1.2 1.6
Distance (krn)
Figure k12: E f ~ of t lag tolerance or, thc iafcrence of the nligget effect. A large
lag tolerancc (Ah = 'LOO to, solid line) Inads to overestin~ationof the relative nugget
component, compare wit11 daslred line (Ah = 50 m).
4.2. MODELING A IlEGIONALIZA'I'ION 105
(see section 5.8.1). Tllos, total sill valt~escantrot bt: cross validated froin
re-estimation scores.
Values of the seniivariogram niodel for lags smaller than the shortest
sampling interval do not intervene in interpolation algorithms. Hence,
critical model parameters, such as relative nugget effect and the serni-
variogram behavior a t the origin, cannot be cross validated.
Sample data, particularly when they are scarce arid preferentially lo-
cated, may not be representative of the study area. Therefore, the
model that produces the best cross-v;~lid;ited resnlt.~may not yield the
best prediclions a t ol~sanlpledlocations.
Remarks
T h e "goodness" character of a model is elusive and cannot be nreitsured by seniivariogran~. A l.ltird structure is needed to reprodncc both sliort-r;~~kge
rigorous tests; there is no best semivariogram model. Rather than relying on and long-range cornporrelit.s of Ni and Zo stmivariogra~ns.
an elusive objectivity provided by questionable statistical criteria, the user 'l'lre semivariogran~sof the three other meials (Cd, Cn, aud 1'11) arc more
should decide on a niodel ;recording to the followi~~g: erratic and could be modclcd e i t l m as a combination of a 1111ggeteffect it1111it
lortg-rangt: exponential nrodrl or ;IS a con~hin;ttionof a nugget ellixt aud two
1. T h e user's experience and the inforniation available; particular features spherical models with short and long ranges. 'Tltc lat,ler rr~odelwas cl~oserr
of the experimental semivariogram may be deemed spurious and not hecanse the short-range strncturc conld be related to the slrort-range distribtt-
worth modeling, whereas ancillary inforination may lead l o model fea- lion of human contaminant sources as fcat.~rredby indicnt,or scmivariogr;ttns
tures that are not apparent on the experimental curves. of land oscs in Figure 2.17 (page 43).
a t ~ d{n,, o = 1 , . . . , 7 1 1 he a set of 11 data locatio~ls. 'I'hc variance of any one particular set of itltercorrelated 1tf"s Zi(11). O t l ~ c rsets have been built,
bite linear c.orrrbirration Y of the ItVs &(urn), 11, E A, i = 1 , . . . , N u ,can Ire
cr~slom-madefor specific needs, resulting in different ~r~orlcls of coregionaliza-
expressed as a linear colnhination of cross covariatice values:
lion; for exa~nplc,see Zliu and Journcl(lSS3) or the Markov model itttroduced
subseqne~rtlyin sectiot~6.2.6.
,%
I Ire lirlear rrrodel of coregiotraliaation builds each ranclorn function Z,(u)
as a liuear cornbil~at.ionof i~~ciepetldent ratrdorn f u n c t i o ~ ~Ys~ ( I I )each
, with
zero rneari and basic covariance functio~lcl(11) (Jollrnel and Ilrrijbregts, 1978,
[I. 172):
for any clloicr: of 11 locat,ions 11, t A and any weights A,,. lising matrix
notation, t,Irc variance of 1' is writ.ten as
e All principal minor deterrrrir~anlsof order 2 are non-negative aud cross se~nivariograminodels yij(h) are tlrerl ohtained by si~rlplcrcscaling
of the same standardized linear inodel of rcgionaliaat.io~ryn(11):
= b',, b', - [b',,I2 >0
= a;, bi, - [bi,]' 20
= b;, 6$, - [6ill2 >0 For exalriple, Figure 4.15 (page 112, bottom graphs) sl~owsan intrinsic core-
gionalization model where all direct a ~ imss ~ d sernivariogram n~oriclsarc pro-
T h e determinant of order 3 is nun-uegalive: porl.iona1 t,o the sanle standardized nrodcl yo(11) = 0.5g&) + O.5Spl1( / h / / 5 ) .
?'he nugget cffect thus represents 50% of the sill valuc of all tl~rer: n~odels.
Note I.liat, any lincar nod el of coregio~~alization t.llat corrsist,s of a single hasic
struct.ure is an i~~I,rir~sic nro~lcl.
,~
I he i n l r i ~ ~ s icon:gio~~;tlia;rli<~~i
c iriodel is n n ~ r l lnioro rrstriclivc t,lrau t . 1 1 ~
linear rrrotlel of c o r c g i o ~ ~ a l i z a l i Lire
o ~ ~N,(N,
: + 1)/2 direcl, all11 cross s e ~ u i v i ~ r -
+
iogram models must include all ( L I) structures in the same proportio~~s
6'. This is not the case for the linear model (see Figure 4.15, lop graplrs). 11)
These inequalities provide four practical rules for choosing basic structures
particular, note the difference in the rclat.ive rr~~gget effect betwecr~tlre t h e e
gl(h) in the linear model of coregionalization:
direct a11rl cross se~nivariogran~ ii~odi:ls.
,,
R u l e 1: Every insic sl.n~clurcappearing on a cross scn~ivariograrnyij(11) 1 lie ~n;itrixf o r m u l a l h ~of t11c iutri~isiccor<:gi(~~~;iIiz;~tio~t ~rio(Id( 4 4 1 ) is
Innst bc present in both direct sernivariogram models yii(11) and yjj(ll):
R u l e 2: If a basic struclrm gl(11) is absent on a direct semivariograrn, it where = [pij]is the matrix of coefficients p,j. 7'11e intrinsic coregio~ializa-
must be absent on all cross semivariograrns involving this variable:
tion nod el is similarly expressed in terms of basic ~ o v i ~ r i i t nI IcI O~~ C I S ri(lr)
RS
The i n t r i n s i c c o r r ~ g i o n a l i z a t i o rm~o d e l
T h e intrinsic coregionalization model is but a particular linear rnodel(4.37) in
wtrich all the N,(N, +
1)/2 coefficients bfj of any basic sernivariogram model The proportionality coefficient p,j is equal to the variitricc C,,(O) of 1 . 1 1 ~Itfr
gl(h) are proportional to each other, tl~;ttis, bf, = p,j . 6' V i, j , I. All direct Zi(11) for i = j and is equal Lo the cross covariance (Aj(()) for i # j .
4.3.3 T h e practice of tnodelilig requires that all expcrin~entaldirect and cross se~nivariograrnsbe proporl.ional
to each ot,lier. T I E proportiotialit,y ciiirrlition can be vis~lallychecked from
( : l ~ o o s i ~ t.hc
~ g I I I I I I I ~ );rnd
~ ~ cIi;tr;rcl.crist,ics (typc, rsrigc, :inisot.ropy) of basic
the shape of experimental curves. An alternative consists of verifying t h a t
st,ruct.urcs and detcrn~ittingt.ltt>ir cot~l.rib~~t.ioo (sill, slope) to caclr l ~ ~ o d is el the ratio hetween any two semivariograms is a coustaot independent of h:
~ n ~ r~iorc
~ c hdifficult t l t a ~in~ the rurivariate casc. I)iffic~~ltics lie in the r~ontbcr
of t x p ~ i ~ r ~ e ~ r tC.Oi LV IM ~ ~ I ~ Cf C~ ~ ~ c t i o( sn~s~ i r i v a r i o g r i t ~and
~ ~ st,ltt:
) (;o~~st.rai~~ts
that. thr. ~ ~ ~ K i c i ehij n t nrust
s satisfy. As in scction 4.2.4, the followi~rgdiscus-
sion focuses o t ~scn~ivariograms,which are nrost frequently used in pract,ice.
A siruilar approach could he adopted for covariance firrtctio~~s. Similarly, all auto arrd cross correlogranrs are ahout equal
Linear nkodcl of corcgioanlirntioe
S n l c x t i o n of vnriahlos T h e intrinsic ntodel rarely fits expcri~rientalcoregio~~alizaliorrs. One rewon is
t h a t the filtering of rir~correlatedmicro-structures by the cross semivariograni
Ally i~rl~ltivariate
rr~odeli~rg
st,arts with a selecl.io~~ of the N, variables to in- (see section 4 . 2 . 4 ) usually leads to asnialler relative nugget effect on the cross
clude in the corr:gio~~aliaationa ~ ~ a l y s i In
s . most situat,ions, t l ~ e r ris no point sernivariograni than on direct semivariograms. T h e more flexible linear model
i ~ rethi~~irrg
r all measured atl.ributes for these reasons:
of coregionaliaatio~rwould tlierr he preferred. Note the following:
1. T h e nntnher of variables that can be ha~idledj o i ~ ~ t lhy
y i~rterpolation Direct or cross selriivariograrr~sneed not include all hasic structures; see
a l g o r i t ~ l ~is ~li111it.14.
~~s rule 3 ( l q c , 114).
2. Some variables are likely to be redundant and rrerd uot he considered, Variables that, arc well cross-corrrlated arc more likely to show similar
whereas i~tdcprndentvariables are l)rt.ter Irandlerl scparatcly. pal,t,crns of slxitial variahi1il.y.
3. Incrcnsi~rgtlrc 11rrli11)crof vari;tbles ~trakesthe fitting of all dircct and Slight dilliwmccs ~ I the
I slrape of expcri~ncntaldirect and cross semi-
cross scmivariogmrirs tnnch more t k d i o ~ ~ s . variogratirs rriay be disregarded, in particular when data sparsity affects
tlrc reliability of the estimates.
It, is good practice to defirte a hierarclry of v a r i a l h according to the
ol>jec.tivc pursued. Consider, for example, the ol)jectivc of cstitnaling con- l s N i , and Zn), a quick i o s p c c t i o ~a~t the direct
For the three r ~ ~ r t a(Cd,
centrat.ions of Cd, which is called tlre mnin or primary variable. One should semivariograms suffices to i~~vnlirlate the proporl.ior~ality co~rdition(4.44) of
prefererrtially retain as secondary variables the one or two other metals better the intrinsic model (Figure 4.14, page 107). Indeed, the shape of the Ni
correlated wit11 that prirmry variable and bet,ter s a n ~ p l e dsay,
, Ni and ZII. scniivariogram is dominated by the long-range structure, whereas the short-
Different models of corcgionalization could be built depending on which range structrtrc is the r~lajorcomponent for the otlrer rnetkils.
variable is selected as primary. A single rnodel of coregio~ializationthat poorly
lits the direct and cross semivariograms of all attributes is better replaced by F i t t i n g a linear m o d c l of coregionalization
sevcral i~rodelstltat providc hctter fits t,o fcwrr varinhles dce~ncdi m p o r t m t
for l.lre objective. 'Ih fit a linear ir~odclof corr~gion;rliz;~I.io~r,
proceed as follows:
1. Select the smallest, set of basic structures g((11) that captures the major
Selrction of t h e i n o d d fenthrcs of all N, onr~ridircctionaldirect sc~nivariograms. There is no
treed to look a t the experirnental cross selnivariograr~~s in this step since
Once tlic variables have beer^ selected, the next decision concerns the type of
their models cat! i ~ ~ c l u donly4
c I,hc structures that are apparent on the
r ~ ~ o dof
e l coregio~~aliaat.io~r:
i~rtrinsicor linear."
direct sernivariogr;ttr~s;see rule 2 (page 114). For Cd, Ni, and Zn, three
Intrinsic eorcgiosnliznlion model str~~ct,uresare retained: the nugget effect, a splierical model with a
T h e intririsic corcgionalization niodcl is rasicr to fit than the h e a r model rangn of 200 in, and a splrrricnl model with a range of 1.3 km.
in tlre scnsr that only a sirtgle coregionalizatiort r ~ ~ a t r @i x ueed he infrrrrd
(see secl,ion 4 3 . 2 ) Ilowever, such ;L rrrodel is 111uclimore restrictive brc;rnsa it 2 . For oxlt s t h r t u r e ~ , ( I I )irmsidcr
, anisotropy only i l that a~tisotropyis
clearly evident on all dirccl.ional sernivariogra~trs. As for the onivari-
3 A s mentioned previously, other models of corcgionilliration,possibly nodinear, mighl ate case, ancillary infor~r~abiorr may help to dcterniinc the directions of
br coasidcred. I n the fdlowing, only the wirlely used linear model of roregiooalizntion is
retained 'Thir represents ?I definite limitation of the linear model of coregiot~aliziltion.
4.3. MODELING A COKEGIONALIZATION
( 4. Appreciate visually the goodness-of-fit for all direct and cross seniivar-
iograms. One may then decide to modify a range or to change the type
of basic scririvariograoi model, say, try an exponnttt.iitl ratltcr than a
.; spl~ericalt~todel,to in~provt?1 . l ~ : overall q~iiilil.yof the lit.. TIic lit of Figure 4.16: Experinlentirl ornrtidircctionirl clirscl and cross semivariogratns for (:d,
@'
y' any particular one of the N,(N, +
1)/2 exp~:riniet~taldircct and cross and Ni, and t h e linear model of coregionalization fitted. 'The three experimental
se~nivariograrnscall always be improved, but it, is generally dolie a t the sernivaringrams werc co~nputedusing the same set of 259 concentrations.
expense of poorer fits for other semivariogranrs. Wlre~~ever a cornpro-
mise is needed, one s l i o ~ ~ lgive
d pri0rit.y to the direct scniivariograrns,
particularly that of the primary varial~lc. and the determii~antsof the three coregionaliaat.ion matrices are all posit.ive
or zero:
E s t i m a t i o n of t h e cort?giorraIization irrntricr~s
Bivnrinte case (N,, =i!)
In the hivariate c;rsr, the corcgionalimtion m;~l,rixBi is posit,ive semi-deliuitrr hence, the linear nrodel of coregiotializatio~i(4.47) is positive se~iii-defiuitc
if the following tbrce inequalities are s;itisfied:
M d l i u a r ~ a t ecase (N, 2 3)
Tlre hivariate approach is ge~teraliaedto N, variables as fbllows:
1. Tlre N,, direct se~nivariograrnsyii(h) are first modeled as linettr cottibi-
nations of selected basic st.ructures g,(h).
Thus, the littear nrodel of coregionalization is fitt,ed in two steps:
selected basic s t h c h l r e s gr(h). T h e latter condition is, Iiowevcr, insufficient to ensure positive se~rii-deli~rit,cness
of matrices Br = [bij] for N,, > 2. Irrtleed, all princip;il rnirtor d~t.wrt~i~~irtil.s.
2 . ' 1 ' 1 ~sariie l):tsic strrrctures art: t.lrert lilted t,o t l ~ ecross sc~rtivnriogranl 11ot otrly those of order 2, tmlst he r~oti-ireg;tt.ive(rcc;~IIpxgc 114).
u ~ ~ d the
e r r:omtrairit. (4.46). , example, the linear model of coregiooalizatir)rt i ~ I'igurc
C h ~ s i d e rfor r 417.
r,
A good interactive graplrical prograw (e.g., Clru, 1993) and ir pocket c;tlcw I he dirocf. and cross scmivariogranis of (kl, N i , and Ztt were r~~odelcrl using
lator sqfice to n ~ o d e lthe corcgioodia;ttim ljet,wecn t,wo v;~ri;tl)les. the two-slep approach, yielding the followittg iiioclel:
12igure 4.16 slrows the linear niodel of coregionaliz;ilio~~ filled to llic pair
Cd-Ni.. T h e model is as follows:
+ [ 01 00
0.0 0.0 1 (")
~ p h 200rn + [ 0213 38
7, ] (A)
~ p h 1.3km
(4 47)
where go(h) is the nugget effect model (4.10). Only llie s c r ~ ~ i v a r i o g r rttodel
a~n
for Cd inclodcs tlre sl~ort-r:trrge slrrrclr~re(200 ni). l'lle rli;rgor~alelements
T h e burdc~iof cl~eckiuga poskriori each ~rratrixB, tlien applying empirical
corrections is alleviated by an iterative procedure that fits l l ~ elinear model of
coregiona1iz;ttiou dircclly rnidc~rtllc consl.rai~itof posiLive serni-defi~~iteness of
all n ~ a t r i c s sI31 (Goulard, 1989; Goul;trd and volt.^, 1902). As with the u n -
variate criterio~t(1.29), tlie algoritlirn atterrlpts lo nrii~i~nize a wcigl~tedsum
of differences betweell experirrrental ?jj(ht) and model y;j(hk) semivariogram
values (see Appendix A).
'filis iterative procedure is used to model the corcgionaliaation of the three
rnclals Cd, Ni, and Zn (Figure 4.18). Altlrongh it fits the experi~ncntalcurves
as well as the nrodcl shown in Figure 4.17, the rnodel of coregionalization is
now positive s e ~ r ~ i - d c f i ~ ~ i t , ~ :
C o r e g i o ~ ~ a l i z a t i oofn o t l r e r ~ t l e t i r l s Warning
A similar approach is applied to the two other ttrctals with widespread con- T h e reaso~ifor the wide ut,iliaatioo of tllc h e a r model of coregio~talia. stt'1011
tamination (Cu, 1%). The linear model of coregionalization now it~clrtdes is that its pcrmissibilit.y can he ~ a s i l yclreckerl by verifying the positive srnti-
four variables: the two better s a n ~ p l r dnrctals (Ni, Zn), and copper and lead, definiteness of coregionalizatior iriatrices. T h e price to p:iy is ilie rccll~ircn~ctlt
which are closely related (f=0.78). Figure 4.19 shows the ten experirr~ent;d that all direct and cross re~nivariogratrlssharethe same set of basic structures
direct and cross semivariograms. The corcgior~aliaatior~ rrlodel is fitted using gr(h), wliicli may represent a linritation.
the iterative procedltre. Dcspite t.he number of setnivariogra~ns,the overall Alternative linear models that do not rcquire l~sitrga cormrori set of basic
fit is satisfactory. st,ructures have been proposed (e.g., Myers, 1982). Their greater flexibility
is halanced by the ~ieedfor cltecking t l ~ epositive s c ~ ~ ~ i - ~ l e f i ~ r icor~ditirnt
l,c~~ess
of C ( h ) for a l l lags to be used (Goovacrts, 1 9 9 4 ~ ) .Beware tlrat, wet1 in the
bivari:atc case, there is no easy way to pcrfornr s l ~ c al ~cllcck, cspeci;rlly it, t,Ilc
presence of anisotropy. Tlrerefore, it is i)ettt!r to stay wit.l~t . 1 1 ~well proven
and yet rensonahly flexible linear tllrrclel of c.orcgion;~liznl.i(~~~
80.
... .
3 200
E ,
0.0 0.4 0.8 1.2 1.6 00 0.4 0.8 1.2 l.8 0.0 0.4 0.8 1.2 1.6
D l ~ l e n c(km)
~ D ~ S I B ~(km)
CB Uislsnca (km)
Figure 4.19: Exprriiscstal omnidirectional dircct and cross semivariograms for (h,
Pb, Ni, and Zn (n=259). The linear model of coregionaliaatior~is fitled using tltr
iterative procedure, l~ertceis pertoissiblc
Chapter 5
Local Estimation:
Accounting for a Single
Attribute
,,
l l ~ eexisl~eliccof ;t t i ~ o ~ l cofl spatial depnttdeltce allows onz to tackle the
problcrri of eslirriat.i~~g attribute values a t ur~sarnpledlocations. This chapter
presents leasl-squares h e a r regression (kriging) algoritllri~st,l~ataccount for
d a t a wlatcd solely lo the coutinooos attribute being cstiri~al,ed.A l g o r i t l ~ n ~ s
o r ~ introduced in Clraptcr 6.
for incorporating srco~ldaryi n f o r ~ ~ l a l i are
Scctioris 5.1 5.4 introduce (,he lincar regrcssion pararlig~naud t h e e of its
n o s t import.a~~t. variant,^: simple kriging, ordinary kriging, and kriging with
a trend ntodel. Estinration of average values ovcr larger areas (block kriging)
is addressed in section 5.5.
Section 5.6 presents factorial kriging, a rncl.liod for filtering low- or high-
freqr~encycornpo~rr~rl.s frotn the spatial variatiou of the attribute. Filtering
propwlies of krigitig :iIgoritli~nsarc furt,licr discusscd in section 5.7 in the
fr;ltt~cworkof I.lir: drml forttii~lis~ir of kriging.
Scclion 5.8 recalls some importaut properties of thc kriging weights and
kriging varianrc. 'I'l~eperformancc of ordiiiary kriging in estimating metal
conccntratioos a t trst locations is discusscd.
defined as 2. Ordinary kriging (OK) accounts for local flnct,uations of the urea11 by
limiting tlre d o ~ n ; ~ iofn stationarity of thc mean to the local r~eiglrlmr-
hood W ( u ) :
T h e coefficie~~ts
nb(11') are U I I ~ I I ~ W and
II deemed c o n s t a ~ ~witlriu
t each
local neigtrborliood W ( u ) . Uy conventiorr, Jo(u') = 1, IIPIICC the case
is nrinirnieed u~rderthe constraint that K = 0 is equivalent to o r d i ~ ~ e rkriging
y (constant hut unknorvn r~reali
(10).
T h e kriging estimator varies dcpcl~dil~g olr Lire riiodel adopI.l!d for t . h 5.2 Simple Kriging
random function Z ( u ) itself. T h e R.F Z(II) is usually dccomposerl ir11.o a
residual c o ~ r r p o ~ r R
e ~( u~)t and a trend coiiq~orrentm(u): , .
llle modeling of the 1.rend component m ( u ) as a known st a t lonary
' rrrean
in allows oue t,o write the lincar esli111a1,or(5.1) as ;t linc;rr c o ~ ~ r l ) i r ~ n or
l.io~~
+
( n ( u ) 1) picccs of information: the 411) ILVs Z(II,) and the meall value in:
,.
l h e residual component is inodeled as a stationary RF with zero mean and
covariance C R ( ~ ) :
Matrix n o t a t i o n
IJsing matrix notation, t,he simple kriging system (5.10) is written as
K,, Xs,(u) = ks, (5.13)
Tlre error variance (5.8) appears as a quaddralic form Q ( . ) in the n ( n )
wl~ereK,s, is the n ( u ) x n ( u ) matrix of data covariances:
wt:igl~Ls XZK(11). 'Tire optimal wciglrts, i.e., those that r r ~ i ~ ~ i r nthe i a e error
variimce, itrc ol~l.;lincrl1)y setting lo zero c i d t of the n ( u ) partial first drriva-
t,ives:
C(l1,,(11) - 11) ]
CJr(h) is equal to the stationary covariance function C(11) of t . 1 ~RF Z(U):
T h e kriging weights required by the SIC estimator (5.7) are obtained by mul-
tiplying tlre inverse of tlre data covariance matrix by the vcct,or of data-to-
r~nknowncovariarlcm:
Tlrus, t l ~ siruple
e krigingsystem (5.9) can be writ.ten in terms of %-covarianccs
as 'I%c matrix f o r m u l a t i o ~of~ Llle simple krigirlg variance (6.11) is correspond-
ll(U) ingly
X$,(II) C ( u o - q ) = C ( u n - 11) u = I , . . . , n(11) (5.10)
o:,<(u) = C(0) k,, = C(0) - k:, K;: ksti
- Az,(rl)
0=1
T h e simple kriging system (5.10) has a unique solution and the resulting
Tlre rnininn~rnerror variance, also called tire SIC variance, is deduced by kriging variance is positive if the covariancc matrix K,, = [C(u, - UO)] is
substituting expression (5.10) into the definition (5.8) of the error variance: positive definite, in practice if:
no two d a t a are colocated: 11, # 110 for 0 # P
the covariance modrl C ( h ) is permissible (see section 4.2.1)
130 CHAPTER 5 . ACCOUNTING FOR A SINGLE ATTKIBV'I'E 5.2. SIMPLE KRIGING 131
.
transect, and
u = 1, . . . ,71(11)
T h e unique solution is then
Xz?(11) = 1 and A;"(II) = 0 V up # 11,.
Weight o f the m e a n Tlte c s t i ~ n a t i o lis
~ performcd every 50 111 using a t each location 11 the five
closest d a t a , n ( u ) = 5 V n .
l'lre simple kriging estirnator (5.7) can be rewrittell as Figure 5.2 sllows the SI< cstintat,es of Cd concentrations (top gr;rp11) a d
n(ll) the wciglrt of the stat.ionary mean (bott,orrr graph) t h k r l r as the aritltmelic
Z:,<(II) = A:(iz) Z(ue) + A::(II) 7r1 (5.14) rneen of tllc 10 Cd values, r n = 1.4!) ppnr. 'l'he SK estilr~;~t,cs (solid line):
*=I
where the weight X:(n) assigned to the stationary m:alt irt for estimation
a t location 11 is equal to 1 minus the sum of the n ( u ) (lath wcights AZx(lr):
rr(11)
A;T:(u) = 1 - ~F(II)
o=r
_~_ . , ._ -,--
1
I 2 3 4 5 6
Distance (km) 'l'he n~e;to,tr(u) is filtered lroru thr: linear csti~rtatorby forcing
U I ~ ~ I I O W local
II
'Tlre exactil,tlrle property of the SK esl.i~rlator~ creates arlifact, rlisco~rtinu- Tlre rrii~~i~r~iaatiorr
of the error variance (5.8) ot~derthe no~r-biascondition
itirs (peaks) a t d a t a locations (Figure 5.2, top graph). Disconti~~uities are ~fi(=):) XZK(u) = 1 calls for the delittition of a Lagrangiarl I,("), which is a
i~nportant.here because Cd co~~ce~rtratiorrs vary in the sernivi~riograrnof Fig- fu~ictioriof the data weights h g x ( n ) , and a Lagrange parameter 2/1,,(u)
ure 5.1 (right graph). Indeed, such short-range varialrilily entails that the (c.g., see Edwards and Peoney, 1982):
d a t a weigl~t,sX r . ( n ) rapidly dccresc as the locatior~u being e s t i n ~ a k dge1.s
f a r t l ~ c raway from data locat.ions and t,ha e s t i t ~ ~ agets
t e closcr 1.0 the station-
ary lucan f r ~= 1.49 ppor. Large disconli~~uilies occur nexl, to ext.rclnc d a t a
v a l u c s t l r o s e t l ~ a depart
l most from the global 11iea11,for exanrple, near the
corrccntration of 4 ppm ri~easureda t 3.25 k ~ n Alterrratives
. for rcmoving such
discontinuities are discussed it1 the next sections. 'The optimal wciglits Xz"(u) are obtained by setting to zero each of the
(n(1i) + I ) partial first derivatives:
In contrast, the SI< systcnr (5.10) call he expressed i l l terms of ollly covxi-
A l t h o ~ ~ g the
l i mean m ( u ) is assurned stationary only rvitlri~~ the local ances since tlrere is no similar co~rstraii~t 011 Llie simple kriging weiglrts.
neighhorlrood W ( n ) , in llre practice of ordinary kriging tllc rcsir111;tl covari- ? ,
I lie c o n r r r ~ opractice
~~ coosisf.s of inferring and 1nodi:lillg i l ~ nscrr~ivari-
ance is assimilated to the global z-covaria~~ce irrfcrred from all data ;~vi~ilal)lr: ograrn ratlicr than the covi&.nce fr~llctionh d c e d , blic sc1niv;iriograln ~ ( 1 1 )
(Journel and Iluijbrcgts, 1978, 11. 33- 34), learliug 1.0 Lire following systenr: allows one to filter Llie n n k ~ ~ o \ vlocal
n ~rreanm ( u ) tl~;rtis h n r e d coosh;rnt
but unknown over t.11elocal ncigl~borlloodW(n):
T h e resnlting I I I ~ I I ~ I I I I I Ierror
II variimc~:,c;lllctl O K variaurc, is o l r t a i ~ ~ c11y
d since 71l(1l1) = ,ll(llf + ll), v 11') 11' + 11 € W'(l1).
substituting the first ~ ( I I equations ) of tlie ordinary kriging system (5.18) Itowever, for reasons of conrl~ntationalefRcit:ncy, krigil~gsysI.t:~~rs are USII-
into the crror variancc of type (5.8): ally solved in terrns of covariances. As disc~~ssed in section 1.2.1, there arc
SIKII ,as I,hc p v c r n~o(lcl,I h I , II;IVI: I I O cov;~rimc(~
s c n t i v a r i o g r a ~n~odcls,
~
co~nrterpart. For ~ ~ ~ ~ b o u nsr~~rivariogr;rur
ded i~rodcls,a " ~ W I I ~ covarimu:"
O
C ( h ) is defined by subtracting the sen~ivariagrai~r nrodel y ( h ) fro111ally 110s-
ilive value A , such that A - y(11) 2 0 , V h. Again, the no^^-hias condition
allows the constant A to cancel out, frorrr the ordinary krigitrg systerr~,which
is then written in terms of psertrlo covari:tnces. In sulnrnary, (.he conllrlon
practice corrsist,~of (1) inferring and modcling the srn~ivariograrn,and (2)
solvilig all ordinary kriging systelns in tcr~ilsof (pseudo) covnrimccs.
Acconntirrg ibr tire relation C ( h ) = C'(O) - y(ll), t l ~ cordinary kriging sys-
tem (5.18) is expressed in terrns of srrnivariograms as
Kriging t h o l o c a l m e a n
,>
J Ire error variance nj$ = V ~ ~ ( I ~ L ~ , ~ ( I I ) - I IisI (expressed
II)) as a double linear
cornbirration of cov;lri:~nce values:
wlrcre A::(II) = 1 - c~!!~)A~K(u). Accottnting for t11e definition (5.14) of
111s sitnple krigittg cstint;tt.or, one drduccs the following relation between the
SK ntld OK cst.i~ttat.ors:
'I'lrc last two tcrtns of t.11~error variance an: arro since t,lle trerld in(11) is
viewed as a deterministic conlponenl. 'I'lrr differel~cebetween the SK ar~rlOK rstintat,cs of z a t 11 is caused by a
T h e niirrirniaationof the error variance (5.21) under the non-bias condition depar1.11reof the local meat, rnblc(u) horn the global mean in. More precisely,
+
yields the following system of (n(u) 1) linear equations: since X::(II) is ~ ~ s u a l lpositive,
y tlte OK est,irnatc is smaller than the SK
cstinratr: in low-valued areas wltcre the local data tnearr is srnaller than the
global meall. Conversely, the OK est,irnate is largcr than the SIC estirnate in
high-valued areas wltcrc the local mean is larger than the global mean. In
t.lris sense,, the corri~rtonlyused OK algorithm with local search neighborhoods
already accounts for trends (varying mean) in tlre z-data values (see discussion
in seclion 5.4).
,
%
I Be discrepancy hetwcen the two esLinrates ziIc(u) and z ; ~ ( u ) increases
as the weight A:l(u) of tire mean increases, i.e., as the location u being
estimated gels farther away from data locations (section 5.2).
System (5.22) is identical to the OK system (5.18) except for the r i g h t - l r a d
side covariances C(11, - 11) being set to eero.
Because all data-to-nnkrtown covariance terms C ( u , - 11) are zero, t l ~ e Example
location 11 being eslirnat,ed does not appear in the ordinary kriging sys- Figure 5 . 3 (lop graph) shows tert Cd concentr;ttions a t locations 11, to ulo
t c n ~(5.22). Provi(ler1 the same sct. of dalx is used to estirnate tlrc local rticatr along t . 1 NE-SW
~ transect. 'I'ltc local mcan is estimated every 50 rn along that
a t two different 1ocat.ior~s11 and n', tlre sysktn (5.2") rcrrlains rtncl~anged. 1.ransecl using tltct five closest data values; the resrtltitrg estimate is depicted
'I'lrtts, the two sets of kriging wt:ights and the t,wo trend cstirnntris are idrnti- hy a solid line ott the middle graph of Figure 5.3. T h e vcrtical dashed lines
cal: Az:,(u) = X ~ ~ ( I I 'V) ,n and T ~ I ~ ) ~ <=( I~IL) & ~ ( I I ' ) . delineate the OK trettd estimates that are based on the same set of five
neigl~l)oring<lala. For exantple, (:d co~~a:ntratiotis at, local.ions 111 to rri, nrc:
used 1.0 cslilnate the local 111eat1a t all locations wit.lri~~ lhe scgnrent 1 ~ ~ 2k. n1 ~ .
Cd data Sin~il;rrly,tlto next segnrent, 2.1 2 . 5 kln, i n c l ~ ~ d eall s t,rr,nd val~res1l1;tt. are
esl~io~aI.t:dfro111(:<I concenbrat.io~~s itt I O C L ~ ~ O ~112
I S to 116.
As ~ni~ntiolted previously, tlre ordinary kriging S ~ S I . C I I I (5.22) is i i l r ~ ~ t i cat.
al
all locations where the same neighboring data are il~viilvcdin thc e s t i n ~ n t i o n
Consequcnlly, the 01< t,rend estirrmte rniK(lx) is cot~stalllwit.hin each scg-
I I I C I I ~and cl~augcsfrom one segrnerrt to arrotlw depending or^ the neighboring
dnta rctaioctl. 'I'ltis procedure yiclds a trmd estinmte l.l~;~t. follorvs t.hc gel~wal
incrcase ill (>d concentralions along the tr;rnsect. In cout,rast, t,he lllcall of
10 d a t a valncs, rn=1.49 ppm (I~orizont,aldashed line in t,Ire i ~ ~ i d dgraph), le
ovrrcsti~natcsthe local rnean in the low-valued (left) pitrt of i,l~ct.ransect m d
underestilriatcs 1.111: local me;ir~i r ~lllc hig11-va111cd(right,) part oJ'1.h~transcct.
Figure 5.3 (bot.torn graph) sliows both OK (solid line) and SK (d;rsllcd
5'
Trend estimates
I
. .
I
.
I i i
.
liur:) cst,i~natcsof Cd concentr;rl.ions along tlre NE-SW 1.ralrsccl. Notc t.lral.:
&,th estinlalors art: exact..
OK cst.iniales arc smaller tl~nrrSK cstimalks i l l the lcfi, part of t l ~ c
S3 i i i i i
-
C
0
.-
2-
i i i i i transect where the local intiarl 7n;,,((11) is sln;tllcr than t,he overall nle;tii
VI = 1.4;J.
(5.26)
Accounting for the first n ( n ) equations in system (5.26), the minimized error
v;iriancr Iiccon~cs
n=1
X;7,(~~)Z(u,.,) (5.25) IZriging willt a trend model requires a prior detcrrrrir~ationof (1) the I(
trend hrnctiot~sJ ~ ( I I )and , (2) the covariance of t.ltc resirli~alcomponent R ( u ) ,
wil.11 Cl<(ll).
rr(11)
A z ' r ( ~ ~Jk(ue)
) = Jk(u) k = 0,.. ., It'
n=1
'I'lte l.ypc of hnctions fk(rr) niay be dircctly s ~ ~ g g c s t cby
d tlte physics of the
Tire krigit~gwith trend cst.irnator (5.25) is unbiased since the error mean is problem Fur c x a t ~ ~ p lae ,stxirs of sine and cosine f~~ncl.ions can he used 1.0
eq~ralt o zcro: rnodel a periodic lrcnd of nu attribole; for ex;unple, see Skguret and Ilnclron
{ ( I ) -( I ) ) = x
?.(U)
*=I
XZT(ll) m(uo) - rn(11)
(1990). 111 nlost spatial situations, tlic earth scientist has no physical ground
for clroosir~ga particular type of analytical tret~rffutrcliol~.ljccause the con-
cept of trend is t~suallyassociated wilh a smootlily varying corrlporie~~t of
the z-varial~ility,Ion-order ( 5 2) polynomials are typically used l o model the
I,rt:nd, for CX:LIII~IC,
I n f e r r i n g the r e s i d u a l covnriauct:
011evcrifics that tlre previous linear combination ofz-valr~esreverts to a l i ~ m n
In practice, llic residunl st:n~ivariogranry1((11) is first ir~fcmed,l.11e111 . l rcsicl- ~ comhinat~io~r of residual valr~cs,t,Iiereby filtering the trcnd component,:
nal (pseudo) covariance Clr(11) is dc(li~cedas A -- y11(11). Likc i l l or(1in;try
kriging, the first ui~l)iasr:dl~essc o ~ d i l . i o l~;::)~i;.'.(ir)
~ = I filtcvs o i ~ ttlw
arbitrary constaut A f r o r ~the ~ first n ( u ) c q u a t i o ~ ~ins tile K'T system (5.26).
,Ixhe computation of tlie residoal semivariograni ~ ~ ( 1 is1 110t ) straightfor- +
More generally, a difference of order (k I) filters any polynomial trcr~dof
ward because availahle data arc z-values, 1101, rt,sidnal vrtli~rs. 'l'lre expcri- ordcr I.. T h e variance of such differences is rcfcrrctl t o as tlrc generalized
~ related to ylr(li):
mentally avail;il)le z - s c r t ~ i v a r i o g ry; ~( h~)~is covariance of order k m d is rlsrd as rcsidiriil cov;~ri;~ncc ill s y s k m ~(5.26).
1h:wnre (.hat:
r Such Iriglr-order differences isre not readily available wl~endata are 11011
griddetl
A I I ~~nreli;rl)le
pract.ice would cousist of:
T l ~ residr~al
e serriivariogratncal~hllen be inferred directly from tlra z-sen~ivariograw
.
conipnted frorn sr~clipairs. 'I'lre followi~rgg11icIe1irn:sare nscf111:
1telat.iorr (5.28) is generally sat,isfied Tor sn~;illscparatio~rdistances Ih/.
Thus, tlre residual semivariogram for Ll~rfirst h g s rimy be identified
2' . applying the K'I' algoril11111usil~gthe residiial serniv;~riogr:~m
ferred from the estimated values ~ ( I I , ) .
-,-(11)
11%
in
(5.31)
= K;: k,<.. aud the kriging
,
I l ~ ekriging system (5.31) has a unique solution if t l ~ r s etwo conrlitior~sare
\
met:
where (r,, :I*) are t,he coordi~ratcsof any datum location 11,. Using nratrix
iotat ti on, the K T system (5.29) is written as 1 . T h r covarimce matrix [(&(II, - up)] is positive d c f i ~ ~ i tinc , practice if:
K ,<. X,.r(rz) = k,. (5.30) no t,wo d a t a are colocatcd: 11, # u p for n # P.
the residual covaria~rcenrodel C?,i(l~)is permissihlt:
w l m c K,.- is the (n(11) + 3) x (n(11) + 3) ~iml~rix: 0
+
2. l'lw (I{ 1) fi~~rctiorls f L ( u ) arc linearly independent o n the sel of 71(u)
K,<, = (
(:I~(~II,- Ill)
1 1- 1
,
..
.,
..
.
1
: 1 i
1
- 11"(1)
1
1
-11(1]
1
1
0
l
I I
l
0
]
;I
y,,(11)
0
data; that is, the relabions
;tlig~~ed
K
Q Jk(u,) = 0 V n = 1 , . . . , n(u), would
tn;(T(t~) = x
n(11)
*=I
X~:,(U) Z(u,) (5.32)
'1'11~: matrix K,, in systerr~(5.30) is oht,aincd by adding three rows and where X;$(u) is the weight associated with the datum z(u,). The kriging
i x, , = [C(II, -up)]
bhnr: colurt~nsto the covariance ~ r ~ a t rK [ ~ R ( -u")]
u ~ -- weights are obtained by solving a kriging system identical t o the K T sy8-
146 CHAPTER 5. AGCOIIN'I'ING 1"ON A SINGLE A7'1'HlU1IT15
+
where ,LL~,';(II)is the 1,agrauge pararnetcr tlrat accounts for l.l~t: ( k 1)I.lr 1SI'
constraint on the kriging weights. For 1i = 0, s y s t m ~(5.33) rcverts to t l ~ c
ordinary kriging systelri (5.22) for cstinratir~gthe local nmtn.
Gstimat,ion of the trend conlponent m ( u ) amounts to first esti~ilatitrgLIE
( I i + l ) trend coefficients a k ( u ) , then computing the trcnd csthmate as a l i ~ e a r
co~nhinationof the know11 trend fnnctiorrs Jk(u):
.
mated as linear combinations of z-val~~cs. For example, the li~~e;rrrsti~~~ath
of the coefficier~trrr. a t location 11 is
Ijot,l~kriging systcn~s(5.33) and (5.36) ;trc ihrt.ical lor 1i I0. I ~ ~ d c e r l ,
the co~istantt.rend component in(11) is t , l ~ c t,he
~ i single I.rfmil coellicic~~l,
(111(11).
'The locat.ioo 11 being estirr~;it.eiIdoes ~ ~ appear o t i n thc kriging sys-
where X:~,(U) is the weight msociatcd with tire z-datum a t location 11, for
ten1 (5.36). I'rovirlcd the sarrrr: set of data is used to csti~nabcany trend
the K'r estimatior~of the trend coefficient at 11. 'l'lie csti~riatorn;,(u)
coclficient nr a t t,wo diffment locations 11 and XI', their estimates are
is a RV, being a linear combination of the 1tVs Z(II,), wlmeas ar(11) is ;t
ident,ical: n;(o) = a ; ( d ) Q k = 0 , . . . , I{. For Ii = 0, tllc t.rentl es-
deterministic, tliough uuknnw~r,valoc.
l.irnates at. 11 and 11' arc then ids~itical(secI.iou 5.3). For l,olyrro~~rials
T h e e s t i ~ r ~ a t (5.34)
or is orrhi;~sedif the error mean is zero, that is, if
of lriglicr order, say, a l i ~ w r rt,rcnd irr R 1 , t,hc trt:~rd cst.i~r~ntcs a t 1oc;r-
tiotrs II = ( ; l ! , O ) ;ttrd 11' = ( I : ' ,0) appear as t . 1 ~s;rnw linear t ~ ~ n c t i o nors
c o ~ r d i n a t c sJ : and z', rr~;,.~.(o) = a; + a ; -;I!atrd ~ ~ L ; ~ . , ,= +
( I I ~ ) (1; .c'.
- ( I ) [111((11) -7 t ( l l ) ] (5.39)
o=1
where IVI is the measure (length, area, volume) of block V. T h e integral is,
in practice, approximated by a discrete sum of z-values defined a t N points
u: discretizing the block V ( u ) . For example, the block V(u) in Figure 5.7 is
discretized 1)y the four points u', to 11;.
1 2 3 4 5 6
Distance (km) 'Thc block valne ~ ~ ( 1 conld
1 ) be estimated as the linear average of the N
point estimates, say, OK estimates z&,<(11:):
I'ignre 5.6: O K a n d K'festima1.e~ol tlte trend (topgraph) and olCd concentrations
(lmttotn gritplt). Note tlic siaiilarity betwecrt O K v r d K'1'estimates in interpolation
sitnation 1-5 km, and trow extrapolation results depend critically on the prior choice
or a trend i~~odcl: ccrnstant lor OK or litwar lor K'I'.
where t f ~ esame n(u) d a t a are userl for all N point estimates, and X,(U~)
canse it provides results similar to llmse of KT, but it is easier to i1np1cni~:nt.
is the weight%sociatcd wit11 the datum z(u,) for Llte OK estimation of
111 ext,rapolation conditions, the KT estimator slrould be userl wlrenever the
physics of tlie plrenolrrenon snggesls a particular functional form for extrap-
olaling a trr!rlrl fitted from witllin the salnpled area. In most cart11 science
applications, however, there is usr~allyno such physical hasis for choosing
a particular trend model, and the user slror~ldbe aware that thc eslirnat.ed
values depend heavily on the arbitrary trend being extrapolated.
Though the choice of a particular trend model cannot be validated when
n o d a t a are available (extrapolation situation), mapping the trend estimate
may draw attention to aberrant extrapolation results. For example, Figure 5.6
(bottom graph) indicates that the linear trend model is inappropriate much
beyond the extrerne right datnrn sihce its extrapolation would yield negative
concentration estimates around 7 km.
5.5 Block Kriging Figure 5.7: A two-dinrmsiosal block V centered on location u and its discretizalion
by four points 11; to 11;.
Any rneasurcment z(u,) relates t o a non-zero, finite sample volume, such a s
a piece of rock or a core of soil. Often the size or "support" of the daturn may "To simplify notation, the upperscript 01< is ren~ovedfrom all notations i n section 5 . 5 .
154 CHAPTER 5. ACCOUNTING 1'011A SINGLE: A'T'I'RlBUTE 5.5. BLOCK KRIGING 155
attribute z a t location u:. Such an approacl~requires solving, a t each of the For exan~ple,the covariance hetweeu location 11, and the t w o - i l i n ~ c ~ ~ s i o ~ ~ s l
N locations PI:, an ordinary kriging systern of dimension (n(u) 1): + block in Figure 5.8 is approxi~~raledby the aritlinletic average of point, co-
variances between u,, and the four discrclizit~gpoints 11; t o 11;:
*,
I h e approach bcconres co~r~pul.ationallyexpensive as the number of blacks
and discretizing points increase, hence it is preferal~leto estimate t,he block
value directly from the data values z(u,) wing an estinrator of type
where the block-1.0-block covariance ~ ( v ( u ) V(II))
, is approximated i)y the
arithmetic average of the covari;mccs C(11: - 11;) defined Letwecu any t,wo
discretizing points n: and 11;:
T h e block o r d i ~ ~ a rkriging
y system is written as follows:
Provided tlrc same n ( u ) da1.a are used for ;dl N point kriging systems (5.12)
and for tlrc block kriging sysleln (5.44), each block krigi~igweight X,v(lt) c;ui
be shown to bc the avcrage of the N poiut kriging weigl11.s X,(u:) (dournel
and Wuijbregts, 1078, p. 322):
'l'hus, the block kriging system yields a n estimate irlentical to that ob-
tained by averaging t.he N point rstirnatcs Z & ~ ( U : ) :
-
(;(v(1tm), I J ( I ~ " ) ) =
1
-
1 % lw ~ ~ 'IL 1,118)
u ~
C(II - u') du'
, ~
where iv,l and ltial are l l ~ emeasures of Lire data support a t locations
1. Virecl block kriging through syste~rr(5.44) yields all estimate of l.lle ;111d ufi.
XI<,
3 . ltetait~itrgdl data makes the kriging syslent very large, lengthens the
c o t ~ ~ p t ~ i t,inw, ~ ~ofteti lcads to insL;drility of 1 . h krigiug matrix.
t t . i ~iitld
,, For all tllcsc rcasorrs, krigitig is pri~riarilyused as a local e s t i ~ ~ ~ a l ialgorithm.
ort
I ire physical average of pll values over the block would he obtained by
perforuing block krigitrg on the concentration z(u) of I f f , not on the A global kriging estimate zk(11) can, however, be obtaincd using a two-step
pII valnr ~ ( I I ) . procedure. First, tlre study area is discretized into small blocks and the
average value of z is estimated within each such block. Second, the global
2 . One sllould strikc a balance bet,rvren too frw ~liscrctizingpoints tlral estimatc is cornpuled as a linear combi~ratiol~ of hlock estitnates z;(u), with
providc a rougl~approxi~nntiol~ of t.lw poilrt,-tri-block m d block-to-I~Iock each cstiolatf: rt:criving a weight proportional to the area JVI of that block. A
cov;trianccs and t,oo many rliscrctizing point,s that are c o ~ r ~ ~ ~ ~ ~ t a t . i o r ~ ; ~ l l yrnorc straightforward alternative consists of computing a rieclustered mean of
vxlm~sivr.A rule of tlntml, is to srlert (4)" discretizing points, wllrre the data using the polygonal method or cell-declnstering teclmiqne introduced
e r d i m ~ i ~ s i o2~ 0~1.s ,3, of the block, scc Jorrrtrcl a11d
n is tlw i ~ w ~ b 01. in section 4.1.2; see also lsaaks and Srivistava (1989), Chapter 10.
Iluijl~regts(1978, p. 95-108) for furt.lm discossions. A good practice
co~~sisbsof computing covariance cstin~atesfor an increasing number of Example
discretizing points: a t some stage increasing tlre dcnsity of point,s will Figure 5.9 sliows the point (small dashed line) and block OK estimates of
not signific;~ntlymodify l,l~cresults of tile approxi~nation.
Cd concrntr:ttiorrs along the NE-SW transect. The blocks are defined on seg-
~rtentsof length 50 nt (large dashed line) and 250 m (solid line). In both cases,
3. T l ~ nblock kriging system (5.41) call he generalizcd to l,lte case where
four discretising p a i ~ ~were
t s used to compute the point-to-block covariances.
1 . h ~di1l.a t.l~enmelvesare defined OH ;<support, say, ~ ( I I , ) ,which cannot
Note the following:
he co~~sidcreil as negligilile with regard to the poi111support of tire
covarimca tnodcl. 'The point-1.0-point C(II, - I I ~ and ) poin1.-to-block 'I'he hlock estimates do not rnatclt point-data valnes (black dots) be-
(,"(IS,,, '~'(11)) covariance ternls in syst.em (5.44) arc Lhrn relrl;tcrd hy the cause the supports are different.
ntcrre rlilfic~~lt 1.0 detcct on Argovia~irocks wlrose rtat.ur;d Co ill10 N i C O I I C C I I -
] PoinVbtock Cd estimates tmtions are half tl~oseineasured on otlier rocks ('l'ablc 2.4, pnga 18).
If the scales a t wl~ichthe dilkrent factors ( h u ~ n a ngeologic)
, operate are
very different fro111one anotl~cr,then they slrould be apparent in t.lrc scnri-
variograms of met;rl concentrations. 'I'lle st,ructr~ralanalysis performcrl in
sectio~i2.3.5 revedcd t t ~ eexistence of tliree scales of sp;tt,ial variation: i~ricro-
scale (range sruallrr tliai~the first s r ~ i ~ i v a r i o g rlag
a ~ ~of
i 50 tit), local sc;ilc
(short range x 200 rri), m d regionid sc;rl<:(range x I k~ri).'l'lic slrort,-rang(>
- block (250 ml s t h c t ~ l r ewas rr:latetl to the slml.iel dis1.ril111tio11
of rock t y p and Iznid rrscs
in lllc stt~rlya r e a 'rhe lo~ig-rnngest.rur1.11rcw;rs i~rti:rprdcd as 1.11~. rcgiw;tl
irdhlcnce at' geology, particolsrly Argovinn and Iiinmmidgia~irocks, 011 metal
concentrations. Sucl~inl.erpretatio~iled us to rnodcl the exptirin~entalse~itivar-
iograms of n ~ e t a lconcent~ratioosas linear corr~binntionsd t,l~rcebasic struc-
tures gl(11):
w l m c all r.stim;~t~cs
arc I).?sr:d on Ll~rs a n e n ( u ) clz~tn~ ( I I , ) . It~dccd,the righl-
Itand-side vcctor of tho OK syslcrrl (5.18) is hut the surn of the right-hand-side
162 CIIAI'TEI1 5. ACCOUNTING FOR A SlNGLlC A7'1'1111jU1'1.:
I OK Cd estimates
wlrich entails tile following equality between leR-lra~td-sidetertns:
Examplc
Figure 5.10 shows the dccompositio~~ of the OK estirtlaf.es of Cd concentril-
tions along the N1':-SW trar~sect(right top graph) into fonr corrlponents: a
n t s i,t t,n:~rd c o ~ ~ ~ p o -
nugget component, slmrt- and long-rangc u j l ~ ~ p o ~ ~ cand
nent. Note the following:
T h e short-rang,: and lorq-range c o r n p o n t ~ ~arc
l s zero wlwt:ver bl~eclos-
est daturrr location u,, is more tlrarr 200 ni or 1.3 kt11 *way, rrspcctively.
,, '
l h e nugget component, a l ~ i r hcorrespol~(lsto a zero mnga, is zero a t
any ~~nsanipletl location.
O
I Long-tango component Trend component
OK Cd estimates
...... No filtering
i .-.-
1
~ --...,
2
~-
3
~-~
4
- Nugget fillered
5
~ ~ -
6
T ~
Dislance (km)
Figure 5.11: O K csti,antas of G I ronrestr;~tionsbrlorc irtd after filtering tlw Figure 5.12: Maps of regional compoecats fur copper a n d l e i d
nugget component.
Geology 5.7 Dual Kriging
Quaternary Llnal kriging is just arrotl~crprcselitatiou of kriging whereby t l ~ eestinrales are
Portlandian cxprrssed as linear co~t~l~inat,ions of cov;wiatrcc valws instcad of data values
(1)11l)rule,1983; doorncl, 1!180). The dual kriging frn~nalisn~ provides insight
Sequanian
into the filtering properties of kriging n ~ reduces
~ d tlre c o n ~ p ~ ~ t a t icost
o ~ ~of
al
Kimmeridgian kriging rnl~cnosrd with a global search neigl~l~orl~ood.
Argovian
D n a l simple krigiug
t c z a t l o c a t i o ~11:
1h11sidrrthe simple kriging c s l i ~ r ~ aof ~
'Tlje SIC weights XF(u) are obtained by solvirrg the system of n(u) linear
cquat~ionsderived in seclion 5.2 atld recalled here:
where dF(11) is t.l~edual weight associated wit11 tlre covariance f:(ri, - 11).
Expression (5.61) is called the d1ml form of the tradit,ior~alor primalsimple
kriging estimat.e (5.59). 'l'he dual weights dy(u) are obtained by identifica-
tion of t,he dat;i values by the kriging expression (5.GI):
Figure 5.13: Spatial dislribatioa of rock types over the study area and msps of
rsgional cornportents of four ractals.
OK Cd estimates
Accounting for the dual form (5.68), one can rewrite relation (5.55) het,weeu
OK tisti~uatesas
2. 1,ocatiou u2: 0 < d,,,,,, 5 200 m where bm is the i>roport.ionof tlrc nuggct clfcct lnodrl go(],). For a zero relative
All covariance terms Co(u, -11) are iero, so only the uugget amrponenl urigget eflixt (6" = O ) , tlte i ~ ~ t e r p o l a t icurve
o ~ ~ (solid line) varies smootlily
is filtered fro111the estinlate (5.70): witlrout apparent disconbint~it,y. IIiscont~i~~t~iLies appear as soon as a small
nugget effect, say, bo = 0.2, is present (large daslied line), and they become
irmensingly importatrt as t . 1 relative
~ nugget effect incrca3cs.
174 CIlAP'I%'II5 ACCOUNTING FOR A SINGLE AlTIIIDUTE 5 8 MISCELLANEOUS ASPECTS 01,'KIIIGING 175
- 0% nugget
...... 20% nugget
- - - 5n% "8 ,noat
1 2 3 4 5 6
Distance (km)
,
l l ~ large
\
e off-diagonal covariarrce value C ( u a - I%,) = 0.85 informs the syslrnl
'I'l~ekriging rvcighting systew ;rcco~n~ts
for: on the r e d ~ ~ n d a n cofy the two data a t clustered locations 112 and 113. On the
other hand, the zero covariance veltres in the right-hand-side vector i ~ ~ f o r r n
1. proximity OI data 1.0 llre l o c a t i o ~II~beilig c s t i r ~ ~ a t et ld~ r o u g lt ~l ~ ncovari- the system that the poil~t11 to be estirn;il.etl is heyolid t.lir correlatio:~range
ance t e r m C ( u , - n ) , and of the two data a t locations n4 and US.
2. data redundancy i . l ~ r m ~ t.l~e
g l ~ data covariauce t ~ ~ a t r [i C
x ( u , - II,~)] T h e following ordinary krigirrg weights are solulio~lsof sytcm (5.72):
1. Force all kriging weights to be positive; see Harnes and Johnson (1984),
Xu (1991).
2. Add to all the weights a constant equal to the ~nodulusof the largest
negative weight,, then reset, the weights to s u m to 1; see Jonrnel and
I ~ (I 996)
I
. .. .
-7.0 -0.5
u5
,
, .
0.0
,. ,_-~
0.5
Distance (km)
1.0
. ,-_ / _
-1.0 -0.5 0.0
, ~-_,-.
0.5
Distance (km)
,'
. ~ ~ ~ T -
1.0
3. lteset any faulty esti~nateto the nearest hou~rd,say, 0, if negative values
arc not arccpt;rble, or I for cxccssive proportions; scc Mallet (1980).
I . I n ~ ~ m scorlstraints
e on the kriging estimates rather Limn on the krig-
ing wcigl~tsthn)ngl~the use of iudicator ro~rslraintiulervals; see sec-
Figlure 5.16: Two two-dimensional data configurations with dilfrrent positious of t,ion 7.1.2 and Journel (19861,).
uz. In both cases, the attribute v d u c at locatiou ua is cstimatcd using ordinary
krigiag and live data at locations ul to us. 'J'ltc radius of the daslred circle ceutercd For a parl.icular data co~~figuration, kriging weights u ~ a ydrastically change,
on 110 corresponds to the correlation range (1 krn). depending on the scrnivariogra~r~ i~rodel. For the two data configurations of
1"igure 5.16, Figllre 5.17 sllows the evolution of the kriging weights s the
relat,ive nugget t:fft.ct incrcas~s.A 1;rrgcr nuggct rfict reduces the impact of
As cxljccted, the kriging weight dccraascs as the rlatom location gets l a r t l m disl,a~rceof dirti~1ocati011sto u0 (left grztpl~);it also reduces tlre screening
from uo. N o k blic following: effect of lociilior~1 1 : ~on 112 (right graph), In the presence of a pure nugget
effect, all weights are eqoal to l/rr(u) = 0.2; the krigirrgestirnate then reverts
1. 130th data a t locatiorrs ul and u ? arc the same distance frorrr uo (same
lo 1.11~arit,l~~nel.ic
average of tlrc data retai~~etl. 'I'lre impact of sernivariogram
righ.-hand-side covaria~~ce t c r ~ n s ) ,yet. the latter receives less weight.
parameters (slmpe, relative nugget ell'ect, anisolropy) on kriging weights is
hecanse of its r e d u ~ r d a ~ ~with
c y the third datum a t us.
cogently discussed in lsaaks and Srivastava (1989, p. 296 313).
2. IMa a t locations u 4 and us gel. a norr-zero weight, although they are
heyorrd the correlation range of data. This nowzero weight is due to
their cont.ribut,io~rto t,lre estirrralion of the t r e ~ ~corrrponent
d a t locatiou
uo. 'I'his treud estimation is implicit to ordinary kriging (see related
discussion in section 5.3).
-
-I
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8
Note t.l~es ~ n a l lt~egativeweight given to the datnrrr a t 112. Negative weights Relative nugget effect Relative nugget effect
typically occur wlmr the inlluence of aspecific d a t u ~ nis screened l)y that of a
closer one. In Figure 5.16 (right graph), the location us screens the infiuence Figure 5.17: Impact of the relative nugget effect on the OK weights for tire two
of uz for estimation at ua. Negative wcigl~lsallow the kriging cstirnate to two-dimrnsional data configurations of Figure 5.16. All weigltts get close to each
take values outside the range of the data, Altl~ouglrthis lion-convexity of the o t h ~ as
r the relative nugget effect increases.
estimator is generally a desirable properly, it rrray yield unacceptable results,
5.8. MISCELLANEOUS ASf'EC?'S O F KRIGING 171)
178 CHAPTER 5. ACCOUNTING FOR A SINGLE ATTRIBU?%
clnstercd so that one or two of the nearest data do not screen all o1.11t:rs. 'I'lle
5.8.2 Search neighborhood maxinlunl number n ( u ) of data values to retain deper~dson the objective
As mentioned in the introduction to the linear estimator (5.1), common prac- pursued. When one aims a t depicting local features of Llhe attribute, t h a t
tice consists of using only the data closest to the location u being estimated, number should be limitcd; wltereas more data arid dala farther away sl~ould
i.e., the d a t a within a given neighborhood W(11) centered on the location u be retained t o depict long-range structures.
.
to be estimated. There are several reasons for such a restriction:
The covariance values for large distances are usually unreliable because
of the few data pairs available for iuference a t suclr large d i s l a ~ ~ c e s .
Before nmning any kriging over an entire area, it is good practice to try
several search strategies on a test subarea. Cross validation teclmiq~~es
then Ire nsed to evaluate the inlpact of dilferent search parameters on the
interpol;lt.ion r e s ~ ~ l t As
can
T h e size of tile kriging system, l~encethe co~opntationtime, drastically 5.8.3 Kriging variance
increases with the nnmher of data rctaincd, approximately in proportion Kriging provides not only a least-squares estimate of the a t t r i l ~ u t ez bnt also
to [n(rr)l3. Llre ;tl.tacllcd crror variarrcc; for cx;tr~~plc,
for ordinary triging:
T h e search ncigl~borhoodis typically taken as a circle ccntered on thc
location being estimated. When the variation is a~~isotropic, it, is l~el,terto
consider an ellipse wit11 its nlajor axis oriented along tile direction of rnaxi-
mum continuity: this provides more relevant data. 'Ih reduce the influence
T l ~ a error
l variance is:
of d a t a clusters (if any), it is good pract,ice to split the search rreigl~horl~ood
into equal angle sectors, say, quadrants in two climensions or octants in thrce I . d e p e n d e ~ ~ont t.lle covariance nod el.
dimensions, and retain within each sector a specified uumber of nearest data. 'I'l~is is 1n1 excellent feature: the estimation precision should in<lwd
When data are gridded and the estimation grid is aligned with the sampling depend on the complexity of the spatial variability of r as modeled by
grid, quadrant and octant searches may cause artifact discontinuities in the the covariance.
estimated maps because of sudden changes in the data retained for estimation
2. dependent on tire data co~~figuration.
from one location to the i ~ e x t .
T h e terms C(n,-u) account for the relative geonletry of data locations
One shonld avoid limit,ing a priori the rrraxirnu~i~ senrcl~d i s t a ~ ~ ctoc t l ~ c
11, and their distances to tile location 11 b e i ~ ~
esli~~mtecl.
g 'TI& is also
correlatio~rrange of data. As featured in Figure 5.16, data beyond the corre-
arr excellent feature.
lation range contribute to the estimation of tlre local mean within the scarcll
neigl~borl~ood. As {,herelat.ive nugget increases, the scrcsning clkct of r1ost:r 3. illdependent of dat,a va111es.
d a t a decreases, I~enceremote data get more weight (Fignrc 5.17). 111 sub- For a given covarial~cemodel, two idcntic;rl rl;ita co11lig11rat.io11.s
would
areas where sampling is sparse, the search distance sl~onldbe increased to yield t.lre same kriging variance n o matler wlrab the 11;rtn were.
retain enough data. AII alternative is to spiral away from the local.ioll 11 Ije- The lal.ter two features are illustrirtcd ill Figure 5.18 ( I ~ o t t o ngraph),
~ wliicll
ing estimated and lo retail] the n ( n ) closest dala wl~atnvertheir rlista~lceto slrows the OK variance along the NE-SW transect. ?'lie pattern of variatioil
u. For data selection, a useful practice consists of using the semivariogram of the kriging variance is fi~llycontrolled 11y the dtlla config~~ratiorr.T h e
distance y ( o - u,) rather t h i the Euclidian distance 111 - n,l, so that data
e r r o r variance is zero a t data l o c a t i o ~ ~il~creascs
s, away from 1 . h data, and
are preferentially sclected along tlrc direction of maxi~nnmcontim~ity.
reacl~esn maxinrum value beyond tlre extreme riglit datum (c:xlrapol;rt,io~~
In addition to size and orientation of the search neighborhood, one n111st
sitllation). Indeed, as the locatio~rrl bcing estin~atedgets fart,l~craway from
specify the i n i ~ ~ i ~ uand
u ~ inn a x i ~ n n ~i ~n ~ m i h eofr dab 1.0 IISC for cstimation.
data locations u,,,1,otI1i.11~covariauce term C(u,, -11) ;md tlic kriging wcigl~t.
+
T h e minim on^ must be equal a t least, to 1 . l ~nnrnber (Ii 1) of cor~strai~tts
XEr'(u) decrease, lleuce the kriging variitnce (5.73) i~lcrcasrs.'
on kriging weights. For example, the modeling of a ynadratic tre~ldi l l two
.
dimensions requires a t 1e;rst six data values. In practice, 10 dirta values w o d d
be a rmsoo;hle niirii~nmn. ,I h a t rnlmber should Ire 1;rrgcr rvltcrc data iire
4
Irere.
Only the common case of positive weights and l ~ o s i ~ i vcovarinrtcr
c values is ro~isi4ered
of d a t a values, tlrr kriging variance retains, frotn the data used, only its
geonretry---not t,he data values. In this sense, the krigilrg variance is otrly
a rartkirrg index of d a l a geo~netry(and size)-not a measure of the local
spread of errors; scc J o u r ~ ~ c(198Fa),
l f)eutsch and Joornel (1992a, 11. 15),
and ClrapI.cr 7.
Large estimated concentratious in PI] are in the central and soulh part
of the rcgion.
1 2 3 4 5 6
Distance (km)
Kriging yiolds s i n ~ i l i ~
error
r v i ~ r i i l ~ ~ill.
w sm y two or IIIOIC I~cal.iotlswit11
siniilar local 11;rta configurations. Int~liI.ively,Irowevcr, the potential f<)rerror
is expectod to be greater a t a location uz sr~rror~~lderl by data that arc very
different thin1 a t 111 surrounded by similirrly valued data (Figure 5.1R, top
grapll).
'l'lrc prohlt:m with the krigiug variarice is tl1;~t it is 1101, corrdit,ioz~edto
Lhc 11;~t.nv;~lucsusctl. 'l'hc proper nraasurc of local precision wor~ldbe the
c o n d i t i o ~ ~estimalion
al variance specific to l l ~ rdata values and dehred as
where the r ( u , ) are t,lre data values. T h e krigiug variance is but hire average
Over all possihle realizations of the n(u) data IWs Z(u,). If the configr~ratiorls
of tllc 11(0(')) data retained a t marly different laces ~ ( ' 1 ,I = 1,. . . , L were the
same, t l ~ e nthe krigirrg variance (5.73) would be an estimate of the variance of Figure 5.19: Maps of OK estimates. The tolerable maxima are 0.8 ppm for Cd
and 50 ppm for Cu and Pb.
the 1, errors z*(u('))-z(II(')), Ikcanse of the averaging over the L realizations
At the 100 t,cst locations u,, b o t l ~esti~natedvalues Z & ~ ( I I ~and ) actual Cd data
values z(u,) are available. Statistics for both sets are g i v c ~in~ Table 5.2.
Besides the similarity of the means (global u ~ ~ b i a s e d ~ ~ enote
s s ) ,that t.he csti- -E Prank : ,.
mated values appear I ~ I I C I I less variable Oran act,u;d valr~cs,'Tl~is "s~noothirrg
erect" is due to tho filtering of lrigI~-Sre~r~e~~cy (sllort,-ra~tgc)c o ~ n p o ~ ~ c l ~ t s ;
-:: . . ,
3~
b,
x . : . .,, .
smoothing increases with ir~crcasiugrelative nuggcl effect and wit11 decreas-
ing sampling rlensity. For cx;m~ple,srr~oot.hi~,g is less p r o ~ r o u ~ ~ cfor
r d coball .3. , .., ,,.. . .
.-E
2~
':n .-I?;.:.
,
-
Y A
relative nugget erect than tlrose of tire tlrrce 0 t h metals; c o r ~ p a r ethe setni- O ,.i-
variograrns of I"igurrs 4.13 and 4.14. 0 1 2 3
The scattergrams of true z(u,) versus c s t i ~ ~ ~ a lvalues
ed ~ ; , ~ ( carryli~) True value (ppm) Std deviation (ppm)
muclr more iofornralion than the sulnmary statistics of 'fable 5.2 (Figure 5.20, Cu data
left graphs). 'Tl~cyall reveal an n~~rlerest.in~at.io~~ of large cot~centrations(val-
Prank 0.01
ues are below tlre 45' lino) a d an overestiu~ationof s~rrallconce~~trat.ior~s
(values are above the 45' line). 'Tlrc l~alanciogof these two effects results
- 120
in dire global onbiasedness seen in Tahle 5.2. Such a bias is called con&-
Ei 80
-
tional because it depends on the class of values considered. T h e c o ~ ~ d i t i o u a l e
bias drastically ir~Ruer~ces the evaluatiou of the extent of c o r ~ t a m i n a t i o(Ta-
~~ 404
ble 5.2). T h e u~~clerestimation of large copper concentrations may lead one to ..
declare safe all test locations, wl~ercirs8% of thcii~actually exceed the thresl~. 0
old value. In contrast, the overestimation of smiill cadnrin~rrc o ~ ~ c t w l . r a t i o ~ ~ s 0 40 80 120 160 15 16 17 18 19 20
may lead one to classify t l ~ cmajority (02%) of test locatious as contatni~mted. True value (pprn) Std deviation (ppm)
The same effect, tllouglr less pronourlced, is ohserved for l e a d
Pb data
0
0 100 200 300 23 25 27 29
True values True value (pprn) Sld deviation (ppm)
OK estixnatrs Co data
Sld devzatzoe
True values
OK estnnates
% contammatto7r
.. . ....
True values 0
OK estimates
0 4 8 12 16 20
% a~isclnssificalion True value (pprn) Std deviation (pprn)
OK estimates
Figure 5.20: Scattergrains of t n ~ cvalues a(u,) versus OK cstim;rtes Z ~ ~ ~ at ( I I ~ )
100 test locations ut (left graphs). For the same locations, note the lack of reli~tion
between the kriging error standard deviation ooli(or) and t.lx ithsoluk estimation
...-. I-. I.. 1 - -1.. ) I I.:,.,,' 1..,"."
184 CHAPTER 5 ACCOUNTING FOR A SINGLE ATTRlBUTE
.
infornratiolr t~iayrelate to two types of attributes:
A catcgorical attribute s wit11 It' ~ n u t ~ t a l lexclusives
y states sk (e.g.,
. rock types)
A smootl~lyvaryiug coirtinooos attribute y (e.g., nickel concentration)
If the secondary inforrnation is densely available but not exhaustive, it is a
*w better yet, by
re;wooal)lr! ;rpproxilnat.ior~to cotnplete it By ioterpolati~,~.
186 CHAPTER 6 . ACCOUNTING FOR SECONDARY INFOIIII.IR??ON
siniulation; see Almcida and Jonr~icl(1994). 111 the exatnple of liigure 6.1, Ni 111 this section, tltrce variants of the kriging pnradigni itrc dcvcloped to
concentratioirs were est,irnated in non-overlapping segnients (blocks) of lengtl~ incorporate exhaustive secondary data in the estinrat.io~lof tlic pri111;lry 8t-
50 ni so as to mirriic a smoothly varying and exhaustively sampled variable. tribuk z:
Iirigiag cuilhin strata an~ortnt,sto first strutifyit~gthe study area based
on the secood;try information then est.in~atingthe primary attriRrlt.e
w i l , l ~each
i ~ ~ sprxilic stratuni using tlie ~ o r r ~ s p o n d i nprimary
g data a ~ l d
, Cd data covariance model.
Sinrplc krigieq wilh tXITyIllg local means and krrging luilh o n el:lerr~n/
drift use sccoltdary data to clraracterize the spatial trend of the prinrary
attril~ule.
OK Cd estimates
i
Figure 6.2 (t.op graph) shows a stratification of the NE-SW trarrsect based
on geology. D a t a sparsit.y prevents considering lrrore tl~arrtwo strata. T h e
first s t r a t u m , denoted A , , i ~ ~ c l u d all
- stratification
e s locations on Argovian and Quaternary --- no stratification
rocks. 'I'lrc tlrrce otlrcr rock types with larger Cd concel~trationsand larger
proportions of c o l ~ t a r ~ r i ~ r a tfields
e d (Tables 2.4 and 2.5, pages 18-19), are
regrouped into the second sbratorn Az, wlriclr cousists of two discon~rt~ct.cd
scgt~~rwl.~.
Lf'igure 6.2: Kriging within strata. 'She transect is first split into two strata A I and
Figure 6.2 (~rriddlegraph) shows the s e n ~ i v a r i o g r a ~ofn Cd concentrations
dz,according to geology (top graph). Within each stratum, the Cd semivariogram
of each straturn. Concentrations appear to vary more continuously within is inferred arid mncleled (middle and Cd coucentrations are estimated using
the s t r a t u ~ nA, (smaller relative r~uggeteffect and larger range of the serni- ordinary kriging and stratnn-specific data (bottom graph, solid line). Vertical
variograrn). Recall t h a t the smallest concentrations were rr~eamredon Argo- arrows depict discontinuities at the strata boundaries. 'The dashed line represents
via11 and Quaternary rocks, rvlrich form the first straturn. Thus, the better the OK estimatc witlwnt stratification.
continuity of Crl concentr;rt.ioos within that st.rato~urelates t o the brt.tcr
connectivity of srnall Cd concentratious revealed by the alralysis of indicator bottom graph. T h e d a s l ~ e dline corresponds to the OK estimates computed
sernivariogranls (Figure 2.19, top graph, page 45)
.
without stratification ill section 5.3. Note tlre following:
C a d m i o t r ~coucet~lmt.iousarc cstiln;ttcd ~ r s i r ~ordinary
g krigirrg, st.ratil111-
specific primary data, and the selnivariogriur~models shown in Figure 6.2 Kriging w i t t ~ i ts~t r a t a yields estimates t h a t suddenly cl~arlgeat the
(rniddlc graph). ' h e resulting estinrates arc depicted by the solid line in t l ~ e s t r a t a boundaries depicted by vertical arrows. Indeed, estimates 0x1
different sides of a stratum boundary are based on t,wo different sets of
190 CHAPTER 6. ACCOUNTING FOR SECONDARY INFORMATION 6.1. EXHAUSTIVE SECONDAHY INFORMATION 191
Uuder the decision of stationarity, the mean 711 does not depend ou loca-
tion u but represents global information c o ~ n m o nt o all unsnrnpled loc a t.~ o n s . T h e kriging weights X r ( u ) in expression (6.2) are obtained hy solvirrg a
T o account for the secondary information available a t e a c l ~location u, the simple kriging systerrr of type (5.9):
knownstationary mean rn rnay be replaced by known varying means 7n;K(11),
n(11)
leading t o the simple kriging with varying local incans (SIilrn) estilnator:
C $"(II) ~ l i ( u ,- uir) = c,~(u-- n) o = I , . . ., ~ ( I I )
p=1
I t F 11(11) = X ( u ) -
where C ~ ( 1 1 )is the covariauce fn~rctionof tlre resid~~;tl
ni(lr), not t h a t of Z(u) itself.
Ilifferent estimatr:~o f Llre primary local illeati i r ~ ( o )c;m Ire tlscd, dcl>cIIdillg
on tlie secondary information available: Example
1. If the secoudary information relates to a categorical attrihute s wit11 Consider the esti~natiouof Cd co~~cenlrationsalong 1,he Nit:-SW Lra~is(:cl11si11g
K non-overlapping states s ~ the
, primary local mean can Be identified as secondary inforrnatiou either geology (categorical attribute) o r the lrlock
with tire mean of z-values within t l ~ ecategory sa prevailing a t u: OK estimates of Ni conccntratior~(continno~isattrihute), a s rlcfi~iedirr Fig-
ure 6.1. Figures 6.3 and fi.4 slrow the difTercut st,eljs of I h ! corrcspor~ding
k r i g i ~ ~appn,acl~os:
g
1. 'L'lre c;tlibration between prinlary aurl s c c o ~ ~ d a rdya t a ;~nloontslo de-
ternlining either the average Cd concentration for each rock type (coo-
rlitional ~ n c a u s )or tlre regression fimctiorr of Cd conccntrat.ions on Ni
block estimates (top left graphs of liignrrs 6.3 and 6.4, res~~cctively).
This calibration results either in a "staircase" trend cstilrratc (Fig-
lire 6.3, second row) or a more s ~ n o o t l ~ lvarying
y treud when derived
where n t = E=,
~(II,;sn) is the number of primary d a t a locations from the coutiul~o~ls Ni att.ributc (Figure 6.4, second row).
within category sb, and i(u,; sk) is the indicator (2.23) of category sn.
2. At each primary d a t u m location u,, tlre rcsidr~alvalue ~ ( I I , , )is com-
2. In tlre case of a secondary c o u t i u ~ ~ o nattribate
s y, the priwary local puted lry snbtracting tlie trend estimatt: ~IL;~((II,)f r o n ~tlrc prinrary
mean can Ire a funcl.ion (linear or not) of the secorrtlary attribute va111e I ; I I I I ( I ) e . ( I ) = ( I ) - ~ ( I I ) 'I'l~e. se~nivariogralnof
at u: residuals is t1m1 con~pntetland rnodeled. I'ignrcs 6.3 and 6.4 (top right
grapll) slrow 1.1~: a:ntiv;~riograorcornpu1,r:d from 259 residu;~li1at.a and
tlic inode1 fitted.
1 2 ClIAP'l'ER 6 ACCOUNTlNG FOR SECONDARY INIIORMATION
Semivariogram of residuals
Sernivatiogram of residuals ;egression
I5 l
0.0 C
0.0
-,
0.5 1.0
-
1.5
7
2.0
/
-- 10 20 30
Ni block value (pprn)
0 = 0.86
40 50
Distance h (km)
4 1 Trend component
Dislance (km)
+
SK estimates of residuals
SK estimates of residuals
-7--,
1 2 3 I 5
. -
6
Distance (km)
as opposed to m ( u ) = no(u) + li
( L ~ ( I Ifi(11)
) for kriging wit11 a lrcrid
model.
As with the 1C1' approach, I.11e two t~nkno'ivot.rrnrl coellicients no(11) and 'Che secondary variahle sliorild vary smoothly i l l spacc 1.0 avoid inst.:t-
Ijility of the KED systeni. 111 nit carly applicatio~~ of kriging wit11 en
al(u) are deemed constant witliiu llre search ncighhorlmod W(n) and are
extcrn;rl drift, lto~vever,M;rr&clml(1984) colisidered tlic ex;tmple of lo-
implicitly estimated through t,lie kriging syslem. Unlike l h e SIC approach of
c;il trends of ~iiineralgrades cot~lhllcdby faults hlocks. 'I'he ext~crriid
the previous section, the mean m(u) is not e s t i ~ ~ ~ a tLhrougl~
,cd a calibration
variahle w;rs the11 an indicator varia1,lc that, cli;il~gcdvalue only ;rcross
or regrcssiorr process prior to the kriging of I .
the hrrlls.
The KED estimator is
n(U) As discussed ill sectiol~5.4, the residual scmivariogra~rtyrl(lr) s l ~ o ~be
~ld
I ) = C Xy'(4) %(II,) (6.4) inferred fror11 pairs of z-values that are ~tnafl'ectcdor slightly affi~ctail
hy thc t.rrnd, id?., fro111dat,;t pairs such tlial y(u,,) y(u, +
11).
n=l
196 CIIAI"I'ISI1 6 . ACCOUNTING FOR SECONDARY INFORMAlTON 1 . EXllA US'I'IVE SECONDARY INFOIfA,fAll?ON
(i. 197
Kriging t l r r t r e n d a part,icol;rr case of the K T system (5.36) wit,t~IC = k' = 1 and f i ( u ) = y(u):
St is s o l r ~ e t i ~ luseful
~ e s to couipute and map the trend component m ( u ) that is
implicitly rlsed in tire expression of the I'iEI) estimator. T h e trend estimator
is writ,t,en
n(11)
lll;IEU(ll) = h(;1:1) Z(lln) (6.6)
n=1
where X;~/(II) is the weight assigned i,o datunr z(u,). 'I'lre kriging weights
arc ohtaiued by solviug a krigirig system identical l o lire KT syslern (5.33)
exr:ept that 1.11r.t.rtwd function f i (11) is rrow sr:t cqaal to the secondary varii~ble
?l(ll):
A zero slope n;(u) tncatts titat tile secon<l;~rydnlum y(u) does nol. influence
1 . 1 ~primary trend cstirnate a t u. As that slope incrcasr:s, the ilifluc~~ce of
llie secondary value hcco~nespreponderant. A map of the scaling fact.or
a;(u) allows one lo depict the local inlluetrct: of t l ~ secondary
r variable in tlie
estirnaliol~of tlie primary trend compor~er~t.
TIE estimator of tile i,rerid coefficient n l ( n ) is written 'I'lic ICED estimator (6.9) is tlrus sirrtilar to an SIC estin~at.orwith varying
local means dorived from ;l linear resraliltg of tile colocated ?/-datum:
,,=I
A r ( u ) [Z(ua) - m>rt(ue)]
198 CIlAPTEll G. ACCOUNTING POI1 SECONDARY 1Nl~'OIIMATION
.
,l h e two estitrtators differ by the defiuition of t.lre ireilcl compottc~~t,.
'Tltt: N M W trwrstxl. As in Vigrrrc 5.3, tlic verlic;d d;rshcd l i l m split. (.he t.raitsccf.
trend coefficients nf, and o; are derived once and indej~eodentlyof the kriging into six segnients within wlriclr t,hc same five primary data arc ttsed far esti-
system in the SKlm approach, whereas in the KEI) approacli the regression ~~r;tt,iolt:for examplc, ;dl trettd and (:(I estitrtatcs withill lire seg~llent.I 2.1 klii
coefficient^ a;(u) aud O ; ( I I ) are implicitly estitlratml t.hro1rg11I.lrn krigitrg sys- art: I x w d O I I Cd data n(. loc;rfions 111 to u 5 . AS ( I ~ S C I I S S ( : ~ lpr~~viot~sly, IIII. Ikrip;-
tem within each search ~trighborhoodW(u). ing system (6.8) is identical a t all locations where t.lrr: same neiglrhoring data
are involved in the eshirr~ation.'Therefore, the two KkX-cstituatcd tret~dco-
efficient,s n ; ( ~ ) and n ; ( ~ )are coltstant wit,llin each segnlent and cltaogc from
Example
one segulcnt to the next, depending on Clle fivc neigl~horingdata rcbait~erl;sce
the slope cstitnales (I~'igrrre6.5, 1)ottom griq~lr,solid litre). 'l'l~eslopc of t.lte
trend tttod~lis tnttll.iplied I>y I.w0 frotn t,lie first S ~ ~ C I I 10 L t l v IICXI. I K C ~ I I S I ~
tirr: larg? (:d c ~ ~ l c ~ n t r i r t ;it
i o un g , I , ~ D I I it g& C ~ O S C1.0~ Lh<!C~111Sl.ikrlL ~ 1 0 1 ~ 1
, Trend estimates of llie global trend rrtodel that is 1 1 s 4 i n siniplc krigitig wil,l~varying local
means (Irot.t.ot~igmplr, Iioriaor~l;rldasltcd lirte).
Botlr ICED and SKlrrl trend an11 Cd cstinlates arc fairly similar. 'Tltct
largest diffwcnccs occur for the first segment, 1 2 . 1 k n ~wlifm! , tire two KEl)
and SKlm slope estitiiates differ inore. As wil.11 krigiug with a I.rt!lrd modtd,
beyond Lire extreme right daturrr a t coorditratc 1.8ktn, krigittg with at, r:x-
ternal drift extrapolates the linear trrnd mo11r:l fitted 1.0 tlrc I;rst, fivc (:<I
values. Such e x t r a p o l a t i o ~woold
~ ha dartgemus if the last five vdtlcs slrowrd
an at.ypical relation b e t w c e ~Cd ~ data and Ni lllock ~ ~ t . i t n i i l esay, s , a iicgal.ive
Distance (km) correlation ca~iscdhy an outlier. Si~trpIekrigiug with varying I0ca1 nreit~ts
would be inore rollust in that il. cxtrapol;ites u relatiotr tliat is fidted i,o all
I Cd estimates
data along the transect.
1 . I<rigiug witl~ittstrai,a, 'l'lrc strrdy irrw is divided into two s1.ri~l.i~ I>its(:d
j Slope estimates a;(u) on tlrc geologic map ill I'igrrre 6.6 (left top graph). 'I'lrr, first, stral.!rrrr
includas Argovian rocks wit11 the s~~tallcst. proport.iolt of c o ~ t t ~ ; ~ n r i ~ ~ ; r t ~ : d
locations; the four other rocks form tire scco~tdstr:tt.utr~(l'igure 6.6, lcl'l.
hottom grapl~).Ordinary kriging is pcrfor~rrerlseparately wil,l~itrcaclr
strati1111wing a pooled withi~r-stratumsen~ivariograui;srt: right bolf.orrr
graph of Figure 6.6.
Quaternary
2.0
Poitlamdian 16
Sequanian 12
08
Kinmeiidgian
0.4
Argoviao
00
20
stratum 2 16
st<atum1 12
08
00
0 0
Table 6.1: SLal.istics fur t,rire and cstiri~atedconcent,ra- 6.2.1 The cokriging paradigm
tions of heavy metals a t 100 tcst lociil.iom; algorithms Chusirler the situation where priniary diit,a { r i ( n , , ) , n i = 1 , . . . , r r l } arc slrp-
are the reftwnce ordinary kriging (OK), krigiug within plementcd by secordary data re1;lted to (N,, - 1 ) c o n t i ~ ~ u oat,t.ril)~~l.es m zi,
strata (KWS), simple k r i g i q with Iocd means c o n - { z i ( o , , ) , r v i = 1 , . . . , 7 t i , i = 2 . . . , N,,), a t any, possibly diffcrcnt., locations.'
puted as a linear fui~ctionof Z n l~lockestimates (SKlrn), T h e 1i11e;we s t i ~ ~ i a t o(5.1) r is readily cxl.e~~ded to incorporate tlral, additio~inl
and kriging wit,li an external drift (KED) infornlation:
Mean
T r l w values
OK estimates
KWS esliruates
Sl(1111 cstin~atcs
wl~ereA,,, (11) is the rz.eight assignd to tlic primary datum 21 (II,,,) and A,,,(II),
KEI) esti~nates
i > 1, is the weiglil assigned lo the secondary datum z ~ ( I I , , , )'.r l ~ eexpccl.c:d
values ofl,lie ItVs Z l ( n ) a ~ Zi(u,,) ~ d are denoted m1(11) and nzi(n,,), respec-
Sld deuiatim
tively. 'I'ypically, o111ythe p r i ~ ~ ~ aud a r y sccor~dirrydata closest to llic locatioo
True v a l u e s
11 being cst.in~;tbedare rel.;~i~~erl, i.c., n,(u) is us~lallysmaller than ni. 'I'llc
OK estimates
~ I I I O U d
I ~ data
~ r c t a i n d and 1 . l s~ k <iftlic sc;ircl~~ ~ ( ; i g l ~ l i ( > rIl I~(oWo~n(11,
~ ! lh!
KWS estimates
tlie salne for all atlril~utcs.
S l i l ~ estimates
i~
All c o k r i g i ~ ~csti~nators
g arc but, varia~itsof cxprcssio~i(6.10). 'l'l~oy ;IS(:
KEI) esl.irrrales
;III required 1.0 be unbi;tst:d and 1,o n~irtin~izc 1,hr crror vnrinncc n ; < ( r ~ tlint ),
IS,
SKlm cstin~ates
KED ostimatcs
any two residual 11Vs Ri(11) and R j ( u
'I'ltc cross covariaocc bel~rvce~~ + h) is
,,
I he cokriging weigl~lstitat nli~linliaethe error viirian~r;ire ol~t,itincd1)y set-
+
ting to zero the ( ~ I ( I I ) irx(u)) partial first derivatives of tlrr quadratic
form (6.12):
S e v ~ : r a ls c x o ~ ~ r l n rvnriablcs
y
,,
I h e silnplr cokriging estimator (6.1 I) is readily extended l,o marc t l ~ a none
for cxalnplc, for (A',, --I) varialllcs,
s c c o ~ ~ d n rvarial~le;
y
( 1 1 ) - 7 , = x
?1,(11)
rrl=l
X ~ ( I I [%l(llo,)
) - 1121j
N" t~,(Il)
.+ ( I )( I )-I ] (6.15)
i d ,,,=I
Usir~gan approach similar to that for two attribl~tcs,our derives bile folk)wilIg
cokriging system of ( ~ ~ I ~2( I I1illt:;lr
)~) aluatior~s:
Matrix r ~ o t n t i o u
. .
,, (XI) Using ~ n i ~ l r ~i xr o t a t i o the
~ ~ ,silnple cokrigiug systeltr (6.13) is writt,en
i
~ ~
= G l ( l l < , , - 11) e, = I , . . . ,
&=I
x
n,(lIj
A;:"'(n) c21(llo2 - u p , ) +
&=I
x
,~?(ll)
Ai;;"(ll) C22(110z- 11,j2)
with
Ksc:ti AscK(r~) = kscK (6.17)
fLcc;rll ll1:11 1 I K cross corrrrlograni pij(l1) is delixted as Lhc rnlio (Aj(h)/(ci .uj),
x
N.
, s c ~ I ~ (=~ ~ ) [z;(II) - mi]' A ~ ~ ~ (+I WII
p'
i=l
I ) (&la)
ivl~ereu: is the stationary variance of t,lre ItF Z;(u). Cokriging systems
written in terms of correlograms or covariancrs yield different weights, yet
L I P rokriging cstin~at,orremains !.he same.
- -
'r, gcti (11) is the ni(11) x 1 vertor of
w11ere Z;(tl) = [ Z ~ ( I I ~. .).,, Z ; ( I I , ~ , ( ~ ~ ) ) ] Indecd, accout~littgfor the defiuition of t h cross corrclogra~rr,system (6.20)
cokriging wcightr nssignd 1.0 tlic zi-(iat,a a t locat.iolls II,,,, n; = 1 . . . , rti(tl), IICC~I~I~S
and nl, is t.I~en i ( u ) x 1 vector of stationary (cot~stant)n m w s m i . ' I h Nu
vectors of cokriging iwiglrts arc solutions of t l ~ cs y s t r n ~
3. 'The simple cokriging est,itnat.or (6.15) is an exact interpolator i r ~that Wl~crrnsthe pri~narydata weights arc the same for bolh systems, the weights
it, l t o t ~ the
~ s printnry data nt tlrcir locations: of the secondary data are rescaled by r a t h of staudard deviations. T h e
cmkrigirtg est.il~r;itor(6.1 1) is llren writtm
210 CHAPTER 6. ACCOUNTING FOR SECONDARY INE'ORMATfON
T h e standardized form of the SCK estimator is obtained by dividing lmtlr 'l'lts cokriging variants are now illustrated rrsing the one-dimc~tsio~taldata
ternis of the expression by 01: set with a prin~aryvariable (cadmiur~iconcentration) and a single secondary
variable (nickel conccntratiott). The iuforniation availahle for estimating (:(I
concet~t.rai.ionsalot~gthe NE-SW transect consist.s of Ll~efollowiug:
W e i g h t s of t h c ntcmls
Tlie simple cokrigiq e s h n a t o r ((i.11) c m he rewrit,tcu as
.
0" 0 .
50
I Ni data
-2e- . . . O
"
111 t,hr gencr:rl forttl of cokrigitig, exprrssio~t ( G I s ) , primary and secondary
s
g
20
10
0
o
. '
.PIS data locations
[lala localiotrs itred not coi~icidr.Olie particular casc is wltet~all variables are
equally sa~nplcrl,m d tlrr satr~cn(n) primary and srvmdary data locations
0
-
m
0
- o S datalocations are n:t.aitlcd in t,llc ~ ~ s t i i t t ; & x ~ :
I 2 3 e 5 6
Distance (km)
7'.
where Z ; C I i ( ~ ~ )= [Z.v;((o), . . . , %E;(u)] IS the vector of simple cokrig-
where C ( u , - 11~3)= [Cij(tlu - rrp)] is the N,, x A', matrix of ;int.o mid
cross covariairces l~r!l.weerr ;my two variahlcs Z, ;and Z, a t II,, ;rrld lql, a l ~ d
C(II,, - u) is t l ~ cA', x Nu ii~alrixof aut.o aud cross covarianccs l,st.wicu ;ruy S i ~ ~ r pkl roi g i n g versus s i m p l e cokriging
t,wo variables Xi ;ind Z, a t ti,, and 11. 'I'l~es i ~ ~ r pcokriging
lc s y s t c n ~(li.21) has
the s a ~ f :o r u ~as ~.II(. sitr~pli~
kriging systclri (5.10) c s c q ~ tt.ll;rt,
. t11,: iwlrics ;rrc
matrices irmtcad of scalars.
,,
I h c error variances associated will, t.lre N, s i ~ r ~ p cokriging
lc cstit~mtors
are the diagoi~irlelemel~tsof the nrat,rix:
A large rokrigiug syst.enr 1i1osl Ihc solvt!rl
It is t.licn worth looking a t tlrc hcncfit one rrlay gi~iu~ I Y X I I 1 . 1 1 ~ addibi~llil~
morlcling and coniputatio~ialdart ~recessitaLcdby cokriging.
? % c o w l r r . e l n d o u ~ r l a g r of r u b r y r r r y
'1'11c wkriging t,st,irn;it,or is f,l~wr~%ic;~Ily l x t t , < 1~ ~ ~ ~it,%
x w1r o~r \,t~ri:tuw
s ~ ~ is
In s o ~ ~sitr~at.io~rs,
ic t . 1 1 ~varial~lcsiuvolvcd i u cokrigilig are linearly d c ~ p c ~ r d i ~ ~ ~ t . . always smaller tliati or cq~~;rl t,o tllc r:rror v;lri;rnct: of kriging whic11 igi~orrs
I b r example, tlia zi-values irmy sum lh ;I constant 1); e.g., rcl' a I iv<: ' coiic?~i- all stwm(l;wy iuf~mrtatio~l:
~ oue. I<liowledg<,of any ( N , - 1)
tratious of geoclielnical elelrrr?uts s r l r ~to
attribute valot:s at, 11 allows one to dcd~icr,exactly tile valur of I . 1 1 ~r ~ l ~ ~ i l i n i ~ i g
altril~utexs I ) - ~:i_';' z,(u). I N s~iclra r ; ~ i,cw;rrr,
, t.It:tI.:
estin~alorof a st1111 s, Y(u) =
o f v a r i i d ~ l ~say,
li --.
111 t.11,: isohpic cast, ;irioI.ltrr adv;i~tt.agvof d r i g i r ~ gis t,lb;it, t,lw c o k r i g i ~ ~ g
L1=]
X t ( u ) , is ~ q u i r lt,0 t,lrc S U I I I
of the cokrigirrg estimators of the li compone~rt,sZ k ( u ) (Mathcrori, 1979):
+
provided the ( l i 1) cokrigiq cst,i~rialrmY;<;,<(u) and % : * ~ ~ 1are ~ ( ~built
~)
frollr t h p saxrrr set of zi-data. I"or examplo, Llrr varinl,lc Y ( u ) ilmy bc thc
dols) and h 1 r scco~ldaryd a t a v a l ~ ~ c(ope~r
s circles) all loc;rtcd orr a circle of
unit radius away from u. Simple cokriging weights are c o n i p ~ ~ t eusing tl tllc
fnllowilrg c o r e g i o ~ ~ n l i z a t irnodcl:
o~~
Both direct semiv;trii~gra~n irrodels arc boundcrl and of unit sill, licr~cetlic
sill of the cross scnrivariograrri rr~odelis tlic correlation cocficient i,ctween
prirnary a ~ sccor~dary
~ d variables; see relat.iorr (3.39).
Figure 6.9: Relative custrihation of secondary data (open cin:lrs) versus primary
data (black dots) lor the simple cokriging cstimatiou 01 the prim;~ryntlribtite at
location u. That contribution, nrrasrrcd by the ralio $(II) of secondary to pri-
mary cokriginy weights (ahsolute values), increases as lhe two variables arc better
correlated and as the relative nugget elfect on the primary scrnivnriogran~nrodcil
increases.
220 CIIAP'I'ER 6 ACCOTJN'l'lNG FOR SECONDARY INF'OIIRIATION 6 2 T l f E CORHIGING API'ROACN A*,'
(.I) S n i r ~ p l i e gdcnszly
Figure 6.11 (lop graph) sl~orvs;UI isotopic d;rt;~conligrlralioo wl~ereboth
primary and secondary variables are measured a t four locations one unit
distance away f r o u ~the location 11 to be rstitnated. Two lreterolopic d a t a
configurn1,ions are also considrrcd:
10
. , ~ ~~-~ ~ -
.~- ~.'~--.-
20 30
~.
40
Azimuth angle I1
Figure 6.12 slrows the silriple kriging (clashed line) and cokriging (solid line)
estim;ites of (:<I conccntraI,ioos along f.lrc Nl?-SW tratlsect for three sarrlplitrg
clc~lsitics:
Six pairs of colocated Cd and Ni data plus ten additional Ni data (nn-
ilersa~rtpledc ; m 2)
'The Irriging a1111 cnkriging est,imatcs are cssent.inlly the same in the isolopic
c a w . I)ilkrc~rc<,sbetwecl~cst,i~n;~ti,s
increase as sccol~dnrydata hecornc Inore
For a w r o ~ a i t ~ ~ umgle,
t ~ l r the ratio $(u) rlow excr:crls folu (U'ignrc 6.10, riglit ~ r o t I ~ sr i r l ; Wlmr ouly six Cd values ;ire avnilal,lc (utl-
hof.ton~graph). This li~rgeratio val~leindicates a scrcc~riltgof t l ~ eprimary dcrs;rtrll,lcd case 2), the simple krigillg esti1rr;rtes are close t,o tlrc stalio~lary
d a t a by the colocated sacoudirry i11for111al.ion.A s n ~ a l shift
l (0 = 6') of thc IIIC~~A I ~C
. COIIIIL for ~ ~ secondary licke el data allows a better resolt~tio~r
~ I1,he of
sccotdary data drastically reduces that scrcct~ingeffect. tile cokrigiug csl.inralcs.
222 CHAPTEII 6. ACCOUN'flNG FOR SECONUAflY INE'OllMATlON
Isotopic case
- I
Cd estimates (equally sampled case)
4
Cd=lO
- N i = 10
-8 ' m,
..
..
80 - SCK
....... SK
1 2 3 4 5 6
DislanCe (km)
Config 1 Config 2
0 0
- SCK
....... SK
i_~
.. .. --..-
1 2 3 I 5 6
Dislanco (km)
--
--
Isotopic
,'.
-
- -
. Conlig 1
Config 2 ,
- I I Cd estimates (undersampled case 2)
,:I E Cd = 6
-8 ' -
,'i
.
'N
,"
m
*
Ni- 16
.// c mi ......
3 0
- SCK
....... SK
I
00 02 04 06 06 10
Relat~venugget effect bo
Figure 6.12: 'The di&rcncc bellveeji siinplr cokriging (solid line) and simplc kriging
(dashed line) estimates of Cd concenlralioi~s incrcasrs as thc primary vnriable is
more tndersampled. Nickel data loc;rlions increase f n m 10 (upper grzqrh) to 16
.
(caws 1 and 2). Cd rlala locatioss decrease from 10 (two appcr graphs) l o fj (lower
>
6.2.3 Ordinary cokriging w l ~ r r et.he t.wo 1,ngr;mge parurrretcrs /L;"'"(II) and l~;"""(n) accoill~tfor tile
I.wo ~ ~ I L I > ~ ; ~ S P I I I I Cco~mt,riii~lt,s.
SS 'I'II(I l,i~~r:mgiitl~
1411) is i ~ l i ~ ~ i l l ~by
i z csflt~illg
d
tu zrro i1.s partial first derivatives with r<,spcct to thcr ( I I I ( I I )4-IL~(II) 2) +
(1;kt.a wcigl~t,s;wrl 1,agrnllgt: pararnet.ers:
Minirrlizatiolr of the error variance of type (6.12) ullder the two con-
str;tints (6.28) calls for the definition of a 1,;rgrangian L ( u ) similar to that
introdl~cetlfor ordinary kriging ill sect,ioll 5.3:
Siwilnrly, for L l r n c;we of t.wo or more secondary v;rrinl)lcs, t.hc ordilrary cok-
rigir~gestimator is writ,Lcn
CHAPTER 6. ACCOUNTYNG FOR SECONllAI1Y INFORMATION 6.2. T H E GOKRIGING APPROACH
226
Scmivariogranr n o t a t i o n
with = 1 for i = I and bil = 0 otlrerrvise. The cokrigiug variance is T h e cokriging systern (6.32) cat1 be expressed i n t e r ~ n sof dircct a n d cross
s~mivariograrr~s provided 1 . h ~cross covariance ~nodclsh c t r u w ~prin~ary
~ aud
secondary variables art: sy~r~rnctric, i.e., G j ( h ) = (;,,(ir) V i f j (recall
rliscr~ssionabout tlrc lag elli:cL i ~ seclion
r 3.2.3). Acco11111.ingfor thc r e l a t i o ~ ~ s
Cj,(h) = Cj,(O) - y,j(b), i , j = 1 , 2 , the ordinary cokriging systcrn i s wril,t.cn
Matrix notation
Using the 111aLria forrrrulirtiou i n h d n c c d in scclion (5.2.2, Lha ordinary cok-
riging system (6.29) is wrilten as
where 1; and 0; are n,(n) x 1 unit and nu11 vcctcirs, rcspdivnly. 'rhe gellet-
aliaation to the case N,, 2 3 is str:tigl~t,forward.
In the equally sampled case, !,he vector of ordinary cokrigiug rslima1,ors Il~tlikcortlin;try cokrigit~g,a s i l ~ ~ pcokrigiug
le sysI.i!n~ciw 1w cxpn:sscd
is written in terms of o l ~ l yauto and cross ~ovariances.
a(11)
ZbCK(ll) = [ L : ~ ~ , I ) ]Z[lxc,)
~
n=l
," 7'
where Z;)CK(~l)=: [ z $ ~(I,), . . . , LOCK(u)] IS t,he vector of c ~ k r i g i n ga -
timators. As for sirrrple cokr~ging,the N , x N,, matrices of cokrigi~rgweigl~ts,
LO""(\l)
a = [Az:;"(n)], arc: obtained by solving a single cokriging s y s t e m
It is good pr;~cl.icet,o rcscalc covarimre values sticti t,l~atclen~cotsol' llte
cov;rriatlce m i ~ t ~ iinxsystenl (6.33) arc of t.hc same order of i l ~ a g n i t t t d ~Tlic
.
st.att<l;~rdixrdibrt~tof (,lie ordinary cokrigitig csli~~l;lt.or
(6.27) is
,,
l h c corrcspouding cokriging weiglrl,~uz:"(ir) are ohtained hy solvi~tgt,lre
ordinary cokrigil~gsyslclt~expressed in tcrnis of corrclograms.
1,ikr siir~plccokriging wr:ights, O(:I< rvcigl11.sof starldardiaed and original
v;rriahlr~s;Ire dilti.rc~rt,althoogli l , l ~cokrigittg
r esl.iniatcs arc the same:
A s i n g l e l ~ t ~ l t i a s t ! d n e cs so n s t r a i n t
In s o l ~ ~c;~sr,s,
c priirmry and sccorlihry rl;rl.;t rcl;dr 1.0 the sallti: a1.triljllt.c.
I*'or exnnlplc, a f<,w prcrisc. 1:hor;ltory ~ ~ ~ r ; r s ~ t r e t ~ofl cp11
~ r l .or
s clay conl.~!~~l.
arc s t ~ p p l c t ~ ~ c n hy
l c dmore lrwilerous field data collcclcd using chcaper me:&-
suren~rlrt.devices. Mrasttretncnt errors are likely l,o l x l;rrger for field data,
m d f,l~eirsct~~iv;triogr;trtl is likely l,u havr a larger relalivc nugget, elkxt. 'Ib Like o r d i ~ t ; t rkrigiog,
~ ordinary cokriging ;ttnount,s lo:
z~cco~tnt for such a dilfercncc ill l l ~ rpnlt~crlisof spatial contik~nity,precise
I . estitttaliug tlir local pri~ttaryatid secondary ineans, my, m $ ~ l ~ ( uand
)
I;hor;~l,ory~~trasurc~tr~rt,l,s ancl less prccisc lirld dat;r arc weiglitcd diff(mwttly
~n:;';~(n) for the case N, = 2, a t e a d ~loci~1.ion11 using hotlt pritnary
t,lirot~gl~ cokriging. a ~ t dseconrlary <lath speciiic to t.li;tt neiglthorliood, l l ~ e ~ t
I'rovidcd bolli rncasurctnel~lprocesses arc unbiased and the two pritnary
atid secondary tircms rvitlritt the search rroigl~horltood bV(11) ;Ire dee~lted 2. applying the sir~iplecokriging estitnator (0.11) using these estimates of
q n a l , the lincar est.itnator (6.10) is r v r i t t e ~ ~ I.lrc incans rather titan tlic stj~t,ionaryI I I ~ R I I S7111 imd 1112:
, , , . . .
3
_
, , ~. . _ ~ ,
4
l , . _,
5
.. . , ~~.
6
1 2
Distance (km)
0
.. Y
-
----- OCK
....... SCK
Figure 6.13 (two top graphs) slrows the ordinary cokriging estimates of Cd
and Ni local mearts along the NE-SW transect. Like the single attribute case
in Figure 5.3 (rniddlc graph), both trrrtrd esI.iitl;rtcs h;rvc n s h i r r i ~ s csl~apr,,
each step correspondiug 1.0 r s t i ~ ~ ~ ahassdt c s 011 tlic sntllc twigltlmritrg prirnnry
and secondary data. ?'hc rltmber of sl,rps is, lio\vcvcr, hrger since 16 pritrxtry
data locations are here co~tsidercdinste;~dof only 10 primary da1.a locatior~s 1 Cd estimates
for Figure 5.3. I%of.lit r e l ~ deslininlcs follow tltc getter;d inrrc;~s<ri l l (:<I at111N i
corrcentratiol~salong tlrt: tr;ursert. 111 contrast, tilt: overall ~ ~ l c a t(Itorizot~lal
is
daslted line) overesti~natethe local mean in the low-valrrcd (left,) part of 1.l~:
transect and underestimate the local mean it! 1 . l ~ l~igl~-wdueil (right.) par( of
the trans&. This underestimation is Icss pn~lrout~ced lilr N i concenlritt.iolts.
Figure 6.13 (botl.om graph) shows the simple (daslmi linc) and ordinary
(solid line) cokriging c s t i ~ i ~ a lof
r ~(~!d
s co~~cel~lr;rl,iolts. Note tlw following:
Both tnterpolrrlors arc cxact
OCK cstitnitlks are smaller thau SCli rstilnat,cs i t t the lafl part of (,Ire
transect where C d local rricar~sare smaller tlr;tn the overall nreall.
Figure 6.13: Estimates Cd and Ni local n ~ c a s s ,and Cd concelktri~tionsilsillg
ordinary cokriging (solid line) and simple cokrigi~rg( d a h a d line).
c s l i t ~ r a t i oof
~ ~l . 1 ~C O I I I I I I ~Ioci~ln ~ e i uof
~ pri~~tiiryi111d I , ~ R I c ~s ~ c o ~ i d a r y
vari;thles w i t l ~ each i ~ ~ search neighborhood C V ( o ) . 'I'hlrs, local departures fro111
the ovcrall inems ;re d i l l accow~l,adfor as they arc in traditiotial o r d i ~ ~ a r y
cokriging.
1. So~i~ ofc tllc scco~r<laryd;tl,n w i g h t s arc negative, thereby increasing l i k e simple or ordinary cokriging, it is good pract.ice to solve the cok-
t l ~ rrisk o l gel.t,i~~g
utraccepf~al~le c s l , i ~ ~ ~ a tsee
e s ; rrl;~t,cdrlisc~lssion in riging s y s l . t ~( 6~. 3~7 ) in t.er~nsol c o r r e l u g r a ~ ~w~ls ~ et l ~~ c~variances o l primi~ry
srclion 5.8. and secondary variables d i l h by several orders of maguituile. 'I'lle cokrigi~ig
weights t h a t are provided ljy the a~krigitlgsyst,er~~s writkn in terms of covari-
anccs or a,rrclogra~lisitre not linearly r e l a k l , heuce thr r e s ~ ~ l t ,cokriging i~~g
< > s t . i ~ ~ ~ area t o rsliglrtly
s differmi. (Goovaerl,~,1997).
'I?) rcrluci. f.hc occurrcncc of 11eg;it.ivi. wnigl~lsand avoid artificially li~nit,ing
l.lltx i1111xtct(of s ~ w n d i r r ydalh, Isaaks and Srivastxva (1080, p. 4 1 6 ) proposed 6.2.5 Principal component kriging
rlsil~ga sirrglc co~~sl.ririt~f, of type ( 6 . 3 6 ) ;fin exaniple, for a single secondary
nttril>utc:
~,g:.~<
(11) = x
r>~(ll)
,.,=I
-,m1]
A:YJ<(xl) [Zj(n,, )
[i,=l
?21(11)
( I ) I AyK(ll) = 1
11, I
(6.46)
I3ot,l1prirn;rry ;rnd scco~rdaryvarial~lossl~ooldhe st.nu11;trdiaed to a zero
wllerl hlreir variances are significantly difirent. 'I'h
Itlean ;111rl rtt~itvari;r~~ce
cokriging weigbI,s would t,11r11lic solttlions of systeln (64G) exprcsscd in terms
of corrrhgr;tn~s.
Il~rlikel . l ~s i ~ ~ r p lr er k r i g i ~ ~syst,cln
g ( & I s ) , 1 . 1 ~colocatcd s i n ~ p l ecokrigilrg
systeln does not c;rll for the covirriauce h c t w e n secood;~rydata for 1111 > 0 ,
wl~iclrsllevi;tt.t.s the inli:rerlce anil ~rrodclingclfnrl (see s ~ ~ b s e q ~ discussio~~ ~rwt
rcl;tlt?d to I.lir M;rrliov trlodel).
llnlike full cokrigiug, colocat.ed cokrigiug calls for the iofcrance and modeling
m l y of 1 . h ~prirrmry covaria~~ce
function C l l ( h ) and l l ~ ecross covariance fnnc-
tiou (,'12(l~).'I'hr 111o0cIingdFort ran IIC f~~rt,llcr ; ~ l l ~ ~ v i ; ~hy
l , v t,l~c
~ l foIio\vi~rg
;i]>[~roxi~~r;il,iot~:
( '1 ( I ) G d 0 ) (;ll(ll)
Cll(0)
+
wlrere tho residual 11(11) is assumed ortl~ogonalto Z l ( u h ) , V 11. T h e linear
rcgrcssion (6.48) is not a requisite for model (6.47); rather equatior~(6.48)
sl ~011dbc read as the definition of the residual il(rr).
A sufficient but tmt necessary condit.ion for inodnl (6.47) to L~oldis tilt,
independence relation:
= J J z f ~ c {; Z ~ ( ~ I ) ~ Z =I ( I I ) J ~ ~ ( :,I I ;
2 ) . z1+~2r
1. (hloc;rl(xl c d r i g i n g ;~\widsi1~slh1~ilil~y
<,lt<Iiiry11;llil.
c a ~ ~ s cI)?
d highly ~ ~ I I I I ~SIX-
~ I I ~ ,
if the regression of & ( n ) o t ~Z l ( n ) is linear, as in rc~lxt,iorr(6.48), 4. It docs nol require nlodcling tlic cross covariance fnnction C12(11) as
long as the Markov apprositttation is rcasonahle.
3. 'l'he i t ~ f o r ~ n a t i oprovided
n by secondary data 1)eyond the colocat.ed sec-
ondary cl;tl.u~n~ ~ ( 1is1 )ig~wrcd.
r Tlic ilrfcrcnce of the residual covariancc r+:qllircd iiy kriging with an ex-
t.crual drill, is 1101st,raightforw;rrd. M o d c l i ~ ~dirrxt
g and crms scmivari-
ogr;ttils is st,raightfi~rwardt.lioug11dc~n;~ri~ling. It is rveu less delnanditrg
if Lhr: Markov al~proxi~rral.io~i (6.47) is appropriat,c.
i g r c I : Ordinary cokriging pstimatcs of Cd concentrations using the live Consider the situaliorr where interval-type data are collect.ed a t n' locations
closest Ni hlock astinrntes (full cokrigieg, dnshed linc) r x the single colocnted Ni ub. More elaborate measurements are conducted a t n << n' locations,
I h r k rstirnatc (rolocnlcd cokriCisg, solicl line).
6The primary attril,utc is denoted z in this section since no other continuuos ntlribute
is considered.
the estin~atedOCK value a t 4.75 ~ I I I(undersampled case 1) exceeds
Cd indicator data
4 1 t,he critical tl~ridlold(L~oriaontaldashed line), altlroogh the colocated
soft. & ~ t n ~i~~clicatrs
n that p;uticulnr location as safe, At that location,
the Iirrge itcighbori~~g 11nrd d;rt;r prevail o n the soft (possibly inaccu-
- threshold &=08 ppnr
rat?) coloc;ttccl dat11111.Indicat.or kriging d g o r i l h ~ n sint.roduced in sec-
. + location contaminated
- localionsate
t.ioti 7.4.2 will ;tllow both iiartl dath and ronst,rainI, i~rtcrvaldata t o be
1io11or1dprovidcil they are co~~sistenl.
I..
+ . . . . .
t
-
2
~ 7
+ + + + +
3
+ +
4
+
5
- 6
0 Accon~~t.il,g for indimtor data yields s ~ ~ ~ ; i lCd l e r eslimat.cs a t locations
t h a t are deenred safe tlrsn ordinary kriging usiug o ~ d y11ard data would.
Dislance (km) Uiscrepancios 1)ctwcrn kriging and cokriging esli~lrntcsincrease as hard
da1.a 1icc~l111~ l v i l2).
s p r s c r ( r u ~ d e r s a ~ ~ ~ pcase
Distance (km)
(6.56)
where pzy(11; z,) is tbc cross corrclograitr hctwrxn lrard and soft d ; h a t
threshold 2, Cd estimates (undersampledcase 1)
4 I
Eznmple
tising the co~iditiorialfreqrie~iciesgive11 in 'l'al>lc 2.5 (page I!)), t.l~cgeologic
profile i t 1 Figure 6.1 ( p ~ g cI % , niiddlc grztpl~)is ~ : o ~ ~ v c ri l t. ~~ d;I: h ~>rolilc01'
probabilities of not, excecdit~gtlrr critical t.lrresl~old for Cd concetitr;~tions
(Figure 618, top graplr). T h e largest proliahilit.y is observed oil iirgovim
rocks (1.25-2.75 kni), wlicrcas t,l~ereis a zenl prohal~ilityof not escccrling
the critical t~liresholdon Portlaridian rocks ( 4 . 2 ~ 4 . 8km). Ncxt, the 259 goo-
logic data s(u,) available across t,he study area are co~ivcrt.edinto soft <I;rla
(conditional prol~al~ilities) y ( u n ; rC).Figure 6.18 (secoiid row) slioms t,lw s t h -
dardiactl sen~ivariogramsof soh data m c l C:d colrcr:titr;~tioi~s.'l'hc corrrrln(.io~~ Cd estimates (undersampledcase 2)
between hard and soft (l;tt.;t is inl~clistnmgcr alo~lgtlic N L S W trausvct (10
locations) tlrarr over the study area (259 localioi~s). Rather tliait i~rocfclirrg
the experimental cross se~~iivariogram of 259 data values drpicting a negli-
gible liard-soft corrclation, for the sole purpose of this exairiple a sytttlretic ...........
cross scrnivnriogram irmdcl is 1)11ilt,as a spl~r:rical~ i ~ o dof e l raogc I .Z l i n i witlr
a sill corresponding t,o a correlntion coeflicieirt p z y ( 0 ; zi)of 0 . 4 'T11c litrear - OCK
....-.. OK
model of coregionalimtio~rbetween hard and soft d a h is &:pictcd by LIic solid ., _
,., ., .
4
_. / ... . ~ ,, .
1 2 3 5 6
line in Figure (i.18 (scconrl row). Uislanee IknO
Fig~rre6.18 (two Im~~totii griqilts) sliows tl~t,s t ~ i ~ i ~ d ~ ~cokrigitig
r d i w d (()(!I<)
estinrates (solid liite) of Crl concc~~trntions usitig the five closcst Cd d a t a and
the colocateif soft y-datuit~. 'I'lic d;~slrcd liue dcpicts ordinary kriging CS-
ti,rratt.s l l s i l r l r n n l v ill,. (!,I rl:ll:, I?,," i,,,tl> c:*,,,,,li,,,, ' l ' ~ n Q ; i i1, ~
<>P. ill~ 6 l'rl
I Cd data Cd data
I
OK CK8so CKhet CKcol K CKIFO C ~ h d CKCOI
Algorllhm Algorithm
(hkrigiug algoritlrtus are used l o cst,inlatc ~ l ~ e t nwit.11 l s widesprcsd c o n t ; u ~ ~ -
illat.io~l(Crl, ( A , t'b) and coh;dt a t 100 test locations. 'Lrhlc (i.2 gives, for
each metal, b l c set of st~condary~ncl,alsretained in cokrighg. Sccmdary ill-
-
c
a,
0.8
Cu data Cu data
for~rraliouis availnhle at. 259 prilnarY d ; h 1oc;llions (isot,opic cnsc) or a t 259 -- 1
pri~narydata locatiorls pills 100 lest Iocaliol~s(Ilelerotopic case). For h L h
s;unpling rlcnsil,ies, hllc 16 closr~sl.<lathlocat.iu~sof cad1 pri~tt:lry/sccolldary
2
.-0
0.4
Figurc fi.19: Ilank correlalion coefkients betwecn trnr metal co~~ccntratio~ts and
cokriging cstirrrntcs at 100 test locations (left column), and mean absohtc errors
(right colunm). 'Two cokrigiug estimators (ordinary cokriging, st,aadardizrd ordi-
nary cokriging) i d three data configurations (isotopic, heterotopic, and colocated)
.. , . , , , ., , ~ .,: ~:,,:,,'.
A c c o ~ c ~ ~ tfor
i ~ rbettcr
g s ; i ~ ~ ~ psecourlary
lcd 111e1,als(I~eterotopiccase) sig- primary n ~ c t a l .Rlisclassification fbr Cd and C11 is furlher reduced l)y r~siug
rrificantly increases the rank corrclatiori between true valr~csand est& a single nnbiasedness constririut (dashed line) rather than t,he t r a d i t i o ~ ~ a l
males, and rednces Lhe mean absolute error. 'Ikaditio~raland s t a ~ ~ ( l a r d - constraint,^ that call for the sccoiidary data weights to ~ I I I I Ito zero (solid
iaed ordinary cokriging esti~natorspcrfor~nequally. line). Retainir~gonly the colocated secondary data sligl~tlyreduces cokriging
performances for c a d ~ n i u min the sense that it increases the pcrce~rtageof
Retaining only the colocated secondary data (colocated cokriging) causes Iocatio~rsrnisclassified.
only a slight reduction of the cokriging performances.
Each test locatior~ is classified as coutanrinated or safe, d n p e ~ ~ d i ~~ r Ig I 6.2.9 Multivariate factorial krigitlg
whether the cokriging eslinrate exceeds or not the crit.ical threshold. Fig-
ure 6.20 shows the perce~rtagesof locatior~sthat arc wrongly declared safe As rner~dio~led ill secliorl 5.6, Jlrally physical processes related t,o geology or
or contaminat.ed using difl'erent esti~uittorsand data c o r ~ f i g ~ ~ m t i oAnss .with human act.ivities control the spiitial d i s t r i h ~ ~ t i of o t ~met.al concentr;tt~ionsover
(.he rank correlatio~lcoefficients and t l rrieau ~ alm)lut.e errors, cokriging iul- the study area. Ol~strrvedrrlat,ions L,etwccn cooccntrations of difl'erent illel-
proves over kriging only \ ~ I I ( I I I seccmilary mrtals arc betl,cr sa~llplcdth;ru (.he als n1igl11.tlwo be con~tectedwit11 the occurrence of ctrlrrnloll sources of soil
corrtaminat,ion. For exanrple, Figure 5.13 (middle gral~hs)sl~owcdtliat ilickel
and cobalt concentratior~shave c o ~ ~ r ~ nregional on features linked to the spa-
tial distribution of rock types. 'I'lre strong relation is also depicted by the
scat,t,rrgram of Ni and Co regional co~nponcntsin Figure 6.21 (right top
graph). 'L.lre two 11otto111s r : a t t c r g r ; ~ ~of~ ~Fsi g ~ ~ 6.21
r e s l ~ o ww a k c r relat.io~~s
k)r 1oc;ri ilnd il~icro-scdt!spabial co11rpo11c111s of i1ol.11inetals, which stlggcsls
40 Regional CnmpOnentS
1 Original concentrations 1
-, .
OK CKlso CKhei CKcol
Algorelhm
- 0
0 4 8 12 16 20
Cu data Pb data Cobalt (ppm)
I
4c
I-
3
rn
0 30
-3 -1 1 3
CKso CKhel CKcol CKiso CKnsl CKcol
Algnrilhm Algorithm Cobalt
Figure 6.20: l'roportiur~ of test locatioas that are wrongly declared safe or contam-
inated according to cokriging estimates; as in Figure 6.19, two cokriging rstin~alors
and three data configurnlions arc considered.
t11at dcstructuril~gprocesses operate over shorter distances. Micro-scale vari-
i~tioltsariw piirlly f r o ~ I~I rI ~ : R S I I ~ C I I ~errors
~ I I ~ that could he indeperrdel~tfrom
one metal to auol.lu~r.'l'lie original t~ickela d cobalt concollratior~sresull
ftour a cot~lhinationof all three different scales of variation, aud their scat-
t.r:rgram s l ~ o w in t ~ Figure 6.21 (left top grnpl~)drpict,s a rclat.iotr intermediate
I ~ e t , i v c cthose
~ ~ ol~serveda t local and regional scales.
'I'l~crcladiol~ljc~l,wce~~ col~altnud t~ickclis said lo be scnlr-rlcprirdrul Ix-
c a l m it, clrmgcs as a f~mctiouof the spatial scale considered. Such att.entio11
to scalr-tlrpcrrde~rre may enlianco a relatiou hetwecu varirhles tlral is 0111- Bccnt~set11c Ws Y;(n) arc, by consl,ruction, ~mltoallyortlrogonal, that cross
erwise blrtrrcd in an approacl~ where all different sources of variatiotr are covari;~nccreduces to
~ n i x e d 1cadi1,g
, to a hett,er i~rrdrmtandingof tile physical ~rudcrlyingrnccha-
nisnrs coulrollil~gspat,inl pattcrlls ((kmvarrts turd Wcl,sler, 1994). Mull,iv;tri-
a1.e factorial kriging, also called factorial krigiug analysis (Mat.lrcn)~~, 1982;
Wackernagd, 1988, 1905, p. 160-165), ;tIlorvs otle to a n d y a r relations l x - At 1111 = 0, tlrc value of each basic covariat~ceitrodel ~ ( 1 1 is ) 1. Thus, the
trvccn variables ;it the spatial scales detected and modeled frtrnr experiute~~tal coefficient liij is cil.l~erthe covariarrce a t 1111 = 0 helweetr spatial components
semivariograms. T h e teclt~~iilue has been applied in various areas, s u c l ~as gro- (case i # j) <lr the variance of the spatial component Z:(u) (case j = i).
clletnistry (Sandjivy, 1981; Dnrrrgaolt and Marcotlc, 1991; M'ackcr~~;rgda d I'br each covariance model c,(h), the coeficicnts b& can he arranged into
Sanguit~ett.i,11J93), soil scicnce (Wackcrliagcl el, al., 1988; Goovnerts, 1992, an N, x hrv coregioualization matrix Dl:
1994d; Coulard and Volta, ll)92), l~ydrogeology(Itoulrani and Wackcr~~agcl,
1090; (hovxcrts vt at., I993), ~uiniug(Sor~sx,1989), and in~xgcprocessi~~g
(Daly ct. a . , 1989).
J,ikc f;~cl,orialkrigiug, w l ~ i c lis~ l>esecl or, t.he linear trrodcl of regioualiza-
I . i m (4.27), ~uull.ivrrrial.cfaclorial kriging is b ; w ~ Ion Lhr: spccific litmrr luorlcl
of coregiot~i~lizat,ioi~ (4.35) f i t t d Lo tlre cxperiu~cntalat1t.o and cross covari- Accortnliug for relnlio~l(6.58), the nmtrix Dl is t l ~ cvaria~m-covariance ma-
ance f~loctious: trix of the Nu spatial cotnponcnt,s %~(IE).'Tlte linear correlatiotr between any
two spatial components G(u) a d $(u) is then ~rrcasoreilby the structural
correlation caeficieut defined as
I
Pij
_- 6!
ZJ
Under tlml. particular model, caclr ILF %;(II) can be int.erpreted as ;t liri-
ear con~hinatiot~ of it~dqm~dt:rttlZFs Yk(i~),r;:clt with zero I I I C ~ I I and basic Tltc tn;~t,rixof structural correlation cocflicicrlt,s is deuotcd Rr = [ p l j ] .
covariance f ~ ~ n c t i ocl(11):
~r +
In sectior~4.3.2, under second-order stat,iouarity, the ( L 1) coregional-
iant.ion i l r i ~ t ~ i cwere
r s slmwn 1.0 sum to the covariance firltcl~ionmatrix a t lag
zero:
C(O) = z:
L
I=O
DI (6.5'3)
Correlation m a t r ~ x M~cro-scale
E q r r a l i o ~(6.64)
~ is a rnere rescaling of tile loading q b ; hy the variallce XI of the
kt11 principal conlponenl, and the variance (T? of l.lte it11 original variable. 11
veclor can brl i l r a w u f r o n ~1111. origin iu 1,hr pl;i~te(0, 0) t,o each plotted point.
(pri,pk,i). T h e orientation of t , l d vector with respect to the two axes reflects
tlrc corrrlat,io~rbetrver~rthe vrrriahlc Z; and the two principal r o t n p r e u l s l'i
and &r. T h c I ~ n g l hOC l.lre vector, &m, nreasurcs tlrc perce~ltageof
tlrr variance n;/ esplairrcd by the two components, If the two compotre!tI,s
co~nplct.elyaccotltll for the vari;rrrce of X i , t.lle poiut would lie on ;r circle of
unit radius as ~ ~ I I W IillI Figure 6.22, lrel~cet,llc name czrcle a/ co~~elrrlzer~s.
't'he closer llje poiul, is lo [Ire cclll,cr, t.l~csl~tnllcrlltc proport,ion of variatlce ac-
coutrtrd for hy I;r atrd 1;,.'I'lrcrefore, one should avoid itrl.r~rprrlingrelations
11rt.wm.11vnriahles tltai, plot near t.he cmtcr.
Figure 6.22 (Iefl top graph) shows the circle of correlat.ions defined by tlw Local scale Regional scale
l ~ l , s15). Tllc circle allows one to distilrgrtish copper
first. two r o t t r ~ ~ o n r ~Orl,
a~l(llc;~d,whiclr are s l r o ~ ~ g lcorrelated
y ( p = 0.78), f r o n ~a. group of four
metals (Cd, Cr, Co, and Ni). T h e isolated position of zir~cexpresses its equal
correlatiou with variables of bollt groups (srr 'L'ahle 2.6, pagc 22).
Sinlilarly, circles of correlations call be usrd to display spatial ~ I I ~ I ~ B -
I;il.ious l,rl.wrcw vari;ibles ;is ilescribed by each cor~~giort;~lia~~tion t~ralrixB,.
'I'llr position of lhe ilh variable ill lhe circle dcli~terlby t,llt: pair ( < , I < , ) is
givrn hy the pair of corrclaliorl coeficie~ltsbetween the spatial cotnponent,
Zi and t h ~ two e rrgiol~;dizedfaclors. Lly a l d o g y with exprrssioll (6.64), the
correlation coefficic~rt1)etween the spatial amrponent %;' and t.he regionalized
factor I;' is cot~lpuletlas
Figure 6.23 slrows the nmps of llre first regiol~alize~lfactors a t the local
and regional scales. T h e map of the regional factor enipl~asiaesthe impact of
the geology st row^^ iu Figure 5.13 (page 168, top grap11) on i.he r r ~ n regiot~d
i~~
features of metal concentrations.
Assessment of Local
Uncertainty
] Location ui ] Location u i
N o t e t h a t these probatrility intervals arc i r ~ d e p e r ~ r l c n o tf any particular esti-
m a t e ~'(II) of l,he unknown value ~(II). 111(leed, uncertainty d r ~ p e r ~ dosi l the
inforlnatioll available ( ~ r ) , i ~ n OtI 1.l~.particular o l i t . i ~ ~ ~ i trlri ifiyc r i o n r ~ t a i ~ w d
t o define an f ! s t ~ i ~ x ~A t < lucid
~. discr~ssio~r of tltis i ~ ~ ~ p o r t inctlrodological
nnt
p r i o r i t y is give^^ i n Srivastava (1087a).
Each conditional probability distribution f u ~ ~ c t . P(u;
rrleasure o f ivcaluncertainty i r ~
i o ~ ~z l ( i ~ ) )pmvidcs n
ILlial i t relates t o n specific l o c a l i o ~u. ~ A series
00
k
-
0
obcontamin.: 0.70
1 2
---
3 4
Cd concentration (ppm)
5
, -
6 7
..-
8
0.0 I8-.-.-
0
..I
1 2
Prob contamin.: 0.70
3 4
,
5 6 7
"
8
with
'l'l~e I.ratrsform that allows ane to go I'ro~tr;r lili Z ( u ) wil.h ;my cilf Is'(:) Lo
a R.F Y(n) wit11 standard Ca~tssia~r cdf C(y) is dcpictcd by thr: arro\rrs i n
Figure 7.3 and is rvrithi!n as
turatiua ~!,/Jecl ;tnd associated to the maximum entropy property of t l ~ c 111the e a r t l ~scirnces, low eol,ropy patterns, s t ~ c as l ~ connected strings or
Cmssian l(1' model. M e e d when the cunnllative frequency p trnds to- ptclres of exlren~evalocs, o f t c ~currcspo~d ~ l,o iraaarrlous features that are
ward acre or one, y; i t c o , and the two-point crlf (7.10) tends toward worth specific i~ltent.ior~. For exa~nplr,strings of large per~neal,ilityvalues car1
the product phif the two marginal prohabilil.ies (independence case). represent leakage conduits ilaz;trdoos for :t nnclt:ar repository. Sinrilarly, cow
IIcncc, tire indicator corrclograni pl(11; y$>)tends toward zero, aud the ~tcctedclusters of large metal c o n c e ~ ~ t r a t i oalmvc ~ ~ s the tolr!rahle r n a x i r n r ~ ~ ~ ~
in<lic;ttor seniivariograni y r ( h ; y,,) tends t.ownrd its sill p(1 - y): arc critical for asscssii~gthe risk of soil pollut,io~~, Hence, and rtotrvitl~stalidirlg
its analytical sinlplicity, f.he n~olt,iGanssianIL1' model may be inappropriate
whe~lever the structural analysis or qlmlitative i n f o r ~ u a t i oindicates ~~ that
exbrernc values are spati;dly correlated. I.;vcrr in the ;tbscrrcc of information
;thou1 tllr: conllcr:t,ivity o f e x t r m ~ ~values,
r: blre C;anssinn rriotlcl is no1 corrscrva-
for any vcctor h . Fignre 7.5 slrows an exampleof tllc dcsbructuration ef- tivr in thc sense that, its K I ~ X ~ I I I ~eutropy
I~II charnctcr leads one t o underst.atc
fect for a C a r ~ s s i a111'
~'~Y(o) with aspl~ericalscniivariograrn model y(h) the p d e ~ ~ t i for
a l Irmarrl (G6111e~-Hernin11<:~ and Wen, 1994). 'The more Rex-
with a unit range: the sl;tndardiaed indicntor s a ~ ~ ~ i v a r i o g mreach
ms ible but, also rnorc d e r r ~ n r d i ~indicator
~g approacb introdt~ccdin section 7.3
their unit sill lasher for extrenie tlrresholrl values. allows one to account Tor spali;rl corrclaliou of extreme valnes.
lizample
Figure 7.6 (top graph) sl~owsthe experitue~ttalsenrivariogrnn~of (Y i t ~ r t r ~ a l
scores with the rnodel fitted. Given that model, G;rnssinn-bascil indicator
sernivariograrn rrlotiels are deduced usir~gexpression (7.11) for the nine drcilcs
of the sample distribotiorr. T h e exprrin~enlnlindicator scmivariogr;r~rrs(black
dots) are seen t o deviate from the Galtssinn-inditcd trlotlcls (solid line) lor
slnall thresl~oldval~tcs.A s alre;uly ~rol.iccrlirr I " i g ~ ~2.19
r e (1);rg~4 5 ) , si11;~11(X
concentrat~ionsare trlorc cunltectcd i l l space Illno h r g e conccntral.io~~s. ,1,hlls,
001- , ,
00 0.4 0.8 1.2 b G
Oislsnce lkml
Figure 7.6: Experirncatnl semivilriogram of C:d normal scores (top gr;tp$) and ex-
perinrental stimdardised indicator snrnivariugranw [or the ~ i n iler.ilc,s
e of the sample
r:anialative distribt~tion ' ~ I wsolid liws &,pirt I l w i ~ d i c ; ~ t nsct~~iv;iriogr~irt,s
r <I?-
, , * > .. ,- , . ? r~ ., ~,,:,. ~ :... ..., 3.7
2. A linear resc;~lingof ;t s~l~ootlrly
v;trying secondary variable d(11):
E'igurc 7.7 (top graphs) slrows the ai~~iplr: krigir~gc s t i n ~ n k?/;rs(t~)a111 vari-
nucc (~S,((11) C O I I ~ ~ I I ~ Cusing
I I the l~11C:d nortnal scorcs given in 'IkI11e 7.1
(p;+ge 269, third colu~nn).Acctrrdi~~g t,o relation (7.7), lhc Gattssian cot~di-
tionnl cdfs ;al. locatiorrs 11; to 11; are irlodclcd ;ts
278 C X A l " I E R 7 ASSESSAJEN'I' 01,' LOCAL UNGERTAIN'I'Y
3
* ------
.2 .1 0
.
,....,. .. . . .
1
Normal score
2 3
Liacnr cdf m d e l
A linear rrrodel is gr~rt:rally;rdopted for interpolation witlriti classes of thrtislr-
old values (zr-1, zr]. Such rr h e a r rnodel arnounts to assurnilig ir rlniform
distribution within tlml class, tli;tt is,
by t l ~ cuser. Convtmt~ly,a power model for a psit,ively skewed upper tail
could be
rvlicre tlic p;trar~lctr:r (tltc porrw) w is strictly positive, w > 0. 1)ilfcri~nl. where :I( is l.lrc largrst r - d a k v;tlur and z,,,,, is the rr~axi~iium
z-value fixed
i l t l . ~ ~ r ~ ~111mI~k
~ ~ ~ ~ arc> i ~ ~ r l hy varyitig w ( F i g ~ ~ r7.:)):
i l ~d>I,z~iii<~d t- I)y t,lin nscr.
I Hyperbolic model
1 Power model
Figure 7.10: Ryprrholic mod& for extrapolating cdf valucs bcyond the largest
lZigllre 7.9: I'owr nlodels ror inlcrpolating wil.l!ila-class cdl values. 'TI)? shape ~f tlrresliold unlun 2,; (uppcr t.:~il).'Tlw pararnetcr w rontrols how fast the positively
tlw dislrilwli~nis C O I ~ L ~ O I I C hy
. ~ th<.pnrmwler w : porifivc skewacss for w lrss than skewed cdf rnodcl reaches tlw appcr litnil value 1: the strr;rller is w , the longer is
1.0, tueiforw clistril~oliot~for w = 1 (linear modcl), i ~ r dnegativr ske~vrtcssf<,r w the tail of the distributio~~.
greater thau 1.11.
A weiglrt, w allows a haluuce hotrvtx~~ smoot.lr~msand closeness t,o the origiual 7.3.1 Indicator coding of information
i s t r i i n : as w dccrcascs, tlrc cdf nrotlc:l hecon~esit~crcasinglysnioothcr
but rlcvialw irlorc from tllcr origiuitl distribution 'l'lris particular smoothing 'J'l~eindic;ttor approach s h r t s wil,lr a sclrxtiot~of t l ~ anu~nbcrof tlrrcslrolds
;rlgorithnr rcqoires prior r l c t c r ~ ~ ~ i n a t iofo n1.l11: ruiui~r~ntn
aud nraxi~rnunz- and tlicir v a l ~ ~ c s'To . allcvi;rtc computation m d ~ I I ~ ~ ~ I efforts I C I : and reduce
val IICS. tlic occurrence of order relal.iou prolrlents (ses discussion iu section 7.3.4),
t,lw r~umbcrof tllrcslrolds shuuld rarely exceed 15. 011tlre other hand, the
uu~irhrrof t,l1r~~s1~01~1s S ~ I O I I ~110t
I I he s ~ m l l e rtlrati five to provide a reasonable
7.3 The Indicator Approach disr:rel.izal.ion of t l ~ clocal distrib~ilion. 'rhe set of I< tl~rcshold values is
lypicdly cl~osfvs~lrli(.hat l h r;rnge ofz-veli~rsis split iuto ( I < + 1) classes of
approximat,cly equal frcqucncy, r.g., the ninr! rleciles of Llrc saluple c~ll~rolative
o n I,IIv fdlowing guidclirrvs:
d i s t , t i l ~ ~ ~ l . inot,^^
,
-
, Cd data
- - - - - - - - - :z '5-- --
' - - ppm
z4=2.26
---.---
z3=13 8 ppm
I
,l ,l
[!I I
iProh
h i ) {%(110) <
{ ~ ( I I o5) 1.8812(1lo)
1.38/~(11,~)
&out wlretl~crt,he Cd
I (7.23)
e 1 ------a-
=-
z2=1.38 ppm
.. .-
0)
. . .P 1-0.80 ppm col~ccntrntionexceeds 1.38 al~rl1.88 ppm, rcspecl.ivrly.
8 0
. .. .. .. .. .
0
~
1 1 1 0
-~---
0 0 0 0 0
T.. . .~~.~
-r--
I 2 3 4 5 6
Distance (km)
0 1 2 3 4 5 6
Cd concentration (ppm)
zl=0.80 ppm
Figurc 7.14: Goding oi the profilc or two rock cxtt:gories illto lour profiles of local
prior prohabilitics or not exceeding Cd threshold values. 'The prior probsbilitics at
a location u belonging to rock category sc are interred from the carnulativc distri-
bution of Cd data belonging to that category (middle graph). The four threshold
valucs are depicted by the vertical d a d e d lines.
60 Ni data
- v,=50 pprn
0 .
________r__________---------.
23 .
1Calibral~onscattergram
As ~llentiol~ctl
for soft calegoricd i ~ i f ~ r ~ ~ r iLllc
~ t i~ondil.io~lal
o~r,
cies F'(ztlu1) call tic sn~oothedprior to being used or they call he
frquiw
-
E
,
I .I
. I
borrowed from anot,l~erbetter sampled field. -2 I
I
-1
ti..l
. 1
I
I
I
3. T h e set of I< local prior prol~;tl~ilil.irs
at are t,hen ident,ificd with -arz I
I .'I' . I
9 ,. ,
1. I... I
I
..'..,J.I..p'.'1:@;,+. .: . .
2
the Ii sanlple conditional frequencies E"(znlul) rorresponditq to the
3 I
prevailing class of v-values:
0
0
3
$0
.@44
.1
20
NI value (ppm)
.:.$
.
,
..
30
_
40
-1
I
50
C o l o c a t e d s o o r c o s of i n f o r m a t i o n
When asoft rl;r1111r1 1ocitI.iorl 11: coincides with a h r d d a t w r locatiol~I I ~ 1.11'
, .
set of hard indicat.or data i(11,:.. . zcl .
.., orevails over the colocatcd soft i ~ ~ d i c a t o r
\
&turn is a coltstraint, interval ( a , , b,,], the corresponding nrissing indicator 7.3.2 Updating into ccdf values
data are replaced by soft indicator data a t t.he corresponding tl~resholds,i.e., Indicator coding allows rliffmcnt types of inforniat,ioli ( I I ; anrl
~ soft data) to
tire Iw;d prior cdf (7.22) I~ecou~cs: Ije processed togrll~or,rq;rrrlless of their origins. 'l'l~cobjective is to evaluate
11 the set of Ii' ccdf ~ i d n r s
a t any 1oci~Li011 or posterior probabilities:
if b, 5 zi;
I ; z )= I'
Iz"(zr Isr) if zi; E (a,", b,] k = I , . . ., K (7.26) zkl(71))
Ig'(11; = Pro11 { Z ( u ) 5 z~.l(iz)] k = I,.
wlme the corlditionil~ginforntation (11)consists of l ~ a r i l a soft
. . , Ii
~ ~ ldata rehailled
(7.27)
prior iufor~nt~l.iolr consists of several prior cdfs of type (7.24) or (7.25). For
example, one may kuow that, the conceut.ration of the s e c o ~ d a r yvariahlc Ni is
v;rlnc<l w i t l i i ~(0,151
~ pp111;it location u j I d o ~ ~ g i ntog rock calegory 51. 'l'lius,
there arc two colacated vcctors of local prior probabilities a t uk. 'J'l~efirst I,'irsl., co~lsidcrt.he problem of esti~n;rhi~lg i,l~einrlicnt.or vduc i ( u ; a)a t any
vector cor~tainsthe prohabilitics for the Cd conce~ltrationto he smaller than location 11 using only lctrd indic;ttor data dcfined a t tile same tl~reslroldvalue
each of the four tl~resl~old values given that Ni concentration is valued witl~in r r . T h e linear rst.irnat,or (5.1) is expressed in terms of indicator RVs as
(0,151 pptn:
rrl~creX,(u; zr) is the w i g h t assigned t,o the indic;ttor da1.urn i(u,; q ) in-
terprelkd as a nx&aat,ion of the indicator itV I(11,; z t ) . As with kriging
'I'l~esecond vrctor consists of local prior probah~litirsderived from the cali- estimal.ors of the z-attribute values, two indicator krigi~rg(110 variants are
I~rationof the rock category .st: (listingr~islietl,depending on whetlrer t.lx indicator niean is considered con-
~ study area A.
st,ant. w i t l t i ~the
yz(11$; 2.2G)
y2(uj; 1.88) Sztr~plrindiculur k r i g i q
, \
15 { ~ ( I Iz,i ) ) = F ( z r ) , known V 11 E A
Different sources of soft information co~lldhe ranked ;rccordir~gto their ability ,,
to predict 11;rrd da1.a; see t,he discnssiou relaled t,o Figure 7.23 (page 316). I he linear estimator (7.28) is tlitis writtell as a linear cornhi~~ation
of ( n ( u ) + l )
T h c vector of local prior probabilitirs correspo~idir~g to the most accurate pieces of inforr~~atiori,the n ( u ) indicnl,or 1LVs 1(1r,;zk) anrl t l ~ acdf value
soft inforrni~tionwor~ldthen prevail over the other vectors. I,'(.a):
A s witli o r d i ~ ~ s kriging
ry of a cotrl.iituous variable, ordinitry l l i wibh l o r d
search neigl~horlioodamounts to:
A:~(II;:~) = 1 -- x
.,(ll)
o=1
A:(EI; rk)
Where thc r-value is kltown exacl,ly, tlie pust,erior cdf is I.hc i~nil.-steyprior
cdf (no r~pdatirig):
Ordinary i s r l i c n l o ~higrng
~
Ordiimry indicator krigitig (OK);~llowso w 1.0 account fur local ll~~ctoalions Whcre tile prior inforrnatiou is a wnsl,rait~tinterval ( ( I , , b,], only the missing
of Lllc indicator mean by li~riitingthc donmi11of sthtioriarit.y of th;tt lrrrarr to indicrlm values are updated by posterior probabilities:
the local neigl~l,orl~ood W(II):
Emtuple
Simplr: and ordii~aryindicator hriging ;ire used irloi~gt.lre NIC-SW I.rattsrcl 1.0
drtcrnrilrc ccdf val~iesa t the fonr threshold values zr= 0.80, 1.38, 1.88, and
2.26 ppm. Tlic information available consists of ten Cd concentrations a t
whcre the wciglits are given by a n ordinary kriging systcru of type (5.18): locat.iorts 111 to 1110 (bhck dots) pllrs the constraint int,rrval (1.0,2.0 ~ ) l x na] t
locatiot~11" (Figure 7.16, top grap11). For each tl~resl~old vdue, tlto ioilicaf,or
sr:lrrivariogram is inferred from dl dat.;~;rvailahlr? ovcr t.Itt! sl.udy m : a (1"ig-
tire 7.16, riiiddlc graphs). Estininlioi~is perfornicd cvery 50 111 lmiug 81. I:LICII
locatio~ilhc fivc closcst indicator data r e l a t d oljly to Lhr: t,l,r<dioldL L I~eiirg
considered. 'fhe statiormry irtr1ic;rtor n ~ c a o s(.'(zk) required by silnple IK are
estirnatcd from the cunii~lativt:1~ist.ograrnof the t,en Cd values.
Figure 7.16 (botto~iigraph) slrows bolli olK (solid line) a ~ l dslK (slliall
dashed line) cst.itnat,rs of the prohahilit,y of exccedi~lgthc critical Ll~r?sl~ol(l
rl = 0.8 pprrr c o ~ r i p ~ ~ as
t c dI - [ ' ( I ; ~ ( ) ) ] *l'lic
. 11orizon1,;j:ldaslied line
rl~yict,st . 1 1 ~nrargini~lprobability [ I - I,'(rl)] oC cxccccli~lgthr: I.l~resholdvalne
Cd data 11, cdin~;rl.cdby the proportion o l ( 2 v:rlurs great,t:r 1.lii~110.8 pp111:7/10 =
.Ue 0.7. Not,c t.lre following:
Ijotlr II< estitnators itre exact. The posterior probahilit,y of exceeding
t.llc critical tlircshold is zcro a t dath locations uz t,o 114, where the Cd
concentration is s~nallerthan 0.8 ppm, and that probability is 1 a t other
d;tl.a lomtio~rskliowu lo be col~taniinatcd,
' I h loc.;rl cst,iinnLioti of ihc indicator incan ivitl~incarh search ncigll-
I~orlmodyiclds ordinary IK estinrales that follow the data trends l ~ e t t e r
l.hn11 1 . 1 1 ~sin~plcII< cslilt~ates.'I'hc posterior prr,l,al~ililicsof cotltami-
tri~tioiiare s~rt;llleri n t l ~ elow-valued (left) p r t of tlir trimsect and are
1st threshold 0.8 ppm 2nd threshold 1.38 pprn larger in tlte higll-valnnd (right,) part.
1.21 1.21
Beyond locathn 116, orilinary It< yields a unit prrhl,ility of exceeding
the critic;tl i.l~rrsliold,which correspot~dst,o a zero runiulative prohabil-
ity [F(II;z ~ l ( r ~ ) ) ]It1
* . this part of t,ltc trausecl, all da1.a Iocatiotls within
t,l~csearch ileiglhorlrood lV(11) arc contan~i~ial.sd, which eol.ails that all
iudicalor d;rt:~i(11,; r l ) arc zero and so is llieir cstilnatcd rneari (7.31).
11)contrast, away froltr tllr: data (extml,ol:~tiot~ sitoation), the SIC csti-
nr:rlr approarllcs t,l~cinargiual prolmhility 0.7.
3rd threshold 1.88 ppm 4th threshold 2.26 ppm At locations ti5 and Q , ordir~aryII< yi<~lds t.lie following vcclors of posterior.
. .
. .
. . ~wol~al~ilil.ics:115: [O I I I], 110: [0.00.l(j 0.54 I O ] . 'I'lto cxarl.itude property of
I.he II< cst.i~~l;~t.or
clllails 1,11;tl, t,hc ccrlf ; ~ tlln
t ~ ; I ~ I I I loca1,iolt
II 115 is idcotical lo
t.he prior cdf (7.21) (110 updat.it~g).Similarly, tlic crdl ;tl, locatior~110 honors
the coirstminl inlerval inforn~ation(7.23); only the two missing indicator
0.01 , , , , 0.01 7 , , ,
values a t tlrrcslrolds q and 23 arc updated into tlir posl.crior probabilities
00 0.4 0.8 1.2 1.6 00 0.4 0.8 1.2 1.6 0.16 :rttrl 0.51, rcspccl.ively.
Distance (km) Oislance (km)
hidicator cokriging
] Posterior probability of exceeding 0.8 pprn Nfit,l~erIf< t&iinators (7.29) and (7.31) nrakt. fnll use of tlle infori~tation
availihlv i i r tluiit llwy ignore indica1,or d;&a ;it, tlircsholrls dilliwirt froill that
I x i r ~ gestinmbed. Iufnrnralion froill d l li' thresholds call be accout~terlfor
ltsing the cokriging formalisin introduced i t r seclioi~6 . 2 , 1"or example, the
ordinary indicator cokriging (olCK) estiinat,or of ccdf v;tlue a t threshold zko
would hc writtcll
1 ,
1 2
-.--
3
T~-
4
Distance (km)
5
- 6
Figure 7.16: Simple and ordirtary indicator kriging t:stirnntes of the probability ol wliere XZ:"(u;zk,,) is the weight assigrrcd to indicator daturt~i(u,,;zh) a t
excceding tlre critical thresldd value 0.8 ppnl. 'The information available cousists
location II,, . 'These cokrigirig wcig1tt.s are solutiotts of an ordinary cokriging
of ten Cd concentrations arid the constraint interval (1.6,2.0 ppm] at ua, plus the
indicator scmivariogram rr~odelcdfrom 259 indicator data.
system of type (6.32). screcns tlrc iufl~renceof colocated indicator virlt~esa t ollrer tlrrcslrol~ls
Zk # zko.
Emmple
Figure 7.17 sliows tlic tell o~~mirlirect,ional sta~rdardizc(lindicator rlircct i ~ n d
cross sclnivariogran~sfor the four t,l~rr:sboldCd values ZL= 0.80, 1.38, 1.88,
(7 3) and 2.26 p p ~ n . 1 ' 1 ~ : it,erat.ivc procedt~rt:&xril)cd i n I \ ~ I ~ ~ I\ I is ~ ~~~svrl
X
where C,(h; zb, 21.) is the cross covariance helweelr any two i~~dic;tt.or ltVs
to lit, R /incar t11ode1of cor~gio11aIiz;1t,io11, i ~ r c l u d i ~1~1g1 1 ~
hsic s~,~IIcIII~(~s:
+
I(u;zk) and l(11 11; zb,), and 6ik, = 1 for = 6 0 HIKI 6ib0 = 0 otherwisp.
As rrtentiorled it1 section 6.2.4, ordinary cokrigiug conslraiuts I.11atcall for
the secondary (1;tl.a weights to sum to zero t,etrd to lilttit arlilicially 1 . 1 1 ~intlw
cnce of these secondary data. 7'11esc constraints also increase t.lre occurrenct
of negative wciglrts lreoce the risk of getting unacceplahle estimates such as
negative posterior probaldit&s (Coovaert.~,1!194;1). 'I'hercfort!, one ~ ~ ~ i g l ~ t ,
consider rescaling each secondary indicator variable I ( o ; q ) to the I I I ~ ~ of I I
the primary indicator varidrle, then solving the ordinary indicalor cokriging
system (7.34) with the siriglc onbiirsedt~r~ss constrnint that calls fbr d l prijunry
a d secondary data wcigllls to sum to I .
Large rliff<wncrs hetween the itit its of ~iraasure~r~enl of z and i1.s indicator
Lrai~sfor~ns may c;msi: i~ist;~l~ilil.y prol,lt:~nsw l ~ solvit~g
e ~ ~ the cokriging system.
One sol~itiorrc o ~ ~ s i s of t s rq~l;icir~g !,Ire z-vitlr~esby 1,lrr:ir stanrlardized ranks
3:(110) = 1.(11~,)/11,W I I C ~ CI . ( I I ~ )t [ l , ~is] the r m k of t.br d a l u ~ ~~(11,) r ill
the smilrle r u ~ ~ ~ u l a t .distribntio~~
ivc (see section 2.1.2). 'I'lre irrdicator data
i(n,,;rt) ;w<. valtrcd i,il.ltrr 0 or I , wlwrcas i , l ~ r;r~lk-order r t.ra~rsfor~rrs
~(II,)
arc 1111ifcxru1y disl~riI~111,c~l I].
i l l [I),
' l ' l ~ r r colirigiug 01' t l ~ eindicator l.ransforn~i ( 1 1 ; ~ r) ~ s i i ~the g rank-order
I . r ; , ~ ~ s l i m~ ~( 1i 1 ):is n s r c o ~ ~ d ; ~v;~rial~lc
ry is n:fvrrr:,l 1.0 as probubility kriging
(Ismks, ISJM; J o u r ~ w l ,1!184l); St~Iliv:ii~, 1984, 1985). 'rhc prd)aldity kriging
(J'K) csti~it;~t,or is
p=1 "=I
+ l ~ F ( 1 1z;t ) = (,"l(lb - 11;Zt) ,,
= I , , . . ,n(1,)
1 3rd threshold 1.88 ppm ] 4th threshold 2.26 ppm ~(11) ?&(~rj
C x;"(ll; 21) (:xl(ll" - UP; zt) + C 1,,y(11; Zk) GX(1lU- U")
I"igurc 7.18: Ordioary kriging and cokriging isti~n.ztes01 the probability of ex-
ceeding [mr dillermt 1,hresbold valucs. Arrows iuclicntc inconsistent probnbilitirs
outside tltc irttcrval [!I, I ] produced by indicator cokrigiq. whcrc Cx(11) is the covariance frlnctioi~of the ~rniforrrlI W X ( u ) , and C:xr(h;z t )
is the cross cov:rri;mcc funct,iorr l~etwecnX ( u ) i111dI,h(: in<liratorl l F I ( u ; ZL).
302 I A 1 T 7. ASSIWMEN?' 01,' LOCAL UNCEICIXINTP 73 '17115 INDICATOR AI'I'ROACII 303
Tlie PI< estimator (7.35) uses ruore infor~uatiouthan the ordinary I K I Indicator and uniform data
estimator (7.31) hecause the rank of lhe datum ~(11,) in the sample crlf is
taken into a c c o r ~ ~ill i t additiol~to it.$ indicator of e x r c c d i ~ ~llle
g tlrrcsliold
value zn. T h e 1,r;zrle-offcost for this brtter condilioui~tgof l l ~ cposl.nrior ctlf
. .. .. . .
is the inference of tlie semivariogram y,y(h) and the I< cross se~rtivariogra~~rs 0
y x I ( h ; z t ) , and the n i o d e l i ~ ~ofg the coregiorraliaation between indicator a ~ r d O u 3 o Indicator
,-
unifornr transforms a t each thresl~olil.
l o reduce the proporlio~lof uegative cokrigiilg wciglits and i:r~l~ancet11f.
0
o Uniform
03 Indicator
1
a=1 a=1
T h e PK cst,inlator is ~ I I I I Srewrittell
Examyle
Figure 7.19 (top graph) shows the itniform t,ra~rsfor~ns (open circles) arid
indicator transforms for the t~liresholdvalue 0.8 ppm (closed circles) of ten
Cd conce~~lrations. T h e rank-order trausfor~nis Ixrswl 0 1 1 only tlic tiv Cd (lat,a
values, llence the two cxtrcnie uniforllr dida V ~ I I I C S arc 0.1 fur tlw s~il;dlesl(A1
concentraliou a1 113 and 1.0 for the largest concerllralioli a t un. fiotlr data
sets are comhirred using tlie I'K esti~rralor(7.37) wit.11 ;r sir~glcn~~bi;tsedness Figure 7.19: l'robability kriging estimates of the probahilily of excacdiag 11.8 ppar.
constraint. 'l'lic ser~~ivariograrnsofindicalor and unifor~ntr;ti~sfor~ns and tlicir Indicator and aniform tramfornrs are combined using ordinary cokriging wit11 a
cross semivariograrl~art! inferred from all data available over the stndy area single a~~biuedsess constraint and the scmivariogram ma~lelssl,owu i t , the swond
(Figure 7.19, irriddle graph). By constr~~cliou, i~ldicatorand t~nifortri(1at.a row. 1 lie d a s l d line depicts the ordinary krigillg cstintate tnsing only imlicalor
are negatively correlated since a zero indicator datru~rcorrcspo~~ds to a largc data. 'I'he arrow indicates two int:oasistent i~mb;rbilitiespro<lucrd lby probability
Cd concer~lratiori and, therefore, to a l~iglrrsnk in the salnple cumi11at.ive kriging.
distribution
Figure 7.19 (hottor~lgr;q)l~)shows l)rrtl~I'li (solid liur:) aucl d l < (il;~sl~~!d w a y , probabilit,y kriging c o r r d s for the loss of resolution c;t~~sc(l
by i l ~ e
.
line) estimates of the probability that C:d concentratio~lexceeds 0.8 p p m
Like the cokrigiug i n Figure 7.18, prol~aliilitykriging yields art inlerpo-
lation profilr more vxriablc in space, particularly in the rig111 part of the
use of a single tlrreshold in indic;rtor kriging.
Unlike ordinary cokriging, probability kriging p r o d ~ ~ c eo s~ l ytwo prob-
al~ilitiesoutside the interval [O, 11. 'I'lie use of 2% single secoi~daryvari-
transect. Indeed, the data ranks va111edin [0, 1) allow one to discrinr- able in tho PI< estin~atorlessens the screening effect between variables,
inate Cd concentrations wit11 sirrrilar indic;itor Lrausforn~s0 or I . In tliis thereby reducing the risk of getting neg;~livecokrigirig weights and in-
ronsistrnt nrohnl,ilit,irs.
Median IK is very f a 1 because it requires only one indicator semivari-
,I ,hough lrss d ~ t n a n d i u gll~arlindicator cokriging or probability kriging, iri- ograni (tuedian) to be nrodelcd artrl a single IK system to be solved a t each
dicator kriging st,ill c;~llsfor estirnat,itrg and modelirtg I< indicator scniivar- location u . However, such an approach calls for the particular coregionaliza-
iograms ;utd solviug K kriging s y s t c ~ ~at.~ c!acli
s location 11. 'I'l~e modeling tion model (7.38) that does not allow different shapes or anisotropy patterns
ilnd c ~ t n p u t a t i ~ tdl'ort
~ a l can he substanti;rlly alleviated if the two following for the I( indicator senrivariograt~ts.Wheu saniple indicator semivariograms
conrlit,iotm are jointly met: appear not to be proportional to each other, as in thc case of the Cd semi-
variograms in Figure 7.17 (page 209), the more flexible and still reasonably
I . 'l'ltc li iildica1,w RI's / ( u ; z k ) arc it~t,rinsicxl!y?orrdaLed; rlmll vX- f;rst, i~rdicittorkriging sl~ouldbe used.
pression (4.41). All the I i ( K + 1)/2 indicator direct and cross stitrri-
variogram tnodcls are 1,ltrm proportiorral to a c o r n m o ~ne~nivi~riogram
~ B l o c k v e r s u s c o m p o s i t e cccifs
111odcly,,,1(11):
Let r v ( u ) be the linear average value of an attribute z over a block V of any
specific dirn~,nsions:
2. All vectors of hard indicator dat,a retained in the estimation are COIII-
plete (eqnally sampled case); there arc no missing itidicator values snch where IVI is the measure (length, area, volume) of block V. T h e integral is, in
a s iinplied try constraint intervals of typc (7.22). practice, approxi~natrdby a discrete sum of z-values defined a t N points o:
discretizing the hlock V(u). For the level of information (n), the rtnccrtaitity
As sl~ownit] section 0.2.2, krigir~gand cokriging est,itnators are then ide~ttical; ahout the block value z v ( u ) is ntodcled by the "blor:k" posterior cdf:
fnr example, for the orrlinary (co)kriging c;~se:
<
with the hlock indicator 1tV defined as lv(u;z)=I if Z v ( u ) z a d equal l o
zmo otl~crwisc.
'rhe est,in~nl.or(7.39) is called tncdiat~indicator kriging (n1111) since the c o n - If hlock data zv(o,) wcre available, the posterior cdf (7.42) a t any thresh-
rnon model y,,,l(li) is usually inferred front the indicator semivariogratn a t old zr could be obtained by indicat.or kriging fro111block indicator data defined
(.he median t,l~resholdvnluc z~ = Y'(O.5) (Jourttel, 1984b). Indeed, me- as
d i a ~indicator
~ d a l ; ~i(u,; Z M )are evenly distributcid as O and 1 values, which
nsually rendars the experimental indicat,or swrlivnriograrn y^,(h;r,w) lxtter
drfined t,lian a t o t l ~ e rtbrrshold values.
Since the indical.or data confignrat~iottis the same for all thrcsl~oldvaloes, Unfort~i~ittely, block data rv(n,) do not usnally exist,, iwncc the Mock ccdC
the kriging weights XgK(n) do not depend on t,l~etl~reslioldbeing considered; nus st be n~odeledfrom the point-da1.a ~ ( 0 , )nlot~t:.
tlierefore, only one IK systenl needs to be solved a t each IocatLiot~u : Because l l ~ cindicator variable i(u; r ) is a noti-linear transform of the
original variable ~ ( I I )the
, block indicator i v ( u ; 2) is not a linear average of
point indicators i ( u ; z):
where the wciglrts are give11i ~ ya block OK system of type (5.44): In si~nplekrigiug, t,l~e111argina1prnl~al~ili~,y Y(:L.) docs IWI, d ~ p ~ nO I Id 1.11~:
location u and represents 1 . 1 1 ~glol~;rlprior i ~ ~ f o r ~ ~ ~~ 0i 1r1l1 1, 1i1o0 1~1 LO
~ iill 1111-
sampled locatiorrs under the decision of st;ttionarity. 'I'o account for t,lw soft
datum available a t each local,io~,,the n~arginalprab;hility is replact!d by the
soft prior probnl>ilityy(n; zk) ((:oovaerls and do~rrrrcl,1995). 'I'l~esimple IK
estimator is tlreu rewritlt:~~:
-
T h e average indicalor coviiriar~ceCl(u,, V(u); rk) is approxi~r~ated by the
aritlmetic average of the point-support indiciitor covariances C~(11,- 11:; za)
defined between u,, and auy of the N point,s 11: discrctizing the block V(r1):
where y(u,; zk) is identified with t l ~ es a ~ r ~ pconditional
lc frctpency F'(:klsr)
or I."(zklvl), depending on whcther the soft informat.ion relates to a cakgor-
ical or a contimtons attribute (recall sccI.ion 7.3.1). Tlrc ksiging 1vcigl11~sarc
obtained hy solving a simple II( syslein:
Distance h (km)
indicalor krigilrg yitrlds a unit prolxthility of cxuredit~g0.8 plrln i l l t.l~eright Distance (kml
part of tlic trmsect where d l data locatiar~snre contarr~itmted.Cali1,ralion of
geologic ir~for~iiatio~ri ~ ~ d i c a t eIrowevrr,
s, t.liat Cd coucent,ratiotl Itas a s~irirller 1 Ccdf values (zk=0.8ppm)
probabilily (76.5%) of exceeding that tl~reslroldon the rock category sz pre-
vailiug in this part of the transect,. Account,ing ibr this local prior infororatior~
reduces the probability of co111.amination. Si~uilarly,accounting for tllc non-
zero (33.9%) prior probability of contaniinatio~~ on Argovian rocks increases
t.hc prob;hility of contatnination in the lowvalued part of the transect.
Soft cukriging
l t a t l ~ e rthan asiog t,lic sort indicator data y(n; zk) as local indicator xnea~rs,
these data call be interpreted as a realization of a RF Y(u; zr) correlated
with the indicator ILF I(u;zr), Ilard and soft indicator data are the^^ c o n - Figure 7.20: Simple kriging with varying local means. 'I'l~ctrend component at
bitled using i~tdicatorcokrigitig where I(11; zk) and Y ( u ;r*) are the primary location u is identified with the probability of not exreedieg tltn critical threshold
of 0.8 ppm lor the rock type prevailing tltere.
7.3. THE INDICATOR APIJROAC?I 311
1 Posterior probability of exceeding 0.8 pprn autocovariance functions of the hard and soft indicator RFs, aird CIY(11; en)
.
is their cross covariance fimction. Note the following:
Only one r~l~biasedness
condition is needed if one can assume that the
primary and secondary indicator variables have tlre same mean witlrill
each searcl~t~eigl~borhood
W(II):
[ ( I * ( ) ) ] ~=
n,(uj
<?,=I
A;:y(ll; zk) 1(lla, ; zk j
. T h e cokriging estimator (7.44) can be readily extended to incor1mrate
several differel~tsoft indicator v;~rial~lcs.
+ 1
nd11)
I )( I ;z ) (7.44)
<I=l
where Azyti(u; zc) and A~:"(II; z k ) are the cokrigiug wciglrts of imrd and soft,
indicator data a t locations u,, and n:,>. Ezan~ple
Unlike tlre simple I l i estiinator (7.43), tlre soft information ireixl no longer Consider tlrat the ten Cd concentratiorrs along the NE-SW transect are s111i-
he exhausl,ive. Notc 1.11;it tllc: "soft" colirighg cslit~lnl.or(74.1) rrhilis wily plenrc~~tcrl by the five N i conccrtl.rat.ions slrown at. I,lw top of Figure 7 1 5 .
hard a11d soft ittdical.or daLa a t the tl~rrslioldz k 1)eilig ronsidcr<:d. A s rlis- A c c o ~ ~ ~ lfor
t i ~the
~ g c;~lilwatiot~pcrlim~rmli l l soi:t,ior~7.:1.1, 1 1 ; d ; U I ~soil iw
cussed in section 7.3.2, it is generally ltot worth accou~rt,ingfor irifornratiot~a t for~nationarc coded into local prior probabilities of not e x c t d i ~ t gthe critical
other thresholds when indicator vectors are roniplcte (eqr~nllysampled case). tlrreshold 0.8 pprn (Figure 7.22, top graph). 'l'he direct and cross indicator
T h e cokriging weights nre obtail~edby solvir~gthe following wdirixy mk- scmivariograms reqtiiretl 11y the cokriging syskrn (7.45) arc it~ferreilfrom d l
+ +
riging systerr~of ( n i ( n ) nz(u) 1) equations: data ~tvailahleover tlrc study area. 'l'he modcl of corcgio~rnlizat,iorrhetweo~r
hard and soft irrdicator data for thrcsl~old0.8 pprn is depicted 11y I.he solid
line i r ~Figure 7.22 (rniddle graph).
Figlire 7.22 ( h o t b r n graph) sliows the soft cokriging esti~uate(solid line)
of the prohahility th;tt Cd concentration exceeds 0.8 p p ~ n .T h e daslred h i e
depick the ordinary I K estilnate nsiug only the Lei1 hard indicator ditt,a. As
for tlrc example of Figure 7.21, accounting for soft information reduces the
~xob;tl~ility of co~tlhnti~lation in tl~c:high-valnerl (right) part of the transect.
c
-
10 ~
.. . 0
tlreu rewritten
6
2 0s
e
. __.
a
0 0 ~
I_
._
. -.
0
_ T _.
. / . -_
~ . . o . .
0
. _. / ~
o
_
Hard
son
1 2 3 d 5 6 wlrerr the cokriging wciglits arc solutions of the followingsystenl of (n,(11)+2)
Distance (km) eqo:tt.ions:
. .
If hard and soft iudicator rtrcans arc different, the soft indicator variable
1 Posterior probability of exceeding 0.8 ppm Y(u; z r ) must he rescaled so t h a t its mean equals the ilard irldicator mean
F(zx). T h e cokriging estimator (7.46) is tlren written
wl~crem y ( z r ) = ~,;{)I(II;zk)}.
Colocatcd indicator rokriging is faster than the full indic;rl,or cokrigir~g
a t d avoids i ~ ~ s l . a l ~ i lproblcnrs
ity caused ljy denscly sanlplcd soft i~lforrnation.
Moreover, the colocated cokriging system (7.47) does 11ot require the soft
I.'igrlrc 7.2'2: Soft c o k r i ~ i ~ crstitr~ate
g or the probnbility of eaccedinfi 0.8 p p m The , 11 = 0 where i l is m y ( z ~ ) [ -
autocov;rrinnrc I I I O ~ P I , e x c q ~ tat l m y (zi)].
snit informat,io,t consists of five prim prul~hilitirsor not erccrdittg t.l>cc-ritical
I1m:sliold as rlcrivrd frorn 1 1 ~ :calihratiort of Ni concentr;rlioas in Figure 7.15. llard
a d soft da1.a are combined using orrliuary cokriging a d the semivariugram models
slmwe in the saroad row. 'l'he clasl~ed linr drpicts the ordinilry krigirtg estimate Tire cokriging syslem (7.45) calls for the joint niodcling of two autocovariance
wing only Imr4 indicator d a t a functions and m e cross covnriance f u ~ ~ c l iam t each tlrresl~oldz r . Tlris mod-
eling of ilte Ilard-soft coregionalizatiorr rrlay be alleviated by a Markov-type
1rypol.l~rsisstating t.liat "a h u r l i~rdicatord a t u m a t u, i ( u ;zk), screens the
influence of any colocated soft indicator d a t a 9.' ? r ) on the estiniatior~of
the primary variable a t any other loc;&m u"', lhat is, wlrerr: ~ r l yis the I~IIIIIINY of Io~at.ioltsW I I C ~ C 110t,lr I l i < d illld d i t h ilrB
knowti. T h e besl, situation is when iii[')(zn)= 1, wlrialr rrrtmls t,llat the soft
Prob (Z(11') < zrl i(u; zt), ~ ( nz;t ) ] = Prob { Z ( u l ) 5 zk I i ( u ; zt)} infornlatioo y exactly predicts that t,lw valr~er(n,), with ~ ( I I , , ) zn, is 110<
V u , u', zx (7.48) greater Lh;m the tltreslrold valrle zk.
Convt:rsely, rni"(zx) is estimated by the arithmclic iivcrage of the soft
Building on this relation, Zlru and Jour~lel(1993) cstablislietl tlic f<rllo\virlgre-
indicalor rlirta a t locatio~rswhsrc i(n,; z t ) = 0:
latiolrs between indicator auto a l ~ dcross covariance functions a t any I.lrresl~old
Zk :
Here, tlre best situatiou is wlicn lii("(zk) = 0, wlticll indicates that tlre sofl.
infornraLinll exactly predicts that tlrc value ~ ( I I , , ) wit,ll
, z(o,) > zn, exceeds
the thresllold value2i.
where each cocllicient B(zt) is drfilted as the diffrrcnce hetween the trvo T h e dill'erenrc Il(zk) = %("(zt) - i;l'"'(zr) nrcasures the ;rbilit,y of the
conditional cxpert;tt,ions: -
soft infor~rr;rtiony 1.0 separat,e t,lw two citscs i ( n ; z r ) = I and i(11; zr) = 0. 111
other ~vorrls,H(zk) is an accuracy index for the soft informalio~l:
( t =7 ( z )-? ( z ) € [ I1 k = I , . . . , 11 A
( I ; : ) = ( I ; : ) = ' ( I ; : ) v I1
Under the Markov hypothesis (7.48), Lltc antocov;rriatlct: rriod~:lCy(l1; zn)
a t h > 0 and the cross ciwiriance niodel C ~ y ( l r ; z n )arc deductxl simply
by liltear rescalhg of lire lt?~rdautocovariallce ~ i ~ o d d . ( ; ~ (zk).
ll; 'I'l~~cfore,
modeling the hartl-soft coregionaliaation requires the nrodelingof only a siugle
covariance function per tllreslrold zk. The Bayesian updating of local prior A
Reln;Lrk:
The Markov sthteuicot "hard dala scrtxm t,he itifloeuce of any colocaled soft
data" s o ~ m d srrlislcadilrgly trivial. Indeed, if the soft datum refers to a volume
larger than the coloc;~tedhard datum, it lnay carry valunhlc additiotml infor-
mation. T h e resultittg covariance rr~odels(7.49) and (7.50) must be clrecked
against experimerital covariance fitnctions (see Irereafter ;rid Figure 7.24) Eminple
Consider the estimatioll of LI a t the tltrrslrold Cd v;rhtc zr = 0.8 pptir. '1'11~
L'slimaling the coeficieals B ( r r ) soft information consist,^ of either five different rock typcs st or nilre classes
T h e determination of coefficients B(zx) requires the estitnatioli of tlte two of Ni concentrations ( u - ~ur]. , A calibration like the orkc performed in sec-
conditional expectations mi"(zk) and rr~""(rr) a t each threshold z t . T h e tion 7.3.1 yields two sets of local prior probabilities: y1(11,; 0.8) = P'(O.Rlsz)
quantity m"'(rt) is estiruale~lby the aritluiretic average of t.he soft itrdiciitor arid I l z ( ~ ~ , v ; 0=
. 8 )I."(0.81ur). Figure 7.23 (t,op graphs) shows t.l~csrd.tcr-
data y(u,; z t ) , where i(u,; zk) = 1: grains of 259 Cd cotrct:ntralious versus coloc;itcd lrrcal prior prol)al~ilil.icsde-
rived from rock types (left graph) or Ni co~lcantrations(right gralil~). 111
3S~R
data weigllls ilrc exaclly E C ~to~ zero
I il the t w o tradilic,nnl nd~iasedz,cssc o w
slrnints of l w r (6.28) arc used in the c u k r i ~ i n gsyslem.
both cases, the 259 prior probabilities are split into two groups, depend-
Rock soft data Nl soft data ing 011 whetlrer the cnlocated Cd concentration exceeds the critical threshold
0.8 ppnl depickd by the vertical daslled line. 'She i~istogritnrsof each subset
of local prior proba1,ilities a t the bottom of Figure 7.23 show the following:
lo 8 1.0 , For Ni data (right column), the two lristogrntris have very dilferent
-
??
.- II
-
2
.-
--'
I
-.- sl~apcs.'The prior probi~l~ilities of 11o1exceeding 0.8 p p ~ nare large wheu
.-
n -"."
I
."..... n
m
I hhe actnal Cd co~irrnt,rat,ionis no greater than t.lmt tlrteshold. Con-
n
2 0 s 1 n 0s
I
-4.--. .. verscly, 1.l1cse prior prob;tbilities are s l ~ ~ awl ll m ~the itchlal Cd concen-
a a .A .
: ...-.
I
8
C
a
2
-
= 'I :
7.. +--- . .. . t.ration excecrls 0.8 pp111. 't'lic reasonably large 11-value (0.42) obtained
as t,lw clilT<,rcncebet wee^^ I.II~, Itrr.aris of l.l~eLwo disl.ril~ul.io~rs
reflccts the
.1.--. .
I
00 I . . ,
. l , ~ . ~ ~. , . ~ .
T ,~ ~
, . .
~ - . ~ ~T 7
~ - ~ . >
ability c $ Ni co~~c<~r~t.r;ti,ior~s Lo prrdirt wl~ct.lrar1,llr (:d c o ~ ~ c n ~ t m t i o n
0 1 2 3 4 5 6 0 1 2 3 6 5 6 cxrrwls 11.8 ~ ' ~ H I I .
Cd corrcentration (pprn) Cd concentration (ppm)
For rock types (left colurlm), the contrast betwccn tlrc two l~istograrnsof
prior probabilities is lcss apparent. 'She small U-valuc (0.12) indicates
j Cd 5 0.8 ppm t , l ~ ageology
t provides little infor~nalionon wl~el.l~er
the Cd concentra-
10 In1 Cd0.8ppm
00
Ition cxcrwls 0.8 pprll.
,.
h
rn1')(0.8pprn)=0.42 rn1"(0.8 pprn)=0.62
2 06
0,
02
0.0 05 1.0
....
1.5 0.0 0.5 1.0
. ~-
i5
Prior probabilily Pnor probability
,.
B(0.8 pprn) = 0.42 - 0.30
,. E(0.8 ppm) = 0.62 - 0.20
= 0.12 = 0.42
318 CHAPTER 7. ASSESSMEN'I' OF l.O(CA1, IlNCEl17'AIN'l'Y
2. Use that niodcl aud the B-value B(zr) to deduce the auto and crass
covariance models Cv(h;zx), C ~ y ( l rzr)
; tl~roughrelations (7.49) and
(7.50),
Hard-soft data (rock type) Hard-so&data (Ni)
I I
,>
l h e first ordcr r c l a t i o ~may
~ not l ~ csatisfied because the (co)kriging csti- Tlie difference between the t,wo IK estimates is thus a linear combination
mate is a lion-convex h e a r c o ~ n h i n a t i oof~ ~the conditionit~gdata, i.e., tlie of diffcrc~tccsbedweell 11< weights a t the two tlrresholds 2s and 27:
(co)kriging rveigl~t,~ can be negative (sce sccthn 5.8.1). Uccause the I< pro1,a-
bilities arc: not estimated jointly, Llie sccond condition may not he met either.
,,
Llie posterior cdf in Figure 7.26 s l ~ ~ w hol.11
s types of order relation (levialio~rs:
I 11e cliffrretrce (7.53) is thus zero, heltce tlrcre is no order relation de-
viation of type (7.52). In contrast, a sndden cl~aogci ~ two r consecutive
indicator senlivariogram models, say, from 2s to 27, leads to different
IK weiglits with an increasing risk of order relatiort problems.
I r r r p l e ~ r l c n t a t i o nt i p s
Inconsistent probabilities could be avoided by ir~iposingthe order relations
(7.51) and (7.52) as constraints in the kriging algorillrm. Such a solution is,
liowcvcr, cornpntalior~allyexpensive. Instead, the common practice is t o cor-
rect a posteriori for order relation deviations, see suhseqoenl discussion. Tlie
Figure 7.26: Examples ol order rclaliori pro1,lerns s11orr.n by ccdf vlzlues (black proportion and magnitude of these dcviations and the required corrections
dots) estinlated by an it~dicntorapproach. ' I h magnit,& of onler relation rlevia- can be rcdoced using the followirig implementation tips (see also Deutsch and
tions is, is practice, ~IIIICI, smallex titan i s this fic1,itioas exnmplr. .lournel, 1092n, p. 7 9 80):
322 CIlAl'TEli 7. ASSESShfENT OF LOCAL UNCERTAINTY 7.3. THE INI)ICA?'OH APl'ROAClf
z-variable
is r ~ ~ i n i ~ l i i a1111rlcr
c d the (Ii' + I) li~rcarconstraints:
.
v:i111vs [ ~ ' ( I I2kl(n))]y;
;
Reset all ccdf values t h t are
Oor 1:
i.:!,
('l'z~l~le 1.11ird ~ ~ I U I I I I I ) :
[0, I] to the closest hound,
l ~ r r t ,wit,l~i~l 'TO alleviat.e nol.ations, the corrected ccdf values arc l~creafterdenoted
[ F ( u ; ~kI(71))j*.
I
Upward Downward I Average
,Gl=/Ts
'l'hrcshold Faulty prob
0.05 I 0.05 1 0.05
2. 'l'hr same npproaclr is used to pcrfor~na downw;rrd correctiot~resriltiog
in a set of /i' ccdf values [l.'(n; rxl(n))j;' (Table 7.3, fourth column):
r H.eset all ccrlf vnlws thal. are rot wit,hin [0, I] 1.0 the closest t)ound,
0 or I :
ccdf values
7.3.5 111terpolati11g/extra~~01ating Ccdf model at ~i
Improved
I Cdf model Ccdf model at U;
3. Inference a i ~ di~iwlelingrcsourccs.
Except for median IK, the infixe~~ce ;ind i ~ ~ o d ~ leffort
i ~ r gi~~creases drit- I
,,
llic indicator approacll requires a preliminary coding of each piece of infor-
r i a vect,or of I i local prior prol~ahililirscorresponding to the I<
~ r ~ a l i ointo
sthtcs sk:
3 . 'I'hc series of L"' cdf models are rescaled linearly, srlcli that tire ccdf llifixeut types of local prior 'pdfs can he distinguished, ilel~endingon the
valucs at. l ~ l ~ r ~ ~ l r21-1
o l d sarid 11 art! 11011ored: n;tl.ur<~
of tl~c,l w n l inforn~at.ionnv;rilahlc:
A h;trd dalu~ir~ ( I I , , )is a precise ~rre;lsurcn~eitl
of tlic st,& si; a t location
11, ( n o uncertai~lty),l'he local prior prohahilities arc tl~rribinary (Bard)
ilrdicator data dcfinrd as
,"it,ll y
[ ( I k ) ) ] - [F(ll; ~ k ll(ll))]*
-
1 * z ) - I*,*(*,_,) 'I'hr local i~~fort~rat,iotttlray cot~sistof a zero prohal,ilil.y of occurrence of
ollc or lli(m! sf.ntes s t , ; for exanlpk, a parI.iclrlar land use is kliowli lo
Such a n interpolation a n i o ~ ~ r rtot susing tlre same i~itraclassdistril)utiol~~liodcl be ;il~sonti l l a giveti grologic e ~ ~ v i r o n ~ i ~Tehre~local
t , prior pdf is then
[I.'(r)]r,i,, a t all locations 11; in ot.lier words, the i ~ i t r a c l a sdistribuf.io~~
is non- an incr,nlplet~cvrxtor of liard indicator data:
co~lditiolml.A sirriilnr approncli allows one 1.0 increase llrc rt:solutiott of tlrr
lower and i~pp<:rtail classes ( z,,,in, 211 and (*I<,z,,,, 1.
A good alternative 1.0 t h pirccwise i~~t~t:rpolatio~~/cxtr:~~~olatio~i of the
sample cdf colmist,s of sirioot.liing aud rx1r:~polating the cdf usiug t.he ;tlgu-
rit,lirr~silitrotlnced in sectioi, 7 . 2 5 Ancill;~ryinfor~iiat,ioli(c.g.,calibration of a ~ ~ ~ t i n u o1,-data)
us may pro-
vide prior prohabilitics of occurrence for tile K states s k a t location ub.
7.3.6 Modeling uncertainty for categorical attributes 'I'lle set of local soft indicator data is t,l~e~r
defined as
Dislance (krs)
Meadow Tillage
,
Indicator vectors
1 2 3 4 5 6
Distance (km)
,Forest Pasture
. .. .. .,-i--;-i.
.. .
,
E
eg17os
1 <:. 0.8 ...+~~ *
c c 'l'he 111o<le1 of local u~~certaint,y
is typically post-processed to retrieve single
.-B vall,,~sS I I ~ ;,s
I
04 E 0.4
0.0 0.0~
0.0 0.4 0.8 1.2 1.6 0.0 0.4 0.8 1.2 1.6
Dialancs (km) Distance (km)
2. a11 estilni~tt:of the unknown value r ( u ) , whirl^ is op1,inral for a given
Meadow .Tillage crikriot~which need 1101hr 1c:tsl-sqwres, and
Probability i n t e r v a l
CHAPTER 7. I'i
AS'SESS
I
I
I
EN?' OF LOCAL UNCEItTAlM'1'
T h e probability that the unknown is valued within an interval (a, b], called
7.4. USING LOCAL UNCERTAINTY hfODEI,S
Local entropy
A nreasure of local uncertainty, not specific to any particrllar interval ( a , b], is
335
a prohability interval, is computed as the difference between ccdf valurs for provided by the entropy of t,hc local prohahility density function (Shannon,
thresholds b and a: 1948; Clrrislakos, 1900; Jour~rel;md I)aotsclr, 1093):
A 50% probabili1.y interval means that the unki~owr~ has eqrral probabilily to
lie inside or outside the interval (u, b]. where f(u; TI)) = DI'.(u;zl(n))/iJz is t,he c o ~ ~ d i t i o ~pdf, r a l and all zero pdf
Setting the upper boond b of l l ~ einterval to +oo provides 111sprobalrilily vall~rsare excluded from tlre integral.
of exceeding tlic tl~resholda: T h e boonded prlf wiI.11r ~ i a x i ~ r ~ r ~ r ~ r e nisl rthe
o p yu ~ ~ i f o rdistribut.iol~
rn (3.17):
all outcomes have the same probability of occurrence. T h e entropy, and hence
Prob {%(u)E ( a , +m]1(n)] = Prob {%(II)> nl(n)) uncertainty, decrea3es as the probability d i s t r i h t i o i ~focuses more toward a
= 1 - r ( u ; al(n)) (7.130) single value.
In practice, the range of variatiorr of z-values is discretized illto 1i non-
This latter probability is of particdar i~nportancefor environnienl;d applic,r- overlapping classes ( z r - ] , zk], and the corresponding Ii probability il~tervals
tions, where the focr~sis on the risk of exceeding regulatory lin~it,s. are conipoled as
Consider the l.wo ccdf ruodcls F ( I I ~rl(n))
; and F(u!,; zl(n)) in Figure 7.32:
is valued h e l w n ~1.5
The probability tlmt t l ~ cCd conce~~trat,ion ~ and
4 ppm, depicted between the two vertical dashed lines, is 11egligi1)lea t ,,
Illere are typically ~r~;trry
more rliscretiz;ttion valrles zk tl~alloriginal t,l~n%hold
u; in lhe low-valued part of tlre 1r;tnscct. values used for i~rdicat.orcoding.
T h e entropy of the coriditio~ralpdf a t Iocatiou 11 is t11c11c o ~ n p ~ ~ tas
eil
r The uncertainty is much larger a t uk, wit,h the U I I ~ I I ~ W Cd
I I COIICCII-
trat~ionhaving si~nilarprohahilities to he valued inside or o ~ ~ t s i dthe
e
interval (1.5,4 pprn].
.-C
zm 3
m
- - .- .-
.- .- .- . .- ,
. zero i f pr8(11) = 1 and p i ( o ) = 0 V k # k'
,I ,h e unknown is certainly valued in tlre interval (zrs-I, zr,]
i I i
.
e 0.5
i
n 05
I i ( I I I ~ I I ~ I I uI I~I~I cI Ie r t ~ a i n l y ) .
a i I i
I i I ~ r l i if pk(11) = 1/11' V k
00 -1 _-j,_ .._.n
A~ 0.0
!
- .I... . .
!
L-.,.?..,
Eaclt inlerval (zr-,, z r ] is equally likely to include lhe ~III~IIOWII
0 1 2 3 4 5 6 0 1 2 3 4 5 6 ( ~ n a a i m r ~nm~ ~ c r r t a i n t y ) .
Cd concentralion (ppm) Cd concentration (pprn)
Figure 7.32: (hsditioual cdl mod& provided by prol~nbilityk,ii;ittg a t locatiotcs
I 1 I 131 both c a w s , it linear int,erpolatiotl is prrlornwl i , e l w c w 1alxtl;tl.ed
bounds idmtiticd w i t h L l w 259 (:d cowentr;itiws r r v w llte s1.1~lyarvir. 'l'hi. did1cd
lia~.sdepict the prol,al,ility istvrval corrwposding to t l w inlerval (l.5,~i.O]pp111.
03
/ Cpdf model at u; 03
] Cpdf model at u>
linlike tltc entrq,y rtleastrre (7.6:1), llie varinrlcc is delined arotmd a specific
ccntral vall~c,the rncan of the conditional disLrihntion, and it depends on the
I< wit1iit1-class rncans i k .Beware that hot,lr t,hc ccdf near^ and its nl)per tail
Cd concentration (ppm) Cd concentrallon (ppm) mean can he very sensitive t.o t,lie clloicc of ill<: oxtrayolation model, as would
t,ltc c ~ t ~ d i t i o n variance
al ~ 1SIII)SC(IIICII~
~ ~ ( (see ) CX;LIII[IIC).
For highly asyr~~tnet,ric dist,riht~tions,a more rolinst rlrcasure of spread is
the interqoarLik range, dclinod as 1.l1edifference lietween t.lle tipper and lower
quar1,iles of t l ~ cdistril)utiol~:
/
a n c e , ellI,ropy, a n d interqnartile range) of t h e ccdf m o d e l s obt.ained a l o n g t h e
NIS-SW tr;irisect using prohahili1.y kriging. 'l'lle u p p e r tail is l i w a r l y int.erpo-
Tabulated bounds Tabulated bounds d s o p t i o n ) . At all locat.ions w i i h 1111:s a m r
......... ......... I;rt,ril l,etween t,ahulated h o ~ r ~ (first
-1_---,
- HYpecb. 0,=5.0
----.
Hyperb. 0=1.5
Hypecb. w=5.0
Hypsrb. w t . 5 rlatn configuration, 1,he o r d i n a r y k r i g i r ~ gvariance is t,l~as a n w wl~at,avcrLIla
~
r
-
-
1 2 3 4 5 6 I 2 3 4 5 6
Cd concentration (pprn) Cd concentration (pprn) Cd d a t a
zS0.74 z;=1 95
02=0.45 oZ=t46
Hrl=0.55 HR=O 82
'ii
IOFb0.37
00 00
0 1 2 3 4 5 6 0 1 2 3 4 5 6
Cd concentralion (pprn) Cd cancenlral~on
(ppm) '' OK variance 1 Conditional variance
0.3 i Hyperbolic ( ~ 5 . 0 ) 0.3 1 Hyperbolic (61~5.0) 10
r;=0 71
a2=020 3
- 02
He=O 55 B
IQR=O 37 2 0, I . , .. .
1 2
.
3
. _ .. . . . .
4
.
5
. ~..,
6
..
Distance (km)
00 00
0 1 2 3 4 5 6 0 1 2 3 4 5 6
1 lnterqualtile range
Cd concenlrat~on(pprn) Cd concentration (ppm) 10 I Local entropy
Hyperbolic ( ~ 1 . 5 ) Hyperbolic (10=1.5)
0.3
2;-2 62
$=18 5
H"=0.57 Hn=O 91
lQR=0.37
00
Figure 7.35: Four mcasures of local unccrtaiuty aI<,ng the N E ~ S Wtransact. As
Cd concsnlralian (pprn) opposed to t h e kriging variance, crdf statistics (conditional variance, eutropy, and
Cd cancentratlon (pprn)
interqnartilc range) account for d a t a values and indicate n larger uncertainty i t , the
right part oi the transcct where tlw range of f i l concentrrtions is laryer.
surrounding Cd c o ~ ~ c e ~ ~ l . r a l The
i o r ~three
s. other measures, wl~iclraccount for wlrm? zr m d Ek nri: dctined as in r q u i ~ t i o(7.64).
~ ~ '111~"1,-ol,timal" estilr~ate
d a t a vahrcs, indicate that the uncertainty ;!bout Cd c o ~ m n t r a t i o nis greater for. t l w loss frutctiun I,(.) is tlml tllc z.-v:rliw tlmt. n~ilrin~iars
llre cxpect.ed loss:
w l m e t.11~:range of data is l;trger, that is, in t l ~ ecmtral a ~ r dright parts of tlrc
transecl. (Figure 7.35). 'fhe condrast between t,lre two parts of the transect is
most apparent wheu using t.he interqoartile r a n g r U~~lilie inttq,olat,iou : ~ l g o r i t l ~introduced
r~~s in Cl~aptcrs5 and 6, liere the
d c l . ~ r ~ ~ ~ i r of
~ ia11
i t iq)ti113id
o ~ ~ e s l i ~ n i ~lt ier ~ c ~ i : dills 1 . ~ steps:
0
,,
l h e selcct.ion of a unique esLiu~ntcwithi~rthe rangc of possible r-values re-
quires an o p l i ~ ~ ~ a lcirtiyk r i o ~ ~One
. common crit.erion is the n ~ i u i ~ i ~ i a ; r of l,io~~
the irrq~act.nI.tac11erl tn the estiniation crror e ( u ) = z'(u) - r ( o ) that is likely
l o occur. Consider, for example, the estinratio~~ of a toxic conce~rt,r;ttion.Un-
dt!rest.in~;rtion of tllitt corlcentration (~tcgat,ivresti~riatior~ error) IIIZLY CRLISC ill
src i'igurr 7.46 (Iclt lop graph). T11c optintal csti~tl;rt,cis slmwn t,o be the
I~c;rlLhand Ic:;td tu i r ~ s u r n ~ clitirtrs
~ r c ;dlawsuits. Co~~vcrscly, ovcrest,in~al.ioi~
r~sp!cl.crlvitluc of l,l~r. ccdf at locatio~r11, also c;tlld t . 1 E-typi:
~ e s t i ~ ~ ~ : ~rccnll
l,e,
of the t,oxic c o ~ ~ c e n t . r a t i(positive
o~~ t . s t i ~ ~ r a l , error)
i o ~ ~ rrmy cause costly and
rd;~I,ior~(7.65):
unnecessary cleaning.
Eval~ratet,lle inipact or loss associated wit11 any error as a function I,(.) Kti
of that error, e.g., I,(e(u)) = [e(u)12. Given tlmt particular loss fttnct,ion, the 2:,(11) = z;<(U) Zk ' [ ~ ( I ~kl(71))
I; - 1 7 ( ~~k-11(71))]
~;
estimate z t ( o ) sl~orrldbr clrosen so as to r~lini~nioe the resulting loss; that, is, k=i
For c x ; ~ n ~ p l t,lw
c , E-t,ype e s t i r ~ ~ a taet I&, Z ~ ( I I ~depicl.er1
), by the vertical
dnslictl line in Figure 7.36 (right trip graph), is I .95 P P I ~ I .
I)etcr~r~ir~at~ion of tlrc a c l o d loss l,(z*(n) - ~ ( I I ) requires
) tlw aclonl v;tluc ICI,ypcaod (co)kriging ~ s t i m a t c sare usr~allydilfmenl, ;tltho~~gli holh are
~ ( I I ) wliicl~
, is u u k ~ ~ o wi lnl 1,r;tctice. flowcvcr, the uncertainty about, z(u) is opt.i~~m forl the Icnst-squares criterion, I'hcy differ in that t l ~ cccdf from which
111o11r:ladby the ccrlf I"(II; ~ [ ( n ) )which , is avail;hle. T h e idea is to use t,l~is t,lw IS-t,ypc cstirtrste derives depends OII data vxlucs. One exception occurs
model of u ~ ~ c e r t a i ~tor t ydatern~incthe expected loss: w11o11t.11~original r-vzlues are imrrnnlly distributed and the ccdf is modeled
using a rnulLiG;~ossia~~ approach. In i d ather situations, t,he advantage of the
1';-type estinr;~l,elies in the availability of a model of nncertainl.y much rickler
lhirn a 1 1 1 ~ 1krigirig
~ vnriarict: (~.ccallsectior~s7 1 . 1 and 7.4.1).
l3ec;tusc of' i.lx: s q r ~ ; ~ r iof~ ~
l.lw
g rwLi~~;,t,io~r
error ill oxpressior~(7.68), ex-
treme error values tend to have a grcat, impact on tlrc cxpect,ed loss. Thus,
t.hc K ~ . y p resLiul;,tc inay overly dcl~endon t l ~ :~r~orlelirig of t.l~eupper and
lower t.ails of t11c distr-ihtrtio~~ b'(11; rl(n)). For Llte positively skewed ccdf a t
location uk, t,lw i:-type r s t i n ~ a t cvaries froni 1.70 to 2.02 pp111,d e p e ~ ~ d i n011g
whether l.l~ew - p a r a ~ ~ ~ eof r hypcrlmlic tail 111odc1is set to 5 (short tail)
t c the
or I .5 (long t.ail) ('R~hle 7.4).
74 USING LOCAL UNCEKI'AIN'I'Y MODELS 343
00
0 1 2 3 4 5 5
Cd concentration (pprn)
Using a liucar fnnct.ion of the estimation error rather tbnrl (.he qlladratil:
Underestimation i
I
Owreslimation 0,3 1 CpPf model al u i
fnoct,ioo (7.08) allows one to rcdncc tile impact of t k i l rrlorlcls o ~ tltr
t expected
loss. For rxarnple, the loss call be triodeled as proportioual to the alrsolntc
d 1'l<,tl error:
cstilt~.
see Figure 7.36 (second row, icft, graph). 'l'he optir~balastir~iale(br this iilcalr
rthsolritr: deviation cril.crinn is thi. u~rrrliairof (.III. ccdl
0 1 2 3 4 5 6
Cd Concentration (ppm)
Z;=O 9-quanl~le
=3 86 pprn
I
I
0
.
1
. ,..~.~.7~.~,~
2 3 1 2
~ 3
r
4 5
k
6
h ~ ~
Eslimallon error (pprn) (pprn)
Cd conccl~trvl#on
Fignre 7.36: Poor loss functions and tltc corrrspatliq optimrrl cstinratcs r ; , ( n ; )
depicted by thc verticitl dnsl~rdlinr on the conditionid probability dislvibwtion.
(cost of ullllcccssnry cltming). Cotlvcrsely, rvlret~ predicting clcfir:iet~ciesin of tlrc ccdf dcpict,cd by the! vertic;tl dashed line itr Figurr 7.36 (right hot-
soil nutrient.s, t l ~ oittrpact of ~tlldcresti~llat,io~l
(~uiilue;rpplicat,io~rof correc- t.om graph). T h a t qtlantile rstimate can bc interpreted as t,lx threslrold
tive t,reaLil~ents)is likcly 1.0 be smaller t,lran that of overestir~tat,ion(risk of v i ~ l ~trle~ n has
t only a 10% probability of l~eingexccitdcil by tha ttttknown
rei.ardrd growtlr or pre~naturrdeath). 'I'hrrs, it is often critical t o d i s t i ~ ~ g u i s l ~ vnlne:
fro^^^ ovrrf:stin~at.i<~~~
tll~~lerrsLit~t;~t.iotr wllr~rirtodrling the i~llpactof est,irtration
errors.
'f'lre three typrs of loss fu~lctionsslmwu ill Figure 7.36 (lcfl colwnrr) allow
1. W] = wz (analytical) dcterrrlinatiorr of the opt.itlral d m a t e . lo ah-
a st,raiglrt.forrr.ar~l
,,
L Ire line;tr loss furlction (7.70) is then synllrtet,ric, and the optitnal esti- scnce of srlclr ;ill an;~lytic,alsolution, tltc expected loss cat, be cotnl~utedfor a
inale is t,hc 0.5-qtlmtilc, i.e., l l ~ cmediart previously discussed. series of z-vitltrcs, and t , h ~imc yieldi~lgthe strlallrst expected loss is retained
2. W l > wz as the optimal eslitrlate. Consider, for example, the followirrg asynlrrtetric
'1'111, impact of ovcrcsLi~~l;~tiot~ is larger 1,Itatr tllitl. of ~ ~ ~ r d ~ r e s t i t ~ lof
~rtioII loss hmcl,i~j~t,wllicl~severdy pet~alizcs~ t n d e r r s t i ~ r ~ a t ~ i m :
the same n~agttitt~dc. 'I'lli~s,I ) < 11.5, and the optirrral esl,irl~nt,c is smaller e(u) for e(u) 2 O (overestirnation) (7.71)
than the r~ledian(a conservative clroicc for detecthg deficiencies). Con-
sider Ll1e asyn~metricloss furrctio~~ in Figure 7.30 (t,hird row, lefl graph),
where wr = 0.9 a d w2 = 0.1. The optitrral rstinmte is the 0.1-quantile scc Figure 7.37 (left top gtapb). Krmwledgc of thr ccdf ~ilorlell;'(u;;z l ( d )
of the ccdf depictrd hy the vertical daslred title itr Figure 7.36 ( l l k l allows one to cot~rputet l ~ cexpect.ed loss for a series of Cd concentratio~is
row, riglrt gr;tph). Tlrat qunntile estimate can be iut,crpreterl as the rallgillg front 0 to 6 ppnr (Figme 7.37, holtonl graph). The oplilnal estilnate
threshold value that Itas a 900% probability of being exceeded by the is 2.4 ppm, wllich correspouds 1.0 t,he ~nittirr~~l~rlex~)ect,cd
loss.
~titkrrownvalue:
1. T l ~ ilnpact
c of the estimation error may vary from one area to another;
7'1111sa large esliruate go.l(n) indicates tlrat the onknowrr value is cer- for example, the underesti~natio~r of a toxic cot~ceritrat,ionis likcly t o
taiuly large a t location u. be more prcjudicial in residential areas than in industrial yards. One
3. Wl < wz sllould llretr define a loss f u ~ ~ c t i oLi l( e ( n ) ; ~ lth;tL
) cl~a~rges
locally, de-
T h e impact of overestimation is s ~ n a l l ethat1
~ that of onderestit~latiot~
of I j ~ n i l i on
~ ~ ~11etl1er
g o is in a l industrial or rtrsidentiitl area.
the saritt! t~ragnitude.'f'hus, p > 0.5, and the optilnal eslirrtate is larger 2 . 111 pmctice, LIE losses associated with both l.ypes o l estimation error
than the median ( a conservative choice for detectit~gpollutiorr). Con- may be difficult to evaluate precisely. IIowrver, the riser getrerally knows
sider the asytulnetric loss ftlliclion in Figure 7.36 (left hottorn graph), w11icl1 type of error would be most prejudicial in the situation a t hand.
wltere wl = 0.1 and wz = 0.9. The optirrml estimate is the 0.9-quant,ile Simple asyrnrnet,ric loss functions, such as rxpressions (7.70) or (7.71),
346 CllAP'l'1~li7 ASSESSMENT 01,' LOCAL IJNCl<lI?'AlN?.Y 7.4. USlNG LOCAL CINCElUXlNTY MODELS 347
10
1 Underestimation I
I
Overest~mat~on Ccdf model at u>
7.4.3 Decision making in t h e face o f uncertainty
Many ir~vestigationslead l o irrrportanl decisions, such its cleauing Iramrrloos
,----- areas or correcting for soil deficiencies. Decisions are most often made in the
face of uncertainty because concentrations in toxic or nutrient elenlet~ts;Ire
rarely known with certainty. Given n conditional cdf inodel, there are several
ways t,o account for such ~ ~ n c e r l a i r ~iut y decisio~i-uiaki~~g
proc(?ss.
i . 1
_.-
7 3
..._-. .
4 5 6
Exceeding u probability threshold
Cd concentratton (ppm) Consider the decision of clranirlg localions coritaminattvl by c a d l n i u r ~ ~11 .
straiglttfnrward approach consists of declirring contnmi~lat,edall locatio~ls
where the of cxcectling the tolt!rnl~lcrrraxilritlr~~ 0.8 ppin is 1;rrger
tlrari a given probability tbresl~old.There is no doubt. that locittio~rTI!,,wil,ll
a 95% probability of exceeding 0.8 ppm, is a prime i:;tnrlirlate for ren~edia-
t i o ~ I)ccisio~l
~. l ~ dirlicult. nt. loc;rtion XI;, wlticlr iias ;L 34%
rrlaki~rgis ~ n u c t~lore
grolmbility of exceeding the tolerable maxi~nunr.111stre11 it CIISC, tlrc clwicc of
a prob;~bililyt,lrrcslrold is subjective a d d e p e ~ ~ on d s political arrd social deci-
sions; for cx;itnple, ;r 34% proljal~ilityof contanrination 111;rylw u~raccel~table
for r e s i h t i ; t l amas, ycl b ~ l r d r l cfor i ~ ~ d u s l r iyttrds.
al
In ptacticc, rctncdi;tl ~rrcasuresare applied to an area or block V ( u ) , not lo
l._.l__._..r__ a si~iglcIocatiotl 11. Decision rnaki~rgcar1 t l i e ~procred ~ in ~rianydifferent ways,
the regulatory tlireshold applies to a block, say, ZY fnr
1 2 3 4 5 6
Cd estimate (ppm) delxmtling on \vI~et.l~er
a truckload-sized volu~nc,or t,o a single location, say, z!j for all anger-sanipled
.
size vol~ttt~c. Exiunples of diRerent qq>roacl~cs follow:
Model the ccdf a t N localio~ts11; discrrtizittg l.lie block L1(ll), and r c ~
trieve t t ~ cA' corresporld>g pmbabilit.ies of exceeding l.he crit,ical Lliresli-
old, 1 - [ ~ ( u ;zfl(n))]
; . 'rlict~,decide to clc;in the lilock i f the prolxk-
bility tl~rcsl~old y, is excccded by a given proportion of locations.
3. Wl~atevr!r (.heopl.i~l~ality
crilcriou reliti~~cd,
I.lw rcstrlti~~g ustir~l;~l.r:
ri,(r~)
honors the soft i~lformationavaild>leal, location 11. Consider, fr~rcx-
atnple, that the soft infor~lratio~t of a co~~st,raiut
cor~sist,~ in1erv;rl ( a , 61
for the t~nk~tnrvr~
value ~(11).The post,crior cdf at. II t . l ~ e tiikcs
~i the form
2. 'l%e risk ~ ( I I of
) wro~iglyclassifyit~ga location u as safe (false neg;ttive)
is
Note that the risk @(u)is not defit~cdwhere the risk n ( u ) is, and corrvcrsely. Figure 7.38: Median and 0.9-quantile cstiorates deduced itom the prohatrilily
Agairt, o w r x c s tlie difficr~ll.pmhlent of clroositrg a probabilit,y tlrresl~oldfor kriging ccdf models, and the corresponding ~nisclassificatioarisks: a is the risk of
r d i ~nisclassificutionrisk.
wrongly declaring tlmt a location is hazardous on tlie basis that the estimate &(u)
exceeds 0.8 ppm, whereas p is the risk of wrongly declaring that a location is safe
1"igrrre 7.38 (t,op graph) sllows l,l~cmedim ;tmI 0.9-qualrtik: estil~r:ti.csrc-
on tlic basis tlmt t.Lr mtimate i;(a) is smaller khan 0.8 p p m
usittg prol,;hility kriging. In Irot.11
t b h a d l'roni tlrc ccdf ~ ~ i o i l e ol>Lnit~ed
ls
cascs, a locatiott 11 is declared lrnzardous with s risk n ( u ) of f&e positiw i f
the cstinrate r:xcccds t,l~ecritical thrcsltold depicted 11y the lioriaontal rlasl~ed risks a ( u ) . For example, location 11; is classified as safe with a risk
line (Figure 7.38, middle graphs). Conversely, if the estimate a t u is s ~ t ~ a l l e r P ( o ; ) = 0.24 on the basis of the rr~ediaricst.i~nate(Figure 7.38, bottom
than 0.8 p p ~ nthat
, locatiorr is declared safe with a risk P ( u ) oSf;tlse negative left graph). T h e same l o c a t i o ~u;
~ is classified as hazardous with a risk
(Figure 7.38, Imtt.om graphs). Note the following: a(u',) = 0.76 on the basis of the 0.9-qtrantile estimate (Figure 7.38,
At ally part.icular locatiot~11, the magnitude of the rnisclassificatio~irisk rniddle right graph). In the latter case, all unsar~~pled locations are
dt!pends on t.lw c a l f inotlcl, trot. on f.11~particular csf.i~t~atc
(tnrdi;in or declared contan~inatedwith a large rnisclassification risk a in the low-
(I.!)-quani.ilc) rel.aincd. valued (Icft) part of the (ransect.
7.4.4 Simulation
Let f,'(u;:[(n)) he the cottditional distribution (ccdf) ~ n o d e l i ~t,lre ~ g uncer-
T h e renledialion cost is Irere nrotleled as a constaut value w z . For ex- tainty about the unknown r ( u ) , l t a t l ~ e rtlran deriving a sing11: estin~atkd
ample, the cleaning procedure atnoulrts to removing t h upper layer of value z'(u) from that ccdf, o w rnay draw from it a series of L sinrulatcd
soil, hence the cost is independe~ttof the actual ujucentration and W ? is v a h ~ r szll)(ll), I = I , . . ., L. Each value z(')(o) represents a possiljlc outconic
in units of moury only. An alternative consist.^ of modeling the remcdi- or realization of the 11V Z(11) modeling the uncertainty a t localion 11.
ation cost as proportional to the estimated colrtaminatiorr [z;(u) - z,], ?'lie "Monte-Carlo" sitnulatio~rpruceeds in two steps:
wliich calls for a prior estimate of the unknown concentration a t 11. 1. A series of 1, ittdepe~rdentmndorri trurnbers pi'), I = I , . . . , L, unifi~rlnly
'The conditional cdf ntodel F ( n ; rl(n)) allows one Lo dctcrnriue t,lre ex- distributed in [O, I], is drawn.
pected loss att;tcltrtl to llre two types of classification as 2. T h e 1t.h simulated value :il)(u) is idejrt,ifictl with the l,(')-~lon~~tilc
of tlte
ccdf (Figure 7.40):
z(l)(11) = ( I ;( ) l = 1, . . . , 1, (7.76)
Case: ctb = 0.5 1 Case: 1% = 2.5 TIE I, si~nttlatcdv;rlrrcs r(')(u) nrr dislril~rttedarrrirding t,o the co~tditional
Cdf. Ill<l?<Xl,
Il~alllicost Health cost
......
froto I,he defiitiI.iott (7.76),
Remediation cost Remediation cosl
..3 .. -~. ...... ,.-, = ~ ' r o t ) { ~ i '<) l.'(u;zl(n))]
1 7 3 4 1 6 O l Z 9 d i B
Cd concenlral#on(ppm) Cd concentralion ippm) since F(n;zI(n)) is inolmt.otric t~otrdecreasing,
= 1 q u ;Z[(?L))
p(') arc n~iikrrtrtlyrlisl.ril~t~bri~l
sittcf, 1.11,. r;rttrlo~titt~tt~rl)crs i ~ [(I,
t I].
'I'lris 11ropcrt.y of cctlf reprodncl,iou allows ottc l o approximate any nlo-
trro~rl.or qrta~rt.ilnof llte ct~lidil.ionaldislributiol~by tlte corresponding mo-
111cnLor qttmtilc of the Itis(.ogr;tnt of tttilny rc;tlia;~l,iol~s z ( ' ) ( r ~ ) Tltns,
. Monte-
. - Sale (:atlo sitt~rllatiot~ providcs an allerttnt.ive t,o the ;tl)l)roxirt~;ttiot~s of type (7.64)
1
. , . -.
7 3
.
.......
4
Contaminated
5
. ..
6
.
1
.-~T
2
....... Contaminated
.- --.~-.~
3
..... ..*-
1 5 6
o r (7.65) for ~ ~ t t t p r ~ thel h gconditiotial variance o r L t y p c eslitnale. Note
nistance ( k a ~ j Dirlanm (km)
t,lio~~gli t h a t dctcrmination of the qn;trrt,ile value (7.76) still rcqt~ircsi~rlerpo-
1;rt,io11and cxt.rapolatiot~from calculated ccdf v;iltlcs.
I Resulting classilication Resulting classilication Stochastic simulatiorr can b r rxtrttded to tlrr modcli~igof uncertainty
; h u t Llte oxil.puf. v;1111cof m y cotnplcs trattsfer fuoct,iott a t m y localiorr
Contaminated 11. (:o~tsirlcr,for rxamplc, l l ~ epriiblett~of modclit~gthe t~t~certairtty about
IJ t l loss~ associated with classifying t.ltc location 112 a s safc. I'igurc 7.11 (left
graph) shows 1l1c itist.ograttr of 1,000 s i t ~ ~ t ~ l n Cd-v;dnes
tetl dr;rwrt frorrt the ccdf
nlodcl rlcpiclcd in F'igurc 7.32 (page 334, right, graph). 'l'hc sirt~ulatedhis-
f.ogr;tlli is close 1.0 1.l1e condil.i~mnlpdf nrodcl s11mvtt it1 i.'igtlu! 7.33 (right
grlt), IS it S I I I I I Tht: 1,000 n:aliaations z(')(n2) can he fed into
tltc loss futrctio~t(7.72) t o yield a si~~tttlat,ed distributior~of costs a t loca-
i r e 7 . : 'Swo pairs ol loss l~triclionsn~odolthe loss associated with all ill- tiott uz, y ( l ) ( i ~ 2 )= I,~[z~')(IIz)],1 = 1 , . . . , I , (Figure 7.41, right graph). I11
f:mrect classificatioa of a location as safe (ill health) or coatarniaated (snneccssary
clea~~irtg).As the rr~~iedialirrncost wz increases, tltc expected loss att,ached to u~tdtte
cleaning irrcrrasrs relathrly t o the expeclcd ltralth costs, leading to more lor:ations
bving r-lassilird as safe. Simulated Cd values
0.3
1 Simulated costs
0 1 2 3 4 5 6
Cd concentration (ppm)
As with the nicasllre (7.77), t l ~ elocal elitropy equals zero a t ally datum
location (no o~~cert,ainty).At otlrer loc;iI,io~rs,t.hc local entropy is valt~erli l l
(0, 1111<].
T h e upper bound (Inli) is the lrraxirnu~nelltropy associatcd with
the uniforrn distribution, b1(11;sk/(n))]' = 1 / K , V k . 'l'l~us,;I st;~~rdardizcd
measure of the local entropy is
,,
l h e local entropy a t locations 11; it11011; is 0.28 and 0.87, rcspr:ctively. A g i ~ i l ~ ,
the uncertainty is larger a t 11;.
7.5 Performance Comparison
T h e risk of the C d , Co, or I'h cotrce~rt,ralionsexcecditrg the loleral)lc max- MG estimates olK estimates
imum within the study area is modeled using iliffcrerlt multiCaussiarr- and
indicator-based algorilltrns. Next, regions where remcclial measures sl~oold
he taken are ileli~reatedby applying classificat~ior~ criteria introduced in sct:-
tion 7.4 l o the local ccdf r~totlcls. (:lassi{ic;~i,ion rcsults art: compared \vil,lr
actual metal cortcetltratiolrs known at, test locations.
of the (:<I
2. T h e four otlrcr algorit,ltrlis rcquirc a prior i~rdicatortr;r~tsfrrr~r~
data. Wltere;~sordinary indic;it.or kriging (olK) uses o ~ d yindicator da1.a
defined a t the tlircshold being rstitnatcd, 0.8 ppl~r,ordinary indicator
cokriging (oICK) accolnrts for t,l~rer:additio~ralt,lrrcslrolds 1.38, 1.88,
and 2.26 ppm. 11lsle;rd of indicat,or t,ra~~sfortrrs a t thrcslrolds differe111,
from 0.8 pptir, prohal)ility kriging (I'U) rises orlifnr~lrl,raosfnrn~sof Cd Local prior probabilities SKlm estimates
data as s ~ c i m d a r yi n f o r m a t i o ~ ~ .
3 . T h e l;rsl i~~rlicaf.or
nlgorithlt~i~~ciirporntcs soft, ittfor~r~;rliotl r<.lat.cd to gc-
t,o t . 1 1 ~i~rdi~~;rl.or
ology i t 1 i~ddit,io~t dal.;~;at, l . l ~ r i ~ s l 0.8
~ d ~pp~tr.
l A wliI)r;k-
tion of rock ILypes allows w e t,o rrmvert 1 . l ~ :gwlogic trrap 01 L"ig11rc (i.6
(lefl top graplr) into a map of loc;il prior pn~l,;~ljilitir:sof cxceivlitrg
0.8 pprn, slrown i l l Figure 7.44 (lcft. Im! graph). 'I'l~~w:prcd~;~I~ilit~if~s
are tl~ivru s d 21s l ( m l IIIWUS itr t,Iw siwpit! lli <7sl.i~~~al.or
(7.43); SW I . h
rcsulti~rgprol,ahilit,y tlrap i l l Figure 7.44 (right. Irottom grnph).
360 CHA1"I'ER 7 ASSESSMENT OF LOCAL UNCE'KlAINTY
O p t i m a l e s t i r n a t o s and m e a s u r e s of local u n c e r t a i n t y
OK error variance Conditional variance
Figure 7.45 shows LIte ritaps of ordiuary krigiug Cd e s l i ~ r ~ a t e(I.op
s graph)
and four esI.i~~tates ~ olK-rclalerl ccdfs. Altltougl~boLh OK
retrieved f r o ~ rthe
and 1.:-type estiri~atcsare optimal for the lea&sqoares criterion, they are rot
identical. As mentioned in section 7.4.2, the two estimates differ in that the
ccdf from wlrich tlre E-type estimate derives is deper~ilerrton the d a t a v;ilnes.
' I h t differer~ceis frirther illustrated in Figure 7.46, which drows the maps of
the OK variance and three nreasures of spread of the conditional d i s t r i b u t i o ~ ~ s
Cd OK estimates
Figure 7.46: I.'our nrcasures of local micert.iiaty over t l ~ cstudy area: ordinary
kriging vnriaucr awl tllrce ccdf statistics (I-oaditioanl variirnce, entropy, and ia-
lkrqu;~rI.il~~
~;mgc).
D e l i ~ n i t a t i o of
i ~ continninated areas
Figures 7.47~~7.49 sliow three subdivisions of the stndy area into safe regions
and r e g i o ~ ~where
s re~nedialrneasrires s l ~ o d dbe taken. 111all t,hrce cases, t.l~e
conditional crlfs were cst;~l~lisl~ed 11si11gordinitry iodical.cx kriging. Tlw Lhrcc
criteria for classifying a location as corrta~~~inirtcd follow:
1 . T h e probability of cxcecding the tolcrablc n i i t x i n ~ u0.8
~ ~~~ ~ I isI liuger
I
than the marginal probability of cotrla~~ii~~al.ionr~stirnated in section 2.1.2 Risk n
(Fignn: 7.47).
60
C
.-0
5 so
.-
r
40
-0
."E 30
* 20
10
Classilication
0.0 0.2 0.4 0.6 0.8
Probability p Probability p
R c p r o d u c i n g nroi1c:l s t a t i s t i c s
Instead of a map of local best esli~rrat,es,slociiastic si~tnllatio~i gcueratcs a
1 2
I
3
4
L
5 6
L
m a p or a realization of z-val~~es, say, {z(')(n), rr E A) with 1 dcnoting the it11 2 3 4 5 6
realizalion, which r q ~ r o d u r e sstatistics deelncd most consr:q~letrti;d for t,he Cd concentration (ppm)
prohleni in h a n d Typical rcq~~isites for SIICII si111111at~~d
tnap art, as f o l l ~ ~ ~ v s :
1. Data values are lio~~orcd
a t tlieir locat,iur~s:
Mean 4370
Utilizing expression (7.72), one can ulorlel thr: cost ;tssociat~cilwit11 rlcclaring
safe a locatioll 11'.I as T
1"-
cost
Kriged map
fixed number ~ ( I I ' )of closest original data are retained no nrat,ter how many
prcvioosly simulated values are in the neighborhood of n'.
1. 'l'l~e s e y ~ ~ e n t i asi~rrulat,io~~
l algoritlrrn requires the deter~r~iuation of a
conditional cdf a t each location being sin~ulated.Two r ~ ~ a j oclasses r
Visiii71g srquerm
of seqoential simulatior~algorithms can be disti~rguished,depending on
In theory, the N nodes can he sirnulated in any seq~renceas long as all data
whether the series of conditional cdfs are determined using the multi-
and all prcvio~~sly siniulaled values are used in the dcterlrrinatiou of ccdfs.
Gaussian or the irrdicator f o r ~ r ~ a l i s introduced
~ns in the previous chap-
However, because ouly neighboring data are retained, nrtilicial continuity
ter.
may he gcneratcd along ;L d&rrninistic path visiting tlre N nodes. Hence a
2. S e q u e ~ ~ l i sirr~r~lation
al ensures tlrat dat,a are lronored a t bheir locatior~s ranclo~r~ sequence or path is reconrmendcd (Isaaks, 1990). An exception to
(cor~dil.ionalrc;~liaathos).I ~ ~ d c c ral t, any d n t u t ~l o~c a t i o ~u,,,
~ the sitw t.lris practicc is t.he ~ r ~ ~ ~ l L i ~ ) l c - g r i d s i t ~procedure
r u l a t i o ~descrihcrl
~ sul)sequenlly.
ulalrd value is drawn f r o ~ ra~zero-varial~cc,unit-step co~~dit.io~r;rl crlf Wltw g r w e r a t i ~ ~swtrral
g rc;dizat.ior~s,~ l c~o ~c ~ ~ p u t a t i ti111e
o ~ ~ acan
l be re-
with Incan equal to the z-drtturr~z(u,) it,self. If large rncasure~r~t~nt er- duced rot~sirler;~l~ly by keeping the same rando~tlpath for all realizations.
rors r e ~ ~ d qocstio~~alilc
cr the exnct matclring of data values, one slrould Irt<lecd, i . l x h' kriging systerns, O I K for eaclr node o;, nr:ed be solved o d y
allow the sirrn~latetlvalues Lo deviate somewl~atf r o ~ ndata a t their loca- once since thc N conditioning data conliguratiorrs remai~ithe sarne from one
tions. If the errors are rrormally distributed, the si~nulatedvalue could realizatiou to anotl~er.T h e trade-off cost is the risk of generating realizations
be draw11 from a Gaussian ccdf centered on tile d a t u ~ nvalue and with that are too similar. 'rl~erefore,it is brtt,er to use a different random path
a varial~ceequal to the error variance. for each realization.
1)efinc a r a i ~ d o r path
i ~ visiting each node of thc grid only once.
At each node n', determine tile parameters (mean and variar~cc) wliere tile error component l<(n) is independent of YiT(lr), I':{~~;(II))= O and
of tlie Gai~ssiarrccdf G(n1;gl(n)) using Sf< \villi t h ~ :normal scorc Var{b'(rl)) = o k T ( u ) . H.ecall the ISI' niodel Y(n) = ~ ( I I+) l i ( u ) , w l w c tile
srmivn.riograrn model yy(11). The condil.ioning information (71) trend component m ( n ) is rrlodelcd as a linear conlbi~lalioiiof ( I i + l ) f ~ ~ n c t i o n s
consists of a specified nnrnbcr n ( d ) of botll normal score c1at.a of tlie coordinates. Accounting for t l ~ oK T systcnr (5.26), one car1 s l ~ o wthat
~ ( I I < and
~ ) v:tl~lr:s2 / ( ' ) ( ~ ~
siinulatctl
i) a t previously visited grid nodes. the simulatiori rnoilel (8.8) does not, reproduce the residuitl covariance (:rr(ll)
u~ilessIi' = 0 (silnplr: krigiiig case):
Draw a si~nnlntcdvalue l / ( r ) ( ~ ~fro111
' ) 1.lmt ccdf, and i~ildit to t.lrc
d a t a set.
I'roceod to the next iiode along tlic r a ~ i d o mpath, and repeal. l,l~e
two previous steps.
Loop until all N nodes are simulated.
3. T h e final step consists of back-transforn~iogtlie sirriulated normal scores Consider now the situation rvlierc the unknown t r c ~ ~m i l( n ) is idenl.ified with
{ y ( ~ ~j ) =, 1 , . . . , N ) into simulated values for the original vari- its ISI' estiniate rnkT(n). Since m;iT(u) is know11 everywhere, sinnllation of
able, wliicli amounts to applying the inverse of the r~orlnalscore trans- Llie normal score variable Y anmunts to simr~latirigthe r e s i d ~ ~ aR(n) l llsillg
form (7.8) to the simulated y-values: simple kriging and the resirlnal covaria~lceGII(ll). T h e sinnllalion model (8.8)
becomes
w l w c the S1i weights AZK(u) arc provided by an SIC system of type (5.10).
+
Acco~mtingfor the relation nt;<T(~l) IZSf<(u)= YliT(u), O I I C can write thc
sin~rtlnt.io~l
n~otlel(8,s)) as
3 4 5 6
Distance (kmj
1 2 3 4 5 6
;IS in t l ~ :SIC sysleni (5.9). Distance (km)
1)clertitining a kriging cslinlale and a difrerent k r i g i q variauce would
require solving two systems a t each location 11, ltence it is 1101practical. 'I'lie 1 Simulated normal scores
solution adopted by program sgsim in GS1,1112.(1 is to allow tlrc user to ittpi~t
a conslant v;rriancc correction factor 11111ltip1yingall kriging variariccs (of
normal scores). 'fl~isfactor is to be detern~inctlby trial and error to ensure
that the variauce of all sitttolatcd norrnal scorrs for any particular renliatttiol~
is indeed I as required hy the theory.
A'olr
Actnally, rcproduct.ion of t h covariance model (.:y(h) does 11<1trequire the 1 2 3 4 5 6
succcssivc ccdf ntodcls to be Gmssian; Lhey can be of any type as long as Distance (km)
l l ~ c i rnleans and variances arc deter~rrinedhy sitnl>lekriging (.lournel, 1994a).
,>
I Iris result leads to an iniportant tl~eorelicalextensior~of t,he seqrlmt.ial s i n - Simulated Cd values
ulatiot~paradigtn wl~erebyoriginal I-athribtt1.e values are sin~nlat.eddirect,ly
wil.ltot~lm y prior n o r ~ n ascore
l transforn~.'I'lie algorithn is tlirn ctdled direct
1
scqucntinl sitnulation (dssiln), see Xu and Joornel (1994), Xu (1995b). 111the
absence of a normal score transform and hack-transform, there is, l ~ o ~ e v e r ,
no control on i . 1 ~histogranr of s i n d a t e d values. Reproduction of a target
histogram can be achieved by post-processi~lgthe dssirn realization using the
algorithms introduced in section 8.9.1 I. . .--
-
1 2 3 4 5 6
Distance (km)
Consider the conditional sininlation of Cd coriceritrations along the NE-SW Fignrc 8.4: Sequential Gaussian simulation. 'I'l~eten original Cd rlala are first
transect. Figure 8.4 sllows the main steps of the sGs approach: transforrnrd into ten nortnill score data, then conditional sirnulation is performed
i n t l w n o r m a l s m c c itllirrl row). IFisnllv. tltr simnlatcd sorn,al scores are back-
8.3. SEQUENTIAL GA USSSAN SIMIJLATION 385
CIIAP?%IL 8 ASSESSMENT O F SPATIAL UNCER'lAINl'Y
Figure 8.5: Graphical trartslorrrl of tcn Cd data into ten normal scores (top graph).
Using this correspondence in reverse, the 106 simulated normal scorrs drown in
Figure 8.4 (third row) are hack-transformed into sinlalaled Cd values. 'I'lw ten
original Cd data depicted by the large black dots are retrieved exactly.
1 T h e tell Cd data are first transfornicd into tell normal score data using
the t,ransfor~~lationTable 7.1 011 page 269 (Figure 8.4, t,op graphs).
2 . S e q ~ ~ e n l isimulation
al is then perforn~edin the norulal spact! r~siugt.lle
semivariograr~~rr~odel of Cd nornlalscores show^^ a t the top of Figure 7.6
(page 274). At each silnulaled node, the five closest Cd norriial scorc
data and the five closest, previously si~nr~laterl values arc retained in the
SK system (two-par(. search). Figure 8.4 ( t l ~ i r drow) shows the profile
of sirnulatcd normal scores.
8.3.2 Accounting for secondary information
3. Last, the s i n r ~ ~ l a t enorrr~al
d scorcs are lrack-transfor~ncdint,o si~nolal.etl
Cd vali~csusing expression (8.7) (Figure 8.4, botto~rrgmpll).
~ < I ~ I ~ Inor111a1scores {yI0)(&I;),j = I , . . . , N ]
6. I ~ : L ~ . ~ - I , ~ : ~L IIE I Ssi111111atcd
into si~rnrlatcilvalurs for (.he 1)rimary variable { r(1), 1 = 4 I ( ( 0( I I,~ ) ) ,
j = I , . . . , A'].
{zI( 1 ' ) (I%;),j = I , . . . , N ) , I' # I , are o l ~ t a i i ~ cby
Otlwr rcaliz;tl,io~~s d repeating
wl~ereI;?(.) is the i~rargir~al cdf of the variable %i the cntire process wit.ll a rlilfcrcnt rand0111 path for each realizatiot~.
'l'l~encxt step is t,o check wl~rtlrerthe auto and joint two-point. distrihr~.
Lions of t.l~en o r n ~ ascore
l RFs {YI(II),. . . , l ' ~ ~ ( 1 1J )are rcasotinbly normal. In C o l o c a t r t l r o k r i g i n g a n d thc, M a r k o v t i ~ o t l t ~ l
pract.ice, tlw Iji(:ii~~ssia~~ ~ISSIIIII~~~ isOchecIte~I
II (mly for rarh variihlc srpa-
rately 11si11g(IN. proccdrlre (lr~scriberlin secl.ion 7.2.3. 'f'l~e~ ~ ~ u l t i v a r i n t , ealgorit11111
s(k n g sense t,lrat a frill cokriging
is d e i ~ ~ ; r ~ ~ cinl i t,l~e
If the two-point distril)ut.ior~of each I I ~ ~ I I I :scoreI~ variable appears rea- systeru triost, be solvcd at i d i grid node bcing sirr~olal.ed.I ~ ~ r p l e ~ ~ ~ c of ~~tatio
sol~alrlynormal, a r~rultivari:rle r~~oltiplc-poinl Gaussian RI" 111odt4is then t.l~e;rlgoritl~mis alleviated by using coloc;ited cokriging a11i1the Markov-type
a d o p t c d Under that tnodel, t,lre ccdf p;lran~c:lcrs (mca~ra1111 variance) of ruodel i ~ r t h d ~ l c eillr l st:ctiot~6.2.6.
the nornial srrrre variablti Yl(11) are cqoal to t11e s i ~ n p l ecokrigir~gestimate ,l o reduce t.hr c o t ~ ~ p ~ r t a l i ot,imc
% ~ t a land avoid i11stabilit.y p r o b l m ~ scaused
y(51ZK(11) and correspoltdit~gsi~nplecokriging variauce (~(51)?~~(11))~ o l ~ t i i i ~ ~ e d by possibly highly r c d u n d a ~ ~sccondaryt infornration, ouly thc s e c o ~ ~ d a rdata y
from all rrormal score data yi(u,,). 'I'he ccdf is t,l~enmodeled as colocaterl wit11 the r d e beil~gs i r u ~ ~ l a t ccould d lie r e l a i d in the silnple
.
cokrigiug systeul. T h e trade-off ctlsts of such an approncl~are the followirrg:
'I'lwre is no control on reproduction of the correlation between priniary
and sccond;~ryvariables a t lags 1111 # 0.
. 'l'l~rsccondary vnriables must l ~ availal~lt:
n a t all si~nulatcdgrid nodes. If
this is uot the casc and the secondary i ~ l f m ~ ~ a tisi odense,
n ~ ~ o dvalues
al
of t,lw sccondary vnriahlcs could he ir~tr:rpolatrd or better simulated
prior 1.0 sin~uli~I.iol~
or I,he pri~naryvnri;tl)le.
Il~rliltrfull c o k r i g i ~ ~colocatcd
g, rokrigil~greqttires only tltc it~ferenceand
modcling o f t.lre pri~narycovariance function C y , y , ( h ) and the cross covari-
ance fi~ticl.iortsCy,,,:(h), i = 2 , . . . , N u ,defined as
3. Draw a simulated value y(ll)(ul) from t,hatccdf, and add it to the data
set. I'htcl~ cross stmivariogram rnodel yy,,y,(h) is t l ~ e rderived
~ as a mere linear
rescaling of tlto p r i l ~ ~ a rscmivariograrn
y model y,;,~, (11) Ily the correlation
I . Proceed to the next node along the raodo~rrpath, and repeat the two coolficient py,,lr.(0). One sl~ouldclteck wl~ether,the proporl.ionality rela-
previous steps. tion (8.10) actually applies to the experitnental cross semivariograms.
8.3. S E Q U E N T I A L GAUSSIAN SIMULA'flON
Example
Consider the conditioiriil sirl~r~latiou of Cd concentratiot~sa l o ~ ~the g NE-SW / Normal score data
transect using as exhar~sl,ivelysampled secontlary infor~n;it.ionthe Ni blocli
estiinatas s l ~ o w ~; r~ lhe
t hott.on~of Figure (Ll. 't'li<i it~tiltiv;~rini.~~s(ls
;dgurit.hnl e
proceeds as fdlovvs:
2. Sequent,i;rl s i i ~ ~ ~ ~ l is
i htm
l ~re npcrfornied i l l tlro i~orrlralspace. At e;u:l~
simulated ilode, tlre conditioning infor~rrationconsist.s of the live clos-
est nor~rralscore (><I data, the colocaterl n o r n ~ a lscore N i d a t u ~ r air4
~,
Llle five closes1 previously si111~11alerl normal scorcs. 'I'l~ecoloc;it~rds i u -
Cd normal scores Ni normal scores. Cd-Ni normal scores
ple cokriging systen~is solved u s i q llre direct and cross se~riivariograin
1.2
models inferred from all 269 ~rorrnalscores available over tile study area
(Fignre 8.6, second row). In this cxarl~ple,the Markov-lylw approxima-
E
tion (8.10) was not used I~ecauscthe experil~rel~l.;rl cross scrnivi~riograrn
between Cd and Ni norir~alscorcs is 11o1proportior~alLo the scmivari-
0.0 , -
ogranl of 131 normal scores. 0.0 0.4 0.8 1.2 1.6 0.0 0.4 08 1 2 1-6
0.0 0.4 0 8 1 2 1.6
Olslanrs (W")
Dislancs lkml DIsBnce lkml
3 . I.'inally, the siinulated norrual scorcs are li;ick-lrnr~sfor~trctlinto ~ I I I I I I -
lated Cd values (I,'ig~ire8.0, bottonl graplis).
Simulated normal scores
K r i g i u g will1 all oxterual drift ( K E D )
An alternal.ive i,o cthcal.ed cokrigi~gfor incorporating cxhar~slivelysa111plr4
secondary ioformatio~ris providcd by krigiirg wiI.11 an exl.erl~;ddrill. 'S'hc ircy
assnrn]~liorrl l r w is I.II;II., i l l 1.111: ilori~~;tl
sp;rcc, l.111, ILrc~tdo11.11v~ , r i r l ~ a variidh
ry
Yi is a linear f~irrctjonof t.he s~:cmrdaryvariilhk! V2, I P I I ~ C I I 111us1,
s~rroothlyin space:
l.I111r~[Ore viwy
__ 1 2
..,_-__
3 4
Distance (km)
., _
5
, ._~,
6
I Simulated Cd values
As n~entionedill srclion 6.1.3, it is critical to validate this assonrptioo either
from calibration data or by rising some physical rat,ionale.
Under the trend rirodel (8.1 I), the mean of tlre Caussial~ccdf xt ally grid
rtode 11 is identified with the KEL) csi,iinate:
8.3.3 Joint simulat.ion of ~nultiplevariables Proreed lo t,he liext ti& along 1,hc random pal.lt, and repeat the four
.
ptcvioos st,rps.
I . I)rfirlc it I~ierarcl~y
of variables, starting mit,h 1l1e mosl. itnportant or
better ;mt,wcorr~latedvariable % I .
s
/I
04 04
8.4.1 Accounting for a single attribute
00.. ._ _.,..
..-. ~.~ . -
00 04 08 1.2 16 (:onsider first the si~iiulatio~t at.trilrute z a t N grid rlodcs
of asingle cootinrlo~~s
uis,ance (k",)
u; conditional only 1.0the z-data {z(u,),a = I , . , , .
I si~nalationprocecds its follows:
S c r ~ ~ ~ c ~il~t ~i adli c a t o r
is
mL..
00 04
. _.
08 12 I B
.
00 04 08 12 $ 6
nir,anre j*,") D l l f m ~ sIkrn) Ulriancslln,,
20
Cd data 1. 1)eterminc the It' ccdf valnes [ F ( u l ;rk1(71))]. using ally one of the
16
i~idicat,orkrigirlg algoritlm~sint,roduced in section 7.3.2: simple
12
on
! or ordin;~ry indicator kriging, ntedian indicator kriging, indica-
t,or cokriging or probability kriging. 'Tl~ccouditioning infoma-
I
04 1 t,io~iconsists of indicat,or transfor~~rs
(and riuiforrlr transforms for
0.0 lxobability kriging) of ~lr~igllhorilig
origirral d a t a and previously
00 04 0 8 12 1 6 simnlatrd z-viil~~cs.
DiSlrnce (km)
Example
Consider the co~tditionalsirnulalion of Cd concentralio~rsalong the NE-SW
transect. Figure 8.8 depicts the niaitl steps of the seqoet~tialindicator ap-
proach, as follows:
1. T h e ten Cd data w e first t r a n s f o r ~ ~ ~into
e d ten vectors ofirtdicalor data
using the three tlireslrold values zh= 0.8, 1.38, and 2.26 ppm (Figure 8.8,
top graphs).
2. For each tl~rcsl~old value, t.hc indicator s e ~ ~ ~ i v a r i o g risi uinfcrred
i~ frou~
all data over the study area (Figure 8.8, third row).
3. Ccclfs arc deterll~inedusing ordinary irtdicator krigiug. At eacl~sin,- Indicator vectors
d a t e d location, the conditioning information col~sistsof the indicator
transforn~sof the five closest 1X data a d of the five closest previ-
onsly siri~ulalcdCil v a l ~ ~ o(two-part
s search). T l ~ eresolution of tllc t 1 1 1 1 01 to 1
discrete ccdf nlodul is increased by liuear it~terpolatiorlbetweeu t a b w l i t I t O l O O O
lated bounds obtained from the ten Cd d a t k Lower and upper t,ails arc
--~. .. .. .. .. .
0 1 1 10 00 00 0
rnodelerl using, respectively, a power ~rtorlcl(w = 2.5) and a ltypcrl~olic
model (w = 5).
1 2 3 d 5 6
Distance (km)
R e p r o d u c t i o n of nroclol s t a t i s t i r s
At each grid node, the indicator-based sininlat,ion can be viewed as a two-stcp ,Is1threshold 0.8 ppm 2nd threshold 1.38 ppm 3rd threshold 2.26 ppm
procedure:
1 . A simulated class-value is first a s s i g ~ ~ rtod the grid l~otleti', say, 11' t
(z&,, z ~ I ] .
2. A simulated z-value is tlmr d r a w from that class ( 3 , ; q . 1 usir~gSOIIIC
witlrin-class rlislribotion iuodel (e.g., a urtifortn d i s t r ~ l ~ o t i o ~ r ) .
Consequcnlly, indicator-basd algoritl~msguarant.ae (;q~prnxi~nittc)
tion of only the Ii class proporl.io~rsand corrcspor~dingi~~iliral.or
rcproduc-
scn~ivari- 5 , Simulated Cd values
ogralns yr(11;z k ) , uot r ( q ~ t o d ~ ~ c t of
. i othe
~ t cdf i u ~ dsi:~~~iviiriogra~n or 1 . 1 1 ~~ 1 1 -
tinnous z-valucs.
T h e actual ;ipproxinratiorr of one-point aud two-point z-st.;rt.istics by a
seqner~tialindicator realiaathn tlrus depends o n several factors, sucl~;ts t l ~ r
discrctizatioo level (nurr~brrof tl~resholds),the ir~forrua(,ionaccounbcd for
when perforniitrg indicator kriging, aud tlie i r ~ t e r p o l a t i o t r / c s t r ~ ~ ~ ~ ~mod-
~l;ttiorr
els used for increasing LIte resolut,ion of tlu: discrete ccrlf. i. . , , , --
7
I 2 3 a 5 6
Distance (km)
Figure 8.8: Sequential indicator sirnalatio~Alter coding the ten original (:d data
into indicator vectors, coeditioual simulation is performed usiiig ordinary indic;~tor
kriging and the three indicator scn,ivariograms slmwn i s tltr third row. 'I'lrc bottom
, , . . *.. . .. , 0 ,
Cd data
'J'l~cr m m a l scow b;~ck-tr;i~rsforrir, vvl~icliis all int.cgrirl pnrl of tlrc sGs algo-
ritlun, g c n i d l g lmds to ;I l>ctt.crrcproducLion of I,IK 2-cdf ~II;LIIdo indicator-
I,;~scd t w l m i q u ~ s .If t h i t t r t ~ l ~ r ~ d ~ ~isc t~.Ci oF I~I iI CI ~ B T S S S : I I ~ , the realizat~iot~s
can IN. p ~ ~ s t - ~ ~ o w snsit~g
s " c d I.hr ;rlgorit.llms inlrorlilccd l;ttcl. i l l srctiot~8.9.1.
Concer~lralion(ppm)
8.4.2 at'1 0 1 1
Accounting for secondary infor~n I 1 2 3 4
s2:Other rocks
5 6
Distance (km)
r,=O.BO ppnt
6
tairled using t . 1 ~hlarkov-Bayes algorit.lt~rl(solid line) and ordinary indicator Distance (km)
kriging not accor~l~ting for soft iuforrnal,iorr (dashed li~ie).'l'he satue rnrtdorir
path and series of rarrdolri nunrhers art: used i l l l,of.lt cases, so diffcrcoces bc-
twecu the two sitiiulated profiles orighnte ouly fro111tltc ;i~lditio~ial soft infor-
mation. Accoutrt,ing for geology yields s~nnllcrsimnl;rl.ed valucs on Argoviir~t
rocks, wlricli have sttrallcr probnbilit~icsof esce<:rli~~g tlic diff;:rcnt I.lircslrrrlds.
l e d rrodc A4 large
'l'lic cornputn6io~1aIcost of solvirtg ;it caclt s i ~ r ~ t ~ l agrid
and oftctt u ~ ~ s I . a indicat,or
lh cokrigirrg sysf.m~swit.11 (A! x N ) equat.ions
5. I'rorcrd 1.0 1111. next, node along t l ~ crand or^^ path, ;rnd repeat stkps
I 1.04.
~ ~~ . I I P standard normal RP
whcrc Cy(l1) is tlrc c o v a r i a ~ ~ fwr l ~ t c t i oof
Y(u), ClI is t,lw 7 1 x n ~ I i i ~ ~ - t o - d a t a c o ~ a matrix,
ria~ic~ Cz2 is the N x N
2. C:odc tlte sir~lolatcdvdue z f ) ( n l ) illto a vector of local prior proh-
rode-1.0-~torle covarinncc nratrix, and C 1 2 = C;l is t,lw data-to-node
abilities of t.y pe (7.25) for %2, [ l l a ( ~ ~I: ' ;z), . . . , l/a(nt;zriz)]. rw:triauw n~:ttrix.
triangular matrix: witlr the point-ccdf ~'(II;;zI(11)) being d e f i n ~ dit1 J poit~ts11; discr~tizingthe
block V ( u ) (recall section 7.3.2).
In the absence of hlock d;rl.a zv(u,) and c o r w s p o ~ ~ d ihlock
t ~ g sl.;ttist.ics Z I I I ~
I h c k indicat.or data, the 1,lock ccdf (8.13) can be numerically approxilni~terl
Clearly this step requires the dil~~ension
(n+ N ) t,o he stnall (lesser than hy the cninolativr distribution o f ~ n a n ysimulated block values r$'(u) (Isaaks,
10". 1'390; Cht~iez-Iicri~irr<lea, 19'31; I)r:utscl~ arlrl Jourrlrl, I!)!)Zn, 1,. 90; Clacken,
3. Generate a conditional realiaatiol~{y("(u;),j = 1 , . . . , N J as the linear 1'3'36):
combintttion:
<
\vitlr &')(,I; z ) = I if L(~)(II) Z, and Z C ~ Ootllerwise; A is l.he si~nulatiollilrea,
;tllfl A n A _ , , is the ilrlcrsection of area A wit11 its LranslaLioll by vector -11.
11)words, t,lic o ~ ~ c - ~ o and
i r r t two-point ccilfs of the s i ~ n ~ ~ l avalr~es
t r d ziO(u)
averaged over all possible locations 11 E A a p p r o x i ~ ~ ~Lire a t cstationary one-
point and two-poilit :-rdfs.
2 . A ~ ~ ~ ~ ~ - c r ~ ~ ~I,-fielrl
i l i t i rcaliaatio~~
o ~ l ; r l is t l m ~gcl~rriiterlin two-steps:
(I) I ( I ) = 1, . . . , N ] , arc g ~ ~ ~ c r a l11s-
ed
iug ~ i : ~ u c ~ i l (:;n~ssiar~
inl sillmlalio~~ arid t.1~: sc~~~ivariogr;rm of uliifo~lll
Lrarlsfor~iisshown in Figure 7.19 (p;~ge303, rniddlr graph), (2) l l ~ esirrr-
n l a t d y-values are then t r ; w s f o r ~ ~ ~into c d u n i f o r r ~y-values,
~ J =
p(')(u')
(:(11(1)(14)),j = I , . . . , N , wl~ereG(.) is thr: Gaussian cdf. 'The pfield
realiznt,iou is s l ~ ~ w i n ~Figurc
r 8.11 ( s e c o ~ ~row).
d
3. 'Tlli: sinlnl;ttrd p-values are used to s ; ~ m p l cthe couditional cdfs. For
e x a ~ r i ~ lthe
c , sin~ulat.t:d;-values a t l o c a t i o ~ ~11s; and 11; are, respectively,
0.46 and 1.7'3 ppm, c o r r e s p o ~ ~ d to i ~ ~si~riulnted
g l+val~les0.20 and 0.55
(Fignre 8.11, third row). 'l'l~e resulting sirrlulatrd profile is shown a t
R e p r o d u c t i o ~of~ the z-rlalkt values is ensured tlrmugh the ccdfs F ( u ;zI(n)). the bottom of Figure 8.11
Inilrrd, a t m y 11al~11ii I i ~ r a L i ou~e ~, t11c crdf is a imit-step f u ~ ~ c t i oidentifyi~ig
n
t l ~ cdatum value :(n,). 'Tlrns, wlmtcvcr the s i ~ r ~ ~ ~ 11-field l ; ~ t e dvalue JI(')(CI,)
R.cmark
a t that l o c n t i o ~ ~ ,
~ t i ; t l algorithm used in the previous examl~lcto generate
'rhe ~ ~ ! ~ u c ~Qaussinrr
tile p-field clot,s ~ m c;yjit.aliac
t on the fact that the pfickl is II~II-conditional.
C H A P T E R 8. ASSESSMENT OF SI'A'fIAL UNCERTAlNTY
Simulated Cd values
Final realization
After 398,000 swaps I 2 ,Realization semivarioqram I
O t l ~ c rrealizzttio~rs{ r ( " ) ( ~ ~ =
~ )1,, j. . . , A ' ) , I' f I , arc generated hy repealing
1 . h ~c~rtircprocess stnrti~lgfro111dilfcreut initial ilnages. 'I'ypically, the number
of nodes N is so largc and the sen~iv;uiogrnmis so little coustraiuing that
tlrcrc rxisl. Itlatry so111t.iu11st,o the ol~t,inrizatiouprobletl~. TIICreali'ations
dr:twu a r c sa~nlilcsfroin l.lm1 w t ~ f a ~ ~ ~ ~ r osol~~tiotm. xi,i~i~l~
Si~rn~laLcd nu~realingis roncrptrdly sinlple and ollers great flexi11ilit.y to
accounl for various constraints built into the obj,jecl.ive fw~ction. T h e Op-
Figure 8.12: The si~r~ulated ansealing algorithm. An initial random image (top tin~iaalionprocess, l~owevcr,relics 011 brute force, CPU-intct~sive,trial m d
graph) is gradually rnodilicd by swappiog pairs of values so to achieve reproduc- error to gracl~~iilly acl~ieverel)rod~~ctiou of 1I1e target stat.istics. Itrrplenreltla-
tion of the Cd sernivuriogran, model (bottom graph). lion Lips tlws play ;in essential role for the successfid applicatiou of sirnulatcd
anncirling; see the next section and ffcotsch and Journel (1992h), I)el~bsch
and Cockerl~arlt(1994).
412 CIIAP?%'H 8 ASSESSMENT OF SI'AA'I'IAI, UNCEIlY'RINTY 8.7. SIMULATED ANNEALING 413
8.7.2 Implementation tips Such a perturbation rncchanis~nallows one to keep ~lnchangedthe his-
togram of the initial image. Thus, there is no need 1.0 inc111derrpro-
There are rriauy possililc iri~plementatio~~s
of the general sini~~laterianueali~tg
ductioo of the histogram in the objective f~rnctionas long as llre injlial
paradigm. Variants differ in the way t,l~einitial image is generated and then
perturbed, in the corr~po~ienl.s that enter the ol?jcct.ive functinn, and i n t.lrc image already matches the target histogram.
type of decision rule and convergence crit.crion Llmt :art, ndoptccl. 2. Sclect ralldonrly ;r single locatiolr IIJ and n~odifyt,l~r: corrcspondi~~g z-
value z ( ~ _ I ) ( u ; )accortli~igto sonie niec11;roism; for cxample, the t,arget.
T h e i n i t i a l image l~istograniF ( z ) is sarrrpled anew for a different value:
Simu1att:d ; i n ~ ~ ~ ! artxlnires
l i ~ ~ g the prior d ~ ~ t e r ~ n i ~ ~~fa tan
i o rillilia1
~ i~mge,'
{zin)(u;), j = 1, . . . , N } . T l ~ a prior
t dcternlination shoold lxr such that,:
(2) .'ir~rzi~im.roq~~o~t~
irtodcl
1t.eproducl~i011of tlrc z-sc~nivariogra~r~
u ~ o d e l~ ( 1 1 )is geuerally linrited to a
specified nuurhrr S of lags, f h e measure of the dcviat,io~~ bel\vecn target and
current semivariogranr values is
wlrere $')(lr,; rk) is tlrc indicalor semivariogrnnr value a t lag 11, and t.lrreslrold
zr for tlrc roalizntio~~ a t the it11 perlnrhalior~.
S c y ~ c r l t i n il n d i c a t o r s i ~ ~ l u l a t i o n
Consider Ll~eS ~ I I I I Iof IA ~ Ispat,ial
Llrc I distril~~tlionof K niutnally exclusive
ralcgorics si co~rditioni~l to 1116: data set (.s(uCt),cv = 1, . . . , i t ) ; see sttbscqnent
d o l i t ~ i l i o ~Scqocnt,ial
~. indicatirr sirr~ttlationof cat,cgorical vxriahles follows a
pr~~cmlorc sit~iili~r
Lo l.lmt d~scrihcdi t t scct,io~~8.4 for co~~Linr~ous variables:
'li.nltsform tach cat.egoricnl datruti ~ ( I I , , )into a vcclor of li 11nrd indi-
cator dnt,a, d t ~ f i l ~ c;rsd
Sequential i ~ ~ d i c a l osin1111atiorr
r (sis) is used to s i ~ ~ u r l ; ~
tila
t o slr;tt.i;rl dist,ri-
b u t i o ~of~ the few lar~d~ ~ s over c s tire study area condit.io~ialto ilrc 50 rlnt,;~
shown al, the top of IVigure 7.42 (page 355). 'l'l~eco~~ditiooal pn)l~ai,ilit,ics;rrc
determined using ordinary indic;rior kriging aurl tlrv indicator s e ~ ~ ~ i v a r i o g r a ~ ~ i
rnodt:ls displayed a t the b o t l o n ~of Figure 7.30. At, each grid nodc, t l ~ cc o ~ ~ d i -
tioning i ~ ~ f o r r n a t i consists
o~r of Ll~e12 closest la110 usr data and the 12 closest
previor~slysinn~latedvalnes. Forest x Pasture Forest x Meadow Forest x Tillage
Figure 8.10 sl~owsone realizatio~~ (top graph) and tilt: correspouding dircct. 001 E, 00
g ...,' .., . ... .
and cross i ~ d i c a t o rsemivariogra~~rs. Note the following: s
d 04
?'he rcaliaatio~~
depnrls significantly from t l ~ et;rrgrt sanrple proportions X
e
(4, 58, 18, 20%). For correction of s i ~ c del)art~orc,
l~ s w sc,ctiorr 8 9 . 1 0
0 " , ,
00 0 08 $ 2 1.6
,,lb,o,lcC il",)
I
Realization semlvariograms Realization cdfs
I
where r j O ( n ) is tlw value corrected from tire original sirtrolated valne z(')(ll).
0.0 0.4 0.8 1.2 1.6 0 1 2 3 4 5 6 The set of corrected valnes { z ,01(n),51 g A ) identifies (in expected value) the
Distance (km) Cd concentration (ppm) target cdf:
Figt~rc8.17: kYucf.satio~w
it, tlx roproductioa of i.hr targct sc~nivtrriogrammodcl
and target cdl (solid liw) by the threc raaliz;rtions of Figure 8.2.
since t , l ~ uniforn~
c scorc l.'(')(Z(')(u)) is, by definition, nnifor~~lly
distribute(1.
428 GIIAP'I'FR 8 ASSESSMENT OF SI'A'l IAL UNC1<117'AIN?'Y
Realization sernivariograrns
As the locatio~j11 gets f a r t l m ;tway froin data Ioc;tt.ioos, the krigi~ig 12 1
variance &(II), and hence the i~itcnsityof the corrrcl.ion contnjllcd hy
the factor X(ti), increases.
r The ratio u,~(u)/u,,,,, is suialler tl1a11 1, so the rorrcctio~lf;rctor X(u)
increases as w decreases. Tllc parnlneter w allows a lmlance l)el.rvec~~ Lllc
two extrtxne cases of correctio~~
a t all non-data l o c a t i o ~ ~(case
s w = 0)
and no corrcctiotr (rase w = cu).
0.0 0.4 0.8 1.2 1.6
A sirriilar algorit.liu~has been developi:rl for imst-processing realiai~tionsof Distance (krn)
categorical variables, T l ~ eobjective is to reproduce $.lietargct proportion of
each category while lionoritig conclitioui~~g data without signilica~~t,
modifica-
Figure 8.18: Sequenlial indicator reulisation that departs from tlrc target cdf be-
tion of class-indicator sernivariograms. fore (Icft cohlrnn) and alter correction (right colame) uemg w = 2.0. 'The correction
Figure 8.18 (Icft top graph) shows a rcaliaatio~~ generated hy seqncntial improves the rt?produt:lion of the targct cdf w l d e preservi~~g
the sltape of the senli-
indicator si~nulnt,ionusing only five tl~rrsholdv;~lues.The (2-Q plot showti i ~ i "nr,",.~am
Allocate the node 11 to the cadegory s t associated with the smallest
, %
. value of t l ~ eobjective function
I'roceed t,o t l ~ c11ex1,tiode along the rantlotr~path.
l l ~ epost.-processing was stopped after 10 iterations wlren the proportion of
changes per iteration was found to be less tlran 1% of the total nunher of
grid nodes.
Figure 8.1'3 slmws the post-processed rr;rlizatio~~ and the correspo~~ding
13el.I.iv n:protlucthii of ~ r ~ o d sl:rtist.ics,
cl not lin~itcdto t.lre ~ r ~ ; a g i n ailistri-
l ! dircct and cross indicator semivariogr;mis. Sin~rdatedan~realingimproves
h u t i o ~ ~ran
, illso be ac11icvt:d 11y post,-processi~~g I.hr rcaIizal.ioti rviI.11 un;ic- the reproduction of the target global proportions (4, 58, 18, 20%) and indi-
i
cq)I.;hlti fl11ctoi8Lions i~singsir~~ulal,ed ari~~ealing i~~l,roducerl i t 1 sectiou 8.7. cator s c m i v a r i o g r a ~rnodals.
~~ Significaut, discrepancies still remain between
Act,ually, s i ~ r ~ u l : ~ tacndn e i ~ l i ~can
~ g be used urrt o111yto i ~ i ~ p r o vrcproiluc-
c
I ~ n o d c land realization cross sen~ivariogran~s, for example, between pasture
I
tion of currr:uL statistics b111also to inkpart IICW 1,ropcrtics to tlte re;~liz;tl,io~~, I and meadow. Such departures may he dun to an inconsistent target core-
such as ~ r ~ ~ ~ l l . i l ~ l rst.;ttislics
- ~ r o i i i t (I)t-r~bs<~l~
;tnd . l o ~ ~ r ~ 109211;
t r ~ l , Murray, 1!1!)2; gio~~alia:rLiollmodcl, Ikcall t11;tt the direct nnd cross senrivariograms were
Goovacrts, 1!106). i~iodeledinrlependcntly without cl~cckingfor the posit,ive semi-definiteness of
C o n s i h , for example, post-procvssing of t l ~ ,land use si~~jul;tfrd nlnp t h e full matrix of cross covitriance functions.
sl~owna t t l ~ etop of Is'igure 8.16 (page 425) wit11 trvo objectives: ( I ) 1.0 i n -
prove reprod~rctionof the I i target class proportions pr and direct sernivari- 8.9.2 Visualization of spatial uncertainty
ogriuns 7, (11; SI), and (2) to i ~ ~ ~ p r rcprotluction
ovc of i.11ecross s c ~ n i v a r i o g r a ~ r ~ s
-/r(ll; sn, sk,) not scco1111l.ct1 for 11y t l ~ eoriginal sequential i ~ ~ d i c a t ao lrg o r i t l ~ n ~ . T h e set of a l t ~ r n a t i v crealizations generated by sl.oc11astics i r ~ ~ u l a tprovides
io~~
,, it nlr:itsurc of uncr:rt;tinty ;rl)out the spatial distribulio~~ of attribute values.
J Irc ~:orrnspondingtwo ol>jeclivc fwctiou c o ~ ~ i p o ~a~r ce ~ ~ t s 1 'Ibdepict visually that uncertainty, several autl~ors(Sriv:~st,;rva,1994; Wang,
1904) have developed algorithms that slrow the rritliaatior~soue a t a time in
rapid succcssion, like the f r a n ~ e sof an ;u~in~al.ed carttmi, say, eight realiza-
lions per set:ond Like ;In animated cartoou, s~~cccsnive realizations must. be
siinilar enongh l,o allow t l ~ eeye to catch grai1n;tl clm~ges.Such a similarity
call be aclrievcd by ranking the realizations appropriately (Wang, 1994) or
by using a si~rrdationalgoritl~rr~ tlrat generates r e a l i z a t h ~ sthat are incre-
mentally diffrrent (Srivastava, 1994). The animated display of realizations
allows one to disti~tguisllareas that re~rrairlstable over all realizations (low
uncerLainty) from those where large fluct,~~ations occur betweell realizations
st and s n , a t lag 11, c a l c ~ ~ l a t (from
d tire realizatio~~
a t the it,h pcrt,urbaLivn (high u~~cectaint.y).
Ueca~iset l ~ crcalizalion s t ~ o win~Figure ~ 8. I F already r e p r o d ~ ~ c fairly
es well 'I'lic series of L realizatio~~s call also he posl-processed and the u~icertainly
global prtrporlions and indic;~torscmivariograms, it stronld not be conrplett4y i ~ ~ f o r i r ~ a tsumnrariaed
ion using different types of displays, as follows:
randonriacd hy ;~cceptingt,oo inmy u~~firvorablc perturbal.ior~sa t tlrt: 1 q i 1 1 - I . I'rol~abilit,ynkaps.
ning of the post-[)roct:ssi~~g.' ~ ~ I I I I s , t l ~ erei~liaitl.io~~
was post-processed using At tach si~nulatetlgrid node I I ~ ,the probabilily of exceeding a given
the following MAI' algoritl~rnthat retains t l ~ optvlurbation that diminisl~es Lhresl~oldz k is evaluated as the proportiorr of the I, simulated values
most thc value of t l ~ cobjective function: z(')(n$) that exceed that threshold. T h e rnap of sucl~probabilities is
1. Co11111rttc1h1: vnl~~r,
r ~ fI.lw objcctivr f r u ~ c l . i ofor
~ ~1.l1eillilia1 rralizat.ion referrcd to ;IS a probabilit,y nrap. For exa~uplc,Figurc 8.20 (top graph)
sl~ows1 . 1 ~pnhabili1.y ruap for cxceediug tlw critical Cd con cent ratio^^
2. I'hr n spt:cilii.d ~ ~ u ~ n of
b citt:ralions,
r follow tlicse steps: 0.8 11pr11. 'l'lral I I I ; ~was ~ deterrnit~etlfrom 100 sGs realizations. T h e
two low-v;rlimI zoncs correspoud to Argovian rocks where smaller Cd
L)t!fi~~c
a randor~ipat11 t l ~ a visits
t all IIOII-cor~dibiomctl
grid nodes. c o n c c n t r ; ~ l i o ~wcrc
~ s mtvisured.
At each node 11, consider all K possible calegories and conrpule
I.he values of tljr corresponding I< ot~jectivrfnnctior~s. 2. Quantile maps.
R a t l w than displaying tlrc probability of exceeding a particular threslr-
8.9. MISCELLANEOUS A S P E C T S O F SIhfULATION
432 C H A P T E R 8. ASSESSMENT OF SPATIAL U N C E R T A I N T Y
Tillage (5%)
Meadow (58%)
Pasture (19%)
Forest
Forest (18%) I 2
"
-
t
on
1 0 4
rn // 0.1-quantile map 0.9-quantile map
Meadow Tillage
area
3 . Maps of sprt,;~d.
120cal~ l i I l ' ~ ~ r ~I)I%W(X!II
: r ~ c ~ ~rmlizal~im~s
s rxn I><! dcyict<!<ll ~ yI I I ; I ~ I I ~S<IIII<,
~II~
Iucaslire of I h spretid of i.he distril>~~t,ir~ri of Ll~e1, s i ~ n l ~ l a t cv;rlues
d al
car11 s i ~ ~ t u l a t cgrid
d node; for exanrplc, t l ~ rlocal entropy, tlre varia~ice,
or t,he i ~ ~ t e r q o a r t irange
lr of the local ccdf, as defined in sect,ior~7.1.1
lC'or exatr~plc,Figure 8.20 (bottom rr~aps)sl~ows rrlaps of local clr-
tropy and inti:rquarl.ilc range det,crniincd from the 100 sGs realizatio~~s.
Uuth ii~erpsindicate grcaler certainty (liglrt,er grey) or1 Argovia~rrocks
where Cd concent~rationsare consistently snrall. T h e t i t i c e r t , a i ~ ~ist ~
greater where large and niedi~rmCd couceirtrations are i n t e y ~ n i ~ ~ gor lrd
in the west part of llir stndy area wlrrre data arc sp:rrse (see Figure 1.1,
page 5).
Mean: 4221
Std, dev.: 392
Chapter 9
3000 4000 5000 6000 3000 4000 5000 6000
cost cost Summary
Figure 8.22: T w o distributions of costs nssoci;tted wit11 a wrong dvrisiw to dccliirc
the study area safe with resprcl lo Cd. l'hrse distributions are obtained by post-
processi~~g 100 realizations of the spatial distribution of Cd values generated using
either the ser~rtentiitlGaussian or the seqecnti;,l indicatur simnlation algorit1,m. Chapters 2-8 covered the sequence typical of a geostatistical study, beginning
with exploratory data analysis, the11 quantitat,ive rnodeling of spatial conti~i~r-
,~ ity, prediction of attribute va111esa t unvisited locations, and, last, assess~~lent
l11e relative precisio~ircsullil~gf r m ~eacli si~irulal.ioni ~ l g o r i l l is
i ~r('i(dily
~~ of local and spatial r~rrcertai~~tyabout nnsan~pledvalues. 'l'liis liual clmpter
msessed by rr~cas~~rirrg the sprend of tile o t ~ t p u tdistribuliol~,which is oft.cn provides a synopsis of previous cllapters and points 0111 t.opics that i1est:rvf:
referred to ils the spnce of uscer-tnisly. 'I'l~cextt:ut of this space rlcpcnds or1 fnrt.11er attention.
the partLxdar algorithnr used to gol~eratetlru realiaatiol~s.For exarlrple, Fig-
ure 8.22 shows t l ~ eproha1)iliLy rlistribuliorrs of the cost. (8.2) c o ~ r ~ p u t efromd
d two different algorit,l~~irs:
two sets of 100 realiaatiol~sg e ~ ~ e r a t cby squc~~tiirl
Gaussian and seqrrential irrdicator sitii~~lation. l'rccision also depen~lson tlie
transfer f~mctionhciug applied 1.0 the realizations. h r t . 1 1 ~s a t t ~ edath st,(,, C11aracl.eriaation of any region requires a prior collection of data a t specific
d e l ~ n d i n gon t.11e transfer function and the specific respause value r c l . ; r i d , locations witlril~the region. Irr t l ~ i sImok, one consirlcred 1 . 1 ~ : s i t ~ ~ a t wl~c:r<rio~~
different algorithms can be best. Note tliat c o ~ ~ d i t i o ~ ito i u gmorc i~ifortri;tt.io~~ data lravc already hecr~collected alrd t.he main cl~alleugeis to i~rfcrstatis1,ics
docs not neccssi~rilyrcdllce Lhc space of rlllccrt;iilily. Ilrcorporitti~lgaddit,ioli;tl representative of the study area fro111data that iniglil have been prefcrel~tially
information t,liat conflicts rvit,l~erlrrettt. r l a t h ntay iucrrast: tho urmr~.t,;ri~ily ill sauipleil. 'I'he problem of designiltg si~rnpli~ig S C ~ I ~ I I I Chas
S not I)WII ;rddrrsscd
the response v a r i a t k lnformat,ion about 11ow geostatistirs can be used i n sarr~plirigdesign is give11
in Wehster and Oliver (1900, p. 272- 200).
'The ~~nrriber and spatial collfiguration of data are controlled by several
factors, sucli as tlie available teclr~rologyand resources, Llie accessibility of
some a r e a , t.11e need for s m ~ p l i n gmore densely arens dcen~edcril.ical, the
con~binat.iol~ of different rircas~~rement dcvices i~sed,ctc. T l ~ esnllipling str;rt.-
egy sl~ouldalways be clearly docrr~l~e~rted because it ir~;ryguidc or skcw the
results of thc explorat,ory data arlalysis.
Whcri dealing with attributes that vary over tiliic, such its p o h t a l l t
cor~centrations,spatial data sliould bc collected wit11i11a short time to avoid
rnixi~igspiif,ialand tenlporal fluctuations. If the study pursnrs dilli~rcrrtolijec-
tives tliat call for differeut sar~il,lil~gstrategies, tlrr s ; i ~ ~ ~ p l i t ~ g s r sl~ould
l l c ~ l ~bc
e
dcsigr~edto sirr~illtn~~eously aclrieve these objectives. For exaniple, the J u r a
data were collcctr:d using a c o ~ n b i ~ ~ a t ofi o n;L ilcstcd and regular s;t~npli~rg
sclicnics, which allowed r:l~arirctwiaation of sl~orl-raugevitri;il,ilily of r n r h l
r....: -..-... 1.:). :.I: ....:rill,, ,*., ,A. r...l>r"n,.r ,.., ,,,',,\".inr,.
E x p l o r a t o r y data :tnalysis under study. Suhscquent applicatiorrs, srtcl~as predictio~~ or assessn~entof 1111-
,, certainty, rely on t l ~ i sparticular mudel of spatial variability.
Ilrc sccond step c o ~ ~ s i sof
t s gett,ing familiar wit11 tlre data using descriptive As emplrmierd in Clraptrr 4, sen~ivariogra~r~ rrrodelitig is not simply an
f.ools, such as ii lociitiot~map, I~istograni,scattergram, or 11-scattergral~ras exercise in fitting curves to experimental values. Irt~portantdecisions rcgard-
introduced in Chapter 2. Exploratory data ar~alysismay indicate the exis- ing the ntnnber, type, and anisotropy of basic semivariogrir~rrmodels inust be
t.encr of several populations with sigr~ificatrtlydifferetrt features. 111 s u c l ~a nrade by the user rather tlral~left to some automatic-fittir~g algorithm. T h c
case, one slrould consider splitting the data. into more l~omogent~orls sul,sets user's expertise is particularly critical whenever sparse data and nreawremerrl
prior to statistical atmlysis. Such subdivision ]night not be possible becintse errors lead t,o noisy s;mrple seriiivariogranrs.
of t.he lack of data or the inability lo delineate the different popr~latior~s in Modeling a c o r e g i o ~ ~ a l i z a tisi omark
~ ~ difficrllt because the different direct
the field. and cross senrivariogratrls cantlot be ~rrodelerlindependel~llyof one another.
I'rior i~tfbrtuat'ionabout the range of attrilmte values, tlicir cross r e l a t i o ~ ~ s T h e corurnonly usetl linear model of coregionaliaatior~is reasorrably flexible
imd distrilmtions in space slrould be usctl rvitll t , l ~ rdata to build solnnlitry aud allows all easy check of the positive sern-definiteness condition. 'fire
st;tlislics, suclr as tnmtrs, varimcvs, correlat,io~~ coefficie~~ls,and srmivari- consistency constraints become very curnbersorrrc w l ~ e ~many i attribntes are
ogriuus. fk!w:~rc,Illat s;ul!plc sL;rlist,ics are ;~n):cl.edI)y spatial c l ~ ~ s t e arid
rs consi~lcreil;tllogcllrcr. For more than two ilttrihotes, interactive grapllical
c x t r c n ~ evalncs. If there is no stroug pliysical reason for discarrlitrg cxt.rerne progrmls it~clndingsen~i-ant.o~n,ztir fitting irlgoritlrms should bc considered
values or possibility to treal thcnr separately, their irrfluet~ceco~tldbe redr~ced (see Appendix A).
by all ;~ppropri;at.f!t~rans~ortn:tliorr of data or by using more rol)usl st;itistics.
When specific subareas arc preferentially santpled, the global reprcsetrtativ-
iby of sample statistics slronld be qoestiorred, and the declust,ering tecl~l~iqrrrs
inl.rodnced i l l scctio~r4.1.1 sltould he consi<lerc<l.
Locnl e s t i m a t i o n
' l i ~go l>eyol~d t.lw dat,;t ;rnd i4111:tt.ealtril)oI.c valucs at, r~nvisitcdloc;~tio~rs, :i In (3apt.cr 5, thret: v;triants of the li~mrrregression algorit11111(simple krig-
r ~ ~ o &o lf variability i l l s l m a I I I I I ~ Lbo csl:thlisl~ed.Our in~pcrfc:ctkr~orvledgeof i ~ t gordit~iiry
, krigitrg, ;tnd kriging wit,11a t r m d ) arc i~~t,rociuced for predicting
l ~ o wphysical processes oper;itc alrd interact over the st.udy area i~sn;illyyre- valtres of a sit~glcattribnle a t ut~sa~nplerl locatio~rs.Explicit ~nodeliugof local
cludcs any dctcrrnit~ist~ic i ~ r o d e l i ~wl~crcby
~g a siuglc and dcctr~rdexact, value trends is typically necded o~llywhen tlrc locatiorr lxilig estilnaled is ontside
is associ;iterl with tach ur~siirnplerll o c i t h n . As all alternative until dctcrtr~i~r- t . 1 1 ~gcograpltic range of da1.a ( e x t . r a l d a t i o ~situatiori).
~ 111 iutcrpolatio~rsitu-
islic nrotlels I m o m e avail;il)lc, l.hc randorrr fu~tctio~t (IZF) nod el is introtltlcetl ations, ordinary kriging ilnplicitly rescales the mean value within each local
in Chaptcr 3. 'f'l~crandom 111nct.ionallows nmlcling of spatial variability in scnrcl~neighborhood.
tllikf.it yields a t each n~~sctrr~plcd locatiot~a probability distribtttio~tof possible Block kriging illlows for ostitn;~tionof nttribut.~valncs liuertrly averaged
vnlues rol.l~crl l m i a siugle v;due. over supports (segrne~lt,area, voln~rre)IIIUCII larger than data s ~ ~ p p o r t 1%- s.
One i~uport.;r~rl aspect of the KI1 approach is tlie c o ~ i ~ of p tslalio~larity, ware that arith~nelicaverages of data are not the sarllr as physical averages
which dlows hferena: of statistics, s u c l ~as the mean, covari;wcc, or senti- if tilt: ;tttribrtt,t: does nol, avcragp linearly in space, for a x a ~ ~ ~per~ncnl>ility
ple
variogran~,by p o o l i ~ ~rlala g over areas dertr~edIrt~nlogelreotrs. It is worth
- or p11.
recalling t l ~ i stalionarity
t is r ~ o ta c1mracterist.i~of the pl~ysicalp h c ~ t o ~ r ~ e t ~ o n All interpolat,ion algorithnrs tend to smooth out local details of the spa-
under study, rather il is a u~odelirtgd e c i s i o ~tircded ~ to specify the piimnie- tial vnri;hility of the attribute, leading to overest,in~atiouof srnall values and
ters of the RF 111ode1uscd. Decause such statiorrarity cannot be proveu or underestinratio~~ of large ones. Such s t n o o t l ~ i ~depe~~cls
~g on the local data
~ e f n t c dfrom tlie data alone, lrowcver, the decisioir of stationarity, like any configtlration: high-frequency components are progressively filtered as tlre lo-
model drcisiou, can be d e c ~ ~ inappropriate ~ed if its conseqnences do not allow cation being estimated get,s f x t l ~ c raway from data locations. Maps of krigillg
one to reach the goals of the study. estimates appear artificially more variable in densely sampled areas than they
do where data are sparse. Building on that limitation, kriging analysis al-
lows one to lilhor one or more spat.id conrponcnts of the setr~ivariogralllor
S t r u c t u r a l an;rlysis covarial~cctnodel, resulting in n ~ n p sof low- or 11igl1-frerl~~e~~cy conrpo~ients
One major step in any gcoslalistical aualysis is to build a licit scmivariogra~r~ of spatial variability. Such maps can he used as exploratory kook to detect
(covariance) 111odc1that captures the relevatit spatial reatures of the attribute areas that dep;trt sul~stant,inllyfrom a regional backgronocl.
440 CIIAP'1'1<ft I) S U M M A f1Y
proportional t.o t.he nurnlier N(1ik) of pairs used ill the cst.i~natc
Algorithm
T h e least-squares fit of a linear model of coregionalizatiori is more dernanding
than in the univariate case. The:iiffic~$y lies in the coristraint that tire
matrices of esti~rratedcoefficients Dl = [b;] must be positive semi-definite.
T h e idea is to start with a set of arbitrary coregionalizatiorr n~atricesI31 and 6. Define tlic new index riumlm lo = l o + l (lois reset to zero if lo > I , ) , nnd
to nrodify one matrix a t a tirrie iteratively so as to minimize the criterion repeat steps 2 to 5 until the criterion WSS is smaller tlrm a I.hresliold
WSS under the constraint of positive semi-definiteness of that matrix. Tlte v;dtre specified by tlte user.
algorithm proceeds as follows:
B.l Acronyms
ccdf: conditional cutnolative distri1)ulion firnctiol~
df:
ctruiulativc dist,ril>t~l,ion
function
P C K : principal component kriging C ( I I , ~ I ' ) :non-stationary covariance between the KVs Z ( n ) and Z ( u f )
P-I' p l o t : ~ m o l ) a l ~ i l i t y - l ~ l - c ~ I ~plot
i~I~ilily C = C(O): variance cov;iri;utce 111a1,rix
L
(2-(2 p l o t : <lun~itilc-qn;r~itilc
plot
C(h) = b'cl(h): l i ~ ~ c model
ar of rcgionaliai~tion
R.F: random function 1 ~ 0
C,,(li): sl;~l,io~iiiry
cross covariaucc bet wee^^ l l ~ etwo It 1)s %,(u) ;uid %,(11)
RV: r a n d o ~ ~varinl)lc
r
for lag veclor 11
S C K : simple cokrigiug I.
Cij(Ir) = b ~ j c r ( l ~h) e: a r nlorlel o l coregior~alia;tt.i(~ti
s G s : scqut!~~lial
Gaussian sin~ulatio~r
k 0
B ( z ) : Markov-Uayes calil~r;itio~~
paran~t:ter
G ( u ; zl(n)): norl-statio~~ary
conditional cutld;itive distribution functio~tof
the standard normal RV Y(u) given the i n f o r o ~ a t i o(n)
~ ~ available, fur
rxnmplr, rcaliaahio~mof n othcr nrigl~boriogHVs 1;(11,,)
X,,(u): cokriging weight associated to zi-datom a t locatio~ru, for estima- p ( u ; $I): yrobal~ilityfor tile category s t to prevail a t lot.1' I,IOlL
' 11
tion of the attribute z a t location 11
p(11; sr/(n)): c.orrdiLional probability for the calegory s t t o prevail a1 localior!
XZK(u): simple kriging weight associaled to z-datum a t location 11, for u give11 the neigbhorirrg iitlbrn~atio~r (it)
estimation of the attribute z a t location 11. The same type of notation
applies to other algoritllms, for example, OK, KT. p t : global proportion of category st within the area A
X2CK(l~):siritple cokrigi~rgweight associated Lo zi-dato~na t Iocaliou u,,, p,: p-q~~atrtile distribution fuoctiou I"(r), q,, = I.'-'(p)
of tlic c1~1~111ittive
for estimation of the primary attribute zl a t location 11. 'The same type
of notation applies to other algoritlrnls, for example, OCK
X:g(u): OK weight associaled 1.0 z-datum a t location 11, for cstir~ratiu~t
of
the trend conrponent m o x ( u ) . The same type of notation applies to
KT arid ICED.
ll(11)
(I,
-+x : ~ ~
Zi(i1): ~ l m o n ~ p o s i l of
i o ~tile
~ n:siclual cotr~po~lnlrt
trrodel iuto
1) spatial cornponcnts Z1(ll) corresponding Lo 1,lrr ilecortrposition
m: stationary mean of the ILF Z ( u ) of the residual covariance C n ( h ) = El=, L
bicl(1i)
rn: vector of stationary ~ n e a l ~(dilrtensron
s Nu)
~ O K ( I I )Lagrange
: parameter for ordinary kriging a t location 11. T h e same p r ( h ; zn): statiouary indicator corrclograr~~ for lag vect.or ir m d t.lrreshold
type of notation applies to other algorithms, for example, KT and KED. zt of the R F Z ( u ) ; it is the correlograni of the binary indicator RF
l(11; z t ) .
pl(ll; zr, zn,): st;at.ionary i~idicalorcross corrclogr;ml k ~ lag r vcct.or h and
N(11): number of pitirs of data values available at lag vector 11 tlireslrolds rt nnd zt, of the 1l.F Z(i1); it is the cross mrrrlogratr~hrtwecn
t,he two illdicator 1Ws I(u; zr) and I(11; zk,).
N u : nutrlber of varia1)les Z,
P&(h;z;l, rjt,): sl;rt.iorr;try indicat,or cross corrclogra~iifor lag vector 11 and
n: number of data valr~rss(u,) or z(n,) available over tllr arc;, A tlrresliolds zit and zjrr of the R.Fs Zi(11) m i l %;(11); it is t.ltc cross
71(u): nurr~berof data values z(11,) used for estimation of the ;~ttribatcz a t correlograltt between the two i~rtlicatorllFs l ( u ; zit) and I ( n ; z,t,).
location u S(II): generic categorical RV a t location u , or a generic categorical 111' a t
location 11.
ni(11): number of data values zi(11,) used for esl.itl~atiol~
of the a t t r i l ~ n t er
a t focation 11 S ~ I I ( ; ) : spherical se~rrivariogratnniotlel of range a, a fimction of the scpa
ration vector ti
vi,(h): statioriary codispersion coefficient between vz~riahles&({I) and Z,(II+
h) separated by lag vector 11
B2 COMMON NO?'A??ON
~ ( I I , ) : s-(1ntu111value a t l o c a t i o ~11~, zit: ktlr threshold value for the continuous attribute ti
n Z x ( o ) :silnple kriging variance associated wit,l~t.lrr sinlplc kriging estiniatc ~'(11): an estimate of value z(u)
Z;lc(u) a t Ic)c;itio~r11. The sarne Lype of notation applies to other
a l g o r i l , l t ~ ~for OK and K T
~ s cxa~nplc,
, t i K ( u ) : sinlple kriging estimate of value z(u). T h e same type of notation
applies to other algoritlnns, for rxa~nple,01< and K T
z ~ & ( o ) : simple cokriging estimate of t.he prirnary attribute 21 a t location
u. T h e s a n e type of notation applies t,o other rnoltivariate algorithms,
for example, OCIZ.
z;;(u): I?-type rstirrrale of value z(u), obtained as an aritlrrnetic average of
n~oltiplesintulal.ed realizatioris z(')(u) of the Rl*' %(XI)
!=
; y-'(Y): inverse transform fnnctiol~y ( . ) relating two ltVs % and Y
~ or a generic contil~uousILF a t
Z(n): generic continuot~s1tV a t l o c a t i o ~11,
locitt.ion 11
,%
,; (11): sin~plekriging estimator of Z(II). 'l'he sanle type of notahion ap-
plirs t , ot,lm
~ algorith~ns,for cxanrplr, OK and K T .
Dubrule, 0. 1983. Two ntethods with dilferent objectives: Splines and krig- Gdmez-Herniudee, J . and A. G . Journel. 1993. Joint sequential simulation
ing. Malhernattcal Geology, 15(2):245-257. of ~nultiGaussianfields. In A. Soares, editor, Geostatistics Trdia '92,
Edwards, C. and D. I'enney. 1982. Calculus and Analytzcal Geometry. volume 1, pages 85-94. Kluwer Academic Publishers, Dordrecht.
Prentice-Hall, Englewood Cliffs, N.J. G6mez-llerndndez, J . and R.. M. Srivaslava. 1990. 1SIM:iD: An ANSI-(;
Englund, E. and A. Sparks. 1991. Geo-EAS 1 . L l Geostnt~stical1S1rvz- three-dirnensio~~al mult,iple indicator condilional simulation program.
ronmental Assessn~enlSoftware, User's Guide, EPA lieport # 60018- Con~pz~ters& Geosciences, 16(4):395-440.
91/008. U S . EPA, EMS Lab, Las Vegas, NV. Gdrnea-Iietnindea, J , and X. Wen. 1994. To he or not to be inultiGaus-
Farmer, C. 1988. The generation of slocllastic fields of reservoir pararn- siao? T h a t is the question. In Heport 7,Starrford Center for Reservoir
eters with specified geostatistical distributions. In S. Edwards and Forecasting, Stanford, CA.
P. R. King, editors, Malhe~nnlicsin Oil Production, pages 235 252. Goovacrts, 1'. 1992. Factorial kriging analysis: h 11sef11ltool for exploring
Clarendon Press, Oxford. the stroclure of multivariate spatial soil i n f o r ~ n a l i o ~Joart~nl
~. of Soil
Farmer, C. 199'2. Numerical rocks. 1111'. R. King, editor, The Mnfheaznlical Science, 43(4):597-~619.
Generation of Reseruoir Geology. Clarendon Press, Oxford. Goovaerts, 1'. 1993. Spatial ortl~ogor~ality
of the principal c o t n p o n e ~ ~ct os n -
FOEFI, (Swiss Ih(1er;il Office of Environn~cntand i,;~nrlst:;qx:). 1987. C b n - puled from coregionaliaed variables. Malhe~nntir:nlGeology, 25(3):281
n~entary011 the Ordinnnce llelatiny to Pollutnnts i s Soil (VSl3o of June 302.
9, 1986). FOEFL, Bern. Goovaerls, P. 1994a. Comparative perfortnancc of indicator algoritlrrns for
modeling conditional probability dist.ributiorr functions. Malhen~nlical
Froidevaux, R. 1990. Geostnlisticul lholbox Primer, ver.sion 1.50. E'SS In-
Geology, 26(3):389-411.
ternational, 'Froinex, Switzerland.
Froidevaux, R. 1993. I'robability field simnlatior~. 111 A . Soares, editor, Goovaerts, 1'. 1994b. Co~tiparisonof ColK, IK and rnlK pcrforrn;t~~ces ior
modeling conditional prolrabililics of categorical variables. In R. Dim-
Geoslalistzcs Triia '98, volurne 1, pages 73-84. Kluwer Academic Pob-
lisl~ers,Dordrecht. itrakopoulos, editor, Geostntistics for the Nerl Century, pages 18--29.
Klnwer, Dordrecht.
Galli, A., F. Gerdil-Neuillet, and C. Dadou. 1984. Factorial kriging anal-
ysis: A substitute to spectral analysis of magnetic data. In ( 3 . Vcrly, Goovnerts, P. 1 9 9 4 ~ 0
. 11a controversial method for modeling a corcgional-
izatiorr. Mnihe~nnticalGeology, 26(2):197~204.
M. David, A. G . .lournel, and A. M a r k l ~ a l ,editors, Geoslufislics for
Natural Resourccs Chnrnctertzalion, volu~ne2, pages 543- 557. Reidd, Goovaerts, 1'. 1994d. Study of sp&ial r e l a t i o ~ ~ s l ~ l)cl.rvcc~~
ips i.\vo scts of
Dordrecht. variables using multivariate geostatistics. Geodennn, 62:03--107.
Galli, A. a d C. Meuuier. 1987. Study o f a gas restwoir using the exterual Goovaert,~,1'. 1996. Stochastic sinn~lntionof categorical variables using a
drift method. In G. Mat,lteron and M. Ar~nstrong,editors, Geostatistzcnl classification algoritl~rr~
nnd sir~t~llated
anuealirlg. Mnlhe~z~atzcnl Ceol-
Case Studies, pages 105- 119. Ikidrl, Dordrcctit. ogy, 28(7):909-921.
Geman, S. and 11. Geman. 1984. Stochastic relaxatiou, Gibhs d i s t r i b ~ ~ l . i o ~ ~ s , Goovacrts, 1'. 1997. Ordinary cokriging revisikd. Mnthe~naticnlGeology,
and the Bayesian restoration of i~rrages.I E E E Trnnsnclions on Pnttern 29, in press.
A~rnlysisand Machine I~~telligence, PAMI-G(A):721-~741 Goovaerts, P, aud C. Chiang. 1993. 'I'emporal persistence of spatial patterns
Glacken, I. 1996. Change of szrpporl by direct condition01 block sirir~~latioa. for tnineralioablc nitrogen and selected soil properties. Soil Science
Master's thesis, Stanford University, St,anford, CA. Society of America Josmal, 57(2):372- 381.
G6111ez-Iiern&1dea, J . 1991. A Stochnslic Approach to llre Sinz~tlnl~oe of Goovacrts, 1'. and A. 1;. Joornel. 1'3% Int,cgralitrg soil map it~forrr~ation ill
Block Condzrcliuity Fields Conditioned lrpon Data Me(zsured a1 n Smnllcr modelling the sl,atiaI variation of c o ~ ~ t i n m usoil
s propertics IS?iropeun
Scale. Iloctoral dissertation, Stanford University, Sthnford, CA. J o u r ~ dof Soil Science, 40(3):3'37-414.
GSmez-llernindes, J . and E:. Cassiraga. 1994. Theory and practice of se- Goovacrts, 1'. and A. G. Journel. 1996. Accounting for local probalditics
quential simulal.ion. In M. Arrrrstrong and P. A. Ilowd, editors, Geo- in stochast,ic nrodeliug of facios data. S P E Jo~rrnal,1(1):21 2 9 .
slalistical Szwelnlioas. uaaes 111bl24. Kluwer A c i ~ ~ l ~ : ~l'111~lisIw1~
nir Goov;ierts, 1'. and 1'11. S o r r ~ ~ 1993.
t. St,udy of spatial and t c ~ n y ~ x nvaria-
l
HIDLIOGIIA I'H Y 471
A. Soi~res,editor, Geostatisttcs T r i i a '92, volume 2, pages 7 4 5 756. Jonrncl, A. G. 1!184a. Mad and conditional quarktile cstimat,ors. III G . Verly,
IUuwer Academic Publishers, Dordreclit. M. David, A. G . Jonrnel, and A. Martchal, editors, Geostatistics for
Goovaert,~,P., Pli. Sonnet,, and A. Navarre. 1903. h c t o r i a l kriging anal- Natural Resources Chaructcrizatiorr, volume 2, pages 261 2 7 0 . Heidel,
ysis of springwater cont,ents in the Dyle river basin, Belgium. W a t e r Dordrccht.
Ilcsourcrs llesenrch, 29(7):2115 2125. Journel, A. G. 1984b. T h e p l x r of non-parametric gcost,atistics. 111G. V d y ,
Goovaerts, P. and It. Wehster. 1994. S c a l e - d e p e ~ ~ d corre1;~tion
e~~t between M. David, A. G. dournel, and A. Mar&cbal,editors, Geostotislics for
topsoil coppcr and cobalt conccntr;~lionsin Scotland. European Journal Natural Resources Characterization, volunre 1, pages 307355. Reidel,
of Soil Scieitce, 15(1):79 95. I)ordrec.l~t..
Goolard, M. 1989. Infererice in a coregionaliaation t ~ ~ o d eInl . M. Armstrong, Jouroel, A. G. I986a. Geostatistics: Models and tools for the carth sciences.
editor, Gcosfatistics, volu111e 1, pages 397-408. Kluwer, Dordrrcht. Mathernal~calGeology, 18(1):119 1 4 0 .
Coulard, M. and M. Volls. 1092. Linear corcgio~~alisat,ion rnodel: Tools for .lournel, A . G . l98Fh. Constrained interpolation and qualitative inforrna-
tistirnation and choice of cross-variograln iniitrix. Mathematical Geol- tion. M ~ t h e n r a t i c a lGeology, 18(3):269-286.
ogy, 24(3):269-286.
Journel, A. G. 1087. Geostatistics for the environmental sciences, EPA
Grschyk, M. IW3. A j u s l e n ~ r s td'anr Cor4giosalisatio1~Statio1t~~ailr.
I)oc- projwt, no. cr 811893. Technical report, 1J.S. EI'A, EMS Lab, La8
I ~ h Natio~ii~le
ILurd diss~:rl,;itio~~, k Sup6ric11rcdm Mines dc Paris. Vcgas, NV.
Guardiar~o,F, and It. M. Srivastava. 1993. Multivariate geostatistics: Be- Journel, A. G. 1989. Fundamentals of Geostatistics i n Five Lessons. Volume
yond bivariate moments. 111 A. Soares, editor, Geostatisfics Trdia '92, 8 Short Course in Geology. American Geophysical Union, Washington,
voloine 1, pages 133 144. Klriwer Acadtmic Publishers, I~ordrecht. D.C.
IIaldorsm, 11. 11., 1'. J . I3rand, and C. J . Mncdoti;tld. 1988. Iloview of the Journel, A. C:. 1994a. Modeling uncerlni~lty:Some coriceptual thoughts. In
sl.ucl~;wlicimtorc of reservoirs. In S. 1Sdwards and 1'. I t . King, edi- It. Dirnitrakoponlos, editor, Ceoslatislics for the Next Century, pages
tors, Mnlhcnmtics i n Oil Production, pages 109 209. Cl;irendon Press, 30-43. Kluwer, Dordrcclrt.
Oxford
Journel, A. G. 1994b. Resampling from stochastic sirnt~latiorrs. E'nvirow
llaslett, J . , 11.. Dradley, 1'. S . Craig, G. Wills, and A. It. Unwin 1091. Dy-
mental and Ecological Statistics, 1:63R4.
11amic graphics for exploring spatial dal.a, with ;ipplicatiol~lo localiug
glol,;iI i111r1 loc;~la ~ ~ o r l ~ a l iAntericatt
rs. Strtfisficint~,45:234-212. Jourrrel, A . C. 1995. Frobahility fields: Another look and a proof. 111Report
8, S l a n f u ~ dCenter for Reservoir Forecasting, Stallford, CA.
Hudson, G . 1993. I'irigil~gt~ernpcralorein Scotland using the external drift
method. 111A. Soares, editor, Geostntistzcs 7i.iia '99, v o l u ~ ~
2,~pages
c Journel, A. G . and F. G. Alahert. 1988. Focusing on spatial connectivity
577 588. Kh~wcrAcadc~liicI'ul~lisl~r:rs,I)ordrecl~t. of extreme valnad attributes: Stochastic indicator models of reservoir
Isaaks, 1.:. 11. 1984 l<zsk CJunlified Mappings for llntnrdous Waslr Silcs: A I~et,erogeneities.SI'E paper # 18324.
Case S t u d y i n Distrzb~~tios-free
Gcostntistics. Mastcr's tl~csis,Stanford Journel, A. G . and C . V. Deutsch. 1893. Entropy and spatial disorder.
University, Stai~forrl,Ch. Mathematical Geology, 25(3):329~355.
Isaaks, E. H. 1!190. T h e Apphcation o f M o n t e C n d o hlelhods lo the Analysis Journel, A. G . and C. J. Huijhregts. 1978. Mining Geostatistics. Academic
oJSpatially Corr.eluted Datn. Iloctoral dissertal.ion, Stanford Il~~ivcrsity, Press, New York, 600 p.
Sthnford, CA. Jonrnel, A. G . and E. 11. Isaaks. 1984. Conditional indicator simulation:
Isaaks, E. 11, alrrl It. M. Srivastava. 1989. A n l s l l v d u c t i o ~to
~ Applied Application to a Saskatchewan uranium deposit. Malhematical Geology,
Geostatistics. Oxford Univcrsily Press, New York, 561 p. 16(7):685-718.
e l (;., 1980. Tlir logrlarrnal approach to predicling local dist.ribu-
J a r ~ r ~ ~A. Journel, A. G . and D. Posa. 1990. Characteristic behavior and order rela-
tions of selective mining unit grades. Mothematical Geology, 12(4):285- tions for indicator variograms. Mathematical Geology, 22(8):1011-1025.
303.
Jonmel, A. G. and S. E. Rao. 1996. Deriving conditional distributions from
Joornel, A. G. 1983. NOXI-parametrice s t i r ~ i a l i oof
~ ~spatial distributior~s. ordinary kriging. In Report 9, Stanford Center for Reservoir Forecast-
ilfnthe~naficalGeology, 15(3):445-468. ing, Stanford, CA.
BIBLIOGRAPHY 473
Jonrnel, A. G . and M. E. Itossi. 1989. When do we need a t r r t ~ dmodel in Myers, I>. I:;. 1991. Pseudo-cross variograrns, positive-definitc~~ess,
and cok-
kriging? Mathernalical Geology, 21(7):715 -739. riging. Malhemaficnl Geology, 23(6):805-816.
Journel, A. G. and W. X u . 1994. Posterior idcntific;tthn of Iiistogmms I'apritz, A. and 11. Iliihler. 1994. 'IP~nporalchange of spatially aut,ocor-
conditional to local data. Mnthe~naficalGeology, 26(3):323~359. rrlal.cd soil properties: Optimal esti~natiouby cokriging. Geuderinn,
25:1015102&
Krige, D. G. 1951. A Stafislica/ Approach to Some ilfiae V(~/trnlios.sand
I'apritz, A,, H . R. Kunsch, and R. Webstcr. 1993, O n tlie pscndo cross
Allied /'roblems at the Wzhuaterstand. Master's thesis, University of
v;iriogral~t.Mnthernalicnl Geology, 25(8): 1015-1026.
Witwatersrand.
l'osa, D. 1089. Conditioning of the stationary kriging rn;tlrices for sonrc
Luenberger, D. G . 1069. Opti~rtizntiosby Vector S p c e Alellrods. . l o l ~ tWilcy
well-knowt~covariance nrotlcls. Afathematzcnl Geolo!/y, 21(7):755 7 6 5 .
& Sons, New York.
Press, W . 11., 13. 1.' I.'lan~tery,S , A. 'ii:~tkolsky,m d W. 'T'. V?t,tcrling. 198ti.
Luster, G. It. 1985. I l o s Malerials for I'ortland C r ~ n e ~ r lApplzcnlioss
: Nuntericnl Recipes. Canrhriilge University I'rcss, New York.
of Condifiunnl Simulntion of Coregio~~alizafion 1)octoral dissertat.ion,
Stanford University, Startford, CA. ltipley, D. I). 1981. S p ~ t t aSlnfistics.
l John Wiley 81 Sons, New York, 252 1).
Ripley, D. D, 1987. Slochaslic Sinnrlatzo71. Jolltl Wilry 9s Sorts, New York,
Ma, Y. 2. and J . J . R.oyer. 1988. Local geostatistical filteriltg: Application
237 y .
to remote sensing. Sciences de la Terre, Sirie Informatiqnc, 27:17-36.
Iloult;u~i,S . and 11. Wackerltagel. 1990. M~tllivariitlog~ostaliaticalirppr11irc11
Mallet, J . L. 1080. 1ti.grrssion soos cot~t,raintesli~tiilircs: Applic;ttiol~an to space-tinre data analysis. Wnler Resources flesearch, 20(4):5855591.
codage dcs variables al4atoires. Ileut~ede Statisliyue Appliq~iie,28(1):57
68. Sau<\jivy,1,. 1984. 'T'l~efactorial krigit~gutl;rlysis ofregio~~;~liactl
data. Its n p
plication t o geocl~emicalprospecting. In G . Verly, M . David, A. (:. Jonr-
Marcotte, D. 1995. 1:onditional s i ~ n ~ t l a t rvitlt
i o ~ ~data sl~bjectlo measure- nel, and A. Mar&chal,(!&tors, Geoslnlislics for Nslarol Ilesoawes (:ltal~
tnent error: I'ost-sirnolation filtering wit11 modified factorial krigil~g. nclerirtrtioe, pages 559H71. Ikidel, Dordrcclrt.
Mathenzaliral Geology, 27(6):740762.
Sdgoret, S, and P. Iirlclion. 1990. 'I\igonomctric kriging: A new nietliod
Markchal, A. 1984. Kriging seismic data. in presence of Lttdts. In G.Verly, for rerl~ovingthe dirmial variatiot~from geomtgnetic data. .lonrit~rlof
M. David, A. G. Joornel, and A. Mar(rhal, editors, Geoslalistics for Geophysical Rescalrh, 95(13):21,383 21,397.
Natural Resources Charucterizntioe, pages 271294. Iteidel, Dordreclit.
Shannon, C. 1,:. 1048, A niatlmnatical tl~eoryof c o l ~ t r ~ ~ n t ~ i c ; &Uell
l i o ~Sys-
i.
Matheron, G . 1970. La ThCorie des Variables Higiunalisies ct ses Appli- tem Technical Jonrrral, 27379-623.
calions. Faxicole 5, Les Cahiers (In Centre de Morphologie M a l l ~ i n ~ ; t -
Silvcr~nan,H. W . 1986. Uenszty Estzmutioa forStn1zslic.s n u d I)ul~rAnalysis.
tique, Ecole des Mines de Paris, Fontainehleau, 212 11.
Chapman and Ilall, New York.
Matheron, G. 1979. Itecllerclie de simplification dans tin problime de cokrigeage Soares, A. 1002. Ceosl.nt.ist.icnl estitrratiotr of intrlti-plrase st.ruclrrres. hfdlt-
Internal note N-628, Centre de Giostatistiqtie, FontaincBlean. cn~nticalGeology, 24(2):149-160.
Matlreron, G. 1982. Pour one analyse krigeante de dor~tlirsrdgioliaIis&s. Sous;i, A . ,I. 1980. Geostalistical da1.a an;rlysis Arl applicabiotr to on: 1.ypol-
Internal note N-732, Centre dc Giosthtistiqrtc, I~'ontaineh1aa~1. ogy. 111 M . Arn~strong,editor, Geoslnlislics, volwnc 2, pagt.s 851 860.
Matheron, G . 1989. Esliimating and Choosing Springer-Verl;rg, Berlin, Klawer, I)ordreclrt,.
141 p. Srivastava, R. M. 1067a. Mitii~nr~rr~varia~iceor
iiraximumprolitability? ClM
Matlreron, G . , 11. Hencher, C . de i.'ortquet, A. Galli, D. Cuerillol, and Bnrllelza, 80(901):63-68.
C, Ravenne. 1987. Cot~ditiorlalsitnulation of Lltc gconletry of flovio- Srivastava, It. M. I087h. A N~II-ergodzcI'iumr~uorkf i r Vnrrogram und
deltaic reservoirs. S P E paper # 113753. Covarim~ceFunctiol~s. Master's thesis, Stanford liuiversity, Stanford,
Murray, C. 1992. Geoslntistical Applications in IJelroleurn Geology and Sed- CA.
imentary Geology. 1)octoral dissertation, Stanford University, Stanford, Srivast.ava, R. M. 1092. Ittw:rvoir cl~aracterizatio~i
wit11 prob;rl)ility ficld siltl-
CA. ~ t l a l i o In
~ ~SI'E
. Ann~rnlConference and I~xhibzlios,Wnslrington, I). C.,
Myers, D. E , 1982. Matrix for~riulatio~i
of co-kriging. Matherr~uticalGeology, rtul~lber24753, pages 927 938, Washington, D.C., Socie1.y af I'etrolen~it
,a,*, " " n ',,- Ih"i,,"e"s
Srivastava, R.. M. 1994. 'FIE visualization ofspatial uncertainty. In J . M. Yartrs I,arsen, editors, Qum~titntivcAt~alysisof Mineral and Etwgy Resources,
and R. 1,. Cl~arubcrs,editors, Slochastic Modelrng and Geostatistics. pages 390 409. Ikidel, Ilortlreclrt.
Principles, Methods, and Case Studies, volurr~e3 of AAPG Conrputer.
Applicatio~~s in Geology, pages 339-345. Wackernagcl, B. 1904. Cokriging versus kriging in regionalized multivariate
data analysis. Geoderma, 62:83~-92.
Srivastava, It. M. and 11. M. Parker. 1989. Robust measures of spatial con-
Wackernagel, 11. 1995. Multivariate Ceostatistics: An introduclion with ap-
tinuity. In M. Armstrong, editor, Geostatistics, pages 295-308. Kluwer,
plicatio~~s.Springer-Verlag, Berlin, 256 p.
Dordrechl.
Stein, A., M. IIoogerwerf, and J . Borlrna. 1988. Use of soil-map delineatiorrs M1xckern;tgel, H . , 1'. I'etitgas, and Y. Tonffnit. 1989. Ovcrvicw of methods
for coregionalization analysis. In M. Arnrsfrong, editor, Ceoslatistics,
to irrrprove (co)krigiug of point data on ttroisture deficits. Geodermn,
vohlme 1, pagcs 400-420. Kluwer, Dordrecht.
43:163~177.
Wackernagel, 11. and 11. S a ~ r g n i ~ ~ c 1993.
t t i . Gold j,rospccting with factorial
Sr~llivan,J. 1984. Conditional recovery esti~rralior~through probability krig-
cokriging ill the Limousin, France. In 3. C. Davis and U. C. Kerzfeld,
ing: Theory and practice. In G. Vcrly, M . David, A , G. Journel, and
editors, Co~npaterain Geology: 25 yeara of progress. Studies in Moth-
A. Mar&cl~al, editors, Geoslatistics for Natural Resorrrces Charnclerim-
emnticnl Geology, volume 5, pages 33-43. Kluwer, h r i l r e c h t .
lion, pagcs 365-384. Il.cidel, Dordnxl~t.
Wackcrnagel, II., 11. Webster, and M. A. Oliver. 1988. A geostalisti-
Sullivan, 1 . 1085. Nos-parametric Estirnatios of Spatial Uistrib~rliott.~.Doc-
ral inctl~od for segmenting nlultivariate sequcnccs of soil data. In
1,or;il (lissert;tt~ion,Star~fortlUnivrrsity, Sl.altford, CA.
11. 11. flock, edil.or, Cllnssijicatioa nad Itelated Mrthod of I h t n A W s i s ,
Suro-l'drea, V. 1 9 0 1 Generalion of a turbiditic reservoir: T h e 13oolean pages (id 1 6 5 0 . Klsevier North Ilollar~d,Aitrstertlarn.
alternative. In Report 4, Sla~lfordCenter for Rrsrruoir Forecasting,
Wat~g,I,. 1904. Atrin~atrdvisualization of multiple sirndated realizations
Stanford, CA.
and i ~ ~ s e s s m eof
r ~uncertairrty.
t In Report 7, Stanford Certtcr for Reser-
Sum-I'e'rez, V. 1993. Indicator principal component kriging: T h e nrult,ivari- voir [.bi!ecnsting, S t a ~ ~ f b rCA.
d,
at,e case. 111 A. Soares, editor, Geostatzstics P d i a '92, volnrnc I , pages
Mrc!lmter, I<..,0 . Atlei;%,and J.-f. I)rrbois. 1994. Coregionalizatiorr of trace
441- 454. liluwer Acaderuic I'oblislrers, 1)ordrecht.
n ~ e t a l si l l tlrr soil i l l tlle Swiss Jura. Etrropem~J o ~ r n a lof Soil Science,
Suro-Pbrez, V. and A. G . Journcl. 1991. ludicator principal cornponrnt, 45:205 218.
krigirrg. Mut/trn~nlimlGeology, 23(5):75!1 7 8 8 .
r , aud M , A. Oliver. 1900. Statislical Methods in Soil and Land
W ~ l ~ s t e11..
Tran, T. 1994. Improving variogram reproductiort on dense sitnulatiort grids. Ilesoarce Strrvey. Oxford University Press, New York, 316 p.
Computers t3 Geosciences, 20(7):1161--1168. Xiao, 11. 1085, A U e s c ~ t ~ t i oofn the l?el~aviorof Indicator Vur'iograms for
Val) Mcirvc~mc,M., K. Scl~elrlenr;ur,C . Bacrt, and G . Ilofu~an.1994. Quan- n llt~inrtnlr.A'orrr~al I l i s t r ~ b ~ ~ l i Maskr's
os. tl~mis,Sthrfortl Universily,
tification of soil textural fr;rclions of Bas-%;tire using soil nrap polygons Stanford, (:A.
nr~d/orpoint observat,ious. Geodwn~a,62:69 82. Xu, W. 1994. (huvex krigi~lg.111 llcl~or.l7, Slaefonl C e ~ l f efol.
r lZescruoir
Vcrly, G. 1986. Mtrlti(;;u~ssian kriging - A completc case study. 111 It. V. Ita- I~olr'cn.slnig,St;tnford, (:A.
I I I ~ I cdit,or,
I~, Al'COM Sympo-
I'lncerdiags of thc 19th ltrter~~~atiorrnl XII, W. 1095;~.St.ocll;rsl.ic rllrrtlcling of lit.l~of;~cies:
Altert~atives. In Reporl
szunt, pages 283 298, Littleton, CO. Society of Minit~gEngineers. 8, Slnnfolil C m l e r for Ileseruoir I.brecn,sling, Stanford, CA.
Verly, G . 1093. Sequential Gaussian co-sitr~ulation:A simulation trrelliod X u , W. 1095t1. Stocha,stic Modeliug of Reseruoir Lilhofacies and Pelrophys-
integraling several types of i~rformation.In A. Soares, editor, Ceostatis- ical Propcrttes. IIoctoraI dissertation, St.anford University, Stanford,
tics Trriia '92, volu~nn1, pages 543-554. Kluwer Academic Pr~hlinl~rrs, (:A.
1)ordrcrhl.
Xu, W . and A. (:. J o u r ~ ~ c l1!)93.
. Gtsiur: G;mssia~ltruncated simulation
Volta, M. and R. Wcbstcr. 1990, A comparison of kriging, cubic splines of litktofacies. 111 neyort 6, Stnnfo~~d Ceuter. l o r flesenmir Forecasfiag,
and classification for predicting soil properties from sample inforlnatio~~. Stanford, (:A.
Joumnl of ,Soil Science, 41(:1):473 490.
Xu, W , turd A. C . Jowncl. 1'394. 1)ssiui: A general scqllclltial sirnula-
Wackernagel, 11. 1988. Geostatistical techniques for interpreting rr~ult,ivari- ~ . Rqiorl 7, Stanford Center for Ilcsereoir Fwecaslieg,
t,iorr a l g o r i f h ~ Irr
ale spatial inforroation In C.I.'. Clrortg, A . G. Fahbri, and R. Sinding- Stanfor~l,CA.
476 BIBLIOGRAPHY