You are on page 1of 43

GEOSTATISTIK

Syahrul, S.Si., M.T


27 September 2019
Capaian Pembelajaran Mata Kuliah:

Setelah mengikuti mata kuliah ini, mahasiswa akan mampu menganalisis


metode pengolahan data spasial dan non–spasial dalam pendekatan
statistik dan mampu membuat model–model variogram, perhitungan
estimasi, kriging, co–kriging, dan simulasi dalam melakukan analisis data
eksplorasi.

Capaian Pembelajaran Sub Mata Kuliah:

1. Menjelaskan definisi ilmu geostatistik


2. Menjelaskan perbedaan dengan ilmu statistik dan geostatistik
Referensi
Utama :
Isaaks E.H, Srivastava R. M, 1989, An Introduction to Applied Geostatistics, Oxford
University Press, Inc.

Pendukung :
1. Armstrong, M., 1998, Basic Linear Geostatistics, 1st edition, Springer-Verlag, Berlin.
2. Journel, A.G., C. Huijbregts, 1997, Mining Geostatistics, 7th edition, Academic Press.
3. Remy, N., Boucher, A., Wu, J., 2009, Applied Geostatistics with SGeMS: A User’s
Guide, 1st edition, Cambridge University Press
Introduction to Geostatistics
CONTENT
• What is geostatistic?
• Application of spatial statistics
• Basic assumptions in spatial statistics
• Key concepts in geostatistics
• Exploratory data analysis (EDA) for non-spatial
statistics
• Spatial description
Geostatistics
• Geostatistics: branch of statistics that
deal with spatially correlated data
• Basic assumptions:
– Sample values are not independent
– Spatial dependency exists
• Goal of geostatistics:
– Spatial continuity model
– Use the model for estimation and/or
simulation of spatial data distribution
Geostatistics
• Geostatistics Term used by Hart (1952) - Application of
Statistics in a Geographic Context
• Matheron (1962, 1963) Used Term in a Geological
Context for Inferring Ore Reserves from Data Spatially
Distributed Within an Ore Body :
- Developed Theory of Regionalized Variables
– Formal Introduction of New Statistic - the
Semivariogram
– Used Kriging to Obtain Best Estimate of a Property (i.e.
Ore Grade) at Some Location in an Ore Deposit
– Built Theory on Practical Work of Krige (1951, 1960)
What is special about spatial data?
• Location of a sample → intrinsic part
of its definition
• All data sets → implicitly related by
their coordinates (models of spatial
structure)
• Data values may be related to their
coordinates → spatial trend
What is special about spatial data?

• Values at sample points can NOT be


assumed to be independent
• That is, there may be a spatial structure
to the data
– Classical statistics → independence
– Implications for sampling design
Key Concepts

• Spatial dependence: the value of a


variable at a point in space is related
to its value at nearby points
• Spatial structure: the nature of the
spatial relation
• Support of a sample: the physical
dimensions it represents
Geostatistic application
Reservoir Property Distribution Using the Available Well
Log, Core, and/or Seismic Data
Geostatistical
Analysis
Raw
Data

Selection of
Model
Appropriate
Estimation or
Stochastic Algorithm
Geostatistic application
• Quantify Uncertainty Using Multiple Geologically and
Statistically Valid Models
Individual
Reservoir
Simulation
Runs Are
Numbered

n
RESERVOIR
1 3
1 FLOW 6
2
SIMULATOR
3
5
4
4 2
5
6 OUTCOME
PROPERTY (PHI, K) (RECOVERY)
DISTRIBUTIONS
Limitations of geostatistics
Geostatistics does NOT :
• Create Data or Eliminate the Value of
Obtaining Additional Good Data
• Replace Sound Qualitative
Understanding and Expert Judgment
• Necessarily Save Time, At Least in the
Short Term.
• Work Well as a “Black Box”
The first law of geography was put forward by Tobler,
stating that everything is interconnected with one
another, but something close has more influence than
something far away (Anselin, 1988)
Types Spasial Data
 Point Data (Point Pattern Analysis)
Indicates the location in the form of a point, for
example in the form :
 Longitude dan latitude
 x and y
 Line Data (Geostatistical Data)
 Continuous spatial surface
 Area Data (Polygons or Lattice Data)
Shows the location in the form of area, such as a
country, district, city etc.
Point Data
Line Data
Data Area
Spatial Pattern
❑Form of Spasial Pattern

clustered

random uniform clustered


random

uniform clustered
Non-spatial statistics

Exploratory Data Analysis (EDA)

Exploratory Data Analysis (EDA) is an approach/philosophy


for data analysis that employs a variety of techniques
(mostly graphical)
Exploratory Data Analysis (EDA)
For example, multidimensional scaling is an EDA that uses
visual representations of distances or similarities between sets
of objects; It’s up to the user to interpret exactly what the
distances represent
Univariate description

Measure the characteristic of data population

• Mean
• Variance/standard deviation
• Histograms
• Spread/central tendency
• Skewness
Frequency Table
values:
2, 4, 1, 5, 2, 3, 6.9, 2, 5, 7, 2.1, 3.4, 4.2, 2.2, 2.9, 1.7, 3.5, 6.2

Cumulative
Interval Frequency Frequency
1 - 1.999 2 2
2 - 2.999 6 8
3 - 3.999 3 11
4 - 4.999 2 13
5 - 5.999 1 14
6 - 6.999 2 16
7 - 8.000 1 17
Histograms
6
5

frequency
4
3
2
1

0 1 2 3 4 5 6 7 8
data value
relative frequency

0.36
0.30
0.24
0.18
0.12
0.06
0 1 2 3 4 5 6 7 8

data value
Histograms
• Shape varies with Number of Bins
• Rule Of Thumb

Number of Bins = Number of Samples


Cumulative Distributions
cumulative

cumulative frequency
distribution
1.0
0.8
0.6
0.4
0.2

0 1 2 3 4 5 6 7 8
data value

• number of samples below bin maximum


• relative frequency below bin maximum
• probability of grade below bin maximum
Skewness Measurements
• How symmetrical is the distribution?
– Skewness
– Kurtosis

Skewness and kurtosis are more visible measures


for viewing data distribution graphically
Outliers
• Outlier values may be cut to reduce their
impact on arithmetic mean and estimation
g/t gAu %
2 20,000 1%
4 40,000 3%
3 30,000 2%
7 70,000 4%
90 900,000 58%
15 150,000 10%
10 100,000 6%
20 200,000 13%

• E.g. if Top-Cut = 20 a value > 20 is changed


to 20
• Derive Top-Cut from Probability Plot
De-Clustering
• Statistics relies on samples being random
and un-biased
• In mining, we are more interested in ore
than waste
• Usually in mining, more drillholes are
located in high-grade areas
• So sampling is inherently biased - more
samples in high grade areas
De-Clustering
• Overcome this bias with de-clustering
• Put (3D) grid over data
• Within each cell take sample closest to
centre (or average of all samples)
Use average of Use single
4 samples sample

⚫ Continue using this single value per cell


De-Clustering
• Any geological knowledge to split into
separate domains must be used

Only decluster if no geological separation


possible
Bivariate description
• Comparing two distributions
• Scatterplot
• Correlation
• Linear regression
Non-spatial statistics
Scatterplot

Regression line

 = 0.7

x
Non-spatial statistics
different values of correlation coefficient

(Picture taken from Dubrule, 2003)

You might also like