You are on page 1of 23

INTRODUCTION OF

STATISTICS

DW200705

Introduction

STATISTICS
The

Data it self

The

science which study about the


collection, organization, analysis, and made
conclusions of data in the numerical forms.

DW200705

Introduction

DATA QUALITY
Data

has an important role, as in the term GIGO


garbage in garbage out .
Geological hypothesis:
(a) Is there a significance difference in composition
between granite from two intrusion bodies?
(b) Is there a regular cyclicities properties in a
sequence of limestone thin layers?
What can we estimate the permeability of
sandstone layers from a well bore log data (wireline
logging)?
DW200705

Introduction

PRECISION and ACURATION


PRECISION: a measure is called precise if we repeat
it on a object will give the same value.
ACCURATION: a measure is called accurate if their
results close to a right given value.
There are a combination form between low/high
precision with low/high accuration .
DW200705

Introduction

PRECISION and ACCURATION


Precision
High

Low

Accuracy High 49, 50, 50, 52, 50,


49, 51, 50
(suppose
true value
Low 54, 55, 55, 57, 55,
is 50)
54, 56, 55

DW200705

Introduction

55, 47, 50, 52,


44, 53, 57, 47
60, 52, 55, 57,
49, 58, 62, 52

TYPES OF GEOLOGICAL DATA (1)


RATIO SCALE DATA
The ratio scale data is a common measure such as length,
weight of matter, it is a good and without problem data.
INTERVAL SCALE DATA
Zero point is not data termination, such as the
temperature in Celsius or Fahrenheit degrees, the zero
point not an end, however the Kelvins degree is a ratio
scale data .

DW200705

Introduction

TYPE OF GEOLOGICAL DATA (2)


CLOSED DATA
These data in the form of percentages or ppm (part per
million), or other form which indicates a proportion to a
fixed value. It must be careful in bivariates or multivariate
analyses.
DIRECTIONAL DATA
The data are expressed in the form of angle, for example
azimuth, it need special process because 0 o = 360 o

DW200705

Introduction

TYPE GEOLOGICAL DATA (3)


ORDINAL SCALE DATA
It is a poor quality data, interval scale is not regular and just shows the
rank position then it can not be used for added, subtracted or dividing
processes. For example the Mohss hardness scale, Richters earthquake
intensity scale that based on the degree of damages. Analyzing of ordinal
data generally use the non-parametric statistics which analyze the data
based on rank order.

DISCRETE DATA
All those above data has continue properties. The discrete data,
commonly in the form of integer, for examples: counting some objects
such as the number of fossils in 1 cm2 of outcrop.

DW200705

Introduction

TYPE OF GEOLOGICAL DATA (4)


NOMINAL OR CATEGORIAL DATA

These data are in the forms such as name, for example


minerals, fossils which must be converted to numerical
data. It was sometimes analysis in binary forms, e.q. if
there is a fossil we note as 1, if there is no fossil we note as
0. Analysis commonly made by multivariable method.

DW200705

Introduction

TYPE OF GEOLOGICAL DATA (5)


RATIO SCALE
1

10

INTERVAL SCALE

-3

-2

-1

DW200705

Introduction

10

TYPE OF GEOLOGICAL DATA (6)


CLOSED
0% 10
20

30

40

50

60

70

80

90

100%

ORDINAL

2 3

DW200705

7 8

Introduction

10 11

12

11

TYPE OF GEOLOGICAL DATA (7)


N
0o

360 o

360 = 0 .. ?

DIRECTIONAL
DW200705

Introduction

12

TYPE OF ANALYSIS (1)


UNIVARIATE METHODS
Each variable analyzed in isolation, the data can be
portrayed as a series of points along an appropriately
scaled line

x
0

x
5

DW200705

x x
10

x x x
15

20

Introduction

x x
25

13

TYPE OF ANALYSIS (2)


BIVARIATE
Y

METHODS

40
x

30
x

20

10

10

DW200705

15

Two variables analyzed together.


The two measurements made on
one object give coordinates of a
point in a two dimensional (2D)
space, and a data set can be
portrayed as a 2D scatter, for the
purpose of investigating the
relationship between the data
points and/ or the relationship
between the variables.

Introduction

14

TYPE OF ANALYSIS (3)


TIME SERIES
METHODS

X
4
3
2
1
0
1

10

15

20

25

Sequences of data in time


(or space) can be in various
forms. Some can be treated
as simple bivariate data,
with one variable happening
to be time. Often, the
situation is conceptualized
as a continuously varying
curve

Time

DW200705

Introduction

15

TYPE OF ANALYSIS (4)


20
30

10

Three (or four) variables


analysed together, two (or
three) of which are spatial
coordinates; grid
references or latitude/
longitude, with or without
altitude or depth. The other
variable is geological
measurement of interest,
and as regarded as varying
continuously over the area.

20

40

30

30

40

40

30

DW200705

SPATIAL
ANALYSIS

Introduction

16

TYPE OF ANALYSIS (5)


MULTIVARIATE ANALYSIS
General methods applicable to any number of variables
analyzed simultaneously, and usually applied to more
(often many more) then three variables. If these are m
variables, the data may be imagined as points in an mdimensional space. The prime objective is to reduce the
dimensionality so that the shape of the data scatter can be
viewed. Relationship between variable can also be
investigated.

DW200705

Introduction

17

Sample and Population


Population

SAMPLE

DW200705

Introduction

18

Sampling Strategies (1)

RANDOM
DW200705

REGULAR
Introduction

19

Sampling Strategies (2)

CLUSTERED
DW200705

UNIFORM
Introduction

20

Sampling Strategies (3)

TRAVERSE
DW200705

Introduction

21

THANK YOU

DW200705

Introduction

22

DW200705

Introduction

23

You might also like