Professional Documents
Culture Documents
Pete 360 Chapter 1
Pete 360 Chapter 1
1
POPULATIONS A N D SAMPLES
9
• The properties of a reservoir unit for which the geologist
or engineer is required to infer (or estimate or guess)
values can be considered a population
1
0
• Populations possess certain numerical
characteristics (such as the population mean)
which are known as parameters
1
1
• A sample is a subset of
elements drawn from
the population
1
3
• In summary one can estimate the population
parameter by statistical inference from a statistic
computed from some sample data
1
4
Sampling and Sampling techniques
1
5
• A random sample must be unbiased, so that each item
in the sample has the same chance of being chosen
as any other item in the sample
• Second, the random sample must be independent,
so that selecting one item from the population has
no influence on the selection of other items in the
population.
1
6
• Unbiased and independent sample gives a better
chance of understanding the true nature (distribution)
of the population as the sample size increases
1
0
Sampling Methods
1
1
• Sampling with replacement allows us the chance to
pick that same value again in our sample
1
2
Oilfield Applications to Sampling
1
3
• Similarly, the process of drilling wells in a
reservoir necessarily involves sampling without
replacement
1
4
• The sampling routine (also known as the drilling
program) is highly biased and dependent
1
5
MEASUREMENT SYSTEMS
1
6
Nominal Scale
1
7
Ordinal Scale
1
9
Ratio Scale
2
0
• Within the petroleum industry, reservoir properties are
measured along a continuum, but there are practical
limits for the measurements; I t would be hard to
c onceive of negative porosity, permeability, or
thickness, or of porosity greater than 100%.)
2
1
TRIALS, EVENTS, A N D PROBABILITY
2
2
Events can be classified by
relationship to one
there
another;
• Independent Events if the occurrence
event A has no bearing on
of
occurrence
the of event B, and vice
versa
• Dependent if the occurrence of event
influences the occurrence of event
A
B.
2
3
• Mutually Exclusive if the occurrence of either
event precludes the occurrence of the other
2
4
The measure of probability is scaled from 0 to 1,
where:
• 0 represents no c hance of occurrence, and
• 1 represents certainty that the event will
occur.
2
5
Discrete Probability
2
6
Coin Toss Experiment
2
7
Conditional Probability
2
8
• For example, suppose an experiment consists of
observing weather on a specific day
2
9
• In statistical notation, the conditional probability
that event A will occur given that event B has
occurred already is written as P(A|B)
3
0
• and we define the conditional probabilities of B given
A as follows:
3
1
Additive Law of Probability
3
2
• The probability of the union (A u B) is equal
to:
3
3
Multiplicative Law of Probability
3
4
Risk Analysis In Petroleum Play
3
5
The probability of a prospect’s exploration success is
a function of the individual probabilities
concerning;
• RESERVOIR,
• SEAL,
• SOURCE/MIGRATION,
• TRAP
• and T IMING
3
6
• Probabilities are assigned for each of the elements
from 0 (meaning impossible), to 1 (certain or very likely):
3
7
• P(SR) - The probability that there is a mature
hydrocarbon source rock in the vicinity of the structure
and that a migration path exists
3
8
Assuming these events are independent, then apply
the multiplicative law,
P(TR) x P(T)
3
9
As guideline Prospects risk can be classified
into the following
4
0
4
1
4
2
R A N D O M VARIABLES
4
3
• The coin toss is another experiment that produces
numbers randomly . In the case of a coin toss,
however, we need to designate a numerical value to
“heads” as 0 and “tails” as 1; then we c a n draw
randomly from the set 0,1.)
4
4
TWO CLASSES OF R A N D O M VARIABLES
4
5
Discrete Random Variables
4
6
• In most practical problems, discrete random
variables represent count (or enumerated) data,
such as point counts of minerals in a thin section
4
7
Frequency Tables and Histograms
4
8
• In a frequency distribution table interval of variation
of the data is divided into class intervals
4
9
• Data is conventionally ranked in ascending order, and
thus can be represented as a cumulative frequency
histogram, where the total number of values below
certain cutoffs are shown, rather than the total number
of values in each class
5
0
5
1
Continuous Random Variables
5
2
Probability Distributions Of The Discrete
Random Variable
The probability distribution of a discrete random
variable consists of the relative frequencies with which
a random variable takes each of its possible values
• Binomial
• Negative Binomial
• Poisson
• Hypergeometric
5
3
Binomial Probability Distribution
5
5
• let us look at the two-coin example.
• The sample points for this experiment with their
respective probabilities are given below
5
6
• Let y equal the number of heads observed.
5
7
• The Probability Distribution Function for y, where
y = Number of Heads
5
8
• Thus, for this experiment there is a 25% chance of
observing two heads from a single toss of the two
coins
5
9
• Because p(0) = ¼, the theoretical relative frequency
for y
= 0 is ¼; p(1) = ½, hence the theoretical
relative probability for y = 1 is ½, etc
6
0
• If you were to draw a sample from this population,
by throwing two balanced coins, say 100 times
your histogram would appear very similar
6
1
• A typical application of this in the oil industry is to
forecast the probability of success of a drilling program.
6
2
• The binomial distribution is appropriate when a
fixed number of wells will be drilled during an
exploratory program or during a single period
(budget cycle) for which the forecast is made.
6
3
The probability p that a wildcat well will discover gas or
oil is estimated
6
4
• In Maths terms;
6
5
• For example, suppose we want to find the probability of
success associated with a 5-well exploration program in
a virgin basin where the success ratio is anticipated to
be about 10%. What is the probability that the entire
exploration program will be a total failure, with no
discoveries?
6
6
Where:
• P = the probability of success
• r = the number of discovery wells
• p = anticipated success ratio
• n = the number of holes drilled in the exploration
program
6
7
• Discrete distribution giving the probability of making
n discoveries in a five-well drilling program when
the success ratio (probability of discovery) is 10%
6
8
Negative Binomial Probability Distribution
6
9
• The experiment c onsists of x repeated trials.
• Each trial can result in just two possible outcomes. We
call one of these outcomes a success and the other, a
failure.
• The probability of success, denoted by P, is the same
on every trial.
• The trials are independent; that is, the outcome on
one
trial does not affect the outcome on other trials.
• The experiment c ontinues until r successes are
observed, where r is specified in advance.
7
0
• The expanded form of the negative binomial equation
is
P = [(r + x -1)!/(r -1)!x!][(1 -p)x-r p r
Where:
P = the probability of success
r= the number of discovery
wells x = the number of holes
p = regional success ratio
7
1
Discrete distribution giving the cumulative probability that two discoveries will be
made by or before a specified hole is drilled, when the success ratio is 10%
7
2
Poisson Probability Distribution
7
3
• The probability that an event will occur in an interval
is proportional to the length of the interval, and
7
4
• The equation in this case
is
7
5
Hypergeometric Probability Distributions
7
6
• n: The number of items in the sample.
• x: The number of items in the sample that are classified
as successes.
7
7
• The probability of making x discoveries in a drilling
program consisting of n holes, when sampling from
a population of N prospects of which S are
believed to contain commercial reservoirs
P=(SC X . C n-x)/NC n
N-S
7
8
x = the number of discoveries
N = the number of prospects in the
population n= the number of holes drilled
S = the number commercial reservoirs
7
9
Discrete distribution giving the probability of n discoveries
in three holes drilled on ten prospects, when four of the
ten contain reservoirs
8
0
Frequency Distributions Of Continuous
Random Variables
• Probability density function can be represented
by a continuous curve
8
1
8
2
Normal Probability Distribution
8
3
Central Limit Theorem
8
4
The Central Limit Theorem consists of three
statements
8
6
• Formally, the Normal Probability Density
Function is represented by the following
expression
8
7
• Gaussian normal
distribution can be
described by the position
of its maximum which
corresponds to its mean
(µ
)
8
8
• 68.3% of all sample values fall within -1ó to +1ó from
the mean
8
9
Lognormal Distribution
9
1