Professional Documents
Culture Documents
Apuntes de Curso UMSA Petroleo PDF
Apuntes de Curso UMSA Petroleo PDF
Risk
Analysis
for the
Oil
Industry
A supplement to:
Risk Analysis:
Table of Contents
▲
Biography im Murtha, a registered petroleum
engineer, presents seminars and training
courses and advises clients in building
Involving Uncertainty - An @RISK
Tutorial for the Petroleum Industry. In
25 years of academic experience, he
probabilistic models in risk analysis and chaired a math department, taught petro-
decision making. He was elected to leum engineering, served as academic
Distinguished Membership in SPE in dean, and co-authored two texts in
1999, received the 1998 SPE Award in mathematics and statistics. Jim has a
Economics and Evaluation, and was Ph.D. in mathematics from the Uni-
1996-97 SPE Distinguished Lecturer in versity of Wisconsin, an MS in petroleum
Risk and Decision Analysis. Since 1992, and natural gas engineering from Penn
more than 2,500 professionals have taken State and a BS in mathematics from
his classes. He has published Decisions Marietta College. ◆
W
▲
2 Risk Analysis
Risk Analysis:
An Overview
A Guide To –
Risk Analysis
R
R isk and decision analysis was born in the middle
of the 20th century, about 50 years after some of
the necessary statistics became formalized.
Pearson defined standard deviation and skewness
in the late 1890s, and Galton introduced
percentiles in 1885.
The term Monte Carlo, as applied to
uncertainty analysis, was mentioned by Metropolis
ance to isolated departments within organizations.
Managers were notoriously unwilling to embrace
results that presented probability distributions for
reserves and net present value (NPV). Consultants
offering services and software vendors know too
well these levels of resistance.
Now, finally, there seems to be broader
acceptance of probabilistic methods, although as
and Ulam: the Journal of the American Statistical I write, my SPE Technical Interest Group digest
Association in 1940. D.B. Hertz published his contains strong negativism from traditionalists
classic Harvard Business Review article in 1964. A about probabilistic prices. Nonetheless, consider
couple of years later, Paul Newendorp began these items:
teaching classes on petroleum exploration • the three most recent and five out of the last
economics and risk analysis, out of which evolved seven recipients of the SPE Economics and
the first edition of his text in 1975, the same year Evaluation award have been strong proponents
as A.W. McCray and 2 years before R.E. Megill of risk analysis;
wrote their books on the subject. Ten years later, • whereas the index to the last edition of the
there was commercial software available to do Petroleum Engineers’ Handbook only had two
Monte Carlo simulation. references to “risk,” the forthcoming edition
During this 50-year period, decision analysis, will feature an entire chapter on the topic;
featuring decision trees, also came of age. Raiffa’s • my first paper on Monte Carlo simulation was
classic book appeared in 1968. By 1985, there presented at the Eastern Regional Meeting in
were several commercial applications of software 1987 and summarily rejected by the editorial
on the market. committees for not being of adequate general
These developments, in many ways, paralleled interest. (It was a case study dealing with the
the development of petroleum engineering, with Clinton formation, but the methods were
the basics appearing in the 1930s, mature texts like clearly generic and used popular material
Risk Analysis 3
Risk Analysis:
An Overview
4 Risk Analysis
Risk Analysis:
Central Limit Theorem
6 Risk Analysis
Risk Analysis:
Central Limit Theorem
biased compared with the averages over the entire How much difference will this make?
structure. If we put a grid over the area and place a This would be a good time to ask that important
well location in each grid block, then the two question,“Why bother?”What if this is correct: that
notions of average essentially coincide. we should be using narrower and more symmetric
shaped distributions for several of the factors in the
These distributions of averages are normal volumetric formula? Does it matter in the final
Here’s the point: the Central Limit Theorem guar- estimate for reserves or hydrocarbon in place? How
antees these distributions of average properties – net much difference could we expect?
pay, porosity and saturation – will tend to be normal. Let us consider porosity.When we examine the
Another consequence of the theorem is that these database of porosities from all wells (for example, the
distributions of averages are relatively narrow, for average log porosity for the completion interval or
example, they are less dispersed than the full from the layer interval in a multilayer system) as in
distributions of net pays or porosities or saturations Table 1, there are two possible extreme situations.At
from the wells, which might have been lognormal or one extreme, it is possible that each structure
some other shape. The correct distributions for exhibits a wide range of quality from fairway to
Monte Carlo analysis, however, are the narrower, flanks, but the average porosities for the various
normal-shaped ones. structures always fall between 10% and 14%. In this
One objection to the above argument (that we case, the range of all porosities could easily be several
should be using narrower, normal distributions) is that times as broad as the average porosities.That is, there
often we do not have ample information to estimate the are similarities among structures, but not much
average porosity or average saturation.This is true. None- homogeneity within a structure.
theless, one could argue that it still might be possible to On the other hand, it is possible that each structure
imagine what kind of range of porosities might exist is relatively homogeneous, but the different structures
from the best to the worst portions of the structure. are quite dissimilar in quality, with average porosities
Risk Analysis 7
Risk Analysis:
Central Limit Theorem
H
right way to judge whether the type of distribution another common formula is:
matters for an input variable is to compare what OOIP = 7,758Vb(NTG)So/Bo
happens to the output of the simulation when one Vb = Bulk rock volume
type is substituted for another. NTG = Net to gross ratio
Here,Vb would be the dominant factor, which
What about the output could be skewed right and modeled by a lognormal
of the simulation, OOIP? distribution, while the factors NTG, , So and Bo
Regardless of the shapes of the inputs to a would tend to be normally distributed, since they
volumetric model – be they skewed right, or left, or represent average properties over the bulk volume.
symmetric the – output will still be skewed right,
thus approximately lognormal. In fact, as is well Recovery factors
known, the Central Limit Theorem guarantees this. Recovery factors, which convert hydrocarbon in
The argument is straightforward: the log of a place to reserves or recoverable hydrocarbon, are also
product (of distributions) is a sum of the logs (of average values over the hydrocarbon pore volume.
distributions), which tends to be normal. Thus the The recovery efficiency may vary over the structure,
product, whose log is normal, satisfies the definition but when we multiply the OOIP by a number to get
of a lognormal distribution. recoverable oil, the assumption is that this value is an
average over the OOIP volume. As such, they too
Will the Area distribution would often be normally distributed. Additional
always be lognormal? complications arise, however, because of uncertainty
The traditional manner of describing area and about the range of driving mechanisms: will there be
treating it as a lognormal distribution is based on a water drive? Will gas injection or water injection be
prospects in a play. If we were to select at random effective? Some people model these aspects of
some structure in a play, then the appropriate uncertainty with discrete variables.
distribution likely would be a lognormal. Sometimes,
however, not even the Area parameter should be Summary
modeled by a lognormal distribution.Why? Suppose The Central Limit Theorem suggests that most of the
a particular prospect is identified from 3-D seismic. factors in a volumetric formula for hydrocarbons in
We have seen situations where the base case value of place will tend to have symmetric distributions and can
area or volume is regarded as a mode (most likely). be modeled as normal random variables.The main factor
When asked to reprocess and or reinterpret the data (area or volume) can be skewed left or right. Regardless
and provide relatively extreme upside (say P95) and of the shapes of these input distributions, the outputs of
downside (say P5) areas or volumes, the results are volumetric formulas, oil and gas in place and reserves,
skewed left: there is more departure from the mode tend to be skewed right or approximately lognormal.
toward the downside than the upside. Because the Because the conventional wisdom is to use lognormal
conventional lognormal distribution is only skewed distributions for all of the inputs,the above argument may
right, we must select another distribution type, such be controversial for the time being.The jury is still out.
as the triangular, beta, or gamma distribution. We could take a poll and see what users believe. Oh yes,
then we could use the Central Limit Theorem to analyze
Variations of the volumetric formula the sample and predict the overall opinion.
Among the numerous variations of the volumetric What goes around comes around.
formulas, there is usually only one factor that serves Stay tuned for other applications of the Central
the role of Area in the above argument. For instance, Limit Theorem. ◆
8 Risk Analysis
Risk Analysis:
Central Limit Theorem
10 Risk Analysis
Risk Analysis:
Central Limit Theorem
Risk Analysis 11
Risk Analysis:
Bayes’ Theorem
Bayes’ Theorem –
Pitfalls
D D o you know how to revise the probability of
success for a follow up well?
Consider two prospects, A and B, each having
a chance of success, P(A) and P(B). Sometimes
the prospects are independent in the sense that
the success of one has no bearing on the success
of the other. This would surely be the case if the
prospects were in different basins. Other times,
however, say when they share a common source
rock, the success of A would cause us to revise
Bayes’ Theorem
1. P(B|A)= P(A|B)*P(B)/P(A)
2. P(A) = P(A&B1) + P(A&B2) + … + P(A&Bn), where
B1, B2, …Bn are mutually exclusive and
exhaustive
We can rewrite part 2 if we use the fact that:
P(A&B) =
2’. P(A) =
P(A|B)*P(B)
P(A|B1)*P(B1) + P(A|B2)*P(B2) +
…+P(A|Bn)*P(Bn)
Part 1 says that we can calculate the conditional
the chance of success of B. Classic probability probability in one direction, provided we know the
theory provides us with the notation for the conditional probability in the other direction along
(conditional) probability of B given A, P(B|A), as with the two unconditional probabilities. It can be
well as the (joint) probability of both being derived from the definition of joint probability,
successful, P(A&B). which can be written backward and forward.
Our interest lies in the manner in which we P(B|A)P(A) = P(A&B) = P(B&A) = P(A|B)P(B)
revise our estimates. In particular, we will ask: Part 2 says that if A can happen in conjunction
• how much better can we make the chance of with one and only one of the Bi’s, then we can
B when A is successful? calculate the probability of A by summing the
That is, how large can P(B|A) be relative to various joint probabilities.
P(B); and There are numerous applications of Bayes’
• if we revise the chance of B upward when Theorem. Aside from the two drilling prospects
A is a success, how much can or should we mentioned above, one well-known situation is the
revise the chance of B downward when A is role Bayes’Theorem plays in estimating the value
a failure? of information, usually done with decision trees.
As we shall see, there are limits to these revisions, In that context, the revised probabilities
stemming from Bayes’Theorem. acknowledge the additional information, which
Bayes’ Theorem regulates the way two or might fall into the mutually exclusive and
more events depend on one another, using exhaustive categories of good news and bad news
conditional probability, P(A|B), and joint (and sometimes no news).
probability, P(A&B). It addresses independence Further, we can define P(~A) to be the
and partial dependence between pairs of events. probability that prospect A fails (or in a more general
The formal statement, shown here, has numerous context that event A does not occur). A rather
applications in the oil and gas industry. obvious fact is that
Risk Analysis 13
Risk Analysis:
Invoking Tools
Decision trees:
values are varied together).The traditional tornado
chart also is used to show how each perturbed
variable affects the tree value when all other values
are held fixed. This chart takes its name from the
shape it assumes when the influences of the
perturbed variables are stacked as lines or bars, with
the largest on top.
Trees, along with their cousins influence diagrams,
are particularly popular for framing problems and
Objectives, methodology, results reaching consensus. For small to moderate size
Decision trees select between competing alternative problems, the picture of the tree is an effective means
choices, finding the choice – and subsequent choices of communication.
along a path – which either maximizes value or One of the most important problem types
minimizes cost. solvable by trees is assessing the value of informa-
The output of a tree is a combination of the tion. In this case, one possible choice is to buy
optimal path and the expected value of that path. additional information (seismic interpretation, well
Thus, decision trees yield one number. The rules of test, logs, pilot floods). Solving the tree with and
solving the tree force acceptance of the path with the without this added-information branch and taking
highest expected value, regardless of its uncertainty. the difference between the two expected values
In theory, preference functions can be used in yields the value added by the information. If the
place of monetary values, but in practice, users information can be bought for less than its imputed
seldom go to this level. value, it is a good deal.
Decision trees allow limited sensitivity analysis,
in which the user perturbs either some of the Monte Carlo simulation:
assigned values or some of the assigned Objectives, methodology, results
probabilities, while monitoring the overall tree Monte Carlo models focus on one or more
value. Typically, the user varies one or two objective functions or outputs. Favorite outputs
parameters simultaneously and illustrates the results include reserves, total cost, total time and net
with graphs in the plane (how the tree value present value (NPV). Their respective inputs
changes when one assigned value is varied) or in include areal extent, net pay and porosity; line
space (how the tree value varies when two assigned item costs; activity times; and production
25.0%
big Recov
P0 P5 Mode P95 P100 200
25.0%
small Recov
Table 1. Defining parameters for inputs (Area, Pay, 100
Recovery) to Monte Carlo model 25.0%
big Area
2700
forecasts, price forecasts, capital forecasts and med Pay
50.0%
+
40
operating expense forecasts.
A Monte Carlo (MC) simulation is the process small Pay
25.0%
+
20
of creating a few thousand realizations of the model
99.0%
Success
by simultaneously sampling values from the input 0
50.0%
distributions.The results of such an MC simulation med Area
2000 +
typically include three items: a distribution for each
25.0%
designated output, a sensitivity chart listing the key small Area
1300 +
tree#1
variables ranked by their correlation with a targeted
1.0%
%
output, and various graphs and statistical summaries Failure
0
featuring the outputs.
Unlike decision trees, MC simulations do not Figure1. A decision tree begging to become a Monte Carlo model.
explicitly recommend a course of action or make a
decision. Sometimes, however, when there are obtained by following some path, multiplying each of
competing alternatives, an overlay chart is used to them by the corresponding probability, which is
display the corresponding cumulative distributions, obtained by taking the product of the branch
in order to compare their levels of uncertainty and probabilities, and summing these weighted values.
their various percentiles. In the tree, each parameter is discretized to only
three representative values, signifying small, medium,
Example: A tree that can be and large. Thus, area can take on the values 1,300,
converted to simulation model 2,000 or 2,700 acres. Of course in reality, area would
Sometimes a decision tree can be reconstructed as a be a continuous variable, taking on any value in
Monte Carlo simulation. This is especially true of between these three numbers. In fact, most people
trees with one root-decision node. The simulation would argue 1,300 is not an absolute minimum, and
would present the result as a distribution, whereas 2,700 is not an absolute maximum.They might be
the tree solution would only give the mean value of more like P10 and P90 or P5 and P95 estimates.We
the distribution.Take, for example, the tree in Figure can think of a small area being is some range, say
1, where the object is to estimate reserves. Note the from 1,000 to 1,500 with 1,300 being a suitable
pattern of a sequence of chance nodes without representative of that class. Similarly, 2,700 might
interspersed choices. This is a giveaway for represent the class from 2,500 to 3,000 acres. Each
conversion. Incidentally, a + sign signifies where a of these subranges of the entire range carries its own
node and its subsequent branches and nodes have probability of occurrence.
been collapsed. In this case, imagine copying the For simplicity, we have made all the triples of
node at “big area” and pasting it at the “med area” values symmetric (for example, 1,300, 2,000 and
and “small area” nodes.We shall convert this tree to 2,700 are equally spaced), but they could be
a simulation and compare the nature and extent of anything. For instance, area could have the values
information that these two models provide us. 1,300, 2,000 and 3,500 for small, medium and large.
In fact, the tree of Figure 1 is not a standard Likewise, we have assigned equal weights to the
decision tree. Instead of the convention of adding the small and large representatives, again for simplicity
values as one moves out along a path, this tree and ease of comparison.
multiplies them together to get barrels.The output is We have made another simplification: we assume
simply the expected value obtained by considering the all possible combinations are realizable. Sometimes,
possible combinations of area, pay and recovery the large value of area would be paired with three
Risk Analysis 15
Risk Analysis:
Invoking Tools
relatively large values of pay, in the between 0 and +1, for example, then during the
Distribution for Reserves/Sample/F6
belief these two parameters are simulation, realizations sampling larger values of area
dependent.This is simple to accom- would tend to be matched with larger values of pay.
0.100 modate in a tree. This technique is routine, simple to implement and
0.090 Meanv=11.96266
0.080 can use historical data when it is available to estimate
0.070
0.060
Building the corresponding the correlation coefficients.The resulting simulation in
0.050 Monte Carlo simulation model this case would feature more extreme values (both
0.040
0.030
Converting to an appropriate large and small) resulting in a larger standard deviation
0.020 Monte Carlo model requires and wider 90% confidence interval than the
0.010
0.000
finding a suitable distribution for uncorrelated case. Somewhat surprisingly, the mean
0 7 14 21 28 35
each of the three inputs – area, pay value will also increase, but not as much as the range.
5% 5%
4.85 20.88 and recovery. In light of the Introducing correlation and examining its effect
discussion above, we took the is a standard exercise in Monte Carlo classes. If
Figure 2. Histogram from simulation small values to be P5 and the big anything, the simulation handles these paired
values to be P95. We also selected relationships more easily than the tree, where the
triangular distributions is each case (which is how dependency is more hard-wired.
Percentage MMSTB we obtained our P0 and P100 values). The
5% 4.9 resulting distributions are shown in Table 1. Example: Comparing
10% 6.0 From the tree analysis,we can calculate the expected alternative mud systems
15% 6.8 value as well as finding the two extreme values (smallest Sometimes a problem can be described with a tree and
20% 7.5 and largest) and their respective probabilities: enhanced with a simulation.A case in point would be
25% 8.2 Expected value 12MMSTB a drilling cost estimate where there is a choice of mud
30% 8.9 Minimum value 2.6MMSTB, P(min) = 1/64 systems. Consider the following problem.
35% 9.5 Maximum value 32.4 MMSTB, P(max) = 1/64 For 4 years, you have been drilling in a field
40% 10.1 What we cannot determine from the tree analysis is where there is a danger of differential sticking. Of
45% 10.8 how likely the reserves would exceed 5 MMSTB, how the 20 wells drilled to date, six of them encountered
50% 11.4 likely t h ey wo u l d b e between 5 MMSTB and 15 stuck pipe. In five of those wells, the drillers fixed the
55% 11.9 MMSTB, how likely they would be less than 12 problem, spending from 4 to 18 days of rig time
60% 12.6 MMSTB, and so on. with an average of 9 days. In one well, they
65% 13.4 The histogram from the Monte Carlo analysis is eventually had to sidetrack at a marginal cost U.S.
70% 14.2 shown in Figure 2, and its corresponding percentiles $700,000 for materials and 12 days of rig time
75% 15.0 in Table 2.Thus, while the mean value coincides with beyond the attempt to free the stuck pipe, which
80% 16.0 the mean from the tree analysis (in part because of the had been 18 days. Average time to drill the wells is
85% 17.3 symmetry of the inputs and lack of correlation), we approaching 45 days.The rig rate is $60,000/day.
90% 19.0 learn much more about the range of possibilities: One option for the next well is to use oil based
95% 21.2 • 90% of the values lie between 4.9 and 20.9 mud, which would greatly reduce the chance of
MMSTB; stuck pipe (to 1/10) and the time to fix it (to 6 days),
Table 2. Percentiles for • there is about a 56% chance of finding less than but cost $70,000/day,
output (reserves, MMSTB)
from simulation 12 MMSTB (it is close to the median); Ordinarily, one would look to the decision tree
• there is only about a 20% chance of exceeding to compare these two alternatives. First, we calculate
16 MMSTB; and the expected value of drilling with the conventional
water based mud system for the three cases: no
Modifying the tree to account for problem, stuck but fixed and sidetrack.
dependency among inputs No problem cost: (45 days) * ($60,000) = $2,700,000
Consider the modified tree branches shown in Stuck and fixed cost: (45 + 9 days)*($60,000) = $3,240,000
Figure 3, where the user believes larger areas Sidetrack cost: (45 + 30)*($60,000)
correspond to larger net pays. Without going into + $700,000 = $5,200,000
detail, the corresponding Monte Carlo simulation The respective probabilities we assign are 14/20,
would handle this relationship by specifying a number 5/20 and 1/20.
between -1 and +1 to indicate the degree of Next, we estimate the costs and probabilities for
correlation between the inputs. If the correlation were the oil based mud system.
16 Risk Analysis
Risk Analysis:
Invoking Tools
Risk Analysis 17
Risk Analysis:
Invoking Tools
25.0%
big Pay
90 +
25.0%
70.0%
big Area No Problem
2700
2700
50.0%
med Pay +
60
Oil Mud TRUE Chance
small Pay
25.0% 0 2960
40 +
25.0%
25.0% Stuck/Fix
big Pay
80 + 3240
tree#1 5.0%
Stuck/Fix
Success
99.0%
0 med Pay
50.0%
+
Decision 5200
40
med Area
50.0% 2960
2000
25.0%
small Pay +
30
25.0%
big Pay
60 +
tree#1 95.0%
No Problem
3150
50.0%
med Pay +
25.0%
30 FALSE Chance
small Area Water Mud
1300
small Pay
25.0%
+
0 3171
15
5.0%
Failure
1.0%
% Stuck/Fix
0 3570
Figure 3. Tree incorporating dependence between area and pay Figure 4. Tree to compare oil based vs. water based mud systems
time for projects (up to a point, when project technical papers appear, many of these issues will
schedule software is more applicable), production be resolved.
forecasts, and all sorts of cashflows, including
those with fiscal terms. Decision trees must come Summary
down to something that compares alternative Table 2 summarizes some of the features of the
choices under uncertainty. two methods, illustrating their principle differ-
Distribution for CostWater/Sample/E22 ences. Don’t be surprised to find someone using a
Combining simulation tree solution to a problem you elect to solve with
1.600
1.400
E22: Meanv=2937.594 and trees simulation or vice versa. Do try to use common
1.200 E23: Meanv=3148.974 It is now possible to build a sense and do as much sensitivity analysis as you
Values in 10 -3
0.800
all of the branches from
0.600
0.400 chance nodes contain
0.200 probability distributions
0.000
2 3 4 5
rather than values.Then one
Values in Thousands can run a Monte Carlo
simulation, where each
Figure 5. Comparing oil and water based mud systems iteration is a new solution of
the tree. This technology is
Distribution for CostWater/Sample/E22 relatively new and relatively
untested. Although it holds
1.600
1.400
E22: Meanv=2833.63 promise, this combination
1.200 E23: Meanv=2826.767 requires the user to be
Values in 10 -3
0.800
0.600
and simulation. Some people
0.400 raise questions of logic
0.200 when we solve a tree pop-
0.000
2 3 4 5 ulated with values that are
Values in Thousands
not mean estimates, but
Figure 6. New cost comparison when oil based mud rather extreme possibilities
results in faster drilling. of the inputs. In time, when
18 Risk Analysis
Risk Analysis:
Correlation
and
Cov(1/n) (xi x)*(yi y)
x varx
19 Risk Analysis
Risk Analysis:
Correlation
Risk Analysis 20
Risk Analysis:
Correlation
(and the mean) to increase. It is far less common for the total cost to have a bigger range, but the mean
the correlations to have the opposite effect. cost is unaffected by correlation of any kind.
In more complex models, however, it is possible It should be noted that correlation can have a
for cor relations to serious impact on cost models in terms of contingency.
actually reduce un- Some people define contingency for the total project as
Positive correlation
certainty. One example the difference between some large percentile, say P85
120 we discovered several or P90,and a base value like P50 or the mean total cost.
100 years ago is a capital This contingency could easily grow substantially,
80 project involving con- though, because when we correlate line items, the
Pay
60 struction and operation mean (and to a large degree the P50) remains
40
of a gas-fired electric unchanged but the dispersion (typically) increases.
20
plant. The high positive
0
0 500 1,000 1,500 2,00 2,500 3,000 correlation between Estimating the correlation coefficient
Area price of natural gas (the Once you decide to include correlation in a model,
Figure 1. Correlation between Area and Pay
principal operating cost) you must find a way to estimate the coefficients.This
and price of electricity issue is similar to the more basic issue of what
(the principal factor in distribution to use for a given input variable. In both
Positive correlation revenue) will cause the cases, you can resort to one or more of the following:
0.24 output, NPV, to be far • empirical data;
0.22 more modest in range • experience; and
0.20 (on the order of 50%) • fundamental principles.
0.18
when the correlation is Sometimes we have adequate and appropriate field
0.16
0.14 included, making the data,in which case,we can calculate the (rank order) cor-
0.12 investment less risky. relation coefficient. Likewise, we can use curve-fitting
0.10 With a cost model, procedures (both popular Monte Carlo software pack-
0 20 40 60 80 100 120 adding cor relations ages have them) to generate the probability distribution.
Figure 2. Original data, r = 0.67 between line items is If we have no data or have appropriate data but
one of several refine- only a small sample, we should temper our auto-
ments that can increase matic processes with experience. What does this
R the range of total costs. kind of parameter usually look like: is it symmetric
13 Correlation coefficients or skewed, small or large? And for correlation, how
12
11 in this case are often do these two parameters usually relate to each other:
10
9 thought to be higher positively or negatively?
8
than those in reserve Once in a great while, we can appeal to that
6
models. For instance, higher power: fundamental principles. For example:
4
steel prices can influ- • “this parameter is a sum or average of other
2
ence numerous line parameters; thus it should be approximately
1 2 3 4 5 6 7 8 9 10 11 12 13 items, thereby resulting normal;”
Figure 3. Ranks of data r = 0.57 in a positive correlation • “this parameter is a product of other parameters
among the pairs. One – it should be skewed right;”
of our clients used a • “these two prices represent substitutable
Correlations for NPV
cost model with a matrix commodities, therefore they should be
Production1/D4 0.694 of risk factors vs. line positively correlated;” and
items. When the same • “these two parameters represent competing
Investment/D3 -0.528
risk factor influenced quantities and should be negatively correlated.”
Price1/D6 0.432 two or more line items, Remember, the correlation coefficient is limited
these items became to the range [-1,+1]. Anything in the range [-0.15,
Decline Rate/D5 -0.074
positively correlated. +0.15] could be noise and is unlikely to impact the
-1 -0.5 0 0.5 1
Positive correlation results. It is rare to have correlation coefficients
Correlation Coefficients (the usual type) among above 0.8 or below –0.8. So, make an educated
Figure 4. Sensitivity chart
line items will cause guess, and run the model with several alternate
21 Risk Analysis
Risk Analysis:
Correlation
Risk Analysis 22
Risk Analysis:
Risked Reserves
Beware of
Risked Reserves
“R R isked reserves” is a phrase we hear a lot these days.
It can have at least three meanings:
1. risked reserves might be the product of the
probability of success, P(S), and the mean value
of reserves in case of a discovery. In this case,
risked reserves is a single value;
2. risked reserves might be the probability
distribution obtained by scaling down all the
values by a factor of P(S); or
3. risked reserves might be a distribution with a
spike at 0 having probability P(S) and a reduced
probability distribution of the success case.
Take as an example Exploration Prospect A. It
has a 30% chance of success. If successful, then its
reserves can be characterized as in Figure 1, a
lognormal distribution with a mean of 200,000
STB (stock tank barrels) and a standard deviation
of 40,000 STB. Then:
• definition 1 yields the single number
0.3*200,000 = 60,000 STB;
• definition 2 yields a lognormal definition with
a mean of 60,000 and a standard deviation of
12,000 (See Figure 2); and
• definition 3 is the hybrid distribution shown in
Figure 3. By contrast, suppose another prospect,
B, has a 15% chance of success and a reserves
distribution with a mean of 400,000 STB and a
standard deviation of 200,000 STB.Then under
definition 1,B would yield the same risk reserves
as A, 0.15*400,000 = 60,000 STB. However, consider
Figure 2, which shows how B would be scaled
compared with A,with the same mean but larger
standard deviation.And Figure 4 shows how the
Figure 1.Lognormal distribution for Prospect A reserves original distributions compare.
Assigning these two prospects the same number
for the purpose of any sort of ranking could be
misleading. Prospect B is much riskier, both in the
sense that it has only half the probability of success
than does A, and also because even if it is a success, the
range of possible outcomes is much broader. In fact,
the P10, where P=Percentile, of Prospect B equals the
P50 of Prospect A.Thus, if you drilled several Prospect
A types,for fully half of your successes (on average),the
reserves would be less than the 10th percentile of one
prospect B.
The only thing equal about Prospects A and B is
that, in the long run, several prospects similar to
Prospect A would yield the same average reserves as
several other prospects like B. Even this is deceptive,
because the range of possible outcomes for several
Figure 2.Comparing the original distributions for A and B prospects like A is much different from the range of
23 Risk Analysis
Risk Analysis:
Risked Reserves
Running economics
What kind of economics these two prospects would
generate is another story. Prospects like A would
provide smaller discoveries more consistent in size.
They would require different development plans and
have different economies of scale than would
prospects like B.
So, does that mean we should run economics?
Well, yes, of course, but the question is with what Figure 3.Hybrid distribution for A showing spike at 0 for failure case
values of reserves do we run economics? Certainly not
with risked reserves according to definition 1,which is
not reality at all.We would never have a discovery with
60,000 STB. Our discoveries for A would range from
about 120,000 STB to 310,000 STB and for B from
about 180,000 STB to 780,000 STB (we are using the
P5 and P95 values of the distributions). So, surely, we
must run economics for very different cases.We could
take a few typical discovery sizes for A (or B), figure a
production schedule, assign some capital for wells and
facilities, sprinkle in some operating expenses and
calculate net present value (NPV) at 10% and IRR
(internal rate of return). My preference is not to run a
few typical economics cases and then average them.
Even if you have the percentiles correct for reserves,
why should you think those carry over to the same Figure 4. Original distributions for A and B
percentiles for NPV or IRR? Rather, I prefer to run
probabilistic economics. That is, build a cashflow ranking prospects. Moreover, the process would
model containing the reserves component as well as indicate the drivers of NPV and of reserves, leading
appropriate development plans. On each iteration, the to questions of management of risks.
field size and perhaps the sampled area might
determine a suitable development plan, which would Summary
generate capital (facilities and drilling schedule), The phrase risked reserves is ambiguous. Clarifying its
operating expense and production schedule – the meaning will help avoid miscommunication.
ingredients, along with prices, for cashflow. The Especially when comparing two prospects, one must
outputs would include distributions for NPV and recognize the range of possibilities inherent in any
IRR. Comparing the outputs for A and B would multiple-prospect program. Development plans must
allow us to answer questions like: be designed for real cases not for field sizes scaled
• what is the chance of making money with A or down by chance of success. Full-scale probabilistic
B? What is the probability that NPV>0? and economics requires the various components of the
• what is the chance of exceeding our hurdle rate model be connected properly to avoid creating
for IRR? inappropriate realizations.The benefits of probabilistic
The answers to these questions together with the cashflow models, however, are significant, allowing us
comparison of the reserves distributions would give to make informed decisions about the likelihood of
us much more information for decision-making or attaining specific goals. ◆
Risk Analysis 24