Professional Documents
Culture Documents
Summary. This paper presents a method of predicting lithology by statistical analysis of wireline log measurements with
calibration to a core lithology standard. Although an example of the technique applied to the Shublik formation of the Prudhoe Bay
area, North Slope, AK, is developed and presented, the method can be applied to any field where some core has been taken. The
Shublik, a complex mixture of seven clastic and carbonate rock types, presents a common problem: how can the rock lithology
best be identified ftom wireline logs? During recent work on the reservoir description of the Shublik, it became necessary to
answer this question.
A lithology standard must be available to calibrate against during development of a log model of lithology. For the Shublik, the
core available provided an excellent sample of the total formation. Using the statistical technique of discriminant analysis, we were .
able to evaluate a number of log models and to choose the most appropriate one. ThG final log model chosen for the Shublik
formation works quite well and correctly predicts lithology 75% of the time.
0.9
0.9 0.9
vary with lithology. In practice, we hope that each lithology w·ill M < 0.6363 sideritic mudrock
be clearly differentiated from the others by its distribution of M 0.6363 < M < 0. 7055 shale
values. · / 0. 7055 < M limestone.
Figs. la through lc display histograms of the distributions of M ·
within the sideritic mudrock, shale, and limestone lithologies, re- It is thus possible to use this division of the Maxis as a classifi-
spectively. It is immediately apparent that the distributions center cation rule to determine lithology from the iogs. If we apply the
around different points. Table 2 shows the means and standard devi- rule to the same data, we can compare the lithology classifications
ations for the three lithologies. Unless the standard deviations differ derived from the logs with the core classifications. The most con-
greatly, it is customary to use a pooled estimate of an assumed com- venient comparison is a simple cross-classification table that shows
mon standard deviation; for the example data, the estimate is 0.0412. the frequency with which each interval of rock is classified into
The pooled estimate is a weighted root mean square of the respec- every possible combination of core and log lithology. Table 3 shows
tive group's standard deviations.- See Ref. 4, Page 464, for details. these sample data.
Normal (Gaussian) distributions are completely determined· by
their means and standard deviations. It is thus possible to fit three Verification "~:echniques
normal distributions, one for each lithology, to the data by assum- A single measurement of how well the log model matches the data
ing that they have a common standard deviation and the respec- is the percent agreement, the proportion of all the data that are
SPE Formation Evaluation, December 1987 413
TABLE 4-CROSS-CLASSIFICATION USING M
TABLE 3-CROSS-CLASSIFICATION USING M WITH LITHOLOGY PROPORTIONS
Log Lithology Log Lithology
Sideritic Sideritic
Core Lithology Mud rock Shale Limestone Total Core Lithology Mud rock Sh;:~.le Limestone Total
·-·~.-. -- - -
Sideritic mudrock 102 14 0 116 Sideritic mudrock 75 41 0 116
Shale 64 2Q1 79 424 Shale 15 354 55 424
Lithology limestone 9 60 202 271 Lithology limestone 4 78 189 ' 271
Total 175 '355 281 811 Total 94 473 244 811
Percent agreement= 100 x [(102 + 281 + 202)/811] = 72.13%. Percent agreement= 100 x [(75 + 354 + 189)/811] = 76.20%.
matched correctly. In this single case, using only Mas a discrimi- Note that tpe interval of the Maxis for which shale is the log
nator, the percent agreement is 72. ~ 3% . Note that the percent agree- lithology is wider than in the case where lithology proportions were
ment is lower in the shales than in either the sideritic mudrocks disregarded. As a result, the revised log model will designate more
or limestones. Because the shales by themselves constitute more observations as shale, and fewer observations as either sideritic
than half of the data, the effect is to depress the overall percent muqrock or limestone. Table 4, the cross-classification table of core
agreement. c, with log lithologies, shows the re'sults. The percent agreement has
By accounting for the proportion each lithology constitutes of the risen to 76.20%.
total, we can improve the overall match by weighting each distri-
bution curve by the corresponding proportion. Our classification Use of Two Variables
rule is to assign a given observation to the category for which the The ability to discriminate can often be improved by the use of more
weighteq distribution function is largest. 1Fig. 2b shows a graph of than one discriminating variable, just as the use of several explana-
the proportion" weighted· distribution curves·. tory variables can often improve the predictive ability of a regres-
By solving the equations 1f Ift (M) = 1r2h (M) and 1r2f2 (M) = sion model. As the next step, consider the lithoporosity variable,
1r 3f3'(M), where 1r 1 ; 1r 2 , and 1r 3 represent the respective propor- N, 1 a function of neutron and bulk density logs that, again, should
tions of sideritic pmdrock, shale, and limestone (from the core~). vary only with lithology, and not with porosity. Proceeding as be-
we find the points that divide theM axis iota three regions. The fore, but using two variables, we begin with Figs. 3a through 3c,
resultant classification rule is which are crossplots or scatter diagrams showing the joint distri-
butions of M and N for each of the three example lithologies. It
may be observed that sideritic mudrock generally has low values
M<0.6052 sideritic muqrock of both M and N, limestone has high values of both variables, and
0.6052<M<0.7167 shale shale has intermediate values. Spedfically, the mean values of M
0.7167 <M limestone. are 0.6010 for sideritic mudrock, 0.6717 for sbale, and 0. 7394 for
0.90 . - - - - - - - - - - - - - - - - - - - - - - ,
a a
SIDERITIC
,.·
·., :,b~t;
•• •• :•.!!"•': •: •.
·.
0.50 L--------~----,------------1
I
0.25 N 0.65
0.90 r - - - : - - - - - - - - - - c - - - - - - - - - - _ _ _ , 0.50-+"'--:------=--=---<----------------i
b 0.25 N 0.65
0.90,---~-~----------------,----,
M
~ ....,.
. ~ .'::·• ::~.
~ ~-:··
··:· ..
.. ' M
SIDERITIC
MUDROCK
0.50-l-----------------------1
0.25 N 0.65
0.90 , - - - - - - - - - - - - - - - . - - - . , - - - . - - - . - - - - - ,
c
::.~·:· N
0.65
. . ."*-~'•r.•. . .
.. ·'.:~·~
.. ,..,.•"Pt.'
M
-. Fig. 4-Distribution functions of M and N as contour plots.
.
' .
0.50 L - - - - - - - - - - - - - - - - - - - - - i
0.25 N 0.65
Log Lithology
Sideritic
Core Lithology Mud rock Shale Limestone Total
Sideritic mudrock 101 13 2 116
Shale 17 320 87 424 0.90
Lithology limestone 7 57 207 271
Total 125 390 296 811
Percent agreement= [(100 x 101 + 320 + 207)/811] = 77.44%.
Log Lithology
Sideritic
Core Lithology Mud rock Shale Limestone Total
-- --
Sideritic mudrock 94 22 0 116 0.90
Shale 7 361 56 424
Lithology limestone 5 76 190 271
Total 106 459 246 811
Percent agreement= 100 x [(94 + 361 + 190)/811] = 79.53%.
Log Lithology
Phosphatic Phosphatic Sideritic
Core Lithology Sandstone Shale Limestone Siltstone Limestone Mud rock Mud rock Total
-- --
Sandstone 5 1 7 13
Shale 1 1'88 38 13 7 5 252
Limestone 1 27 297 9 15 4 2 355
Siltstone 4 24 16 61 105
Phosphatic limestone 11 1 84 14 110
Phosphatic mudrock 4 10 91 9 114
Sideritic mudrock 2 1 7 80 90
Total 11 242 363 95 109 I
123 96 1,039
Percent agreement= 77.57%
Analysis of Misprediction
'""~_-,····
It is instructive to investigate the reasons why 316 ft [96 m], or
25%, of the core in Group 2 wells did not match.
There are intervals within the Shublik in which lithology interbed-
Fig. &-Comparison plot of lithology probabilities and pre- ding occurs, and there are some lithologies that may grade into
dicted vs. actual lithology. others. Thus, it is of interest to study the misclassified feet to learn
what lithology probabilities were assigned for them.
Thin beds exist within the Shublik. Log resolution problems, core
nonrecovery intervals, and occasional log/core misalignment proq-
lems also create the possibility that the log response for a given M = lithoporosity function
point may match the core at a nearby interval. N ·= lithoporosity variable
Finally, certain lithologies are similar in properties and are easi- 1r 1,2 , 3 = proportions of sideritic mud rock, shale, and lime-
ly confused. In many cases, these confusions are innocuous. Lithol-
ogies that are occasionally confused include shale with siltstone, stone, respectively
and limestone with phosphatic limestone. In addition, as a result
of thin-bed effects and intergrading, limestone and shale and lime- Acknowledgments
stone and siltstone are sometimes confused. We thank B.E. Hunter, D.K. Davies, and D.B. Schafer for their
We distinguished eight possible explanations for mismatched in- invaluable help in the geologic portion of this work. We also thank
tervals: (1) the log-determined lithology agreed with the core li- Arco Alaska Inc., Exxon Co. USA, and Sohio Petroleum Co. for
thology either 1 ft [0.3 m] above or below the interval being their permission to publish this paper.
classified; (2) the log lithology matched the core 2ft [0.6 m] above
or below the interval in question; (3) the core lithology for the in-
terval was assigned a probability greater than 0.2, so the logs evi- References
denced some ambiguity concerning the classification; (4) the core 1. Burke, J.A., Campbell, R.L. Jr., and Schmidt, A.W.: "The Litho-
lithology was assigned a probability greater than 0.1; (5) a shale Porosity Crossplot,'' Trans:, SPWLA Annual Logging Symposium
was classified as a siltstone or vice versa; (6) a limestone was clas-. (1969) paper Y. ,
2. Clavier, L. and Rust, D.H.: "MID-PLOT: A New Lithology Tech-
sified as a phosphatic limestone or vice versa; (7) confusion be-
nique;" The Log Analyst (Nov.-Dec. 1976) 16-24.
tween shales and limestones; and (8) confusion between siltstones 3. Delfiner, P.C., Peyret, 0., and Serra, 0.: "Automatic Determination
and limestones. · of Lithology from Well Logs," SPEFE (Sept. 1987) 303-10.
Table 9 shows the proportion of the mismatched 316 ft [96 m] 4. Johnson, R.A. and Wichern, D. W.: Applied Multivariate Statistical
that fell into the categories defined by the eight reasons. Because Analysis, Prentice-Hall Inc., Englewood Cliffs, NJ (1982) 461-531.
more than one reason might serve as an explanation for the same 5. Anderson, T. W.: An Introduction to Multivariate Statistical Analysis,
mismatch, the percentages total more than 100. Table 10 presents John Wiley Publishing Co., New York City (1958) 126-53.
the information in a hierarchical fashion, showing the proportion · 6. Gnanadesikan, R.: Methods For Statistical Data Analysis of Multivariate
Observations, John Wiley Publishing Co., New York City (1977)
of mismatches attributable to Reason 1, then the additional mis-
82-103.
matches attributable to Reason 2, and so forth. 7. Boutemy, Y., Simond, R.G., and Clavier, C.: "Field Studies: A Prog-
It is readily seen from Table 9 that the first four reasons, reflect- ress Report On The Contribution of Logging," paper SPE 8178
ing resolution, bed boundary problems, possible core/log misalign- presented at the 1979 SPE Offshore Europe Conference, Aberdeen,
ment, etc., account for 95% of the mismatches, leaving only 8ft Sept.3-7.
[2 .4 m], or 2. 5%, to be explained by shale/ siltstone or shale/lime- 8. Haslett, J.: "Maximum Likelihood Discriminant Analysis On the Plane
stone confusions.· In the end, only 7ft [2.1 m], accounting for 2.2% Using a Markovian Model of Spatial Content,'' Pattern Recognition
(1985) 18, 287-96.
of the mismatches and 0. 6% of the validation data base, remained
unexplained. The overall conclusion again is that the log model is Appendix-Statistical Considerations
working extremely well.
In the work reported in this paper, except for the pedagogic exam-
ples, actual lithology proportions were used to obtain weighted clas-
Conclusions
sification probabilities. This procedure is justifiable in any situation
This paper demonstrates that with statistical discriminant analysis, in which the available core may be reasonably considered to be a
it is possible to find a log model that accurately predicts lithology random sample from the reservoir. In our case, because only wells
for a formation as lithologically complex as the. Shublik. The method whose cores completely penetrated the Shublik were used, we con-
requires only that a competent geologist describe a statistically rep- sidered this to be the best thing to do. Situations are conceivable
resentative amount of core. In the case of the Shublik, not only in which this proportion weighting would not be indicated. For ex-
was it possible to find a model for which the log lithology agreed ample, if a geologist examined core only until reaching a certain
with the core very well, but the overwhelming majority of the mis- quota of samples for each of several lithologies, rather than until
matched data could plausibly be explained as thin-bed effects, log completely describing available core; if a number of the wells had
resolution problems, and core/log misalignment problems. In sum- core only from the top or bottom of the formation; or if wells had
mary, discriminant analysis applied to log lithology determination been cored in some region of the field specifically for the purpose
is a promising technique for reservoir description. of obtaining data on what was considered a possible geologic anoma-
ly, then the existing data would be subject to sampling bias. In such
Nomenclature cases, the appropriate weights are not obvious and would have to
distribution curve functions for the three lithologies
/ 1,2 ,3 = be determined on a case-by-case basis.
An additional factor that might be used in a more complex weight-
thickness of zone, ft [m]
d =
ing scheme is that of transition probabilities. If deposition is con-
h = proportional height-in-zone variable sidered to be a first-order Markov process, then the lithology
ha = height above zone bottom, ft [m] probability at any depth depends on the actual lithology at the next
SPE Formation Evaluation, December 1987 417
lower depth. The transition probabilities (the probabilities that a tested by use of Bartlett's test, for example, but this test is known
given observation is of Lithology x when the next lower observa- to l:>e extremely unreliable in the absence of nonnormality of dis-
tion was of Lithology y) for all pairs (x,y) of lithologies can be eas- tribution, and we have chosen to rely on a simple comparison of
ily estimated. A method for incorporating this additional information intrapopulation (within lithology) standard deviations and correla-
into discriminant analysis has recently been given by Haslett. 8 This tion coefficients. More complicated kinds of discriminant analysis
method promises to be especially valuable in formations where (1) have been proposed, but software to perform them is not readily
there is a pronounced tendency for lithologies to persist from one available, and their properties are not as well known. In any event,
observation to the next or (2) the classic discriminant analysis shows a primary purpose of this paper is to suggest that a relatively sim-
evidence (through probability vs. depth plots) of considerable am- ple technique is available and deserves wider application.
biguity in' the d~termination of lithology. · One further topic should be mentiop.ed, although space limita-
Another matter that needs careful study in an actual application tions preclude an adequate discu~sion. Discriminant analysis clas-
is the metric in which the data are measured. Well-to-well varia- sifies each observation into some lithology, and the assigned
,tion in log response may suggest calibration offsets, 7 although the classification probabilities are predicated on the ass~mption that
usefulness of these in applications to uncored wells is slight. In many some of the stated lithologies are in fact correct. It may be, of co~rse,
cases, the distribution of the variables used may be nonnormal (non- that none of the assumed lithologies matches the log response, either
Gaussian). Although the nonnormality frequently does not damage because one or more of the logs is anomalous or because an un-
the inference from the analysis, in extreme cases it is worth con- cored well has penetrated a lithology different from any examined
sidering a transformed variable. In this study, the logarithm of the by the geologist. In this event, a statistic known as the Mahalano-
gamma ray, rather than the gamma ray itself, was used for this rea- bis distance may reveal the· situation. This statistic· will have ap-
son. The gamma ray was also calibrated, or normalized, by stan- proximately a x2 distribution, and thus observations that are
dardizing its response in known shale markers. The neutron logs incompatible with the known lithologies may be detected.
available included several tool types, which were also standard-
ized before use in the analysis. Sl Metric Conversion Factor
The discriminant analysis used in this study.js simple linear dis-
ft X 3.048* E-01 m
criminant analysis, which wefelt was preferable to more compli-
cated alternatives in the absence of indications from the data that
·conversion factor is exact. SPEFE
such were called for. Its mathematical derivation rests on the as-
sumption, among others, that the variances of the variables (actu- Original SPE manuscript received for review Sept. 22, 1985. Paper accepted for publica-
tion July 11, 1986. Revised manuscript received Dec. 8, 1986. Paper (SPE 14301) first
ally, their variance/covariance matrix) do not differ from one presented at the 1985 SPE Annual Technical Conference and Exhibition held in Las Ve-
population (lithology) to another. This assumption can be formally gas, Sept. 22-25.