Professional Documents
Culture Documents
*panggabeandimasandreas@gmail.com
Abstract. Machine learning in oil & gas can be used to improve the capabilities of this
increasingly competitive sector. One of the most noticeable effects of machine learning in an
industry that focuses on oil & gas is how it changes the discovery process. This can be proven
from the "MajuRoyal" oil field case study in finding potential reservoirs and hydrocarbons in
different structure compartments using two well data of Well-A and Well-B with different data
completeness conditions using the k-Nearest Neighbor (KNN) algorithm, one of the simplest
algorithms in machine learning, for electrofacies, lithology and hydrocarbon zones prediction.
Well-A with more complete data condition as training data and Well-B as test data.
Classification of electrofacies, lithology, hydrocarbon zones and their potential for training data
can be modeled. The KNN algorithm in training data is also analyzed quantitatively in creating
models for prediction and validation using a confusion matrix. The results of the validation of
the KNN with a good correlation on the training data prove that the KNN algorithm can be used
to predict the classification of electrofacies, lithology, and hydrocarbon zones of hydrocarbons
in the test data. The comparison of actual data and prediction data from KNN algorithm in the
training data, shows the average accuracy score above 0.8 for the model in the model
electrofacies, lithology and hydrocarbon zones. The KNN algorithm from the training data
applied to the test data shows good results. From the prediction results, qualitatively,
electrofacies in Well-A and Well-B can be correlated, for lithology prediction and the
hydrocarbon zone in Well-B shows a good correlation. As validation, carried out tests on the
hydrocarbon potential zones at Well-B, and the results showed very satisfactory results and
create new oil compartment on the "MajuRoyal" oil field.
1. Introduction
The "MajuRoyal" oil field (Figure 1A) production zone in the Early Miocene to Middle Miocene
intervals with a transitional depositional environment dominated by marine shales and open marine
sandstones [3]. Based on the success of oil production in this field, a re-evaluation study of abandoned
exploration wells has carried out to see the potential for hydrocarbons, especially those outside the
structure. This study uses two well data, namely Well-A and Well-B. Refers to (Figure 1A), the Well-
A and Well-B positions are 7 km away and separated by normal fault with NE-SW trend. Well-A is
located in the existing producing oil field, while Well-B is an abandoned well because it is in a different
compartment and has a lower structure than the existing oil field. Moreover, the limitation of the Well-
B data makes Well-B has no indication for showing hydrocarbon potential. Since Well-A has been
proven as a production well and Well-B have similar reservoir characteristics with Well-A, hence Well-
B deserves to be reviewed for its hydrocarbon potential by predicting the type of electrofacies, lithology
and hydrocarbon potential. KNN algorithm used in this study as a tool to predict the reservoir and
hydrocarbon potential of Well-B based on control data from Well-A. KNN or k-Nearest Neighbors is
one of classification methods from machine learning application. Machine learning is a machine that is
designed for learning, so it requires training and testing to find out the machine has learned as desired
by the user. Training data is the information used to train an algorithm, which consists of input data and
the corresponding expected output. The test data is the information used to see the performance of the
used algorithm, which only consists of input data [5]. Classification is the process of building the model
from the training set made up of database instances and associated class labels. The resulting model is
then used to predict the class label of the testing instances where the values of the predictor features are
known. KNN algorithm is a kind of supervised algorithm. This algorithm has a purpose for classifying
new objects based on attributes and training samples [1]. As shown in (Figure 1B), the new object
projected on the dimensional space that contains training data (points). The classification process using
the KNN algorithm will carry out by observing the closest point of the new object. Since machine
learning uses training and testing, it is necessary to know the accuracy level of the created model by
using a confusion matrix. A confusion matrix (Figure 1C) is used to visualize the performance of a
classifier [4]. Confusion matrix arranged from actual data, which is the corresponding expected output
from the data set, and predicted data that has performed.
4. Conclusions
The application of machine learning, especially the k-Nearest Neighbors (KNN) algorithm in predicting
electrofacies, lithology and HC Zone, was successful. This success was not only in sharpening the
interpretation of results based on well logs in Well-A as a training data set with an accuracy score above
0.8, but also in predicting electrofacies, lithology and the hydrocarbon zone in Well-B. Validated
potential hydrocarbon zone of Well-B can increase the prospect of oil accumulation in different
compartments of the "MajuRoyal" oil field.
Acknowledgments
The authors would like to acknowledge the government as host authority: Ministry of Energy & Mineral
Resources and Special Task Force for Upstream Oil & Gas Business Activities of the Republic of
Indonesia and Jabung JV partner : PETRONAS Carigali (Jabung) Ltd., PT.Pertamina Hulu Energi
Jabung, and PT.GPI Jabung Indonesia, for the permission to re-utilize and publish these datasets. We
would also like to thank to The Managements of PetroChina International Jabung Ltd. and the PIT
HAGI 45 for the support in publishing this paper.
References
[1] Bhavsar, Hetal and Amit Ganatra. (2012). A Comparative Study of Training Algorithms for
Supervised Machine Learning. International Journal of Soft Computing and Engineering
(IJSCE) 2(4): 74-81. https://doi.org/10.1.1.492.6088
[2] Emery, D & Myers, KJ, (eds) 1996. Sequence Stratigraphy. Blackwell Science Limited, 297pp.
www.sepmstrata.org.
[3] Ginger, David and Kevin Fielding. (2005). The Petroleum Systems and Future Potential of The
South Sumatra Basin. Proceedings of Indonesian Petroleum Association 39th Annual
Convention & Exhibition. https://doi.org/10.29118/ipa.2226.05.g.039
[4] Klein, Bernd. (2018). Machine Learning. Python Course. https://python-
course.eu/total_listing_machine_learning.pdf
[5] Smith, Daniel. (2019). What is AI Training Data?. Lion Bridge.
https://lionbridge.ai/articles/what-is-ai-training-data/
Figure 1. (A) "MajuRoyal" Oil Field, Top-Y Time Structure Map. (B) KNN or k-Nearest Neighbors
classification methods. (C) Confusion matrix to visualize the performance of a classifier.
Figure 2. (A) Well-A Dataset as Training Data. (B) General gamma ray response to grain size variations
and electrofacies log shapes relating to the sedimentary environment. (C) Well-B dataset as Test Data.
Figure 3. Workflow of the study
Figure 4. The results of processing in Well-A in the form of log predictions from the KNN algorithm
based on features in well log data, then prediction accuracy between log predictions and log
interpretations in the log definition column.
Figure 5. The results of processing in Well-B in the form of predicted electrofacies, lithology, and
hydrocarbon zones potential based on KNN algorithm and its prediction accuracy based on well test or
DST data (Green Square).
Figure 6. (A) The structural correlation between Well-A and Well-B based on the electrofacies and
lithology prediction result. There are several tested oil column (green) and potential hydrocarbon
column (orange). (B) The new hydrocarbon zone that increases the prospect of hydrocarbon
accumulation in the "MajuRoyal" Oil Field.