You are on page 1of 13

applied

sciences
Article
A Hybrid Neural Network Model for Predicting Bottomhole
Pressure in Managed Pressure Drilling
Zhaopeng Zhu 1 , Xianzhi Song 1,2, *, Rui Zhang 1 , Gensheng Li 1,2 , Liang Han 1 , Xiaoli Hu 1 , Dayu Li 1 ,
Donghan Yang 1 and Furong Qin 1

1 School of Petroleum Engineering, China University of Petroleum (Beijing), Beijing 102249, China;
zhuzp@cup.edu.cn (Z.Z.); tlzhangr@163.com (R.Z.); ligs@cup.edu.cn (G.L.);
2021210328@student.cup.edu.cn (L.H.); 2020210238@student.cup.edu.cn (X.H.);
2021211715@student.cup.edu.cn (D.L.); 18500986428@sohu.com (D.Y.); 2021215174@student.cup.edu.cn (F.Q.)
2 State Key Laboratory of Petroleum Resources and Prospecting, China University of Petroleum,
Beijing 102249, China
* Correspondence: songxz@cup.edu.cn; Tel.: +86-010-89732176

Featured Application: This article highlights the feasibility and advantages of using hybrid opti-
mization network models to predict bottomhole pressure under managed pressure drilling.

Abstract: Managed pressure drilling (MPD) is an essential technology for safe and efficient drilling
in deep high-temperature and high-pressure formations with narrow safety pressure windows.
However, the complex conditions in deep wells make the mechanism of multiphase flow in drilling
annulus complicated and increase the difficulty for accurate prediction of bottomhole pressure (BHP).
Recently, an increasing volume of research shows that intelligent technology is an efficient means
of accurately predicting BHP. However, few studies have focused on the temporal properties and
Citation: Zhu, Z.; Song, X.; Zhang, R.;
variation mechanism of BHP. In this paper, hybrid neural network prediction models based on the
Li, G.; Han, L.; Hu, X.; Li, D.; Yang, multi-branch parallel are established by combining the different advantages of back propagation (BP),
D.; Qin, F. A Hybrid Neural Network long short-term memory (LSTM), and a one-dimensional convolutional neural network (1DCNN)
Model for Predicting Bottomhole model. The results show that the relative error of the best model is about 70% lower than the optimal
Pressure in Managed Pressure single intelligent model. Preliminary experimental results reveal that the hybrid models combine
Drilling. Appl. Sci. 2022, 12, 6728. the advantages of different single models, which is more accurate and robust for extracting the
https://doi.org/10.3390/ temporal features of MWD. Finally, based on the trend analysis, the validity of the hybrid model
app12136728 is further verified. This study provides a reference for solving the problem of optimizing temporal
Academic Editor: Christian characteristics and guidance for fine pressure control in complex formations.
W. Dawson
Keywords: bottomhole pressure; temporal properties; hybrid neural networks
Received: 17 May 2022
Accepted: 22 June 2022
Published: 2 July 2022

Publisher’s Note: MDPI stays neutral 1. Introduction


with regard to jurisdictional claims in
Real-time monitoring of BHP is an important reference for the drilling program design,
published maps and institutional affil-
optimization of drilling parameters, and the dynamic regulation of production programs,
iations.
which can provide a basic guarantee for safe, efficient drilling and the development of oil
and gas [1]. For deep high-temperature and high-pressure drilling, accurate and efficient
calculation of BHP is more critical to ensure drilling safety. The traditional methods of
Copyright: © 2022 by the authors.
estimating BHP mainly include mechanistic model-based calculation and using gauge
Licensee MDPI, Basel, Switzerland. or well testing, but there are major economic and technical limitations in this method;
This article is an open access article although the mechanistic model calculation method [2–8] is less costly, it is difficult to
distributed under the terms and consider all complex environmental conditions simultaneously. Therefore, most of the
conditions of the Creative Commons methods can hardly process a large variety of datasets and only perform well under certain
Attribution (CC BY) license (https:// conditions.
creativecommons.org/licenses/by/ Recently, the introduction of intelligence technology has realized accurate and effi-
4.0/). cient prediction of BHP based on surface monitoring data and provides scientific basis

Appl. Sci. 2022, 12, 6728. https://doi.org/10.3390/app12136728 https://www.mdpi.com/journal/applsci


Appl. Sci. 2022, 12, 6728 2 of 13

and technical support for safe drilling and efficient production. Many researchers have
conducted exploratory research on intelligent prediction methods of BHP. One of the most
representative is the use of Artificial Neural Network (ANN) techniques. As early as
1995, Ternyik et al. [9] established a prediction model of BHP based on ANN according to
the three-phase flow law of oil, gas and water, and compared the field data to verify the
feasibility of the neural network model of BHP. Subsequently, several scholars have used
machine learning methods and built different types of ANN models to predict BHP, such
as Mohammadpoor et al. [10], Jahanandish et al. [11], Chen et al. [12], Spesivtsev et al. [13],
Mask et al. [14], Sami and Ibrahim [1], and Okoro et al. [15]. The prediction results from
these studies all indicate the following similarities:
• The ANN models have significantly improved the accuracy and efficiency of BHP
compared with traditional calculation modeling, overcome the limitations of empirical
and mechanistic models, and even replaced the function of downhole sensors.
• These models scarcely considered the temporal properties of datasets and variation
mechanism of BHP.
Furthermore, some researchers highlighted that the optimization algorithms also
played a significant role in improving the performance of the intelligent prediction model of
drilling BHP. Several studies on BHP prediction using optimization algorithms, for instance,
particle swarm optimization algorithm [16], gray wolves optimization algorithm [17], ant
swarm algorithm [18], and genetic algorithm [19–21], have revealed that optimization
algorithms combined with the traditional neural network models have higher precision
and stronger robustness. Compared with a single solvable model, this model combined
with the optimization algorithm can solve issues globally and is independent of the initial
conditions.
In fact, with the development of artificial intelligence, more and more researchers
have begun to explore the integration of intelligence and mechanisms to further improve
the performance of intelligent models such as data fusion, a combination of data and
mechanism, etc. In terms of BHP prediction, the following research has been conducted.
Fruhwirth et al. [22] integrated multiple flow parameters into comprehensive parameters
such as wellbore Reynolds number or dimensionless diameter, which also improved the
generalization ability of the model. Elzenary et al. [23] presented a principal component
analysis to reduce the number of input parameters, simplify the model structure, and
weaken the overfitting of the model. Al Shehri et al. [24] considered the mechanism relations
among input parameters, constructed neural network sub-modules, and integrated all sub-
modules into a functional neural network system, which improved the model’s mechanism
description. Adaptive fuzzy logic is another effective way to improve the performance
of neural networks [21]. The fusion of mechanism and data can improve the stability,
generalization, and accuracy of the model. Based on flow pattern recognition, Li et al. [25]
developed the neural network models of circulating pressure drop under different flow
patterns and supplemented the multiphase flow formula to realize the separate calculation
of BHP drop in different well sections, extending the applicable conditions of the intelligent
method. Gola et al. [26] proposed the grey box theory and innovated the combination
method of mechanisms and the intelligent model.
MPD data have both strong and weak time-series feature parameters, such as drilling
fluid rheological parameters, which play a key role [27], and the single intelligent prediction
models, such as BP, LSTM, and CNN, lack the ability to capture potential information,
while the parallel dual-branch networks can simultaneously capture the strong and weak
features of time-series data in order to mine more related information of time-series data,
thereby reducing information loss.
Based on the advantages of the single intelligent models to capture different charac-
teristics, hybrid optimization parallel models are established in this paper. This method
can fully identify the time-series characteristics of the data and effectively capture the
subtle changes behind the data. This research can be applied to optimally solve temporal
problems and provides a reference for precise pressure control in drilling.
Appl. Sci. 2022, 12, x FOR PEER REVIEW 3 of 17

can fully identify the time-series characteristics of the data and effectively capture the sub-
tle changes behind the data. This research can be applied to optimally solve temporal
Appl. Sci. 2022, 12, 6728 3 of 13
problems and provides a reference for precise pressure control in drilling.

2. Theory Background
2. Theory Background
2.1. BP Neural Network
2.1. BP Neural Network
The BP neural network in Figure 1 is a multilayer feedforward neural network
The BP neural network in Figure 1 is a multilayer feedforward neural network trained
trained according to error backward propagation algorithm [28]. It is good at capturing
according to error backward propagation algorithm [28]. It is good at capturing the potential
the potential relationship between input and output. With its arbitrarily complex network
relationship between input and output. With its arbitrarily complex network structure and
structure and powerful nonlinear mapping ability, it is one of the most widely used neural
powerful nonlinear mapping ability, it is one of the most widely used neural networks.
networks. However, the BP neural network can only accept the input of a fixed dimension,
However, the BP neural network can only accept the input of a fixed dimension, thus
thus generating the output of a fixed dimension, and the completion process is point-to-
generating the output of a fixed dimension, and the completion process is point-to-point
point mapping, unable to consider the temporal properties of BHP and unable to analyze
mapping, unable to consider the temporal properties of BHP and unable to analyze the
the relationship in the contextual information of the time series data.
relationship in the contextual information of the time series data.

Figure1.1.Schematic
Figure Schematicdiagram
diagramofofBP BPneural
neuralnetwork
networkwith
withoneonehidden
hiddenlayer
layerwhere
wherexnxnisisthe
thecurrent
current
input;
input; i is the input layer; h is the hidden layer; wih and wjk are the weight matrix of the hiddenlayer;
i is the input layer; h is the hidden layer; w ih and w jk are the weight matrix of the hidden layer;
ooisisthe
theoutput
outputlayer
layerand
andyyn nisisthe
thecurrent
currentoutput.
output.
2.2. LSTM Neural Network
2.2. LSTM Neural Network
Different from feedforward neural networks, recurrent neural network is a basic
Different
multilayer from neural
feedback feedforward
network.neural networks,
In the network,recurrent neural
each neuron andnetwork
its outputis signal,
a basic
as the input signal feedback to other neurons and LSTM in Figure 2, is a special kind ofas
multilayer feedback neural network. In the network, each neuron and its output signal,
the inputneural
recurrent signalnetwork
feedback[29];
to other
on theneurons
basis ofand
the LSTM
originalinloop
Figure 2, is a it
structure special kind of
introduces fourre-
current neural network [29]; on the basis of the original loop structure it introduces
interactive layers, including the cell state, input gate, forget gate, and the output gate. The four
interactive
network canlayers,
be used including the cell state,
more efficiently input long-term
to extract gate, forgetcorrelations
gate, and the inoutput gate.
sequence The
data.
network can
Specifically, thebecellular
used more
state efficiently to extract
acts as a pathway forlong-term
informationcorrelations in sequence
to travel through data.
sequence
Specifically, the cellular state acts as a pathway for information to travel through
connections, and gate structures are trained to learn which information to save or forget, sequence
connections,
which makes and gate
it able tostructures are trained
forget useless to learn
information which
and learninformation
correlations to between
save or forget,
data
Appl. Sci. 2022, 12, x FOR PEER REVIEW 4 of 17
points that are far away from each other in sequence. Therefore, the LSTM between
which makes it able to forget useless information and learn correlations constitutesdataa
points
new that are
method for far away afrom
building neweach
modelother in sequence.
for this Therefore,problem.
type of prediction the LSTM constitutes a
new method for building a new model for this type of prediction problem.

Self-circulatingstructure
Figure2.2. Self-circulating
Figure structurein
inLSTM
LSTM where
where xxtt isis the
the current
current input;
input; h ht−1
t−1isisthe
theoutput
outputatatthe
the
previousmoment;
previous moment; hhtt isisthe
the current
current output;
output; Ct−1
t−1isisthe
theprevious
previouscell cellstate;
state;CCt tisisthe
thecurrent
currentcell
cellstate;
state;fft t
isisthe
theforget
forgetgate gateneuron
neuronininLSTM
LSTMTheTheoutput
outputof ofthethecell;
cell;ititisisthe
theoutput
outputofofthe theinput
inputgate
gateneuron
neuronin in
LSTM;
LSTM; C e tt isisthe
thenew
newcandidate
candidatevalues
valuesfor
forcell
cellstatus;
status; OOtt is the output of of the
the output
outputgategateneuron
neuronin in
LSTM;σσisisthe
LSTM; thesigmoid
sigmoidactivation
activationfunction
functionand
andtanh
tanhisisthe
thetanhtanhactivation
activationfunction.
function.

2.3. 1DCNN
The biggest difference between CNN and other neural networks lies in its special
network structure: a convolution layer and a pooling layer, which effectively reduces the
Appl.
Appl.Sci.
Sci.2022,
2022,12,
12,x 6728
FOR PEER REVIEW 5 of
4 of1713

2.3.
2.3.1DCNN
1DCNN
The
Thebiggest
biggestdifference
differencebetween
betweenCNN CNNand andother
otherneural
neuralnetworks
networkslies liesininitsitsspecial
special
network
network structure: a convolution layer and a pooling layer, which effectivelyreduces
structure: a convolution layer and a pooling layer, which effectively reducesthe the
complexity
complexityofof the original
the original network,
network, reduces
reducesa large number
a large number of parameters
of parameters and makes
and makes for-
ward
forward
Appl. Sci. 2022, 12, x FOR PEER REVIEW
propagation
propagation moremore efficient. The convolution
efficient. The convolution kernelkernel
of theof convolution
the convolution layer 6cap-layer
of 17
tures the local
captures the time
local feature
time featureof the ofdatathebydata
constantly sliding the
by constantly window,
sliding and the sliding
the window, and the
convolution kernel shares
sliding convolution kernelthe weight.
shares The different
the weight. featuresfeatures
The different extracted from multiple
extracted from multiplecon-
volution kernels
convolution are reduced
kernels are reducedthrough the pooling
through layerlayer
the pooling for dimensionality
for dimensionality reduction.
reduction.Fi-
constructing
nally, local a BP-LSTM
features are hybrid
combined model
to form as shown
global in Figure
features 4
throughto input
Finally, local features are combined to form global features through a full connection layer.a full the sequential
connection data
layer.
Ininto
Inthisthe
this LSTM
paper,
paper, network
1DCNN
1DCNN and
ininFigurethe3non-sequential
Figure 3isisadopted
adoptedaccordingdata into
according the
totothe BP network,
thesequence
sequence length to give
length andfull
and play
feature
feature
to the
location advantages
locationofofthe
thedata of the
datameasured two
measuredwhilemodels and
whiledrilling. achieve
drilling.This a higher
Thisstructure precision
structuremodel prediction
modelisisrelatively
relativelysimpleof the
simplewithmod-
with
els.
less
lessweight
weightinput.
input.
network has limited generalization ability for temporal multi-factor problems, and
the LSTM network requires a certain length of sequence as training samples. Therefore,
the prediction results of these two models alone do not have a high generalization ability.
Considering the characteristics of the two models, this paper adopts the method of con-
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
the LSTM network and the non-sequential data into the BP network, to give full play to
the advantages of the two models and achieve a higher precision prediction of the models.
network has limited generalization ability for temporal multi-factor problems, and
the LSTM network requires a certain length of sequence as training samples. Therefore,
the prediction results of these two models alone do not have a high generalization ability.
Considering the characteristics of the two models, this paper adopts the method of con-
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
the LSTM network and the non-sequential data into the BP network, to give full play to
Thestructure
Figure3.3.The
Figure structureofof1DCNN.
1DCNN.
the advantages of the two models and achieve a higher precision prediction of the models.
network
2.4.Parallel
Parallel has limited
Network generalization ability for temporal multi-factor problems, and
2.4. Network ofofBPBPandand LSTM
LSTM
the LSTM network
FromSections
Sections2.1 requires
2.1andand2.2, a certain
2.2,the
theBP length
BPandandLSTM of sequence
LSTM networks as training
eachhave
have samples. Therefore,
theirdefects.
defects. The
the From
prediction results of these two models alone networks
do not have each
a high their
generalization The
ability.
BP network has limited generalization ability for temporal multi-factor problems, and
BP network has
Considering thelimited generalization
characteristics of theability for temporal
two models, this papermulti-factor problems,
adopts samples.
the method and the
of con-
the LSTM network requires a certain length of sequence as training Therefore,
LSTM network
structing requires
a BP-LSTM a certain length of sequence as training samples. Therefore, the
the prediction results hybrid
of thesemodel as shown
two models in Figure
alone do not 4have to input
a high thegeneralization
sequential data into
ability.
prediction
the LSTM results
network of and
these two
the models alonedata
non-sequential do into
not havethe BPa high generalization
network, to give full ability.
play to
Considering the characteristics of the two models, this paper adopts the method of con-
Considering
the advantagesthe ofcharacteristics
the two models of the
and two models,
achieve a this paper
higher precisionadopts the method
prediction of the of con-
models.
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
the LSTM network and the non-sequential data into the BP network, to give full play to the
the LSTM network and the non-sequential data into the BP network, to give full play to
advantages of the two models and achieve a higher precision prediction of the models.
the advantages of the two models and achieve a higher precision prediction of the mod-
els.network has limited generalization ability for temporal multi-factor problems, and the
LSTM network requires a certain length of sequence as training samples. Therefore, the
prediction results of these two models alone do not have a high generalization ability.
Considering the characteristics of the two models, this paper adopts the method of con-
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
the LSTM network and the non-sequential data into the BP network, to give full play to
the advantages of the two models and achieve a higher precision prediction of the models.
network has limited generalization ability for temporal multi-factor problems, and
the LSTM network requires a certain length of sequence as training samples. Therefore,
the prediction results of these two models alone do not have a high generalization ability.
Considering the characteristics of the two models, this paper adopts the method of con-
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
the LSTM network and the non-sequential data into the BP network, to give full play to
the advantages of the two models and achieve a higher precision prediction of the models.
Figure 4. BP-LSTM parallel network structure.
network has limited generalization ability for temporal multi-factor problems, and
the
2.5.LSTM network
Parallel Networkrequires
of 1DCNN a certain
and LSTM length of sequence as training samples. Therefore,
2.5. Parallel Network of 1DCNN and LSTM
the prediction results of these two models
The 1DCNN branch network mainly captures alone dolocal not have a highinformation
correlation generalization ability.
through the
The
Considering 1DCNN branch network mainly captures local correlation information through
convolution the characteristics
kernel, but the size of of the
theconvolution
two models, this limits
kernel paperthe adopts
contextualthe method
relationship of
the convolution kernel, but the size of the convolution kernel limits the contextual rela-
tionship of a sequence captured by CNN, while the LSTM branch network can make up
for this shortcoming by extracting sequential features. Therefore, a parallel network based
on CNN and LSTM, shown in Figure 5, is proposed in this paper, which can significantly
improve the effectiveness and accuracy of the BHP prediction model while extracting fea-
Appl. Sci. 2022, 12, 6728 5 of 13

of a sequence captured by CNN, while the LSTM branch network can make up for this
shortcoming by extracting sequential features. Therefore, a parallel network based on
CNN and LSTM, shown in Figure 5, is proposed in this paper, which can significantly
Appl. Sci. 2022, 12, x FOR PEER REVIEW 7 of 17
improve the effectiveness and accuracy of the BHP prediction model while extracting
feature advantages from the network.

CNN-LSTM parallel network structure.


Figure 5. CNN-LSTM

3. Dataset and Methodology


3. Dataset and Methodology
Appl. Sci. 2022, 12, x FOR PEER REVIEW Based on the various network models proposed above, combined with the data
Based on the various network models proposed above, combined with the data8prep- of 17
preparation and model testing sessions, the complete BHP prediction process of this paper
aration and model testing sessions, the complete BHP prediction process of this paper can
can be obtained as shown in Figure 6.
be obtained as shownnetwork has limited generalization ability for temporal multi-factor
problems, and the LSTM network requires a certain length of sequence as training sam-
ples. Therefore, the prediction results of these two models alone do not have a high gen-
eralization ability. Considering the characteristics of the two models, this paper adopts
the method of constructing a BP-LSTM hybrid model as shown in Figure 4 to input the
sequential data into the LSTM network and the non-sequential data into the BP network,
to give full play to the advantages of the two models and achieve a higher precision pre-
diction of the models.
network has limited generalization ability for temporal multi-factor problems, and
the LSTM network requires a certain length of sequence as training samples. Therefore,
the prediction results of these two models alone do not have a high generalization ability.
Considering the characteristics of the two models, this paper adopts the method of con-
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
the LSTM network and the non-sequential data into the BP network, to give full play to
the advantages of the two models and achieve a higher precision prediction of the models.
network has limited generalization ability for temporal multi-factor problems, and
the LSTM network requires a certain length of sequence as training samples. Therefore,
the prediction results of these two models alone do not have a high generalization ability.
Considering the characteristics of the two models, this paper adopts the method of con-
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
the LSTM network and the non-sequential data into the BP network, to give full play to
the advantages of the two models and achieve a higher precision prediction of the models.
network has limited generalization ability for temporal multi-factor problems, and
the LSTM network requires a certain length of sequence as training samples. Therefore,
the prediction results of these two models alone do not have a high generalization ability.
Considering the characteristics of the two models, this paper adopts the method of con-
Figure 6.
Figure Process of
6. Process of BHP
BHP output
output forecasting.
forecasting.
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
the
3.1. LSTM network and the non-sequential data into the BP network, to give full play to
Data Pre-Processing
3.1.
the Data Pre-Processing
advantages
3.1.1. Data of the two models and achieve a higher precision prediction of the models.
Description
3.1.1.network
Data Description
In this paper,limited
has generalization
the basic data used toability for test
train and temporal multi-factor
the intelligent problems,
model and
are 130,000
the LSTM
In BHP
sets of network
this paper, requires a
the basic from
measurements certain
data used length of sequence
to monitoring,
surface train and testwith as training
the aintelligent
maximum samples.
model Therefore,
are 130,000
measured depth
the prediction
sets results of these
of BHP measurements from two models
surface alone do not
monitoring, have
with a high generalization
a maximum ability.
measured depth of
Considering
6705 m and the themaximum
characteristics of the
vertical twoof
depth models,
4942 m.this paperthe
Firstly, adopts the key
missing method of con-
drilling pa-
structing were
rameters a BP-LSTM
removed. hybrid model
At the sameastime,
shown in Figure that
considering 4 to input the sequential
the rheological data into
parameters of
the LSTM
drilling network
fluid have aand the non-sequential
similar data which
single distribution, into the BP network,
plays a pivotalto give
role in full
the play to
perfor-
the advantages of the two models and achieve a higher precision prediction of the models.
Appl. Sci. 2022, 12, 6728 6 of 13

of 6705 m and the maximum vertical depth of 4942 m. Firstly, the missing key drilling
parameters were removed. At the same time, considering that the rheological parame-
ters of drilling fluid have a similar single distribution, which plays a pivotal role in the
performance of the model and will amplify the influence of subtle noise or lead to the
high linearity of the model and reduce the stability of the model, this paper selectively
supplements the performance parameters of drilling fluid, including funnel viscosity, s,
sand content, %, and drilling fluid density, g/cm3 . In addition, to check the validity of the
data, empirical correlations and mechanistic models were used to estimate BHP, and the
outliers were removed. Finally, 100,000 data points of pressure while drilling were used to
train and test the model. The characteristics of BHP after cleaning are shown in Table 1.

Table 1. Data description of BHP.

Evaluation Standard Value Evaluation Standard Value


Count 60,000 25% (Mpa) 58.26
Mean (Mpa) 58.61 50% (Mpa) 58.70
Std (Mpa) 0.40 75% (Mpa) 58.70
Min (Mpa) 57.37 Max (Mpa) 59.58

3.1.2. Feature Selection and Data Normalization


In the process of drilling, BHP is related to many factors. Through correlation analysis,
parameters that have a greater impact on BHP are selected, small or irrelevant variables are
removed, input parameter dimensions are reduced, and the influence of noise is enlarged
by the distance correlation coefficient due to the simultaneous introduction of two or more
parameters with high similarity being avoided, as follows:

 √ dCov2n (X,Y) dCov2n (X)dCov2n (Y) > 0
2 dCov2n (X)dCov2n (Y)
dCorn (X, Y) = (1)
 0 dCov2 (X)dCov2 (Y) = 0
n n

where dCor2n (X, Y) is the distance correlation coefficient of X and Y, dCov2n (X, Y) is the
distance covariance of X and Y, dCov2n (X) is the standard deviation of the distance of X,
and dCov2n (Y) is the standard deviation of the distance of Y.
Parameters with a correlation greater than 0.7 were screened through calculation, and
the following 12 input parameters were finally determined, as shown in Table 2, including
6 temporal parameters: well depth, rotary speed, riser pressure, inlet flow rate, outlet flow
rate, total pool volume, and 6 non-timing parameters: drilling fluid density, funnel viscosity,
fixed vertical depth, back-pressure pump flow rate, outlet density, and sand content.

Table 2. Input and output parameters of the model.

Input Parameters Output Parameter


Well depth, Rotary speed,
Riser pressure, Inlet flow rate,
Temporal parameters
Outlet flow rate, Total pool
volume
Fixed vertical depth, Drilling BHP
fluid density,
Non-temporal parameters Funnel viscosity,
Back-pressure pump flow rate,
Outlet density, Sand content

The 100,000 filtered data points were divided into train sets and test sets. The train set
of 80% (80,000 data points) was used to train the models, and the remaining dataset of 20%
(20,000 data points)was used as a test set to test the prediction capabilities of the developed
models.
Appl. Sci. 2022, 12, 6728 7 of 13

Data normalization can speed up the gradient descent and the optimal solution for
the characteristics between different dimensions, in a certain value of the comparability,
and avoid excessive numerical problems in the process of training. Therefore, before
model training, we need to normalize the processing of all input parameters and make all
parameter mapping at the interval [0, 1], as shown in the following equation:

x − xmin
x= (2)
xmax − xmin
e

where ex is the normalized value of x, xmax and xmin are the maximum and minimum values
of the variable x.

3.1.3. Multivariate Multi-Step Sequence Sample Construction


Given that the MPD data collected in the field have a certain time series correlation,
namely the data at the current well depth will have a certain influence on the lower well
section, the prediction of BHP can be treated as a multi-variable sequential prediction
problem, and the BHP at a certain depth can be predicted through the historical BHP.
Therefore, prior to training the model, it is necessary to prioritize the collection time
of the original data set and convert it into supervised learning samples. Since there are
multiple characteristic variables in managed pressure drilling data at each time point, and
the predicted label has only one variable BHP, the other 12 characteristic variables can be
deleted at the last time point when constructing sequence samples, so as to reduce data
dimension and improve model operation efficiency. The fixed sequence of the unit length
specified by the sliding window is used to calculate the indicators in the fixed sequence.
The window size selected is 10, namely the sequence data of the first 10 s are used to predict
the wellbore pressure of the next 10 s. The partial samples of the established multivariate
sequence data are shown in Table 3.

Table 3. Sample of multivariable sequence data.

Var6(t−1) var7(t−1) var8(t−1) var9(t−1) ... var13(t−1)


0.00038 0.957212 0.849603 0.009718 ... 0.799
0.00042 0.957229 0.879854 0.018351 ... 0.799
0.00046 0.957242 0.867551 0.016718 ... 0.799
0.00050 0.957263 0.855436 0.026837 ... 0.799
0.00058 0.95728 0.881510 0.020017 ... 0.799

3.2. Evaluation Indicators Selection


To evaluate the developed models and their predictive performances, in this paper,
mean relative error (MRE), root mean square error (RMSE), and mean absolute error (MAE)
are selected as the evaluation indicators of the network model. At the same time, this
experiment adds additional model running time to the evaluation index to compare the
model running efficiency.
m yi − ypre

1
MRE = ∑ (3)
m i=1 yi
v
u
u1 m 2
RMSE = t ∑ (yi − ypre ) (4)
m i=1

1 m
m i∑
MAE = − y (5)

yi pre
=1
where yi is the observed BHP values and ypre is the calculated BHP values, which are
predicted by the built models.
Appl. Sci. 2022, 12, 6728 8 of 13

3.3. Model Establishment


3.3.1. Single Intelligence Models
In this study, we developed single intelligent prediction models with pytorch based
on BP, LSTM, and 1DCNN. In order to make the models comparable and keep the model
complexity consistent, all network parameters are set similarly. A sample input to the BP
neural network is a one-dimensional vector, including 130 feature variables, and 13 features
under each time step. A sample input of the LSTM network is a 10 × 13 matrix, which
represents a time window of 10 and each time step contains 13 features. The 1DCNN
network input sample is a 10 × 13 two-dimensional vector. The network contains11three
Appl. Sci. 2022, 12, x FOR PEER REVIEW of 17
one-dimensional convolution layers (Conv1D) and one Flatten layer (Flatten), where the
convolution kernel is 3, 3, and 2 in sequence, and the dropout probability is 0.2. The
structure parameters of each network model are shown in Table 4.
Table 4. The structure parameters of BP, LSTM and 1DCNN network model.

Parameter Table 4. The structure


BP parameters of BP, LSTM
LSTMand 1DCNN network model.
1DCNN
Model parameter 3249 3953 3769
Parameter BP LSTM 1DCNN
Time steps / 10 10
Model parameter 3249 3953 3769
Batch size 600 600 600
Time steps / 10 10
Learning
Batch sizerate 0.001
600 0.001 600 0.001600
Optimizer
Learning rate Adam
0.001 Adam0.001 Adam 0.001
Optimizer
Activation function Adam
Relu ReluAdam ReluAdam
Activation
Hidden function
layer neurons/ Relu Relu Relu
Hidden layer neurons/ Dense1: 8; Dense2: 8 LSTM: 24; Dense: 8 Conv1: 8; Conv2: 16; Conv3: 8
Channels Dense1: 8; Dense2: 8 LSTM: 24; Dense: 8 Conv1: 8; Conv2: 16; Conv3: 8
Channels

3.3.2. Hybrid Parallel Intelligence Models


3.3.2. Hybrid Parallel Intelligence Models
In this paper, based on the characteristics of the data, seven strong temporal and six
weakIntemporal
this paper, based on thefeature
(non-temporal) characteristics
parametersof are
the filtered
data, seven strong and
by analysis, temporal and
the strong
six weak temporal (non-temporal) feature parameters are filtered by analysis, and the
temporal feature parameters are input into the LSTM network and the non-temporal fea-
strong temporal feature parameters are input into the LSTM network and the non-temporal
ture parameters are input into the BP and CNN networks. Finally, the features are fully
feature parameters are input into the BP and CNN networks. Finally, the features are fully
connected and fused to the output, so that the hybrid parallel network models of BP-LSTM
connected and fused to the output, so that the hybrid parallel network models of BP-LSTM
and CNN-LSTM are established, as shown in Figures 7 and 8. It should be noted that the
and CNN-LSTM are established, as shown in Figures 7 and 8. It should be noted that the
hybrid network model is based on the fusion of the output features of a single intelligent
hybrid network model is based on the fusion of the output features of a single intelligent
model, so the network parameters of each branch remain the same as in Section 3.3.1.
model, so the network parameters of each branch remain the same as in Section 3.3.1.

Figure7.7. BP-LSTM
Figure BP-LSTMparallel
parallelnetwork
networkinternal
internaldetailed
detailedstructure.
structure.

network has limited generalization ability for temporal multi-factor problems, and
the LSTM network requires a certain length of sequence as training samples. Therefore,
the prediction results of these two models alone do not have a high generalization ability.
Considering the characteristics of the two models, this paper adopts the method of con-
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data into
the LSTM network and the non-sequential data into the BP network, to give full play to
the advantages of the two models and achieve a higher precision prediction of the models.
network has limited generalization ability for temporal multi-factor problems, an
the LSTM network requires a certain length of sequence as training samples. Therefo
the prediction results of these two models alone do not have a high generalization abili
Considering the characteristics of the two models, this paper adopts the method of co
structing a BP-LSTM hybrid model as shown in Figure 4 to input the sequential data in
Appl. Sci. 2022, 12, 6728 the LSTM network and the non-sequential data into the BP network, to 9giveof 13 full play
the advantages of the two models and achieve a higher precision prediction of the mode

Figure 8. CNN-LSTM
Figureparallel networkparallel
8. CNN-LSTM internal detailed
network structure.
internal detailed structure.

4. Results and Discussion


4. Results and Discussion
In this section,In wethis
compare
section,and we analyze
compare the andperformance of the singleofbottomhole
analyze the performance the single bottomho
pressure intelligent
pressure intelligent prediction model and hybrid parallel intelligentmodel
prediction model and hybrid parallel intelligent prediction on model
prediction
the test set, findthe
thetest
advantages of different models to capture features, and finally
set, find the advantages of different models to capture features, and finally sele select
and evaluate the andbest intelligent
evaluate prediction
the best model.
intelligent prediction model.
Observing the prediction results as shown
Observing the prediction results in Table 5 reveals
as shown that5 the
in Table single
reveals thatintelligent
the single intellige
models all exhibit similar volatility during the phase when the managed
models all exhibit similar volatility during the phase when the managed pressure drilling
pressure drilli
is maintained constant. The best relative performance is the LSTM model
is maintained constant. The best relative performance is the LSTM model with a relatiwith a relative
average error ofaverage
0.148%,error
whileofthe relative
0.148%, average
while errors of
the relative BP anderrors
average 1DCNN of BPareand0.156%
1DCNN and are 0.156
0.270%, respectively, and the
and 0.270%, stability ofand
respectively, thethe
models performs
stability poorly.performs
of the models Specifically,
poorly. theSpecifically,
BP t
model captures BP themodel
features of the the
captures time-series
features data
of themore evenlydata
time-series and more
is good at dealing
evenly and iswith
good at deali
the point-to-pointwithmapping relationship
the point-to-point without
mapping considering
relationship the contextual
without considering information
the contextual info
of the time-seriesmation
data.of1DCNN
the time-series
shows strongdata. 1DCNN
response shows strong response
fluctuations fluctuations
in the abrupt change in the abru
change phasedrilling,
phase of pressure-controlled of pressure-controlled drilling, and
and the model generates the model
a larger featuregenerates
map because a larger featu
of the relatively map because
small number of theofrelatively
channels,small whichnumber
leadsoftochannels, which leads toof
a large computation a large
the compu
tion ofso
convolution process, theit convolution
takes up more process,
memory so itand
takestakes
up more
longermemory
to run.and takes longer to run.
Further analysis shows that the prediction accuracy and stability of the two-hybr
parallel networks
Table 5. Experimental evaluation have significantly improved when compared with the single networ
results.
From Figure 9d,e, the excellent performance of the hybrid model in trend response an
Evaluation wave response is apparent. The CNN-LSTM parallel model has the best performance w
BP LSTM 1DCNN BP-LSTM CNN-LSTM
Indicator an MAPE of 0.031% and the BP-LSTM has an MAPE of 0.040%, which are 78% and 72
Run time (s) lower than the optimal
30.042 39.034single intelligent
160.711model LSTM, 45.089respectively. In the CNN-LST
200.640
MAPE (%) parallel model, the feature
0.156 0.148 parameters with weak temporal
0.270 0.040 information 0.031are input into t
MAE CNN0.092branch network 0.087
and the feature 0.158
parameters with 0.023strong temporal 0.018 information a
RMSE input0.117
into the LSTM branch 0.106 network,0.255 which makes up 0.049
for the shortcoming0.047 of the 1DCN
model with abnormal fluctuations at the pressure inflection point, making the predicti
result more
Further analysis shows stable.
that In
the the BP-LSTM accuracy
prediction parallel model, the feature
and stability parameters
of the two-hybrid with weak tem
parallel networks poral information
have significantly are input into the
improved BP branch
when comparednetwork
withand the the feature
single parameters wi
network.
From Figure 9d,e, strong
thetemporal
excellentinformation
performance areofinput into themodel
the hybrid LSTM in branch
trendnetwork,
responsewhich and makes u
wave response is apparent. The CNN-LSTM parallel model has the best performance with
an MAPE of 0.031% and the BP-LSTM has an MAPE of 0.040%, which are 78% and 72%
lower than the optimal single intelligent model LSTM, respectively. In the CNN-LSTM
parallel model, the feature parameters with weak temporal information are input into the
CNN branch network and the feature parameters with strong temporal information are
input into the LSTM branch network, which makes up for the shortcoming of the 1DCNN
model with abnormal fluctuations at the pressure inflection point, making the prediction
result more stable. In the BP-LSTM parallel model, the feature parameters with weak
temporal information are input into the BP branch network and the feature parameters
with strong temporal information are input into the LSTM branch network, which makes
Table 5. Experimental evaluation results.

CNN-
Evaluation Indicator BP LSTM 1DCNN BP-LSTM
LSTM
Appl. Sci. 2022, 12, 6728 Run time (s) 30.042 39.034 160.711 45.08910 of 13 200.640
MAPE (%) 0.156 0.148 0.270 0.040 0.031
MAE 0.092 0.087 0.158 0.023 0.018
RMSE
up for the volatile shortcomings 0.117 intelligent
of the single 0.106
models 0.255 0.049 in the 0.047
of BP and LSTM
pressure stabilization stage.

(a) (b)

(c) (d)

(e) (f)

Figure 9. Comparison and evaluation results of real and predicted BHP. (a) The prediction results
of BP network; (b) The prediction results of LSTM network; (c) The prediction results of 1DCNN;
(d) The prediction results of BP-LSTM parallel network; (e) The prediction results of CNN-LSTM
parallel network; (f) Performance evaluation results of various models.

On the other hand, compared with the CNN-LSTM model, the BP-LSTM model still
has local outliers at the pressure changes, and the fluctuation is more intense at the sudden
change. Similar to 1DCNN, the CNN-LSTM parallel model has the longest running time,
which is close to four to five times the running time of other models, which has also become
one of the important factors to be considered in field practical applications.
Overall, these results indicate that the advantages of effectively fusing the extracted
features of the model can further improve the accuracy and stability of the prediction model.
The BP network extracts the feature information of the data with uniformity and is good at
parallel network; (f) Performance evaluation results of various models.

Overall, these results indicate that the advantages of effectively fusing the extracted
features of the model can further improve the accuracy and stability of the prediction
Appl. Sci. 2022, 12, 6728 model. The BP network extracts the feature information of the data with uniformity and
11 of 13
is good at dealing with the point-to-point mapping relationship. The CNN network can
accurately capture the subtle information between the data but finds it difficult to deal
with thewith
dealing feature
the sequences withmapping
point-to-point large fluctuations. TheThe
relationship. LSTMCNN is good
networkat considering the
can accurately
dependencies before and after the sequence. However, it finds it difficult to extract infor-
capture the subtle information between the data but finds it difficult to deal with the feature
mation with
sequences weak
with temporal
large information.
fluctuations. Therefore,
The LSTM optimizing
is good the hybrid
at considering model struc-
the dependencies
ture according to the actual feature of the field data is the key to improving
before and after the sequence. However, it finds it difficult to extract information the perfor-
with
mance of the model.
weak temporal information. Therefore, optimizing the hybrid model structure according to
the actual feature of the field data is the key to improving the performance of the model.
5. Trend Analysis
5. Trend Analysis
Overall trend analysis can systematically assess the strength of the proposed model
and make
Overall sure of how
trend effectively
analysis the proposed
can systematically model
assess the captures
strength of the physical
the proposed phenomenon
model and
[16]. In
make thisofwork,
sure synthetic datasets
how effectively the proposedweremodel
created, wherethe
captures in the dataset
physical only one input
phenomenon [16].
parameter
In this work,well depth datasets
synthetic was variedwerefrom its minimum
created, where in the to maximum
dataset only values while
one input all other
parameter
parameters
well depth was were kept from
varied constant at their average
its minimum to maximumvalues.values
Figurewhile10 shows the parameters
all other effect of in-
were kept constant at their average values. Figure 10 shows the effect
creasing well depth with three different drill fluid densities of 1.1, 1.15, and 1.2 g/cm of increasing well
3. It

depth 3
can bewith
seenthree different
that in drill fluid
the horizontal densitiesas
direction, ofthe
1.1,well
1.15,depth
and 1.2 g/cm . It can
is increasing, thebeBHP
seenhas
thata
in the horizontal
slowly increasingdirection, as the well
trend. Vertically, thedepth
biggeris increasing,
drilling fluidthedensity
BHP has a slowly
makes increasing
a higher BHP.
trend.
At the Vertically,
same time,the bigger drilling
observing fluid
the Y-axis, it density
is foundmakes a higher
that when BHP.depth
the well At thereaches
same time,
5608
observing the Y-axis,
m, the drilling it is found
fluid density that when
is changed the1.1
from well todepth
1.2 g/cmreaches
3, and5608 m, the
the BHP drilling fluid
increases from
density is 58.474
changed fromwhich
1.1 to proves
1.2 g/cm 3
58.388 to MPa, the, and the BHPofincreases
robustness the model. fromThis
58.388 to 58.474
trend can also MPa,be
which proves the robustness of the model. This trend can also
further demonstrated by the momentum equation. To sum up, the trend analysis shows be further demonstrated by
the
thatmomentum
the CNN-LSTM equation. To sum
hybrid model up,can
the capture
trend analysis showsphysics
the correct that thebehind
CNN-LSTM
the data hybrid
and
model can capture
has a certain the correct physics behind the data and has a certain stability.
stability.

Figure 10. Effect


Figure 10. Effect of
of changing
changing well
well depth
depth on
on BHP
BHP with
with different
different drilling
drillingfluid
fluiddensity.
density.
6. Conclusions
6. Conclusions
The contribution of this paper is to predict the BHP accurately and optimize the
The contribution of this paper is to predict the BHP accurately and optimize the pro-
processing of temporal data in managed pressure drilling.
cessing of temporal data in managed pressure drilling.
The BP model captures time-series data with uniformity, the LSTM model can effec-
The BP model captures time-series data with uniformity, the LSTM model can effec-
tively extract the contextual information of the time-series data, and the CNN model pays
tively extract the
more attention to contextual information
capturing the of theof
subtle changes time-series data,hybrid
the data. The and the CNN model
optimization pays
model
more attention to capturing the subtle changes of the data. The hybrid optimization
that combines the advantages of a single intelligent model can significantly reduce the model
that combines
predicted theimprove
outliers, advantages of a single
the accuracy andintelligent model
stability, and canthe
reduce significantly reduceerror
average relative the
by about 70%.
The CNN-LSTM model has a strong ability to extract the time-series characteristics
of the pressure-while-drilling data and can capture the subtle changes, with an average
relative error of 0.031%. However, due to the influence of the size of the convolution kernel,
the feature map formed after convolution is large, which reduces the efficiency of model
solving, and the running time is about 4.5 times higher than that of the BP-LSTM parallel
model. Through trend analysis, the effectiveness and robustness of the CNN-LSTM model
is demonstrated.
Appl. Sci. 2022, 12, 6728 12 of 13

This research can further predict BHP accurately by using a hybrid model method and
create a more robust model to deal with complex situations in the field. In the future, we
hope to intelligently select the appropriate model according to the time-series characteristics
of the data and explore interpretable methods to correctly capture the physical meaning
behind the MWD data.

Author Contributions: Conceptualization, Z.Z. and X.S.; methodology, Z.Z.; software, R.Z.; valida-
tion, Z.Z., X.S. and R.Z.; formal analysis, Z.Z.; investigation, Z.Z.; resources, X.S.; data curation, R.Z.;
writing—original draft preparation, R.Z.; writing—review and editing, Z.Z., L.H., X.H., D.L. and
D.Y.; visualization, R.Z. and F.Q.; supervision, Z.Z.; project administration, Z.Z.; funding acquisition,
X.S and G.L. All authors have read and agreed to the published version of the manuscript.
Funding: This research was funded by China National Petroleum Corporation Limited–China
University of Petroleum (Beijing) strategic cooperation science and Technology special project, grant
number ZLZX2020-03, National Science Foundation for Distinguished Young Scholars, grant number
52125401, Science Foundation project of China University of Petroleum (Beijing), grant number
2462022SZBH002 and National Key RESEARCH and Development Program of China, “Key Scientific
Issues of Transformative Technologies”, grant number 2019YFA0708300.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: The data are not publicly available due to involve the information of
Chinese oil fields and need to be kept confidential.
Acknowledgments: The author would like to thank the academic salon of the High-Pressure Water
Jet Drilling and Completion Laboratory of China University of Petroleum (Beijing), and Wang Tianyu
for giving constructive suggestions and ideas, which helped improve the quality of the paper.
Conflicts of Interest: The authors declare no conflict of interest.

References
1. Sami, N.A.; Ibrahim, D.S. Forecasting multiphase flowing bottom-hole pressure of vertical oil wells using three machine learning
techniques. Pet. Res. 2021, 6, 417–422. [CrossRef]
2. Aziz, K.; Govier, G.W. Pressure Drop In Wells Producing Oil And Gas. J. Can. Pet. Technol. 1972, 11, 30940. [CrossRef]
3. Chokshi, R.N.; Schmidt, Z.; Doty, D.R. Experimental Study and the Development of a Mechanistic Model for Two-Phase Flow
Through Vertical Tubing. In Proceedings of the SPE Western Regional Meeting, Anchorage, AK, USA, 22–24 May 1996.
4. Ansari, A.M.; Sylvester, N.D.; Sarica, C.; Shoham, O.; Brill, J.P. A Comprehensive Mechanistic Model for Upward Two-Phase Flow
in Wellbores. SPE Prod. Facil. 1994, 9, 143–151. [CrossRef]
5. Duns, H., Jr.; Ros, N.C.J. Vertical flow of gas and liquid mixtures in wells. In Proceedings of the 6th World Petroleum Congress,
Frankfurt am Main, Germany, 19–26 June 1963.
6. Gomez, L.E.; Shoham, O.; Schmidt, Z.; Chokshi, R.N.; Northug, T. Unified Mechanistic Model for Steady-State Two-Phase Flow:
Horizontal to Vertical Upward Flow. SPE J. 2000, 5, 339–350. [CrossRef]
7. Hagedorn, A.R.; Brown, K.E. Experimental Study of Pressure Gradients Occurring During Continuous Two-Phase Flow in
Small-Diameter Vertical Conduits. J. Pet. Technol. 1965, 17, 475–484. [CrossRef]
8. Orkiszewski, J. Predicting Two-Phase Pressure Drops in Vertical Pipe. J. Pet. Technol. 1967, 19, 829–838. [CrossRef]
9. Ternyik, J., IV; Bilgesu, H.I.; Mohaghegh, S.; Rose, D.M. Virtual Measurement in Pipes: Part 1-Flowing Bottom Hole Pressure
Under Multi-Phase Flow and Inclined Wellbore Conditions. In Proceedings of the SPE Eastern Regional Meeting, Morgantown,
West Virginia, 17–21 September 1995.
10. Mohammadpoor, M.; Shahbazi, K.; Torabi, F.; Qazvini, A. A New Methodology for Prediction of Bottomhole Flowing Pressure
in Vertical Multiphase Flow in Iranian Oil Fields Using Artificial Neural Networks (ANNs). In Proceedings of the SPE Latin
American and Caribbean Petroleum Engineering Conference, Lima, Peru, 1–3 December 2010.
11. Jahanandish, I.; Salimifard, B.; Jalalifar, H. Predicting bottomhole pressure in vertical multiphase flowing wells using artificial
neural networks. J. Pet. Sci. Eng. 2011, 75, 336–342. [CrossRef]
12. Chen, W.; Di, Q.; Ye, F.; Zhang, J.; Wang, W. Flowing bottomhole pressure prediction for gas wells based on support vector
machine and random samples selection. Int. J. Hydrogen Energy 2017, 42, 18333–18342. [CrossRef]
13. Spesivtsev, P.; Sinkov, K.; Sofronov, I.; Zimina, A.; Umnov, A.; Yarullin, R.; Vetrov, D. Predictive model for bottomhole pressure
based on machine learning. J. Pet. Sci. Eng. 2018, 166, 825–841. [CrossRef]
14. Mask, G.; Wu, X.; Ling, K. An improved model for gas-liquid flow pattern prediction based on machine learning. J. Pet. Sci. Eng.
2019, 183, 106370. [CrossRef]
Appl. Sci. 2022, 12, 6728 13 of 13

15. Okoro, E.E.; Obomanu, T.; Sanni, S.E.; Olatunji, D.I.; Igbinedion, P. Application of artificial intelligence in predicting the dynamics
of bottom hole pressure for under-balanced drilling: Extra tree compared with feed forward neural network model. Petroleum
2021, 8, 227–236. [CrossRef]
16. Tariq, Z.; Mahmoud, M.; Abdulraheem, A. Real-time prognosis of flowing bottom-hole pressure in a vertical well for a multiphase
flow using computational intelligence techniques. J. Pet. Explor. Prod. Technol. 2019, 10, 1411–1428. [CrossRef]
17. Nait Amar, M.; Zeraibi, N.; Redouane, K. Bottom hole pressure estimation using hybridization neural networks and grey wolves
optimization. Petroleum 2018, 4, 419–429. [CrossRef]
18. Irani, R.; Nasimi, R. Application of artificial bee colony-based neural network in bottom hole pressure prediction in underbalanced
drilling. J. Pet. Sci. Eng. 2011, 78, 6–12. [CrossRef]
19. Ashena, R.; Moghadasi, J. Bottom hole pressure estimation using evolved neural networks by real coded ant colony optimization
and genetic algorithm. J. Pet. Sci. Eng. 2011, 77, 375–385. [CrossRef]
20. Barati-Harooni, A.; Najafi-Marghmaleki, A.; Tatar, A.; Arabloo, M.; Phung, L.T.K.; Lee, M.; Bahadori, A. Prediction of frictional
pressure loss for multiphase flow in inclined annuli during Underbalanced Drilling operations. Nat. Gas Ind. B 2016, 3, 275–282.
[CrossRef]
21. Liang, H.; Wei, Q.; Lu, D.; Li, Z. Application of GA-BP neural network algorithm in killing well control system. Neural Comput.
Appl. 2021, 33, 949–960. [CrossRef]
22. Fruhwirth, R.K.; Thonhauser, G.; Mathis, W. Hybrid Simulation Using Neural Networks To Predict Drilling Hydraulics in Real
Time. In Proceedings of the SPE Annual Technical Conference and Exhibition, San Antonio, TX, USA, 24–27 September 2006.
23. Elzenary, M.; Elkatatny, S.; Abdelgawad, K.Z.; Abdulraheem, A.; Mahmoud, M.; Al-Shehri, D. New Technology to Evaluate
Equivalent Circulating Density While Drilling Using Artificial Intelligence. In Proceedings of the SPE Kingdom of Saudi Arabia
Annual Technical Symposium and Exhibition, Dammam, Saudi Arabia, 23–26 April 2018.
24. Al Shehri, F.H.; Gryzlov, A.; Al Tayyar, T.; Arsalan, M. Utilizing Machine Learning Methods to Estimate Flowing Bottom-Hole
Pressure in Unconventional Gas Condensate Tight Sand Fractured Wells in Saudi Arabia. In Proceedings of the SPE Russian
Petroleum Technology Conference, Virtual, 26–29 October 2020.
25. Li, X.; Miskimins, J.L.; Hoffman, B.T. A Combined Bottom-hole Pressure Calculation Procedure Using Multiphase Correlations
and Artificial Neural Network Models. In Proceedings of the SPE Annual Technical Conference and Exhibition, Amsterdam, The
Netherlands, 27–29 October 2014.
26. Gola, G.; Nybø, R.; Sui, D.; Roverso, D. Improving Management and Control of Drilling Operations with Artificial Intelligence. In
Proceedings of the SPE Intelligent Energy International, Utrecht, The Netherlands, 27–29 March 2012.
27. Wiśniowski, R.; Skrzypaszek, K.; Małachowski, T. Selection of a Suitable Rheological Model for Drilling Fluid Using Applied
Numerical Methods. Energies 2020, 13, 3192. [CrossRef]
28. Sheu, B.J.; Choi, J. Back-Propagation Neural Networks. In Neural Information Processing and VLSI; Springer: Boston, MA, USA,
1995; pp. 277–296. [CrossRef]
29. Gers, F.A.; Schmidhuber, J.; Cummins, F. Learning to Forget: Continual Prediction with LSTM. Neural Comput. 2000, 12, 2451–2471.
[CrossRef] [PubMed]

You might also like