Development and Application of A Machine Learning Based Multi-Objective Optimization Workflow For CO2-EOR Projects

Fuel 264 (2020) 116758
Contents lists available at ScienceDirect
Fuel
journal homepage: www.elsevier.com/locate/fuel
Full Length Article
Development and application of a machine learning based multi-objective T

optimization workflow for CO2-EOR projects
Junyu You , William Ampomah, Qian Sun
⁎
Petroleum Recovery Research Center, New Mexico Tech, Socorro 87801, USA
ARTICLE INFO ABSTRACT
Keywords: Carbon dioxide-Enhanced Oil Recovery (CO2-EOR) is known as one of techniques for hydrocarbon production
Carbon dioxide sequestration improvement as wells as an important candidate to reduce greenhouse gas emissions. Thus, an ideal develop-
CO2-EOR ment strategy for a CO2-EOR project would consider multiple objectives including to maximize oil recovery, CO2
Multi-objective optimization storage volume and project economic outcomes. This work proposes a robust computational framework that
Artificial neural network
couples artificial neural network (ANN) and multi-objective optimizers to optimize the aforementioned objec-
tives in CO2-EOR processes simultaneously. Expert ANN systems are trained and employed as surrogate models
of the high-fidelity compositional simulator in the optimization workflow. The robustness of the development
optimization protocol is confirmed via a synthetic injection-pattern-base case study. Afterward a field im-
plementation to Morrow-B formation to optimize the tertiary recovery stage of the field development is dis-
cussed. This work compares the optimum solution found using an aggregate objective function and the solution
repository (Pareto front) generated by the multi-objective optimization process. The comparison indicates the
existence of potential multi-solutions satisfying certain criteria in a CO2-EOR project designing, which cannot be
found using traditional weighted sum method. The optimization results provide significant insight into the
decision-making process of CO2-EOR project when multiple objective functions are considered.
1. Introduction [13–15].
This paper uses an extensive data set collected by The Southwest
In the oil and gas industry, primary and secondary recovery Regional Partnership on Carbon Sequestration (SWP) at the field site of
methods typically can produce on the average about one-third of the their Development Phase project [16,17]. One primary purpose of this
original oil in place (OOIP), while enhanced oil recovery (EOR) tech- project is to investigate the feasibility of safely and permanently storing
niques can boost oil recovery to 40–60% of OOIP [1]. Common EOR over one million metric tons of anthropogenic CO2 in a partially de-
methods include gas flooding [2], chemical EOR [3] and thermal EOR pleted reservoirs [11,18–20]. The SWP’s project focuses on an active
[4,5]. Recently, gas injection EOR gained extensive attention due to its water alternating CO2 (CO2-WAG) project underway at Farnsworth
wide application in shale reservoirs [6]. In US, the existence of natural Field (FWU) in Ochiltree County, Texas [19]. The work in this area has
CO2 resources makes it possible to use CO2 as one of the main injection yielded rich set of geological, geochemical and petrophysical data from
agents to enhance hydrocarbon recovery [7]. Reported studies show FWU unit [21,22], providing significant fundamental input to the nu-
that CO2 –EOR brings huge potential in improving oil production from merical reservoir simulation models used to history match the historical
shale reservoirs in North America [6,8] and some mature oil fields [9]. data, forecast field performance, optimize the development strategy,
A successful CO2-EOR project would take advantage of the injected CO2 and investigate the trapping mechanisms and track the long-term fate of
to produce residual oil leftover from the primary and secondary re- injected CO2 [16,23,24].
covery stages, and safely store large volume of CO2 gas for a long time Although the field operator may have a different strategy, the ideal
period. Thus CO2-EOR is also considered as one important approach to development strategy concerned by SWP scientists would seek to
reduce global greenhouse gas emissions [10–12]. CO2 has been utilized maximize the hydrocarbon recovery, CO2 storage volume and project
as a miscible/immiscible injectant in numerous enhanced oil recovery economic outcomes simultaneously. And this task can be the same for
(EOR) projects in the US and it has been observed that much of the CO2-EOR projects with similar concerns. Various methods have been
injected CO2 remains underground during the life of the project used to deal with problems that have multiple objectives to optimize;
⁎
Corresponding author.
E-mail address: junyu.you@student.nmt.edu (J. You).
https://doi.org/10.1016/j.fuel.2019.116758
Received 21 October 2019; Received in revised form 22 November 2019; Accepted 25 November 2019
0016-2361/ © 2019 Elsevier Ltd. All rights reserved.
J. You, et al. Fuel 264 (2020) 116758
methods include goal programming, multiple objective particle swarm [42] and more. Utilization of those machine learning algorithms greatly
optimization (MOPSO) [25–27], multi-objective genetic algorithm increases the computational efficacy of dealing with those complex
(MOGA) [28], non-dominated sorting genetic algorithm (NSGA II) [29], problems.
etc. In this paper, we present a comprehensive framework coupled with
More specifically to CO2-EOR projects, critical literature reviews an ANN-based proxy model developed to evaluate the performance of
show that multi-objective optimization approaches are broadly applied CO2-EOR project and CO2 storage capacity in the FWU field. Field and
to assist project design and evaluation of project performance. Forooghi injection-pattern scale reservoir models are used to generate datasets
et al. [30] designed a methodology to optimize the performance of a that serve as the training knowledge base of the proxy model. Three
CO2-EOR project in a North Sea Chalk reservoir. Leach et al. [31] de- objectives functions including cumulative oil production, CO2 seques-
veloped a theoretical framework to analyze the dynamic co-optimiza- tration volume and project net present value (NPV) are considered in
tion of oil production and CO2 sequestration in a CO2-EOR project. the optimization workflow. The MOPSO algorithm is employed to
Menad and Noureddine [32] introduce a methodology to optimize CO2- generate the Pareto front considering all three objectives. Once the
WAG processes using evolutionary algorithms. optimization framework is structured, we first apply it to optimize an
One approach to the multi-objective problem is to transfer multiple injection pattern scale problem around a specific well and then extend
objective functions into an aggregated objective, which is calculated the workflow to the entire field. The results indicate that the proposed
using the weighted summation of the sub-objectives. This approach is optimization protocol is competent to establish the solution repository
also known as Weighted Sum Method (WSM), and it has been employed that contains various dominating solutions. More importantly, we show
to co-optimize the CO2-WAG project at FWU, where oil production and comprehensive comparison between the Pareto front solution and the
CO2 storage are used as objective functions [9,33]. The two objectives result using WSM, which provides constraints and reduces uncertainties
were combined into a single objective function by assigning the same involved by the multi-objective optimization process.
weights. And genetic algorithm is employed to optimize the aggregated
objective function. However, it is difficult to assign reasonable weights
for individual objectives. More importantly, different weight combi- 2. Methodology
nations would yield various results, which introduces strong un-
certainties and bias to the optimization process. Notably, the global In this section, we will discuss the major procedures to structure the
optimization algorithms typically go through an iterative searching proposed optimization workflow, which includes the followings steps:
procedure that requires assessing the objective functions multiple
times, a computationally expensive process. In field scale optimization 1. Data preparation: preparing the dataset used to training the proxy
processes, machine-learning proxies are broadly employed to act as model using artificial neuron network. The training database is
surrogate models of high-fidelity numerical models to reduce the prepared by sending a certain number of batch runs to the original
computational overhead. This is because running a proxy once takes numerical simulator.
fraction of seconds while a high-fidelity numerical model would take 2. ANN architecture optimization: to find the optimal architecture that
minutes to hours to run one simulation depending on the size of the yields the smallest error or satisfies the error limit. The proxy model
model. trained by optimal architecture is picked as the optimal proxy to be
Literature surveys show extensive successful developments of ma- used in the following work.
chine learning models to act as proxy models of high-fidelity numerical 3. Coupling the ANN model with MOPSO: the developed proxy model
models using regression [9,34], artificial neural network (ANN) [35,36] is utilized to generate the Pareto front of multi-objectives in co-
and support vector machine models [37,38]. Machine-learning based operation with the MOPSO algorithm.
proxy models help with reservoir characterization [39], history
matching [40], uncertainty quantification [41], reservoir simulation Fig. 1 briefly depicts the workflow chart of the research work.
Fig. 1. Procedures of optimization framework.
2
J. You, et al. Fuel 264 (2020) 116758
water cycle, gas cycle, producer bottomhole pressure (BHP) and water
injection rate are were considered as the inputs of the proxy model. In a
CO2-WAG project, WAG optimization and CO2 slug size are significant
factors to determine the project performance [43]. The duration of
water/gas cycles, water and gas injection rates need to be carefully
designed to have good project performance. In this work, the total gas
injection rate is controlled by the total compressor capacity of the field,
which equals 20 MM scf/day [9]. To ensure more CO2 is injected and
then stored underground, the gas injection rate is directly set to be this
Fig. 2. Schematic diagram of a 5-spot injection pattern. Four producers are value. For WAG wells, the upper limitation of injection pressure is to set
placed at corners and one injector is placed at the center of the pattern. to be 5000 psi, since an injection pressure higher than 5000 psi may
frack the formation rock due to previous published work [9]. For the
2.1. Reservoir modeling and training data preparation producers, a higher producer bottomhole pressure typically will lead to
more oil production; however, more CO2 will be produced at the same
The proxy model is trained using synthetic data generated from time, which reducing CO2 storage amount. Consequently, the producer
high-fidelity numerical simulation models. Therefore, a certain volume bottomhole pressure was selected as another control variable and used
of batch simulation runs is sent to prepare the knowledge base to de- in the optimization workflow. This work doesn’t account for changing
velop the ANN models. the number of wells and well placements, since due to the operator’s
The developed optimization protocol is firstly validated by applying developing plan, all water injection wells will be converted into WAG
it to a synthetic injection-pattern-base case study. The injection pattern wells to achieve a higher oil recovery and storing more CO2 under-
model focuses on the 5-spot pattern near Well 13-10A at Farnsworth ground [9,33]. Table 1summerizes the grid blocks used in each case.
field (Fig. 2). The reservoir model was established to simulate one- In a CO2-WAG project, water is first injected to pressurize the re-
eighth of the injection pattern. After a rigorous grid dimension sensi- servoir to reach the minimum miscible pressure (MMP) and enhance
tivity analysis, we imposed a 60 × 60 Cartesian grid system to dis- macroscopic sweep efficiency, then CO2 is used to improve microscopic
cretize the system with a grid size of 26.7 ft × 26.7 ft. The reservoir displacement efficiency by forming a miscible front with reservoir
properties were extracted from the area surrounding Well 13-A. This fluids. Table 2 lists the fluid components model used in each case.
injection pattern based model is used to predict CO2-WAG performance The output parameters are cumulative oil production, CO2 storage
for 10 years. For the injection pattern scale application, 820 simulations volume and the project NPV.
runs were performed to extract the training data. Compared with the A simplified economic model is utilized in this study to estimate the
field-scale reservoir model, the computational overhead of the synthetic project NPV that serves as the third objective. The project NPV is cal-
model is small; hence a shorter time is needed to prepare the training culated using Eq. (1):
dataset. The validation process is to test whether the proposed opti- n
Qoi × ro Ci
mization framework can work appropriately. A quick preparation of NPV =
(1 + p)i (1 + p)i (1)
dataset helps to test the workflow more efficiently. i=1
Afterward, the validated workflow is used to co-optimize the multi- where Qoi is the oil production at i-th year, bbl; ro is the selling price of
objective of an active CO2-EOR project within FWU. For the field-scale oil, $/bbl; p is discount rate; Ci is the operational cost for i-th year, $.
implementation, a numerical reservoir model of the FWU field (Fig. 3, The operational cost incorporated in this economic model accounts for
[21,22]) is employed. This compositional reservoir model incorporated the cost of water injection, CO2 injection and recycling. It should be
extensive geological, geophysical and operational data, and it has been noticed that tax credit for carbon capture projects is not considered in
successfully history matched using field historical data collected from this equation, which bring some limitation to this economic model. The
55 years of primary and secondary production and 8-years of tertiary policy of tax credit in each state of US can be different depending on
CO2-WAG implementation. A total of 53 wells (21 injectors and 32 various project types. Moreover, the sparse public information re-
producers, placed in a 5-spot pattern) are included in this field reservoir garding tax credit makes it is even more difficult to select a correct
model. The model is used to predict CO2 storage amount, cumulative oil value of tax credit and use it in this economic model. However, the use
production and project NPV in the following 20 years. For the field of this NPV model is to test whether the developed optimization fra-
scale application, we employed 597 cases as the training knowledge mework can work properly, which is proofed to be good enough to
base. estimate the project NPV under current project development require-
As visualized in Fig. 4, four project design parameters including the ments. The economic model can be easily modified to evaluate the
Fig. 3. Top view of the FWU numerical reservoir model. The left figure shows the initial oil saturation distribution in the top layer of the reservoir, the right figure
shows initial water saturation distribution in the top layer of the reservoir.
3
J. You, et al. Fuel 264 (2020) 116758
Fig. 4. Structure of training dataset, the training

targets include cumulative oil production (green
square), CO2 storage amount (red square) and
project NPV (yellow square) at the end of the
forecasting period. Training inputs are four para-
meters represented by blue squares. (For inter-
pretation of the references to colour in this figure
legend, the reader is referred to the web version of
this article.)
Table 1 with 10 neurons in the first layer, 20 neurons in the second layer and 30
Summary of grids used in pattern base case model and field model. neurons in the last layer is expressed as {10, 20, 30}. The number of
X Y Z Total blocks Total blocks in active
values in the brace represents the number of hidden layers used in every
architecture, and the values represent the number of neurons in each
Pattern base case 60 60 1 3600 1830 hidden layer. Table 3 summarizes the components and their possible
Field case 88 82 10 72,160 14,754 values or selections.
As displayed in Fig. 6, the entire dataset is divided into categories
for training, validation, and blind testing with ratios of 80%, 10% and
Table 2
10%, respectively.
Fluid components model for each case.
For the injection pattern base case in this work, there are 820 sets of
Fluid component (mole fraction) Field case model Patten base case model data. 80% (656 sets of data) are used to train the neural network, while
10% (82 sets of data) were used validation, and the remaining 10%
C1 38.5% 36.6%
C2 3.9% 3.9% used for blind testing. For the field case, the number of these three
C3 2.5% 2.5% groups of data was 477, 60 and 60. The performance of the trained
C4 2.0% 2.0% neural network was determined by finding the absolute relative error of
C5 1.8% 1.8% the blind test data.
C6 2.8% 2.9%
C7-38 33.5% 34.6%
The absolute relative error of blind test data (E ) of one training is
C38-70 15.1% 15.6% calculated based on Eq. (2):
m
1 |Tk Yk |
E=
effect of tax credit if there is more solid information obtained regarding m k=1
Tk (2)
tax credit in future work.
where Tk is the observed target value of the k-th output, Yk is the pre-
dicted value of k-th output, and m is the number of outputs (training
2.2. Development of ANN proxy targets) of each experiment.
In this work, each architecture picked from the architecture pool
An artificial neural network is composed of three types of layers: was trained 10 times repeatedly. For each training run, the training
one input layer, one (or more) hidden layers and one output layer data, validation data and test data were randomly selected from all
(Fig. 5). The input information is transferred to hidden layers via training data sets following the ratio of 80%, %10 and 10%, to ensure
weighted connections. The information is processed in neurons in every training run is based on different training data. The alternation of
hidden layers through the linear transformation function (activation test data in every training run avoided overfitting of trained neuron
function) and then transferred to the output layer. Connections between network, and guaranteed the trained network could be tested in a
hidden layers and from hidden layer to output layer are controlled by broader range of unused data.
weight vectors. Finally, the framework will determine the average absolute relative
In our work, in order to find the architecture yielding the best error of these 10 times training runs (E ), expressed by Eq. (3):
prediction performance, a pool of many ANN models containing dif-
10
ferent ANN architectures is generated [44,45]. Three components are s=1
Es
E =
varied to form the different architectures in this work: the number of 10 (3)
neurons in each hidden layer, the number of hidden layers and acti-
vation functions applied to each hidden layer. Each model is trained where Es is the relative absolute error of s-th training run. E helps to
and assessed by extensive blind testing applications. The goal is to se- test the performance of a certain neural network architecture in a more
lect the model with the best testing performance. general way and to avoid fortuitous good or bad performance results.
The number of hidden layers used in this work is pre-set to be se- The developed framework will finally find the optimal architecture that
lected from 1 to 5. For each hidden layer, the possible number of has the smallest average absolute relative error (E ) among all archi-
neurons is assigned to be selected from a close set of 1 to 50 due to the tectures in the architecture pool.
small number of the training targets used in training data. Possible
options for the activation function used in this work include the hy- 2.3. Multi-objective particle swarm optimization (MOPSO) and Pareto front
perbolic tangent sigmoid transfer function (tansig) and the log-sigmoid
transfer function (logsig). The framework will select the architecture The PSO is incorporated with Pareto dominance to handle problems
having the lowest prediction error or the one satisfying a given error with multiple objectives to be optimized. The multi-objective PSO op-
limit. For simplicity, a format is defined to represent possible combi- timizer is referred to as MOPSO.
nations. For example, an architecture containing three hidden layers, Pareto dominance is defined as [27]:
4
J. You, et al. Fuel 264 (2020) 116758
Fig. 5. Schematic diagram of the multi-layer neural network.
Table 3 f1(x2) > f1(x1) and f2(x2) > f2(x1). When comparing the solutions on
Values and options for components used in neural network archi- the Pareto front, i.e. x1 and x3, f1(x3) > f1(x1) while f2(x3) < f2(x1).
tecture. Thus all solutions on Pareto front are nondominated solutions.
Item Possible selections The algorithm of MOPSO is well demonstrated in reference[27]. The
framework developed in our work incorporates MOPSO and feedfor-
Number of neurons 1–50 ward neural networks to co-optimize cumulative oil production and
Number of hidden layers 1–5
CO2 storage in a CO2-WAG project.
Activation functions tansig, logsig
3. Workflow validation
A vector u = (u1, u2, , uk ) is said to dominate v = (v1, v2, , vk )
(denoted by u v ) if u is partially less than v, which is expressed as Eq. The developed optimization framework is firstly utilized to optimize
_ various objectives in an injection-pattern-base case study (Fig. 2) as the
(4):
following strategy: the framework was initially used to co-optimize two
i {1, 2, , k }: ui vi i {1, 2, , k }: ui < vi (4) objectives (cumulative oil production and CO2 storage), and subse-
quently extended to optimize three objectives (cumulative oil produc-
Pareto Optimum Set is the collection including the solution vectors
tion, CO2 storage and NPV) simultaneously. Such an implementation
(Eq. (5)):
strategy is to examine the capability of optimization workflow on
P = {x | x' : f (x ') f (x )} handling two and three objectives respectively.
_ (5) A summarization of all architectures is shown in Table A-1 in
Appendix A. Table A-2 summarizes the blind test results for all four
where is a set of vectors which include all the feasible solutions in the
cases that were evaluated, while Table A-3 provides more information
search domain; f ( x ) = [f1 ( x ), f2 ( x ), , fm ( x )] is a vector including m
regarding error for each case. Fig. A-1 displays the training dataset used
objective functions involved in the optimization process. The Pareto
for proxy development for all case studies.
front is defined as the corresponding set of objective function of the
Pareto Optimum Set (Eq. (6)):
3.1. Two objectives
PF = {f ( x ) = [f1 ( x ), f2 ( x ), , fm ( x )]|x P } (6)
The developed framework is firstly applied to optimize cumulative
In MOPSO, the external repository (REP) (or archive) is used to oil production and CO2 storage. It should be noticed that for two-ob-
store a historical record of the nondominated solutions. When the REP jective case studies of both injection pattern base model and field
is full, the nondominated solutions found in a new generation will be model, training output data contains only oil production and CO2 se-
compared with solutions already stored in the repository. Dominated questration amount. As discussed in Section 2.2, the workflow selected
solutions in the REP must be eliminated and replaced by the dominating a best one from a pool of network architectures, and the optimal ar-
solutions in the new generation. A sample Pareto front with two ob- chitecture found is {9, 9, 8}, which represents three hidden layers are
jectives is demonstrated in Fig. 7. contained in this architecture. The average blind test error of this ar-
As shown in Fig. 7, all solutions in the red rectangular area are chitecture is 0.8%. Fig. 8 displays the histogram of blind test errors. As
dominated by solution x1. For example, x2 is the dominated solution shown, 97.6% of all blind test data has an error of less than 2% as
since both two objective values of x2 are larger than those of x1: compared with the simulation results.
5
J. You, et al. Fuel 264 (2020) 116758
Fig. 6. Schematic diagram of the division of training dataset into three groups: 80% of the training data are used to train proxy, 10% of the data are used for
validation and the other 10% are used as blind tests.
Among the blind tests, the highest error was 6.5%, the lowest one
was 0.01%, and the median was 0.7%.
From this work, we conclude that the trained proxy model using
optimal architecture selected by the framework is robust enough and
capable to make predictions matching high fidelity simulation results.
Then, the trained proxy is used to generate Pareto front of these two
objectives (f1 and f2) with the MOPSO algorithm, as shown by the blue
line in Fig. 9. It is observed from the Pareto front that the range of f1
(cumulative oil production) in the Pareto front is 1.74 × 105 to
3.59 × 105 bbls, and the range of f2 (CO2 storage) in the Pareto front is
1.53 × 104 to 1.87 × 104 metric tons.
To validate the generated Pareto front, 5000 experiments were
conducted by using the trained proxy. Input parameters of these 5000
experiments are randomly picked from their operational range, which
Fig. 7. A sample of Pareto front with two objectives.
are enumerated in Table 4. The results of these random experiments are
Fig. 8. Histogram of blind test results of two objectives in the injection pattern Fig. 9. Pareto front of two objectives and the comparison between Pareto front
base case. and 5000 random experiments in the injection pattern base case.
6
J. You, et al. Fuel 264 (2020) 116758
Table 4 3.2. Three objectives

Range of training input parameters.
Parameters Unit Min Max After successfully using the framework to optimize a two-objective
problem, the framework is applied to find the optimal solutions for
Water injection period day 100 200 three objectives; cumulative oil production, CO2 storage and project
Gas injection period day 200 365
NPV, at the same time. The optimal architecture found is {12, 18}, and
Producer BHP psi 1000 4000
Water injection rate stb/day 1000 3500
the average error is 0.7%. High, medium, and low error are 5.5%, 0.6%,
and 0.02%, respectively. As shown in Fig. 11, 84.6% of all blind tests
have an error of lower than 1%, and 98.5% have errors less than 2%.
Details of architecture and test results are shown in recorded in Table A-
1 and Table A-2 in Appendix A. The three selected data with highest,
median and lowest error are listed in Table A-3.
The generated Pareto front of the three objectives is displayed in
Fig. 12. Again, an additional 5000 experiments were also performed to
compare with the Pareto front (Fig. 12). The Pareto front indicates that
f1 has a range from 1.59 × 105 to 3.62 × 105 bbls, f2 (CO2 storage) has
a range from 1.52 × 104 to 1.87 × 104 metric tons and the range of f3
(NPV) is from 0.99 × 107 to 2.05 × 107 US dollars.
Fig. 12 shows that this optimization framework can successfully be
applied to problems with three objectives
4. Field application
Fig. 10. Comparison between Pareto front and the simulation results in the The application of the developed optimization framework in a
injection pattern base case. synthetic injection pattern base model proves that this framework is
capable of simultaneously co-optimizing multiple objectives. Thus it is
finally utilized to optimize the oil production, CO2 storage and NPV of
the CO2-EOR project that is currently operational at the FWU field
depicted in Fig. 9. As shown, solutions of all those 5000 experiments are (Fig. 3). Similarly to what is done in the injection pattern base case, the
dominated by the Pareto front, proving its validity. framework is first applied to co-optimize two objectives and then used
For further validation of the Pareto front, several solutions were to optimize the three objectives.
randomly picked from the Pareto front and their input parameters in-
vestigated and given back to the numerical model of the injection
pattern base model to conduct simulation. Fig. 10 shows the compar- 4.1. Two objectives
ison between the simulation results and Pareto front solutions Fig. 11.
As shown, these simulation results had similar values with some The optimal architecture selected is {15, 30}, yielding an average
solutions found on the generated Pareto front, proving that solutions blind test error of 1.4%. The histogram (Fig. 13) shows that 75.8% of
found on the Pareto front match well with high-fidelity numerical si- the total blind tests have an error of less than 2%. The three selected
mulation results and that the difference between them is within ac- cases are listed in Table A-3, the worst case is found to have the highest
ceptable margins of error. Each of those nondominated solutions on the error of 3.9%.
Pareto front is not worse than any other solution. It is difficult to use the The generated Pareto front of two objectives and results of 5000
numerical model coupled with other optimization algorithms to find random experiments are shown in Fig. 14. The Pareto front indicates
those nondominated solutions due to the huge computation required; that f1 has a range from 1.37 × 107 to 1.64 × 107 bbls, f2 (CO2 storage)
however, with the help of the proxy model, the computational overhead ranges from 1.54 × 106 to 2.17 × 106 metric tons.
of structuring a high-resolution Pareto front is reduced tremendously. Comparing the two Pareto fronts of two-objective in both pattern
base case (Fig. 9) and that of field case (Fig. 14), it is found that the
Pareto front of pattern base case contains two regions: a relatively flat
top (with oil production ranges from 1.74 × 105 bbl to about 3.5 × 105
bbl) and a sharp decline region (with oil production ranges from
3.5 × 105 bbl to about 3.59 × 105 bbl). The sharp decline means the
solutions in this region are approaching the physical limits of the ori-
ginal pattern case model, indicating that not much more incremental oil
recovery can be enhanced by sacrificing the CO2 storage amount. A
similar trend is also found in the Pareto front of field case model. Al-
though not obvious, the right side of Pareto front (Fig. 14) also exhibits
a sharp drop on the curve.
For further validation, six solutions were randomly picked from the
Pareto front. Similarly to the injection pattern base case, the input
parameters of those six solutions were put into the numerical model for
simulation. The simulation results are shown in Fig. 15. The comparison
between the simulation and Pareto front validates the Pareto front and
implies that the Pareto front is capable of capturing the trend line of
how f1 changes with f2. The difference between the simulation results
Fig. 11. Histogram of blind tests results of three objectives in the injection and Pareto front is within an acceptable error margin.
pattern base case.
7
J. You, et al. Fuel 264 (2020) 116758
Fig. 12. Pareto front of three objectives and the comparison between Pareto front and 5000 random experiments in the injection pattern base case.
Fig. 13. Histogram of blind test results of two objectives in field case. Fig. 15. Comparison between Pareto front and the simulation results in field
case.
Fig. 16. Histogram of blind tests results of three objectives in field case.
Fig. 14. Pareto front of two objectives and the comparison between Pareto
front and 5000 random experiments in field case.
8
J. You, et al. Fuel 264 (2020) 116758
Fig. 17. Pareto front of three objectives and the comparison between Pareto front and 5000 random experiments in field case.
10.5 10.3
10.3 10.25
10.1 10.2
f1 + f2 + f3
f1 + f2 + f3
9.9 10.15
9.7 10.1
Aggregate equation Aggregate equation
9.5 10.05 Pareto front
Pareto front
9.3 10
0 50 100 150 200 0 10 20 30 40 50
Pareto front particle ID Pareto front particle ID
(a) (b)
Fig. 18. Comparison between the optimal solution found using the aggregate equation coupled with PSO. Plot (b) is zoom in the overlapped part (red circled part) of
plot (a). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Table 5 4.2. Three objectives

Comparison between optimal solutions found using
aggregate equation incorporating PSO and solutions In this case, NPV is added as the third objective. The optimal ar-
found by Pareto front. chitecture is found to be {8, 5, 19}, and the average blind test error is
Case # Value of ft 1%.
As shown in Fig. 16 and Table A-3, 93% of the blind tests have an
PSO 10.265 error of less than 2% and the worst case’s error is 4.2%.
Case 1 10.265
The generated Pareto front is showed in Fig. 17. The Pareto front
Case 5 10.265
Case 13 10.268 indicates that f1 has a range from 1.36 × 107 to 1.64 × 107 bbls, f2
Case 32 10.266 (CO2 storage) has a range from 1.64 × 106 to 2.19 × 106 metric tons
Case 33 10.266 and range of f3 (NPV) is from 5.79 × 108 to 6.99 × 108 US dollars.
Case 34 10.265 When comparing the Pareto front of three objectives and that of two
Case 35 10.266
Case 36 10.267
objectives for this field case, it seems there is not much on the Pareto
Case 37 10.264 front solution values of f1 and f2. One reason is that, in this work, the
Optimal: Case 11 10.279 NPV is calculated by two terms: oil production and operation cost (see
Eq. (1)), and the operation cost is mainly dependent on the CO2 storage
9
J. You, et al. Fuel 264 (2020) 116758
amount. In the field case, the magnitude of oil production is ten times of 1. The ANN model exhibits its strength in adapting to the complex data
CO2 storage’s magnitude, thus the NPV’s value is mainly dependent on structure associated with CO2-WAG projects. The ANN architecture
the value of cumulative oil production. Consequently, when MOPSO optimization protocol helps the proxy model improve prediction
algorithm is finding nondominated solutions, it is highly possible that a performance.
nondominated value of f3 will be found with a nondominated value of f1 2. Employing an ANN model plays a vital role in the proposed work-
at the same time. Therefore, adding NPV as the third objective will not flow to help the MOPSO establish a high-resolution Pareto front
change significantly values of f1 and f2 of the solutions on the Pareto since MOPSO alone requires extensive computational overheads.
front. 3. The outcomes of the optimization study show promising results that
successfully establish the solution repository considering multiple
5. Comparison between weighted sum method and Pareto front objective functions. Such results are verified by presenting the
Pareto fronts and the dominated solutions.
In this section, we will compare the optimal solutions found by the 4. The Pareto front solutions exhibit a contradictory relationship be-
Pareto front against that using WSM. Then, an optimization algorithm is tween oil recovery and CO2 storage volumes, meaning that field
applied, incorporated with an aggregate equation, to find the optimal operators need to make a decision to balance the trade-off benefits.
solution. For comparison purposes, the aggregate equation is also uti- 5. The comparison between the aggregate equation and Pareto front
lized in the field case to find the optimal solution for these three ob- methods shows that the Pareto front is a more robust method to
jective functions. The aggregate equation is expressed as Eq. (7): solve problems with multiple opposite objectives. The use of the
Pareto front solution addresses the issue of the existence of non-
ft = f1 × 10 7 + f2 × 10 6 + f3 × 10 8
(7)
unique solutions.
where ft is the combined objective function of f1, f2 and f3, its unit is 107 6. The economic model used in this work is mainly to demonstrate the
barrels oil + 106 metric tons CO2 storage + 108 US dollars. feasibility of the prosed optimization workflow. It can be easily
Particle swarm optimization is used in cooperation with ft to find modified when more solid information obtained regarding tax credit
the best solution. The optimal value of ft is 10.265. The values of ft of in future work.
all solutions on the Pareto front are calculated and plotted in Fig. 18. It
is found that the highest value of ft found among all Pareto front so-
CRediT authorship contribution statement
lutions is 10.279, which is even higher than the optimal solution found
by using the aggregate equation.
Junyu You: Writing - original draft, Software, Validation. William
When two objective function values are similar or equal, decision
Ampomah: Conceptualization, Supervision, Methodology. Qian Sun:
making will be dependent on other constraints such as operational
Conceptualization, Writing - review & editing.
parameters. Because using an aggregate equation can only find one
unique solution, it will not provide more choices for engineers to design
development strategies to achieve the optimal objective value; how- Declaration of Competing Interest
ever, a Pareto front will optimize different objectives simultaneously
and find all existing possibilities of optimal objective values, hence The authors declare that they have no known competing financial
providing more flexibility on development strategies design. interests or personal relationships that could have appeared to influ-
When investigating the comparison between these two methods, it ence the work reported in this paper.
is found that, for the ft value found using the aggregate equation
(10.265), there exist multiple solutions with the same or similar value
Acknowledgments
of ft found in the Pareto front. Table 5 summarizes those solutions with
a ft value that is equal or close to 10.265.
Funding for this project is provided by the U.S. Department of
Table 5 proves that, even for the same or similar value of ft , using an
Energy’s (DOE) National Energy Technology Laboratory (NETL)
aggregate equation can only find one unique solution, while a Pareto
through the Southwest Regional Partnership on Carbon Sequestration
front will be able to give all possible solutions with the same or ap-
(SWP) under Award No. DE-FC26-05NT42591.
proximate ft value. Considering that in many real problems with dif-
ferent objectives, there always exists more than one optimal solution,
using the Pareto front will help to find all those existing optimal solu- Disclaimer
tions and help engineers to pick the one satisfying other constraints.
This also indicates that the Pareto front method is more robust than This report was prepared as an account of work sponsored by an
using aggregate equations to solve an optimization problem with mul- agency of the United States Government. Neither the United States
tiple contradictory objectives. Government nor any agency thereof, nor any of their employees, makes
any warranty, express or implied, or assumes any legal liability or re-
6. Conclusion sponsibility for the accuracy, completeness, or usefulness of any in-
formation, apparatus, product, or process disclosed, or represents that
In this paper, we presented a robust optimization framework cou- its use would not infringe privately owned rights. Reference herein to
pling multi-layer neural networks and the MOPSO algorithm that co- any specific commercial product, process, or service by trade name,
optimizes oil recovery, CO2 storage, and project NPV in a CO2-WAG trademark, manufacturer, or otherwise does not necessarily constitute
project at the Farnsworth field in Texas, USA. The validity of the fra- or imply its endorsement, recommendation, or favoring by the United
mework is confirmed by testing it on an injection pattern and a field- States Government or any agency thereof. The views and opinions of
scale problem. authors expressed herein do not necessarily state or reflect those of the
The following conclusive statements can be drawn from this work: United States Government or any agency thereof.
10
J. You, et al. Fuel 264 (2020) 116758
Appendix A
Table A-1 summarizes the setups of optimal architectures found in each case. Specifically, the “tansig” means hyperbolic tangent sigmoid transfer
function, and “logsig” means log-sigmoid transfer function.
Table A-1
Setup of optimal architectures.
Case name Number of each layer Activation function Average error
Pattern case: 2 objectives {9, 9, 8} tansig-tansig-tansig 0.839%
Pattern case: 3 objectives {12, 28} logsig-tansig 0.676%
Field case: 2 objectives {15, 30} tansig-logsig 1.410%
Field case: 3 objectives {8, 5, 19} logsig-logsig-logsig 1.014%
Table A-2 summarizes blind test results for all four cases studied.
Table A-2
Summary of blind test results.
Two objectives Three objectives
Error Frequency Cumulative % Error Frequency Cumulative %
Pattern case < =0.5% 219 26.71% < =0.5% 327 39.88%
0.5% < and < =1.0% 325 66.34% 0.5% < and < =1.0% 367 84.63%
1.0% < and < =1.5% 215 92.56% 1.0% < and < =1.5% 89 95.49%
1.5% < and < =2.0% 41 97.56% 1.5% < and < =2.0% 25 98.54%
2.0% < and < =2.5% 9 98.66% 2.0% < and < =2.5% 6 99.27%
2.5% < and < =3.0% 3 99.02% 2.5% < and < =3.0% 2 99.51%
3.0% < and < =3.5% 6 99.76% 3.0% < and < =3.5% 0 99.51%
3.5% < and < =4.0% 1 99.88% 3.5% < and < =4.0% 0 99.51%
4.0% < and < =4.5% 0 99.88% 4.0% < and < =4.5% 1 99.63%
4.5% < and < =5.0% 0 99.88% 4.5% < and < =5.0% 0 99.63%
> 5% 1 100.00% > 5% 3 100.00%
Field case < =0.5% 91 15.17% < =0.5% 122 20.33%

0.5% < and < =1.0% 123 35.67% 0.5% < and < =1.0% 214 56.00%
1.0% < and < =1.5% 133 57.83% 1.0% < and < =1.5% 142 79.67%
1.5% < and < =2.0% 108 75.83% 1.5% < and < =2.0% 80 93.00%
2.0% < and < =2.5% 77 88.67% 2.0% < and < =2.5% 30 98.00%
2.5% < and < =3.0% 41 95.50% 2.5% < and < =3.0% 9 99.50%
3.0% < and < =3.5% 22 99.17% 3.0% < and < =3.5% 1 99.67%
3.5% < and < =4.0% 5 100.00% 3.5% < and < =4.0% 1 99.83%
4.0% < and < =4.5% 0 100.00% 4.0% < and < =4.5% 1 100.00%
4.5% < and < =5.0% 0 100.00% 4.5% < and < =5.0% 0 100.00%
> 5% 0 100.00% > 5% 0 100.00%
Table A-3 lists the 3 blind tests data with the highest, the median and the lowest error are selected from all blind tests.
Table A-3
Three blind test data selected with the highest, median and the lowest error in all case studies.
Case name Highest error Median error Lowest error Average error
Pattern case: 2 objectives 6.506% 0.740% 0.006% 0.839%

Pattern case: 3 objectives 5.546% 0.577% 0.023% 0.676%
Field case: 2 objectives 3.871% 1.273% 0.020% 1.410%
Field case: 3 objectives 4.236% 0.890% 0.042% 1.014%
11
J. You, et al. Fuel 264 (2020) 116758
Fig. A-1 shows all training dataset points used to train the proxy model for each case study.
Pareto front and Training dataset Pareto front and Training dataset
2.5 2.5
2.3 Pareto front 2.3
f2: CO2 storage, ×104 Metric ton

f2: CO2 storage, ×106 Metric ton
2.1 2.1
Training data
1.9 1.9
1.7 1.7
1.5 1.5
1.3 1.3
1.1 1.1
Pareto Front
0.9 0.9
0.7 0.7 Training data
0.5 0.5
1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 0.5 1 1.5 2 2.5 3 3.5 4
f1: Cumulative oil production, × 107 bbl f1: Cumulative oil production, × 105 bbl
(a) (b)
(c) (d)
Fig. A-1. Pareto front vs training dataset for all case studies: (a) field case with 2 objectives considered, (b) pattern base case with 2 objectives considered, (c) field
case with 3 objectives considered and (a) pattern base case with 3 objectives considered.
References oil recovery (CO2-EOR) via thickeners, foams, and gels–a detailed literature review
of 40 years of research. Contract DE-FE0004003. Activity 2012;4003(01).
[14] Dai Z, et al. An integrated framework for optimizing CO2 sequestration and en-
[1] Alagorni AH, Yaacob ZB, Nour AH. An overview of oil production stages: enhanced hanced oil recovery. Environ Sci Technol Lett 2013;1(1):49–54.
oil recovery techniques and nitrogen injection. Int J Environ Sci Develop [15] White MD, et al. Numerical simulation of carbon dioxide injection in the western
2015;6(9):693–701. section of the Farnsworth Unit. Energy Procedia 2014;63:7891–912. https://doi.
[2] Alharthy N, et al. Enhanced oil recovery in liquid-rich shale reservoirs: laboratory to org/10.1016/j.egypro.2014.11.825.
field. SPE Reservoir Eval Eng 2018;21(01):137–59. https://doi.org/10.2118/ [16] Balch R, McPherson B. Integrating enhanced oil recovery and carbon capture and
175034-PA. storage projects: a case study at farnsworth field, Texas. SPE Western Regional
[3] Raffa P, Broekhuis AA, Picchioni F. Polymeric surfactants for enhanced oil recovery: Meeting. Society of Petroleum Engineers; 2016. DOI: https://doi.org/10.2118/
a review. J Petrol Sci Eng 2016;145:723–33. https://doi.org/10.1016/j.petrol.2016. 180408-MS.
07.007. [17] Litynski J, et al. US Department of Energy’s regional carbon sequestration part-
[4] You J, Rahnema H. Numerical modeling of multiphase steam flow in wellbore. J nership program: overview. Energy Procedia 2009;1(1):3959–67. https://doi.org/
Petrol Sci Eng 2018;164:259–77. https://doi.org/10.1016/j.petrol.2018.01.070. 10.1016/j.egypro.2009.02.200.
[5] You J, Rahnema H, McMillan MD. Numerical modeling of unsteady-state wellbore [18] Ahmmed B, et al. Chemical effects of carbon dioxide sequestration in the Upper
heat transmission. J Nat Gas Sci Eng 2016;34:1062–76. Morrow Sandstone in the Farnsworth, Texas, hydrocarbon unit. Environ Geosci
[6] Sheng JJ. Enhanced oil recovery in shale reservoirs by gas injection. J Nat Gas Sci 2016;23(2):81–93. https://doi.org/10.1306/eg.09031515006.
Eng 2015;22:252–9. https://doi.org/10.1016/j.jngse.2014.12.002. [19] Ampomah W, et al. Farnsworth field CO 2-EOR project: performance case history.
[7] Manrique EJ, et al. EOR: current status and opportunities. SPE improved oil re- SPE improved oil recovery conference Society of Petroleum Engineers; 2016.
covery symposium. Society of Petroleum Engineers; 2010. DOI: https://doi.org/ https://doi.org/10.2118/179528-MS.
10.2118/130113-MS. [20] Pan F, et al. Uncertainty analysis of carbon sequestration in an active CO 2-EOR
[8] Yu W, et al. CO2 injection for enhanced oil recovery in Bakken tight oil reservoirs. field. Int J Greenhouse Gas Control 2016;51:18–28. https://doi.org/10.1016/j.
Fuel 2015;159:354–63. https://doi.org/10.1016/j.fuel.2015.06.092. ijggc.2016.04.010.
[9] Ampomah W, et al. Optimum design of CO2 storage and oil recovery under geo- [21] Kutsienyo EJ. et al. Evaluation of CO-EOR Performance and Storage Mechanisms in
logical uncertainty. Appl Energy 2017;195:80–92. https://doi.org/10.1016/j. an Active Partially Depleted Oil Reservoir. in SPE Europec featured at 81st EAGE
apenergy.2017.03.017. Conference and Exhibition. 2019. Society of Petroleum Engineers DOI: https://doi.
[10] Faltinson JE, Gunter B. Net CO2 stored in North American EOR projects. J Can Pet org/10.2118/195534-MS.
Technol 2013;50(7):55–60. https://doi.org/10.2118/137730-PA. [22] You J, et al. Assessment of enhanced oil recovery and CO storage capacity using
[11] Dai Z., et al., Uncertainty quantification for CO2 sequestration and enhanced oil machine learning and optimization framework. SPE Europec featured at 81st EAGE
recovery. arXiv preprint arXiv:1411.4900; 2014. Conference and Exhibition. Society of Petroleum Engineers; 2019. DOI: https://
[12] Tapia JFD, et al. Optimal CO2 allocation and scheduling in enhanced oil recovery doi.org/10.2118/195490-MS.
(EOR) operations. Appl Energy 2016;184:337–45. https://doi.org/10.1016/j. [23] Soltanian MR, et al. Simulating the Cranfield geological carbon sequestration project
apenergy.2016.09.093. with high-resolution static models and an accurate equation of state. Int J Greenhouse
[13] Enick R, Olsen D. Mobility and conformance control for carbon dioxide enhanced
12
J. You, et al. Fuel 264 (2020) 116758
Gas Control 2016;54:282–96. https://doi.org/10.1016/j.ijggc.2016.10.002. Engineers; 2015. DOI: https://doi.org/10.2118/174055-MS.

[24] Ampomah W, et al. Performance assessment of CO2-enhanced oil recovery and [35] Priezzhev I, Scollard A, Lu Z. Regional production prediction technology based on
storage in the Morrow reservoir. Geomech Geophys Geo-Energy Geo-Resour gravity and magnetic data from the Eagle Ford formation. Texas, USA, Denver SEG,
2017;3(3):245–63. https://doi.org/10.1007/s40948-017-0059-1. 2014.
[25] Christie M, et al. Use of multi-objective algorithms in history matching of a real [36] Kim Y, et al. Prediction of storage efficiency on CO2 sequestration in deep saline
field. SPE reservoir simulation symposium. Society of Petroleum Engineers; 2013. aquifers using artificial neural network. Appl Energy 2017;185:916–28. https://doi.
DOI: https://doi.org/10.2118/163580-MS. org/10.1016/j.apenergy.2016.10.012.
[26] Aliyev E, Durlofsky LJ. Multilevel field-development optimization using a sequence [37] El-Sebakhy EA, et al. Support vector machines framework for predicting the PVT
of upscaled models. SPE Reservoir Simulation Symposium. Society of Petroleum properties of crude oil systems. SPE Middle East Oil and Gas Show and Conference.
Engineers; 2015. DOI: https://doi.org/10.2118/173198-MS. Society of Petroleum Engineers; 2007. DOI: https://doi.org/10.2118/105698-MS.
[27] Coello CAC, Pulido GT, Lechuga MS. Handling multiple objectives with particle [38] El-Sebakhy EA. Forecasting PVT properties of crude oil systems based on support
swarm optimization. IEEE Trans Evol Comput 2004;8(3):256–79. vector machines modeling scheme. J Petrol Sci Eng 2009;64(1–4):25–34. https://
[28] Han Y, Park C, Kang JM. Estimation of future production performance based on doi.org/10.1016/j.petrol.2008.12.006.
multi-objective history matching in a waterflooding project. SPE EUROPEC/EAGE [39] Ramgulam A, Ertekin T, Flemings PB. An artificial neural network utility for the
annual conference and exhibition. Society of Petroleum Engineers; 2010. DOI: optimization of history matching process. Latin American & Caribbean Petroleum
https://doi.org/10.2118/130500-MS. Engineering Conference. Society of Petroleum Engineers; 2007. DOI: https://
[29] Niri ME, Lumley DE. A multi-objective optimization method for creating reservoir doi.org/10.2118/107468-MS.
models that simultaneously match seismic and geologic data. SEG Technical [40] Negash B, et al. History matching using proxy modeling and multiobjective opti-
Program Expanded Abstracts. Society of Exploration Geophysicists; 2014. p. mizations. ICIPEG 2016. Springer; 2017. p. 3–16.
2477–81. [41] Zhang Y, Sahinidis NV. Uncertainty quantification in CO2 sequestration using
[30] Forooghi A, Hamouda A, Eilertsen T. Co-optimization of CO2 EOR and sequestra- surrogate models from polynomial chaos expansion. Ind Eng Chem Res
tion in a North Sea chalk reservoir. , in EAGE Reservoir Charac-terization and 2012;52(9):3121–32. https://doi.org/10.1021/ie300856p.
Simulation Conference; 2009: Abu Dhabi, UAE. [42] Queipo NV, et al. Surrogate modeling-based optimization for the integration of
[31] Leach A, Mason CF, van‘t Veld K. Co-optimization of enhanced oil recovery and static and dynamic data into a reservoir description. J Petroleum Sci Eng
carbon sequestration. Resour Energy Econ 2011;33(4):893–912. https://doi.org/ 2002;35(3–4):167–81. https://doi.org/10.1016/S0920-4105(02)00238-3.
10.1016/j.reseneeco.2010.11.002. [43] Jarrell PM, et al. Practical aspects of CO2 flooding Vol. 22 Richardson, TX: Society
[32] Menad NA, Noureddine Z. An efficient methodology for multi-objective optimiza- of Petroleum Engineers; 2002.
tion of water alternating CO2 EOR process. J Taiwan Inst Chem Eng [44] Sun Q, Ertekin T. Structuring an artificial intelligence based decision making tool
2019;99:154–65. for cyclic steam stimulation processes. J Petroleum Sci Eng 2017;154:564–75.
[33] Ampomah W, et al. Co-optimization of CO2-EOR and storage processes in mature [45] Sun Q, Ertekin T. The development of artificial-neural-network-based universal
oil reservoirs. Greenhouse Gases Sci Technol 2017;7(1):128–42. https://doi.org/10. proxies to study steam assisted gravity drainage (SAGD) and cyclic steam stimu-
1002/ghg.1618. lation (CSS) processes. SPE Western Regional Meeting. Society of Petroleum
[34] He J, et al. Improved proxy for history matching using proxy-for-data approach and Engineers; 2015. DOI: https://doi.org/10.2118/174074-MS.
reduced order modeling. SPE Western Regional Meeting. Society of Petroleum
13

Development and Application of A Machine Learning Based Multi-Objective Optimization Workflow For CO2-EOR Projects

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Development and Application of A Machine Learning Based Multi-Objective Optimization Workflow For CO2-EOR Projects

Uploaded by

Copyright:

Available Formats

Fuel 264 (2020) 116758

Contents lists available at ScienceDirect

Full Length Article

Development and application of a machine learning based multi-objective T

ARTICLE INFO ABSTRACT

Fig. 1. Procedures of optimization framework.

Fig. 4. Structure of training dataset, the training

Fig. 5. Schematic diagram of the multi-layer neural network.

Table 4 3.2. Three objectives

Table 5 4.2. Three objectives

Error Frequency Cumulative % Error Frequency Cumulative %

Field case < =0.5% 91 15.17% < =0.5% 122 20.33%

Pattern case: 2 objectives 6.506% 0.740% 0.006% 0.839%

f2: CO2 storage, ×104 Metric ton

Gas Control 2016;54:282–96. https://doi.org/10.1016/j.ijggc.2016.10.002. Engineers; 2015. DOI: https://doi.org/10.2118/174055-MS.

You might also like