You are on page 1of 12

Made available for a limited time for personal research and study only License.

pubs.acs.org/JPCC Perspective

Machine Learning Assisted Understanding and Discovery of CO2


Reduction Reaction Electrocatalyst
Erhai Hu, Chuntai Liu, Wei Zhang,* and Qingyu Yan*

Cite This: J. Phys. Chem. C 2023, 127, 882−893 Read Online

ACCESS Metrics & More Article Recommendations


See https://pubs.acs.org/sharingguidelines for options on how to legitimately share published articles.

ABSTRACT: Electrochemical CO2 reduction reaction (CO2RR)


Downloaded via INDIAN INST OF SCIENCE on February 21, 2023 at 16:09:53 (UTC).

is an important process which is a potential way to recycle


excessive CO2 in the atmosphere. Although the electrocatalyst is
the key toward efficient CO2RR, the progress of discovering
effective catalysts is lagging with current methods. Because of the
cost and time efficiency of the modern machine learning (ML)
algorithm, an increasing number of researchers have applied ML to
accelerate the screening of suitable catalysts and to deepen our
understanding in the mechanism. Hence, we reviewed recent
applications of ML in the research of CO2RR by the types of
electrocatalyst. An introduction on the general methodology and a
discussion on the pros and cons for such applications are included.

1. INTRODUCTION In this paper, we intend to summarize recent progresses of


applying ML in the research of CO2RR electrocatalysis. We
It has become a consensus to reduce carbon dioxide (CO2)
will first introduce generic methodologies of ML-assisted
levels in the atmosphere, the major reason for climate change,
catalyst discovery, followed by current progress in the
to control global warming within a relatively safe level.1
understanding and discovery of different types of catalysts.
Electrochemical CO2 reduction reaction (CO2RR), as one of Finally, we will discuss the challenges presented and possible
the carbon recycling methods, is promising in this application future endeavors in this area.
because of its ability to turn CO2 into alternative fuels such as
CO, CH4, hydrocarbons, and alcohols. A suitable electro-
catalyst is essential in terms of lowering the energy barrier, 2. GENERAL METHODOLOGY
enhancing the catalytic rate, and facilitating high product In general, ML can be either supervised or unsupervised. The
selectivity for this reaction. Extensive efforts have been made in former assumes the availability of a labeled data set and aims to
catalyst development for electrochemical CO2 reduction.2 For train an ML model to approximate the mapping from input
example, transition metals3 and their alloys,4 single atom features to output, the labels. The latter does not require
catalysts (SAC),5 metal oxides,6 MXene,7 and nanostructured labeled data and is designed to learn the patterns without
catalysts8 have been shown to be effective for CO2RR. ground truth. So far, the popular way of ML-based electro-
However, due to the large search space of catalysts, limited catalyst screening is the former and labeled data set plays a key
progress has been made compared to the time invested. In role. Specifically, the supervised ML process starts with training
recent years, the development of ab initio methods like density a model with a data set with a number of data samples, where
functional theory (DFT) has helped researchers screen new each sample consists of an input feature vector and the label of
catalysts at a faster speed, and it is able to filter out most of the the corresponding output. The model, once trained, can take in
nonpromising options without further experimentation. an input, and produces the modeled, or estimated output.
Furthermore, theoretical computation can help us understand Naturally, the accuracy of this model can be measured based
the mechanism better by computing energies for intermediate on the gap between the derived output and actual/observed
and pathways. However, current method still requires a
considerably large amount of time and computational power Received: November 28, 2022
to maintain a satisfiable performance.9 Recently, Machine Revised: December 13, 2022
Learning (ML) emerges as a promising method to accelerate Published: January 8, 2023
this process. Given a sufficiently large training data set, a well-
trained ML model could predict the properties (e.g., energy) of
a system with relatively high accuracy at a much faster speed.10

© 2023 American Chemical Society https://doi.org/10.1021/acs.jpcc.2c08343


882 J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

Figure 1. Key elements with examples of ML application in CO2RR (top) and general process of ML during application (bottom).

Figure 2. Schematic of input features matrix and feature selection according to feature importance ranking.

target value.11 In the following subsections, we will introduce applied simple NN to predict the adsorption energy of
the key elements in ML (Figure 1), specifically in the intermediates such as CO.16−18
application of CO2RR. As the amount of data in this area continued to grow, deeper
2.1. Feature Engineering. As the first step of an ML and more complex NN, such as Graph Neural Networks
process, the input features will be fed into an ML model. They (GNN), is receiving increasing attentions. In contrast to
will be usually in vector or matrix forms for a certain sample traditional ML where input parameters are relatively
and a whole data set respectively (Figure 2). Features selected independent to each other without capturing much structure
need to well represent the characteristics of the input to enable and relational information, GNN realizes the exploration and
efficient training. Typical features describing an electrocatalyst utilization of atomic structures of the catalysts. The atoms and
include geometric features (e.g., atomic radii of compositional the bonds in such structure can be represented a graph with
elements and coordination numbers) and electronic features nodes and edges.20 For instance, Back et al. use a Crystal
(e.g., electronegativity, polarizability, valence electron num- Graph Convolutional Neural Network (CGCNN) to predict
ber). Feature selection is commonly performed before training adsorption energy of CO and H based on the initial structure
the actual model to eliminate redundant features. This may and connectivity distance of adsorbate and catalyst (Figure 3).
prevent overfitting and increase accuracy by reducing noise. This model achieved a mean absolute error (MAE) of 0.15 eV
Many a feature selection algorithm has been implemented by respect to the DFT-calculated energy for CO and H
measuring the feature importance or statistic correlations adsorption energy prediction.19 This is considerably accurate
between features.12 compare to other methods (Table 1). By taking atomic
2.2. ML Algorithm. An ML algorithm is the core of a interactions into consideration, the modeling power of GNN
model, which decides the way to process data and learn based models substantially increased, but the required amount
features. A few popular ML algorithms have been applied in of data is also much more. On the contrary, simpler ML
this field, such as random forest regression (RFR), support algorithms may perform better given a small data set.
vector regression (SVR), and gradient boosting regression 2.3. Target Properties. As the last step of the process, the
(GBR).13−15 Neural networks (NN) give another common trained model will output a predicted value as the target. The
type of ML algorithm which has more flexibility with target property needs to be decided at the very beginning. For
information exchange between neurons. Multiple studies most cases, adsorption or binding energy of certain
883 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

recent reviews focused on the methodologies and functions of


ML in such research27,28 yet pay much attention to the
difference between application on different types of catalysts.
In this paper, we fill in the gap by introducing the latest
advances according to the type of catalyst.
3.1. Metal. Pure metal electrodes are among the first to be
used as catalysts in electrochemical CO2RR. Representatively,
Hori et al. reported the result of electrochemical CO2RR on a
batch of metal electrodes (e.g., Cu, Pb, Zn, and Ni) and
classified them into four groups based on their major products
(e.g., HCOO−, CO, H2, and >2e−). Because of the limited
number of metal elements, pure metal catalysts can be
thoroughly screened individually with a moderate cost.
However, understanding of this complex reaction process is
Figure 3. Schematic illustration of CGCNN architecture with atomic still lacking, such as effect of adsorption sites and interaction
structure input, convolutional layers, fully connected layers, and
between adsorbates. These effects could be well studied with
binding energy output.19 Reprinted with permission from ref 19.
Copyright 2019 American Chemical Society. ML.
Cu with controlled structure and shape has shown superior
intermediates in CO2RR will be the target because of its direct catalytic efficiency toward products beyond CO.29 Chen et al.
relationship to catalytic activity. With this relationship, Tang et use ML to study the active sites on Au nanoparticles (AuNP)
al. quantitatively analyzed the reaction thermodynamics of responsible to reduce CO2 to CO efficiently.30 First, over 10K
different pathways in CO2RR to construct a selectivity map surface sites are identified on AuNP simulated through
using only CO and OH binding energy to the catalyst (Figure molecular dynamic. A two-layer NN with 39 local geometric
4).21 Their derived rules conform well with experiment results features as input was constructed to predict the adsorption
from literature. A ground truth target value will be used during energy of CO and HOCO (ΔECO and ΔEHOCO), two critical
training with the corresponding input features. This target intermediates, on these sites.31 With comparatively small data
value can normally be generated by either DFT computation set with about 1000 data samples generated from DFT
or High-Throughput (HT) experimentation. calculations, an accurate ML model is derived for predicting
2.4. Existing Data Sets. There are some existing data sets ΔECO and ΔEHOCO. Such model is applied to analyze a much
and databases storing the features and computed/tested larger scale 10K sites for AuNP and 3K sites for dealloyed
ground truth value for ML training. Two well-known and AuNP (Figure 5a), with the prediction results generated
relatively comprehensive material data sets are Materials almost instantly. Based on the common understanding that the
Project 2 3 and Open Quantum Materials Database CO2RR activity will increase with lower ΔECO and higher
(OQMD).24 A few other databases specialized in electro- ΔEHOCO,32 active sites, which mainly consist of stepped or
catalysis have also been established recently. One of them is defect surface, could be accurately identified based on the
the Catalysis-Hub database, with reaction and activation prediction results (Figure 5b). The better catalytic perform-
energies for more than 100 K chemical reactions on catalytic ance demonstrated experimentally on dealloyed AuNP33 has
surfaces collected from near a hundred publications. The data been attributed to the increasing portion of StepUnder111 and
set offers useful features including displaying atomic structure SurfaceDefect sites comparing to pure Au.
and detailed computation settings, data analysis with Like most of the research, the aforementioned study only
interactive plots, and real-time energy prediction with a built- considers the energy of individual adsorbates. Supplementary
in ML model.25 Similarly, another database named Open adsorbates have also been demonstrated to influence the
Catalyst Project (OCP) computed 260 million adsorption catalysis process, but this effect has not been thoroughly
energy exhaustively from a set of adsorbate−catalyst slabs. The studied yet. For example, coelectrolysis of CO2 and O2 on
slabs are generated with randomly chosen adsorbates and type copper is able to boost the production of oxygenates and
of bulk material, as well as surface and position where the hydrocarbons to up to 216 times faster. This was attributed to
adsorption happened. This data set is customized for training the introduction of hydroxyl group on copper, which will lower
and producing ML models, which can be used to screen new the energy barrier of CO2RR.34 Hence, the same group
electrocatalytic materials.26 conducted an ML based investigation, focusing on 17
supplementary adsorbates and 289 adsorbate pairs.35 With
3. RECENT APPLICATIONS OF ML ON CO2RR these pairs, the energy of two key steps for C2+ and CH4
ELECTROCATALYST production,31 *CO dimerization (ΔGCO dimerization) and hydro-
Different types of catalyst have been used for CO2RR, and the genation (ΔGCO hydrogenation), are predicted using DFT and ML.
most popular ones include transition metals,3 metal alloys with Extreme Gradient Boosting Regressor (XGBR) was used to
two or more elements,4 and single metal catalysts.5 Some predict the energy with 6 pairwise features input, including the

Table 1. Example of ML Studies of CO Adsorbate Energy Predictions

data set size ML algorithm catalyst test accuracy


Back et al.19 20771 CGCNN intermetallic MAE: 0.15 eV
Chen et al.13 171 extreme GBR SAC RMSE: 0.1652 eV
Li et al.17 1032 NN bimetallic RMSE: 0.22 eV

884 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

Figure 4. Above the potential axis, selectivity maps of CO2RR under different potentials are shown based on binding energy of CO and OH (GCO*
and GOH*). Below the potential axis, experiment results of Hori22 are shown in the map which are fairly consistent with theoretical predictions.21
Reprinted with permission from ref 21. Copyright 2020 Elsevier B.V.

Figure 5. (a) Predicted ΔECO and ΔEHOCO of all sites identified on dealloyed AuNP. (b) Visualization of a value of atoms in dealloyed AuNP,
where a = ΔECO − 1.4423ΔEHOCO. A larger a value indicates better catalytic activity.30 Reprinted with permission from ref 30. Copyright 2019
American Chemical Society.

difference and the average of the valence electron numbers and promote C2+ production by lowering the ΔGCO dimerization, and
covalent radiuses etc. The difference between predicted energy almost all adsorbates will reduce ΔGCO hydrogenation to promote
and that of pristine Cu (100) was plotted as heatmaps (Figure CH4 production.
6, parts b and c). It shows that adsorbates with more unpaired During liquid phase synthesis of catalyst, additives are
valence electrons, such as CH, N, CH2, and NH, are likely to usually introduced to develop a preferable morphology for
885 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

Figure 6. (a) Example structure of *CO dimerization on Cu (100) with the presence of a supplementary adsorbate pair O/O. Orange, gray, red,
and white spheres represent copper, carbon, oxygen, and hydrogen atoms, respectively. (b) Heatmap of the difference between predicted ΔGCO
35
dimerization and (c) ΔGCO hydrogenation of different adsorbate pairs against that of pristine Cu(100). Reprinted with permission from ref 35. Copyright
2021 American Chemical Society.

efficient CO2RR,36−38 but understanding toward this process is in huge search spaces of many real-world applications, is a
still lacking. Hence, Guo et al. conducted a series of ML guided promising technology to accelerate this process substantially.
experiments to study the effect of different additives in Cu Ulissi et al. predict adsorption energy of CO (ΔECO) on
synthesis on the catalytic performance.39 In three rounds of NiGa bimetallics sites exhaustively using NN, where an NN
learning, they go through a wide screening of different types of model was trained iteratively.43 Adsorption configurations of
additives, adding more data points and input variables, and high certainty are assumed to be trustful. Those with
studying artificial-designed additives with a combination of significant uncertainty are selected automatically for further
features (Figure 7a). In general, faraday efficiency (FE) and refinement or verification using DFT, which produces ground
truth value of the configurations to enhance the training data
set for another round of ML training (Figure 8a). With much
slower DFT being only used for uncertain configurations
instead of all, this proposed process using ML reduces 90% of
the DFT workload compared to traditional non-ML
approaches. Besides, an interesting finding from this work is
that Ni active sites surrounded by Ga atoms demonstrate the
optimal ΔECO like behavior of stepped sites.
In parallel to the above-described effort, Tran and Ulissi
conducted active learning study to find CO2RR and HER
catalyst with ideal CO and H adsorption energy through an
iterative process of DFT and ML (Figure 8b).10 A unique
feature of this study is that only DFT data were included in the
final data set and ML is only used to guide the surface selection
to be calculated using DFT. Four descriptors describing
geometric and electronic features of elements coordinated the
adsorbate were selected as input, and Tree-based Pipeline
Figure 7. (a) Scheme of three round of learning. Feature importance Optimization Tool (TPOT)44 was used to choose the best ML
of (b) FE-CO and (c) FE-C2+ prediction in the first round of model. In the end, a data set with 42,785 DFT calculations
learning.39 Reprinted with permission from ref 39. Copyright 2021 (ΔECO and ΔEH) was obtained. Based on these data, Cu−Al
American Chemical Society. bimetallics was identified as a new catalyst to have optimal
ΔECO and ΔEH, which can be seen in the two-dimensional
current density (j) of CO2RR products are the output of ML volcano plots (Figure 9, parts a and b).
models. It was predicted that Sn is most significant for FE-CO Subsequently, the team managed to design and fabricate the
and aliphatic OH group on the organic additive is most Cu−Al and test its CO2RR activity.45 The best Cu−Al ratio
significant for FE-C2+ (Figure 6, parts b and c). Subsequent was found via ML prediction of ΔECO on Cu−Al bimetallics
experiments reveal that the high FE for organic additives and with different Al concentrations. Optimal morphology and
Sn is due to the Cu2O phases induced during synthesis, which electrolysis conditions are also studied in different experiments.
has been reported for its high catalytic activity.40,41 As a result, a dealloyed nanoporous Cu−Al on PTFE catalyst
3.2. Alloy. Alloy represents an important group of catalyst was successfully developed with 2-fold increment in C2H2
in CO2RR. It is possible to tune the electronic properties selectivity comparing to pure Cu (Figure 9c).
through the interaction between different metal components.42 Besides direct analysis on the DFT-produced data set, this
The huge search space of alloying elements and composition data set was also used to train ML models, such as the
choices impose great challenges particularly for new alloy representative aforementioned CGCNN.19 Recently the
catalyst discovery. ML, proven to be able to search efficiently CGCNN model has been improved by using the unrelaxed
886 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

Figure 8. Scheme of iterative ML and DFT optimization process. (a) Prediction of ΔECO on NiGa bimetallics43 Reprinted with permission from ref
43. Copyright 2017 American Chemical Society. (b) Selection of catalyst with optimal ΔECO and ΔEH to be included in DFT calculation.10
Reprinted with permission from ref 10. Copyright 2018 Springer Nature.

Figure 9. Two dimensional plots of (a) activity for CO2RR and (b) selectivity toward CO2RR according to ΔECO and ΔEH. (c) FE of gaseous
products on dealloyed Cu−Al at different current density compared with nanoporous Cu and pure Cu.45 Reprinted with permission from ref 45.
Copyright 2020 Springer Nature.

bulk structure of catalyst as input only.46 During training, over 20% improvement in MAE of ΔGCO compared to its
binding site atoms are obtained from relaxed adsorbate-catalyst benchmark CGCNN.
slab according to a distance criterion. Those atoms are then High entropy alloy (HEA) represents an emerging class of
labeled on an unrelaxed bare catalyst slab to be used as input alloy yet is chemically stable. HEA has unique catalytic
for training (Figure 10a). During new catalyst screening, all properties by combining different metals. Hence, tuning the
possible adsorption sites will be identified through the graph composition and concentration is crucial for an optimized
conversion of the initial structure, and these labeled site catalytic activity.47 Pedersen et al. studied two set of HEAs,
structures could be used to predict the adsorption energy CoCuGaNiZn and AgAuCuPdPt, by predicting ΔECO and ΔEH
(Figure 10b). Besides this representation, the author also using ML.48 To simplify the possibilities of surface adsorptions,
added an attention module and used an ensemble of five surface microstructures with only categorical composition
models trained together to further increase the accuracy and features (Figure 11a) was constructed as input of a Gaussian
stability. In the end, the LS-CGCNN-ens model could achieve process regression (GPR), one of the traditional ML
887 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

Figure 10. (a) Labeled site representation generated during training and (b) the modeling process during application.46 Reprinted with permission
from ref 46. Copyright 2020 American Chemical Society.

the corresponding ML models can also shed lights to similar


applications with enhanced generalizability of the deliverables.
Two rounds of screening were conducted from all adsorbates
considered to the most important adsorbates considered. In
the end, 34 HEAs are identified as highly active catalysts
toward methanol generation by screening high CO and H2CO
adsorption energy and relatively low O adsorption energy
(Figure 11b).49
3.3. Single and Dual Atom Catalysts. Single atom
catalyst (SAC) is an emerging class of catalyst with isolated
metal atoms anchored on various supports.50 It possesses
unique catalytic properties because of the quantum size effect
and high utilization of active sites.51 As it was only developed
recently, extensive studies are necessary to be carried out on
SACs with different metal atoms, support, and coordination
environment, where ML could be involved.
Zhou et al. attempt to discover new SAC with different
transition metal (TM) atoms and different nonmetal
coordinated environments using ML.13 Simple electronic and
geometric features are considered to describe the SACs and
filtered according to the Pearson correlation coefficient. For
Figure 11. (a) Microstructure representation of on top (left), fcc- example, combinations of electronegativity and covalent radius
hollow (middle), and hcp-hollow (right) adsorption sites.48 The of metal atoms and coordinated atoms are included in the final
white, green, blue, and red spheres represent the adsorbate, adsorbing 14 features. Using these features, XGBR selected by TPOT
site, closest surface neighbors, and closest subsurface neighbors, achieved high modeling fidelity with R2 around 0.9 for ΔGCO
respectively. Reprinted with permission from ref 48. Copyright 2020 prediction, given only about a hundred training data samples
American Chemical Society. (b) Contour plot for adsorption energies used. The author also computed ΔGH and compared it with
of CO, CH2O and O.49 Selective region is marked by the red square. ΔGCO, to find the best catalyst with high CO2RR activity and
Reprinted with permission from ref 49. Copyright 2022 American
Chemical Society.
low HER activity. As a result, 94 SACs out of 1000 candidates
with potentially high CO2RR efficiency were proposed (Figure
12b).
algorithms. Selectivity toward CO2RR/CORR compared to Dual-atom catalyst (DAC) is an extension of SACs and have
HER are studied by analyzing predicted ΔECO and ΔEH. Pure more freedom to adjust the electronic structure of the
Cu was confirmed to be the optimal catalyst and locally adsorption site.52 Wan et al. studied dual-metal-sites catalysts
optimal compositions of the HEAs can be found according to (DMSCs), which has one TM atom attached to the other one
the relation. (Figure 12c), using an analogous method to the previously
Roy et al. also studied the selectivity of CuCoNiZnMg and mentioned work. GPR is shown to be effective without obvious
CuCoNiZnSn HEAs using GPR and kernel ridge regression reliance on a large-scale data set, which comes at a huge cost of
(KRR), another traditional ML algorithm, with a similar resource and time. Twenty features describing two TM atoms
microstructure input. Compared to their predecessor, elec- and their interaction with the coordinated ring are included.
tronic and geometrical features such as metallic radius and Unlike many other studies, limiting potential (UL) was selected
dipole polarizability are newly introduced into the input. Those as the descriptor for CO2RR, where a smaller absolute value of
features are more universal and thus the insights gained from UL indicates a lower energy barrier for CO2RR, thus better
888 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

Figure 12. (a) Example of an SAC structure.14 (b) Prediction heat map of ΔGCO on various SACs.13 Blue squares indicate optimal ΔGCO for
CO2RR. (c) Example of a DMSC structure. d) Prediction heat map of limiting potential of various DMSCs. In parts a and c, white, gray, blue,
orange, and green spheres represent H, C, N, TM 1, and TM 2 atoms, respectively. Parts a and c are reprinted with permission from ref 14.
Copyright 2021 American Chemical Society. Parts b and d are reprinted with permission from ref 13. Copyright 2020 American Chemical Society.

Figure 13. (a) Representative RMSE of train set, test set, and test set with CGL. Distribution of ΔECO for SAAs with (b) p-block and (c) d-block
alloying atoms. Ideal range of ΔECO for CO2RR is indicated by dashed lines.15 Reprinted with permission from ref 15. Copyright 2022 Creative
Commons https://creativecommons.org/licenses/by-nc-nd/4.0/legalcode.

CO2RR activity. According to the heat map (Figure 11d), relaxed using HT DFT. The data were then split into source
DMSCs located at the top left possess a higher absolute value and target data sets to train a GBR ML model. Initially,
of UL, which may be attributed to the similar electronegativities alloying elements in target data are not included in source data
of two TM atoms. In contrast, DMSCs with Ag−Mo and Ag− to test the model’s generalizability. Considering the high
Co as TM atoms with the lowest absolute value of UL were testing error, the model was further trained by adding 10 data
considered as high-performance catalysts. samples from target data set to source data set, which is
Single atom alloy (SAA) is another type of SAC with referred as cross group learning (CGL). Such CGL turns to be
alloying metal atoms dispersed across the host metal surface.53 helpful for boosting the performance and managed to improve
Wang et al. used DFT to study Cu-based SAA catalysts and the performance of target data by over 50% (Figure 13a)
developed an ML model for accurate prediction of ΔECO.15 A without requiring much information on target data.
total of 2669 SAA configurations with different Cu facets, The author then analyzed ΔECO computed for each SAA
alloying elements, and adsorption sites were identified and configuration with DFT data. For example, SAA with d-block
889 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

alloying elements shows a wider distribution of ΔECO (Figure (e) Some commonly used ML models for CO2RR are black-
13c), which implies a greater tunability for different catalytic box in nature.54 Interpretable ML can be helpful to
reactions. Optimal configurations were selected according to understand the model mechanisms and build up trust
the ideal ΔECO range (Figure 13, parts b and c) and were and confidence for practical deployment.
further filtered by their ΔEH to find promising SAA catalysts.
Last but not least, ML is more of a supporting technology to
4. OUTLOOK assist users. Domain expertise is a must for research
In this Perspective, we introduced the general methodology of understanding and results analysis, and naturally a prerequisite
applying ML in CO2RR research. Regardless of the application,
one needs to prepare a sufficiently large data set, select of making full use of this powerful technology for CO2RR
informative features, train a suitable ML model, and predict a electrocatalyst.
certain target property. Then, recent research using ML in the
understanding and discovery of pure metal, intermetallic, and
SAC catalyst was demonstrated and discussed.
Though opportunities and advantages brought by ML are
■ AUTHOR INFORMATION
Corresponding Authors
obvious and the research domain looks promising, drawbacks Wei Zhang − Infocomm Technology Cluster, Singapore
and challenges of this method still exist inevitably. Powerful Institute of Technology (SIT), Singapore 567739, Singapore;
ML algorithms often demand significant training data. For Email: wei.zhang@singaporetech.edu.sg
example, complex models like GNN will outperform traditional Qingyu Yan − Energy Research Institute @ NTU, Singapore
ML models only based on the availability of large-scale data 637141, Singapore; School of Material Science and
sets. Generating data samples for ML can be costly, e.g., HT Engineering, Nanyang Technological University, Singapore
experiments and DFT with significant overhead for time, 639798, Singapore; Institute of Materials Research and
computation, and so on. In this regards, establishment of a Engineering, A*STAR, Singapore 138634, Singapore;
collaborative database, like Catalysis-Hub,25 could solve this orcid.org/0000-0003-0317-3225; Email: alexyan@
problem to some extent. Further effort is necessary, e.g., ntu.edu.sg
rigorously and systematically discussing the standardization of
descriptors and methods. In the case of training ML to Authors
approximate DFT, i.e., using DFT to generate training data set, Erhai Hu − Energy Research Institute @ NTU, Singapore
gaps are there between DFT-derived results and the results 637141, Singapore; School of Material Science and
from a real-world implementation. Even ML can accurately Engineering, Nanyang Technological University, Singapore
approximate DFT with nearly neglectable error, and the ML 639798, Singapore; orcid.org/0000-0003-4545-1828
prediction results may not align with real-world experimental Chuntai Liu − Key Laboratory of Materials Processing and
results very well. Hence, the combination of ML and DFT is Mold, Ministry of Education, Zhengzhou University,
more appropriate to guide preliminary screening, while more Zhengzhou 450002, China; orcid.org/0000-0001-9751-
formal and practical verification, discovery, and analysis require 6270
HT or manual experiment. Complete contact information is available at:
Besides the inherent challenges of computational studies, https://pubs.acs.org/10.1021/acs.jpcc.2c08343
there are some other inadequacies, but they can be resolved
through future endeavors: Notes
(a) As discussed, most of the studies focused on adsorption The authors declare no competing financial interest.
energy of CO and H. This could only determine the Biographies
activity of CO2RR, but the selectivity or dominating
reaction pathways are not distinguishable. Researchers
shall also focus on individual products or pathways and
intermediates beyond CO.
b) More types of catalysts need to be considered with ML-
based approaches. Currently, catalysts studied using ML
are mostly intermetallic and some SACs. Other effective
CO2RR catalysts like metal oxides,6 MXene,7 and
nanostructured catalysts8 shall be studied using ML as
well.
(c) Researchers need to standardize the evaluation of ML
performance with common evaluation metrics and
benchmarking data set. This can enable the research
community to gain qualitative comparison and under-
standing of different proposed approaches and accord-
ingly streamline the optimization process. Erhai Hu completed his Bachelor of Engineering degree in Materials
(d) Practicality for the synthesis of the discovered catalysts Engineering at Nanyang Technological University, Singapore. He is
need to be examined, either before the prediction or in
now a Ph.D. student in the Energy Research Institute @ NTU (ERI@
the post analysis. For example, synthesis of alloys with
certain compositions and some types of SACs may be N). His research interests include electrocatalysis, energy storage
impossible or very difficult. materials, and machine learning.

890 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

Electrochemical Society, Singapore Section. He is a fellow of the


Royal Society of Chemistry since 2018. He has published >300 papers
(with a total citation of >33000 and an h-index of 98) on the research
areas: battery development and thermoelectric materials and electro-
catalytic processes for energy conversion.

■ ACKNOWLEDGMENTS
Q.Y. acknowledges the funding support from Singapore MOE
AcRF Tier 1 under Grant No. 2020-T1-001-031. The authors
would also like to acknowledge the 111 Project (D18023)
from Zhengzhou University for their support for this work.

Chuntai Liu is currently a Professor in the School of Material Science


■ REFERENCES
(1) IPCC. Global Warming of 1.5°C: IPCC Special Report on impacts
and Engineering at Zhengzhou University. He also serves as deputy of global warming of 1.5°C above pre-industrial levels in context of
director of the Key Laboratory of Materials Processing and Mold of strengthening response to climate change, sustainable development, and
the Ministry of Education. His research areas include nanocomposites, efforts to eradicate poverty; Cambridge University Press: 2022.
polymer processing, and intelligent materials. (2) Jin, S.; Hao, Z.; Zhang, K.; Yan, Z.; Chen, J. Advances and
Challenges for the Electrochemical Reduction of CO 2 to CO: From
Fundamentals to Industrialization. Angew. Chem. Int. Ed 2021, 60
(38), 20627−20648.
(3) Nitopi, S.; Bertheussen, E.; Scott, S. B.; Liu, X.; Engstfeld, A. K.;
Horch, S.; Seger, B.; Stephens, I. E. L.; Chan, K.; Hahn, C.; et al.
Progress and Perspectives of Electrochemical CO2 Reduction on
Copper in Aqueous Electrolyte. Chem. Rev. 2019, 119 (12), 7610−
7672.
(4) He, J.; Johnson, N. J. J.; Huang, A.; Berlinguette, C. P.
Electrocatalytic Alloys for CO2 Reduction. ChemSusChem 2018, 11
(1), 48−57.
(5) Li, M.; Wang, H.; Luo, W.; Sherrell, P. C.; Chen, J.; Yang, J.
Heterogeneous Single-Atom Catalysts for Electrochemical CO2
Reduction Reaction. Adv. Mater. 2020, 32 (34), 2001848.
(6) Zhao, G.; Huang, X.; Wang, X.; Wang, X. Progress in catalyst
exploration for heterogeneous CO2 reduction and utilization: a
critical review. J. Mater. Chem. A 2017, 5 (41), 21625−21649.
Wei Zhang is currently an Assistant Professor with the Information
(7) Heo, J.; Her, N.; Jang, M.; Park, C. M.; Son, A.; Han, J.; Yoon, Y.
and Communications Technology Cluster, Singapore Institute of Photocatalytic and electrocatalytic reduction of CO2 by MXene-based
Technology (SIT). Dr Zhang serves as an Associate Editor for the nanomaterials: A review. Critical Reviews in Environmental Science and
IEEE Internet of Things Journal (IoT-J) as well as the Area Chair, Technology 2022, 1−22.
Track Chair, Review Committee Member, and TPC member of top (8) Zhang, L.; Zhao, Z.-J.; Gong, J. Nanostructured Materials for
conferences such as IJCAI, GLOBECOM, and ICME. His research Heterogeneous Electrocatalytic CO2 Reduction and their Related
interests lie in utilizing artificial intelligence (AI) for smart cities, Reaction Mechanisms. Angew. Chem. Int. Ed 2017, 56 (38), 11326−
especially energy related applications, toward improved user 11353.
experience and enhanced sustainability. (9) Schleder, G. R.; Padilha, A. C. M.; Acosta, C. M.; Costa, M.;
Fazzio, A. From DFT to machine learning: recent approaches to
materials science−a review. J. Phys. Mater. 2019, 2 (3), 032001.
(10) Tran, K.; Ulissi, Z. W. Active learning across intermetallics to
guide discovery of electrocatalysts for CO2 reduction and H2
evolution. Nat. Catal 2018, 1 (9), 696−703.
(11) Gero, J. S.; Sudweeks, F. Artificial Intelligence in Design’96;
Springer Science & Business Media: 2012.
(12) Li, J.; Cheng, K.; Wang, S.; Morstatter, F.; Trevino, R. P.; Tang,
J.; Liu, H. Feature selection: A data perspective. ACM computing
surveys (CSUR) 2018, 50 (6), 1−45.
(13) Chen, A.; Zhang, X.; Chen, L.; Yao, S.; Zhou, Z. A Machine
Learning Model on Simple Features for CO 2 Reduction Electro-
catalysts. J. Phys. Chem. C 2020, 124 (41), 22471−22478.
(14) Wan, X.; Zhang, Z.; Niu, H.; Yin, Y.; Kuai, C.; Wang, J.; Shao,
C.; Guo, Y. Machine-Learning-Accelerated Catalytic Activity
Predictions of Transition Metal Phthalocyanine Dual-Metal-Site
Catalysts for CO 2 Reduction. J. Phys. Chem. Lett. 2021, 12 (26),
Qingyu Yan is currently a Professor in the School of Materials 6111−6118.
Science and Engineering at Nanyang Technology University. He (15) Wang, D.; Cao, R.; Hao, S.; Liang, C.; Chen, G.; Chen, P.; Li,
joined the School of Materials Science and Engineering of Nanyang Y.; Zou, X. Accelerated prediction of Cu-based single-atom alloy
Technological University as an assistant professor in 2008 and catalysts for CO2 reduction by machine learning. Green Energy &
became a professor in 2018. He is currently the Chair of the Environment 2021.

891 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

(16) Ma, X.; Li, Z.; Achenie, L. E. K.; Xin, H. Machine-Learning- electrochemical reduction of CO2 via co-electrolysis. Nat. Commun.
Augmented Chemisorption Model for CO 2 Electroreduction Catalyst 2020, 11 (1), 3844.
Screening. J. Phys. Chem. Lett. 2015, 6 (18), 3528−3533. (35) Wu, D.; Zhang, J.; Cheng, M.-J.; Lu, Q.; Zhang, H. Machine
(17) Li, Z.; Wang, S.; Chin, W. S.; Achenie, L. E.; Xin, H. High- Learning Investigation of Supplementary Adsorbate Influence on
throughput screening of bimetallic catalysts enabled by machine Copper for Enhanced Electrochemical CO2 Reduction Performance.
learning. J. Mater. Chem. A 2017, 5 (46), 24131−24138. J. Phys. Chem. C 2021, 125 (28), 15363−15372.
(18) Li, Z.; Ma, X.; Xin, H. Feature engineering of machine-learning (36) Huan, T. N.; Simon, P.; Rousse, G.; Génois, I.; Artero, V.;
chemisorption models for catalyst design. Catal. Today 2017, 280, Fontecave, M. Porous dendritic copper: an electrocatalyst for highly
232−238. selective CO2 reduction to formate in water/ionic liquid electrolyte.
(19) Back, S.; Yoon, J.; Tian, N.; Zhong, W.; Tran, K.; Ulissi, Z. W. Chemical Science 2017, 8 (1), 742−747.
Convolutional Neural Network of Atomic Surface Structures To (37) Han, Z.; Kortlever, R.; Chen, H.-Y.; Peters, J. C.; Agapie, T.
Predict Binding Energies for High-Throughput Screening of Catalysts. CO2 Reduction Selective for C ≥ 2 Products on Polycrystalline
J. Phys. Chem. Lett. 2019, 10 (15), 4401−4408. Copper with N-Substituted Pyridinium Additives. ACS Central Science
(20) Xie, T.; Grossman, J. C. Crystal Graph Convolutional Neural 2017, 3 (8), 853−859.
Networks for an Accurate and Interpretable Prediction of Material (38) Hoang, T. T. H.; Ma, S.; Gold, J. I.; Kenis, P. J. A.; Gewirth, A.
Properties. Phys. Rev. Lett. 2018, 120 (14), 145301. A. Nanoporous Copper Films by Additive-Controlled Electro-
(21) Tang, M. T.; Peng, H.; Lamoureux, P. S.; Bajdich, M.; Abild- deposition: CO2 Reduction Catalysis. ACS Catal. 2017, 7 (5),
Pedersen, F. From electricity to fuels: descriptors for C1 selectivity in 3313−3321.
electrochemical CO2 reduction. Appl. Catal. B: Environmental 2020, (39) Guo, Y.; He, X.; Su, Y.; Dai, Y.; Xie, M.; Yang, S.; Chen, J.;
279, 119384. Wang, K.; Zhou, D.; Wang, C. Machine-Learning-Guided Discovery
(22) Hori, Y. i. Electrochemical CO 2 reduction on metal electrodes. and Optimization of Additives in Preparing Cu Catalysts for CO2
Modern aspects of electrochemistry 2008, 42, 89−189. Reduction. J. Am. Chem. Soc. 2021, 143 (15), 5755−5762.
(23) Jain, A.; Ong, S. P.; Hautier, G.; Chen, W.; Richards, W. D.; (40) Yi, J.-D.; Xie, R.; Xie, Z.-L.; Chai, G.-L.; Liu, T.-F.; Chen, R.-P.;
Dacek, S.; Cholia, S.; Gunter, D.; Skinner, D.; Ceder, G.; et al. Huang, Y.-B.; Cao, R. Highly Selective CO2 Electroreduction to CH4
Commentary: The Materials Project: A materials genome approach to by In Situ Generated Cu2O Single-Type Sites on a Conductive MOF:
accelerating materials innovation. APL Materials 2013, 1 (1), 011002. Stabilizing Key Intermediates with Hydrogen Bonding. Angew. Chem.
(24) Saal, J. E.; Kirklin, S.; Aykol, M.; Meredig, B.; Wolverton, C. Int. Ed 2020, 59 (52), 23641−23648.
Materials design and discovery with high-throughput density (41) Arán-Ais, R. M.; Scholten, F.; Kunze, S.; Rizo, R.; Roldan
functional theory: the open quantum materials database (OQMD). Cuenya, B. The role of in situ generated morphological motifs and
Jom 2013, 65 (11), 1501−1509. Cu(i) species in C2+ product selectivity during CO2 pulsed
(25) Winther, K. T.; Hoffmann, M. J.; Boes, J. R.; Mamun, O.; electroreduction. Nature Energy 2020, 5 (4), 317−325.
Bajdich, M.; Bligaard, T. Catalysis-Hub. org, an open electronic (42) Li, Z.; Wu, R.; Zhao, L.; Li, P.; Wei, X.; Wang, J.; Chen, J. S.;
structure database for surface reactions. Scientific data 2019, 6 (1), 1− Zhang, T. Metal-support interactions in designing noble metal-based
10. catalysts for electrochemical CO2 reduction: Recent advances and
(26) Chanussot, L.; Das, A.; Goyal, S.; Lavril, T.; Shuaibi, M.; future perspectives. Nano Research 2021, 14 (11), 3795−3809.
Riviere, M.; Tran, K.; Heras-Domingo, J.; Ho, C.; Hu, W.; et al. Open (43) Ulissi, Z. W.; Tang, M. T.; Xiao, J.; Liu, X.; Torelli, D. A.;
catalyst 2020 (OC20) dataset and community challenges. ACS Catal. Karamad, M.; Cummins, K.; Hahn, C.; Lewis, N. S.; Jaramillo, T. F.;
2021, 11 (10), 6059−6072. et al. Machine-Learning Methods Enable Exhaustive Searches for
(27) Zhang, N.; Yang, B.; Liu, K.; Li, H.; Chen, G.; Qiu, X.; Li, W.; Active Bimetallic Facets and Reveal Active Site Motifs for CO2
Hu, J.; Fu, J.; Jiang, Y.; et al. Machine Learning in Screening High Reduction. ACS Catal. 2017, 7 (10), 6600−6608.
Performance Electrocatalysts for CO2 Reduction. Small Methods (44) Le, T. T.; Fu, W.; Moore, J. H. Scaling tree-based automated
2021, 5 (11), 2100987. machine learning to biomedical big data with a feature set selector.
(28) Sun, Z.; Yin, H.; Liu, K.; Cheng, S.; Li, G. K.; Kawi, S.; Zhao, Bioinformatics 2020, 36 (1), 250−256.
H.; Jia, G.; Yin, Z. Machine learning accelerated calculation and (45) Zhong, M.; Tran, K.; Min, Y.; Wang, C.; Wang, Z.; Dinh, C.-T.;
design of electrocatalysts for CO2 reduction. SmartMat 2022, 3 (1), De Luna, P.; Yu, Z.; Rasouli, A. S.; Brodersen, P.; et al. Accelerated
68−83. discovery of CO2 electrocatalysts using active machine learning.
(29) Zhu, S.; Delmo, E. P.; Li, T.; Qin, X.; Tian, J.; Zhang, L.; Shao, Nature 2020, 581 (7807), 178−183.
M. Recent Advances in Catalyst Structure and Composition (46) Gu, G. H.; Noh, J.; Kim, S.; Back, S.; Ulissi, Z.; Jung, Y.
Engineering Strategies for Regulating CO2 Electrochemical Reduc- Practical deep-learning representation for fast heterogeneous catalyst
tion. Adv. Mater. 2021, 33 (50), 2005484. screening. J. Phys. Chem. Lett. 2020, 11 (9), 3185−3191.
(30) Chen, Y.; Huang, Y.; Cheng, T.; Goddard, W. A. Identifying (47) Wang, X.; Guo, W.; Fu, Y. High-entropy alloys: emerging
Active Sites for CO2 Reduction on Dealloyed Gold Surfaces by materials for advanced functional applications. J. Mater. Chem. A
Combining Machine Learning with Multiscale Simulations. J. Am. 2021, 9 (2), 663−701.
Chem. Soc. 2019, 141 (29), 11651−11657. (48) Pedersen, J. K.; Batchelor, T. A. A.; Bagger, A.; Rossmeisl, J.
(31) Birdja, Y. Y.; Pérez-Gallent, E.; Figueiredo, M. C.; Göttle, A. J.; High-Entropy Alloys as Catalysts for the CO2 and CO Reduction
Calle-Vallejo, F.; Koper, M. T. M. Advances and challenges in Reactions. ACS Catal. 2020, 10 (3), 2169−2176.
understanding the electrocatalytic conversion of carbon dioxide to (49) Roy, D.; Mandal, S. C.; Pathak, B. Machine Learning Assisted
fuels. Nature Energy 2019, 4 (9), 732−745. Exploration of High Entropy Alloy-Based Catalysts for Selective CO2
(32) Cheng, T.; Huang, Y.; Xiao, H.; Goddard, W. A. Predicted Reduction to Methanol. J. Phys. Chem. Lett. 2022, 13 (25), 5991−
Structures of the Active Sites Responsible for the Improved Reduction 6002.
of Carbon Dioxide by Gold Nanoparticles. J. Phys. Chem. Lett. 2017, 8 (50) Wang, Y.; Su, H.; He, Y.; Li, L.; Zhu, S.; Shen, H.; Xie, P.; Fu,
(14), 3317−3320. X.; Zhou, G.; Feng, C.; et al. Advanced Electrocatalysts with Single-
(33) Sun, K.; Cheng, T.; Wu, L.; Hu, Y.; Zhou, J.; Maclennan, A.; Metal-Atom Active Sites. Chem. Rev. 2020, 120 (21), 12217−12314.
Jiang, Z.; Gao, Y.; Goddard, W. A.; Wang, Z. Ultrahigh Mass Activity (51) Yang, X.-F.; Wang, A.; Qiao, B.; Li, J.; Liu, J.; Zhang, T. Single-
for Carbon Dioxide Reduction Enabled by Gold−Iron Core−Shell Atom Catalysts: A New Frontier in Heterogeneous Catalysis. Acc.
Nanoparticles. J. Am. Chem. Soc. 2017, 139 (44), 15608−15611. Chem. Res. 2013, 46 (8), 1740−1748.
(34) He, M.; Li, C.; Zhang, H.; Chang, X.; Chen, J. G.; Goddard, W. (52) Zhang, W.; Chao, Y.; Zhang, W.; Zhou, J.; Lv, F.; Wang, K.;
A.; Cheng, M.-j.; Xu, B.; Lu, Q. Oxygen induced promotion of Lin, F.; Luo, H.; Li, J.; Tong, M.; et al. Emerging Dual-Atomic-Site

892 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893
The Journal of Physical Chemistry C pubs.acs.org/JPCC Perspective

Catalysts for Efficient Energy Catalysis. Adv. Mater. 2021, 33 (36),


2102576.
(53) Hannagan, R. T.; Giannakakis, G.; Flytzani-Stephanopoulos,
M.; Sykes, E. C. H. Single-Atom Alloy Catalysis. Chem. Rev. 2020, 120
(21), 12044−12088.
(54) Esterhuizen, J. A.; Goldsmith, B. R.; Linic, S. Interpretable
machine learning for knowledge generation in heterogeneous
catalysis. Nat. Catal 2022, 5 (3), 175−184.

Recommended by ACS
Electrochemical Synthesis of Urea: Co-reduction of Nitric
Oxide and Carbon Monoxide
Hao Wan, Alexander Bagger, et al.
JANUARY 19, 2023
ACS CATALYSIS READ

Materials Screening by the Descriptor Gmax(η): The Free-


Energy Span Model in Electrocatalysis
Samad Razzaq and Kai S. Exner
JANUARY 16, 2023
ACS CATALYSIS READ

Trends in CO2 Reduction on Transition Metal


Dichalcogenide Edges
Pernille D. Pedersen, Heine A. Hansen, et al.
JANUARY 31, 2023
ACS CATALYSIS READ

Dual-Cation-Coordinated CoFe-Layered Double-Hydroxide


Nanosheets Using the Pulsed Laser Ablation Technique for
Efficient Electrochemical Water Splitting: Mechanistic Sc...
Shreyanka Shankar Naik, Myong Yong Choi, et al.
JANUARY 10, 2023
ACS CATALYSIS READ

Get More Suggestions >

893 https://doi.org/10.1021/acs.jpcc.2c08343
J. Phys. Chem. C 2023, 127, 882−893

You might also like