You are on page 1of 9

Materials & Design 210 (2021) 110037

Contents lists available at ScienceDirect

Materials & Design


journal homepage: www.elsevier.com/locate/matdes

Machine learning assisted empirical formula augmentation


Bin Xiong a, Xinpeng Zhao a, Yunfeng Hu a, Haiyou Huang a,b,⇑, Yang Liu c, Yanjing Su a,b,d
a
Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, China
b
Beijing Key Laboratory of Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China
c
Institute of Physics, Chinese Academy of Sciences, Beijing 100190, China
d
Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China

h i g h l i g h t s g r a p h i c a l a b s t r a c t

 We have proposed an augmentation


strategy for empirical formula based
on machine learning mothed.
 A novel empirical formula of Ms as
the function of compositions was
fitted based on an augmented dataset
combined experimental data with
predicted data.
 Compared with previous empirical
formula, the accuracy and robustness
of the novel empirical formula we
proposed was significantly improved
without additional experimental cost.
 This strategy offers a recipe to build
empirical formula based on a small
experimental dataset.

a r t i c l e i n f o a b s t r a c t

Article history: Tuning the martensite transformation temperature through composition design has become an impor-
Received 27 May 2021 tant way to broaden the applicable temperature range of shape memory alloys (SMAs). The empirical for-
Revised 23 July 2021 mula based on traditional statistics is a key reference for composition design. Due to the lack of
Accepted 8 August 2021
experimental data, a large deviation may exist among the prediction results from the empirical formulas
Available online 10 August 2021
obtained by different data sources. In present work, we proposed an augmentation strategy of empirical
formula based on a machine learning method to build the relationship between martensite transforma-
Keywords:
tion start temperature (Ms) and compositions in Cu-Al-based SMA system. A series of ML models were
Shape memory alloy
Machine learning
established by physical and chemical features and a Gaussian radial basis kernel function support vector
Empirical formula machine (SVR.rbf) model was screened out based on mathematical and domain knowledge criteria. An
Transformation temperature augmented empirical formula of Ms as the function of compositions was fitted based on the abundant
Composition design augmented dataset combined experimental data with predicted data by the SVR.rbf model. Compared
with previous empirical formula fitted by small experimental dataset, the accuracy and robustness of
the augmented empirical formula was significantly improved without additional experimental cost.
This strategy offers a recipe to build empirical formula based on a small experimental dataset.
Ó 2021 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license
(http://creativecommons.org/licenses/by-nc-nd/4.0/).

⇑ Corresponding author.
E-mail address: huanghy@mater.ustb.edu.cn (H. Huang).

https://doi.org/10.1016/j.matdes.2021.110037
0264-1275/Ó 2021 The Authors. Published by Elsevier Ltd.
This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
B. Xiong, X. Zhao, Y. Hu et al. Materials & Design 210 (2021) 110037

1. Introduction have concluded several empirical formulas from their respectively


experimental studies in TiNi-based and Cu-based SMAs. As a sam-
Concluding empirical law from experimental or calculational ple, the formulas of martensitic start transformation temperature
data is an important method of scientific research [1]. The accuracy (Ms) as a function of the composition in Cu-Al-Mn SMAs [21–23]
and applicability of the empirical law is strongly dependent on the were summarized in Table 1 and illustrated in Fig. 1. Fig. 1 indi-
quantity and quality of the data sample. In practical investigation, cated that these empirical formulas obtained by different research-
we often face low-quality dataset such as insufficient data, ers show significant differences in the composition space.
severely skewed distribution and low mergeable data due to col- In general, an accurate empirical formula needs to be fitted with
lecting from different experimentations with various processing sufficient experimental data. However, due to the limitation of
parameters. A formula fitted by these low-quality data will show experimental cost, the experimental data used by researchers for
a low accuracy. Improving the quantity or quality of data sample formula fitting is often only a few to a dozen. Collecting more
under a consistent experimental condition is a feasible way to experimental data from other researchers to increase the total
solve the problem, but it also means high experimental costs. amount of data may be a simple and feasible solution. However,
With the development of materials information, machine learn- the mergeability of the data from different researchers is often
ing (ML) is emerging as a powerful tool for dealing with material hard to determine easily.
data problems [2]. In addition to dealing with big data, ML has Compared to the traditional data fitting method, the ML model
been successful in small data problems [3]. Data Augmentation is can not only perform statistical fitting, but also build physically
an effective technique in ML which helps in adding value to base meaningful relationships by mining hidden information beneath
dataset by adding the gathered information from various sources the data, which is becoming an important and powerful tool for
to improve the quantity and the quality of data. Data Augmenta- material discovery and design, even building emerging materials
tion is one of the most important processes that makes the data intelligence ecosystems [24–28].
more informational. In the present work, we proposed an empirical formula aug-
Shape memory alloys (SMAs) is a kind of important functional mentation strategy based on ML method, as schematically shown
material that has been widely used in sensors, actuators, environ- in Fig. 2. Based on published literatures, an experimental dataset
mental protection refrigeration and energy conversion equipment including Ms and compositions (Ci) of Cu-Al-based SMAs was sum-
due to its unique shape memory effect and superelasticity [4–7]. marized. Two novel empirical formulas were obtained in our work.
The martensitic transformation temperature that determines the The empirical formula N1 was fitted directly based on the summa-
temperature window of SMAs in their applications, is one of the rized experimental dataset, which was in contrast to the aug-
most important material parameters in material design [8–10]. mented empirical formula N2. Along the other way, a ML
The martensitic transformation temperatures have strong com- surrogate model Ms = f (pi) was built by the use of physical and
position sensitivity [11–17]. Therefore, it is a convenient way to chemical features (pi) of the elements as this is known to be closely
control the martensitic transformation temperatures by composi- related to the transformation temperature. It should be noted that
tion design, which also becomes an important way to broaden during the ML model selection process, both the mathematical and
the application temperature range of SMAs and to develop physical (domain-knowledge) criteria were employed to optimize
ultralow-temperature or high-temperature SMAs. Consequently, the best model. Consequently, the optimum ML model was applied
obtaining the relationship between the martensitic transformation to generate amount of predicted data, which could augment the
temperatures and the compositions of SMAs is a key premise to experimental data to product an augmented dataset. Then, the
accurate composition design. empirical formula N2 was fitted by using the augmented dataset.
Among lots of SMAs systems, the Ni-Ti-based SMAs are the At last, the robustness of the empirical formulas was also
most practical systems because of their excellent superelasticity discussed.
and shape memory effect, good ductility and good fatigue resis-
tance [18]. Cu-based SMAs, especially Cu-Al-Mn SMAs, on the 2. Experimental dataset
other hand, are another promising system because they have high
superelasticity (>10%) [19], high isothermal entropy change (~20 J/ The experimental dataset was built based on the SMAs dataset
kgK), wide operation temperature window (>100 ℃) [20] as well from MGEData platform [29]. Our dataset used in this work only
as are easy to manufacture and cheaper cost compared to Ni-Ti- selected experimental data from literatures and our group. Consid-
based SMAs. Moreover, Cu-Al-Mn SMAs also show a wide appliable ering the martensitic transformation temperatures of SMAs is not
temperature window (4.2℃~450℃) [20] which can be adjusted by only affected by the composition content, but also by processing
composition design, which makes it very necessary to obtain an technology, heat treatment conditions, and aging treatment signif-
accurate relationship between the transformation temperatures icantly [30–36]. Therefore, during experimental data collecting, the
and compositions for their industry application. Many researches processing status and condition were attention to mark on the

Table 1
Empirical formulas of martensitic transformation temperature as a function of composition of Cu-Al-Mn SMAs.

No. Formula Number of experimental data Sample state Range of compositions Ref.
1 Ms(℃) = 1045.1–18.6Al-72.5Mn(at.%) 7 hot extrusion and heat Al: 16.7 ~ 19.3 at.% [21]
treatment @850℃ Mn:11.1 ~ 12.3 at.%
2 Ms(℃) = 2023.9–75.9Al-46.8Mn(at.%) 12 as-cast Al: 21.3 ~ 22.5 at.% [22]
Mn:5.1 ~ 7.5 at.%
3 Ms(℃) = 1616.9–42.1Al-78.0Mn(at.%) 7 as-cast Al: 17.1 ~ 18.1 at.% [23]
Mn:11.4 ~ 13.3 at.%
N1 Ms(℃) = 1292–40.62Al-55.86Mn(at.%) 71 as-cast Al: 16.39~28.30 This work
Mn: 0~11.90
N2 Ms(℃) = 1040–33.30Al-42.90Mn(at.%) 9143* —— Al: 16.39~28.30 This work
Mn: 0~11.90

* Note: it is composed of 9072 prediction data + 71 experimental data.

2
B. Xiong, X. Zhao, Y. Hu et al. Materials & Design 210 (2021) 110037

The root mean square error (RMSE) score obtained by the leave-
one-out method was used to evaluate the model according to Eq.
(1)
rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1 Xm  2
RMSE ¼ yi  ybi ð1Þ
m i1

where m is the number of the data, yi is the true value and ybi the
predicted value. There was 71 data in our experimental dataset.
According to the leave-one-out method, there would be 71 division
methods, and 71 RMSE scores would be obtained after machine
model training. The average value of these 71 RMSE scores was
used to balance the training effect of the model. The leave-one-
out method can obtain as much effective information as possible
from limited data, so that samples can be learned from multiple
aspects and avoid falling into local extremes.
Fig. 1. Fitting surface diagram of three empirical formulas in component space. In the process of model training, the Grid Search method was
used to optimize the hyperparameters. After partitioning the data
using the K-CV method, use the partitioned training set to train
the model and use the test set for testing to obtain the model score
labels of the data. In the process of data screening and cleaning, we under the set of hyperparameters. Train the model and test for
fixed the processing conditions of the data source to the cast sam- each set of hyperparameters in turn, and finally compare each
ples that have not been rolled or drawn because we must avoid the set of hyperparameter combination scores, and output the set of
dimensional disaster caused by the variability of processing hyperparameters with the best score as the best parameter.
parameters. Additionally, the data with the same component hav- We adopted a combined grid search strategy of coarse adjust-
ing different Ms due to suffering different heat treatment processes ment in a large range and fine adjustment in a small range to find
are also not selected into our dataset. After the data screening, we the best parameters, and the model evaluation also returns the
tried to minimize the influence of the processing on the martensite RMSE of the training set and the test set after the ten-fold cross-
transformation temperature to ensure that the intrinsic relation- validation scoring for model evaluation. The corresponding search
ship between the martensite transformation temperature and the range and step length were determined according to the hyperpa-
composition can be found out. In the end, a total of 71 data from rameter characteristics of different machine learning models.
13 literatures are selected to build the experimental dataset.
4. Results and discussion

3. Machine learning models 4.1. Empirical formula N1

Several well-known machine learning models including least As shown in Fig. 1, the existing empirical formulas from differ-
absolute shrinkage and selection operator (Lasso), linear regression ent researchers show significant numerical deviation in the com-
(Lin), polynomial regression (Poly), gradient boosting regression position space, but all of them were linearly fitted, which implied
(GBR), random forest (RF), ridge regression (Ridge) and Gaussian that a linear relationship between martensitic transformation tem-
radial basis kernel function support vector machine (SVR.rbf) were peratures and the content of alloying element. This empirical
employed to produce machine learning models. domain knowledge was also supported by lots of researches in

Fig. 2. A schematic of the empirical formula fitting strategies based on experimental data directly (Route I) and augmented data assisted by ML method.

3
B. Xiong, X. Zhao, Y. Hu et al. Materials & Design 210 (2021) 110037

other SMAs including Cu-based and Fe-based SMAs based on Table 2


experimental data and thermodynamical model [37–40]. The physical and chemical features used in ML modeling.

Followed above empirical domain knowledge, at first, we lin- Name Description Ref.
early fitted the 71 experimental data using the least square Cu The atomic percentage of Copper
method. The obtained linear formula N1 was shown in Eq.(2) Al The atomic percentage of Aluminum
Mn The atomic percentage of Manganese
Ms( C)=1292  40.62Al  55.86 Mn(at.%) ð2Þ Arc Clementi’s atomic radii [35]
En Pauling electronegativity [36]
In Fig. 3(a) and (b), the four empirical formulas were compared by Ven Valence electron numbers [37]
the deviation between prediction values and true values and RMSE, Mr Metallic radius [38]
respectively. In the deviation graph (Fig. 3(a)), the more concen- Dor Waber-Cromer’s pseudopotential radii [39]
trated the scattered point distribution is near the diagonal line, the Anum Atomic number
Mass Atomic mass
more consistent the predicted value and the experimental measure-
ment value are. However, the smaller RMSE is, the more accurate the
prediction is. It is obvious that formula N1 has more accuracy than
the previous three formulas due to using far more data. tics in SMAs, and provide a relatively simple physical basis or
In addition, according to the distribution of scattered points, the structure–property relationship of phase transition temperature
forecast trend of the empirical formula in the component space can [21]. All of them have been tabulated in Table 2. The features
also be understood. The scattered points of the empirical formula except for the atomic percentage of Cu, Al, and Mn were also called
N1 obtained based on the as-cast data set are concentrated on both initial features.
sides of the diagonal. Except for individual data, the prediction The objects of our experimental dataset are shape memory
effect of Ms value in the range of 150 ~ 200℃ is better, and the alloys, not single elements. The unique mapping relationship
predicted value of Ms in the range of 200 ~ 350℃ is about 70℃ between the component space and the compound feature is con-
lower than the actual measured value. The scattered points of structed by the following five calculation formulas, including: X1
the previous three formulas are mostly distributed on the lower was calculated based on the linear mixing rule; X2 was calculated
right side of the diagonal, indicating that the predicted values based on the reciprocal mixing rule; CX was obtained based on
given by the three formulas tend to be higher than the true values. the ratio of each feature X1 to the atomic number of each alloy;
XD and Xd were the deviation and discrepancy, respectively. In each
calculation formula, f Cu , f Al and f Mn represent the atomic percent-
4.2. Empirical formula N2
age. X Cu , X Al and X Mn represent the corresponding features of each
In the above strategy, it is obvious that data augmentation can element, and Z Cu , Z Al and Z Mn represent the atomic number of each
significantly improve the accuracy of empirical formula. Can we element.
break through the limitations of experimental data and further
X 1 ¼ f Cu X Cu þ f A lX Al þ f Mn X Mn ð3Þ
improve the usability of the formula? Therefore, we employ
machine learning model based on physical and chemical descrip-  1
tors that are also related to the transformation temperatures of X 2 ¼ f Cu =X Cu þ f Al =X Al þ f Mn =X Mn ð4Þ
SMAs.
In previous researches, Frenzel et al.[41] indicated that the geo-
f Cu X Cu þ f A lX Al þ f Mn X Mn
metric change during phase transition and the inherent instability CX ¼ ð5Þ
of the parent phase are two key factors to control the phase tran- f Cu Z Cu þ f A lZ Al þ f Mn Z Mn
sition temperature, and the latter is related to the electronic struc- rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
 2  2  2
ture. Sampath et al. [31] evidenced that the addition of alloying
XD f Cu X Cu  X 1 þ f Al X Al  X 1 þ f Mn X Mn  X 1 ð6Þ
elements (aluminum, iron, zinc, silicon) would reduce the EV/a
ratio, thereby reducing the Ms. Therefore, the features we used vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
u !2 !2 !2ffi
include: Clementi’s atomic radii (Arc) [42], Pauling electronegativ- u X Cu X Al X Mn
ity (En) [43], metal radius (Mr) [44], valence electron number (Ven)
t
X d f cu 1  þ f Al 1  þ f Mn 1  ð7Þ
X1 X1 X1
[45], and Waber-Cromer pseudopotential radius (Dor) [46]. These
features represent the electronic, bonding and radius characteris-

Fig. 3. The predicted values as a function of the experimental values (a) and RMSE (b) comparison of four empirical formulas fitted by experimental data.

4
B. Xiong, X. Zhao, Y. Hu et al. Materials & Design 210 (2021) 110037

The physical features were calculated by the above five calcula- Table 3
tion formulas, and the three composition features were combined Feature selection result based on Pearson correlation coefficient.

to obtain the feature pool with a total of 37 descriptors. No. Feature name
The candidate feature (pi) is the input of the property (Ms) 1 En
required by the machine learning model to predict, that is, Ms = f 2 Dor
(pi). For a given target attribute Ms, there will be a corresponding 3 Mass
combination of pi to ensure that the trained machine learning 4 Arc-2
5 En-D
model has better performance. When faced with a comprehensive 6 Mass-D
coverage of pi, through feature selection, ‘‘excellent” features are 7 Mr-D
selected from the candidate features to achieve the effect of reduc- 8 Mr-d
ing dimensionality and improving model performance. Due to the 9 Mass-d
small number of features, we chose a simpler feature selection
route.
At the first step, a correlation analysis was employed to perform
feature selection to reduce the computation time and improve content of alloying element means the Ms values should distribute
model robustness by removing the redundant features. Fig. 4 indi- on a plane in the composition space. Therefore, we not only exam-
cates the Pearson correlation coefficient map between different ine the RMSE scores of the exhaustive feature subsets, but also pay
features. Those with a correlation coefficient>0.95 were considered attention to whether the model is more consistent with the ‘‘one
as highly correlated. After the analysis of Pearson’s correlation plane” criterion for the component spatial prediction data, which
coefficient, a total of 9 features including En, Dor, Mass, Arc-2, is called ‘‘RMSE + one plane” criterion method.
En-D, Mass-D, Mr-D, Mr-d and Mass-d were screened and listed In this criterion method, on the one hand, the average value of
in Table 3. the test set RMSE score of the exhaustive feature subsets using
Then, the RMSE scores of each feature subset were compared by leave-one-out method were called RMSE_math. On the other hand,
the method of feature exhaustion and the one with the highest a SVR.rbf model built by a feature subset was used to predict the
score was used for model training. All machine learning models pre-divided component space, and then linearly fit the prediction
performed feature exhaustion evaluation and the results are data as a function of the Al and Mn atomic percentages. The RMSE
shown in Fig. 5. of the linear fitting to the prediction data was called RMSE_phys as
Fig. 5 shows the deviation between the predicted values and the evaluation indicator for ‘‘one plane” criterion. It should be
experimental values and the RMSE scores of the seven machine noted that all physical and chemical features (37 descriptors) were
learning models. Among them, the SVR.rbf model has the best per- used to train the SVR.rbf model in order to avoid the omission of
formance, with a test RMSE of 53. feature subset under the multiply criterion condition.
In the process of feature exhaustion, we found that the RMSE A virtual composition space was built according to composition
scores of lots of feature subsets are similar. Considering the exper- range of experimental data, that is Cu: 66.68 ~ 76.21, Al:
imental data are imperfect and possible overfitting problem, only 16.39 ~ 28.30, Mn: 0 ~ 11.90. The virtual composition space was
relying on mathematical criterion may not be enough. In the previ- divided into 9143 alloy composition points, in which the Ms values
ous review and discussion, we have concluded an important of 9072 alloy compositions were unknown. At first, the top 40
domain knowledge that is a linear relationship between marten- models according to RMSE_math criterion were selected and their
sitic transformation temperatures and the content of alloying ele- RMSE_math scores illustrated as the red curve in Fig. 6. And then,
ments. Therefore, we can evaluate the SVR.rbf models using the 40 models were evaluated by RMSE_phys criterion and their
combined criterion with mathematical and physical (domain RMSE_phys scores were marked green curve in Fig. 6.
knowledge) criteria. In the Cu-Al-Mn system, the linear relation- Fig. 6 indicates that the RMSE_math scores of the first to fortieth
ship between martensitic transformation temperatures and the SVR.rbf models are in the range of 52 ~ 59, but the fluctuation of
the RMSE_phys scores are very significant. The maximum and
the minimum values are 89 and 27 respectively, which implies that
the various models with the similar mathematical scores may have
a significant deviation of the data distribution in the prediction
space. In order to clarify this issue, we use the 25th model and
the 26th model in Fig. 6 as examples. The 25th model with two fea-
tures (Dor, cVen) shows low scores both of RMSE_math (57) and
RMSE_phys (26). The RMSE_math score difference between the
25th model and the 26th model (cVen, En-2) is only 0.11 and they
also have one common feature (cVen), but the RMSE_phys score
difference between them is 62.75.
The distribution of Ms values predicted by the 25th and 26th
models. It can be seen in Fig. 7(a) the contour distribution is rela-
tively smooth, and Ms values show a monotonous change trend
with the variation of the Al and Mn contents. On the contrary,
the Ms values in Fig. 7(b) show two local low temperature areas,
which is not consistent with actual experience.
For the ML modeling, when the amount of training data is small
or there is large error in training data, several local optimal solu-
tions could be obtained. All of them may have excellent RMSE
scores, but do not necessarily mean the real physical lawn. There-
fore, the introduction of domain knowledge (physical) criterion can
effectively help us to screen out the ML model with physical
Fig. 4. The Pearson correlation map of the 37 features. meaning.
5
B. Xiong, X. Zhao, Y. Hu et al. Materials & Design 210 (2021) 110037

Fig. 5. The predicted values as a function of the experimental values (a)-(g) evaluated by the leave-one-out method and the RMSE scores (h) for seven ML models.

6
B. Xiong, X. Zhao, Y. Hu et al. Materials & Design 210 (2021) 110037

which touches the inherent mechanism of the property in this


material system. Therefore, the obtained ML model can break
through the limitation of formula fitting based on single statistical
method, such as the constraint of component space.
Taking this advantage, more experimental data from other Cu-
Al-based SMAs were supplements collected from 19 published lit-
eratures [31,47–65]. The supplementary experimental dataset
included 77 data from 16 alloys and involved a total of 16 ele-
ments. If we use these 148 data (71 data for Cu-Al-Mn SMAs and
77 data for other Cu-Al-based SMAs) to fit a formula or build a
composition feature-based ML model of Ms as the function of com-
positions, a serious dimensional disaster will occur due to a huge
ratio of the number of features and the amount of data. However,
the physical feature-based ML model allows to expand the number
of elements by increasing the dimensions of prediction compo-
nents space, rather than increasing the model feature variables.
Based on the ML model of Cu-Al-Mn alloys obtained above, the
77 supplementary experimental data was used to fit the coefficient
of each element in linear formula. The obtained empirical formula
for multicomponent Cu-Al-based SMAs is as follows:
Fig. 6. RMSE_math and RMSE_phys scores of the top 40 SVR.rbf models ranked by
RMSE_math.
Ms( C) = 1040  33.30Al  42.90Mn  107.73Cr
Finally, a SVR.rbf model with two physical features of Dor and  190:26Ni  138:28Si  61:07Zn þ 240:87Ti  327:59Nb
cVen was trained and used to predict the Ms values in the virtual  1349:83Be  167:1Mg  187:08Fe þ 119:3Vðat:%Þ
composition space. An augmented linear empirical formula N2 of ð9Þ
Ms as the function of composition was fitted by the combined
dataset including predicted Ms values and experimental data, as It should be clarified that the coefficient fitting for each element
shown as follows was conducted one by one. In addition, the multicomponent
empirical formula is available when the content of each alloy’s ele-
Ms( C)=1040-33.30Al  42.90Mn(at.%) ð8Þ ment is less than 3.5at.%.

4.3. Generalization of empirical formula based on the ML model 5. Summary

The machine learning model based on physical features was In present work, we have proposed an augmentation strategy
used to predict the composition space with unknown Ms value, for empirical formula based on ML mothed. The implicit relation-
and an augmented dataset combined the predicted data and the ship between the target property and physical and chemical
experimental data was linearly fitted to obtain empirical formula. descriptors including those associated with the elements is estab-
This strategy not only augments huge amounts of data to high- lished through an ML algorithm. Based on the ML model, a huge
cost experimental data to solve the problem of insufficient data, amount of data could be obtained via prediction of the target prop-
but also combines physical features and domain knowledge so that erty of unmeasured compositions in a virtue composition space.
the resulting empirical formula contains more information of The predicted data can significantly solve the small data issue
physical characteristics. due to insufficient experimental data. We have demonstrated this
Another advantage of ML model built on physical features is augmentation strategy on the Cu-Al-based SMAs. A novel empirical
good generalization ability. In other words, the ML model builds formula (empirical formula N2) of Ms as the function of composi-
a relationship between the target property and physical factors tions was fitted based on an augmented dataset combined experi-

Fig. 7. The distribution heat map of Ms values predicted by the 25th (a) and 26th (b) ML models in composition space.

7
B. Xiong, X. Zhao, Y. Hu et al. Materials & Design 210 (2021) 110037

mental data with predicted data. Compared with previous empiri- [18] K. Niitsu, Y. Kimura, X. Xu, R. Kainuma, Composition dependences of entropy
change and transformation temperatures in Ni-rich Ti–Ni system, Shape
cal formula fitted by small experimental dataset, the accuracy and
Memory Superelasticity 1 (2) (2015) 124–131.
robustness of empirical formula N2 was significantly improved [19] J.L. Liu, H.Y. Huang, J.X. Xie, The roles of grain orientation and grain boundary
without additional experimental cost. This framework can also characteristics in the enhanced superelasticity of Cu71.8Al17.8Mn10.4 shape
be extended to establishing empirical relationships in other mate- memory alloys, Mater. Des. 64 (9) (2014) 427–433.
[20] S. Xu, H.Y. Huang, J. Xie, S. Takekawa, R. Kainuma, Giant elastocaloric effect
rial systems, especially in emerging material systems, which often covering wide temperature range in columnar-grained Cu71.5Al17.5Mn11
face the similar difficulties of insufficient experimental data. shape memory alloy, APL Mater. 4 (10) (2016) 439.
[21] G. Zak, A.C. Kneissl, G. Zatulskij, Shape memory effect in cryogenic Cu-Al-Mn
alloys, Scripta Mater. 34 (1996) 363–367.
6. Data availability [22] C.L.D. Castillo, B.G. Mellor, M.L. Blazpuez, C. Gómez, The influence of
composition and grain size on the martensitic transformation temperatures
of Cu-Al-Mn shape memory alloys, Scr. Metall. 21 (1987) 1711–1716.
The raw/processed data required to reproduce these findings [23] H. Wang, H.Y. Huang, Y.J. Su, Tuning the operation temperature window of the
can be obtained by contacting the corresponding author. elastocaloric effect in Cu–Al–Mn shape memory alloys by composition design,
J. Alloy. Compd. 828 (2020) 154265.
[24] Y. Liu, T. Zhao, W. Ju, et al., Materials discovery and design using machine
Declaration of Competing Interest learning, J. Materiomics 3 (2017) 159–177.
[25] R. Batra, Emerging materials intelligence ecosystems propelled by machine
learning, Nat. Rev. Mater. (2020), https://doi.org/10.1038/s41578-020-00255-
The authors declare that they have no known competing finan-
y.
cial interests or personal relationships that could have appeared [26] C. Shen, C. Wang, X. Wei, et al., Physical metallurgy-guided machine learning
to influence the work reported in this paper. and artificial intelligent design of ultrahigh-strength stainless steel, Acta
Mater. 179 (2019) 201–214.
[27] D. Xue, D. Xue, R. Yuan, et al., An informatics approach to transformation
Acknowledgements temperatures of NiTi-based shape memory alloys, Acta Mater. 125 (2017)
532–541.
[28] P. Liu, H.Y. Huang, S. Antonov, et al., Machine learning assisted design of c0 -
This research was funded by the Guangdong Province Key Area strengthened Co-base superalloys with multi-performance optimization, NPJ
R&D Program (Grant No. 2019B010940001), Guangdong Major Comput. Mater. 6 (2020) 62.
Project of Basic and Applied Basic Research (Grant No. [29] The website of MGEData platform: www.mgedata.cn
[30] T. Omori, J. Wang, Y. Sutou, et al., Two-Way Shape Memory Effect Induced by
2019B030302011), the National Key Research and Development
Bending Deformation in Ductile Cu-Al-Mn Alloys, Mater. Trans. 43 (2002)
Program of China (Grant No. 2018YFB0704300, 1676–1683.
2020YFB0704504), and Scientific and Technological Innovation [31] Y. Sutou, T. Omori, T. Okamoto, et al. Effect of grain refinement on the
Foundation of Shunde Graduate School, USTB (BK19BE030). mechanical and shape memory properties of Cu-Al-Mn base alloys. Journal de
Physique IV (Proceedings), 11(PR8): (2001) Pr8-185-Pr8-190.
[32] R. Kainuma, S. Takahashi, K. Ishida, Thermoelastic martensite and shape
References memory effect in ductile Cu-Al-Mn alloys, Metall. Mater. Trans. A 27 (1996)
2187–2195, https://doi.org/10.1007/BF02651873.
[33] Q. Yang, S.L. Wang, J. Chen, et al., Strong heating rate-dependent deterioration
[1] J. Brockner, J.Z. Rubin, Experimental Research Methods, Entrapment in
of shape memory effect in up/step quenched Cu-based alloys: A ductile
Escalating Conflicts. Springer Series in Social Psychology, Springer, New York,
CuAlMn alloy as an example, Acta Mater. 111 (2016) 348–356.
NY, 1985.
[34] Q. Yang, J. Chen, Y. Meng, J. Ge, S. Wang, H. Peng, Y. Wen, A Novel Constraint
[2] T. Lookman, F.J. Alexander, K. Rajan, Information Science for Materials
Method During Solution Treatment to Suppress Heating Rate-Dependent
Discovery. Springer Series in Materials Science, V225, Springer, New York,
Martensitic Stabilization in Cu-17.0Al-10.5Mn Alloy, Adv. Eng. Mater. 20
NY, 2016.
(2018) 1701082.
[3] B.Y. Ma, X.Y. Wei, C. Liu, X.J. Ban, H.Y. Huang, H. Wang, W.H. Xue, S. Wu, M.F.
[35] U.S. Mallik, V. Sampath, Effect of alloying on microstructure and shape
Gao, Q. Shen, M. Mukeshimana, A.O. Abuassba, H.K. Shen, Data augmentation
memory characteristics of Cu-Al-Mn shape memory alloys, Mater. Sci. Eng. A
in microscopic images for material data mining, npj Computational Materials 6
481–482 (2008) 680–683.
(2020) 125.
[36] V. Sampath, U.S. Mallik, Influence of minor additions of boron and zirconium
[4] J. Ogilvie, T.D. Drewry, M.C. Sherman, et al., Shape Memory Alloy Staple.
on shape memory properties and grain refinement of a Cu-Al-Mn shape
US6325805B1, US Patent (2000).
memory alloy, in: European Symposium on Martensitic Transformations
[5] A.L. Mckelvey, R.O. Ritchie, Fatigue-crack growth behavior in the superelastic
(ESOMAT), 2009, pp. 181–188.
and shape-memory alloy nitinol, Metallurg. Mater. Trans. A 32 (2001) 731–
[37] V. Sampath, S.V. Gayathri, R. Srinithi, Experimental and theoretical analyses of
743.
transformation temperatures of Cu-based shape memory alloys, Bull. Mater.
[6] A. Kitanovski, P.W. Egolf, Thermodynamics of magnetic refrigeration, Int. J.
Sci. 42 (2019) 229, https://doi.org/10.1007/s12034-019-1911-4.
Refrig. 29 (2006) 3–21.
[38] T.Y. Hsu, X.W. Zhuo, Thermodynamics of martensitic transformation in b-Cu
[7] S. Qian, Y. Geng, Y. Wang, et al., A review of elastocaloric cooling: Materials,
base alloys, Acta Metall. Sin. 27 (1991) 18–23.
cycles and system integrations, Int. J. Refrig. 64 (2016) 1–19.
[39] Q. Luo, H. Chen, W. Chen, et al., Thermodynamic prediction of martensitic
[8] J. Frenzel, E.P. George, A. Dlouhy, et al., Influence of Ni on martensitic phase
transformation temperature in Fe-Ni-C system, Scr. Mater. 187 (2020) 413–
transformations in NiTi shape memory alloys, Acta Mater. 58 (2007) 3444–
417.
3458.
[40] T.Y. Hsu, M. Pan, Thermodynamics of martensitic transformation in Fe-Mn-C
[9] K. Yildiz, Influence of Ta content on martensitic transformation and shape
and Fe-Ni-C alloys, Acta Metall. Sin. 25 (4) (1989) 21–27.
memory properties of Cu-Al-Ni-Mn shape memory alloy, Appl. Phys. A 126
[41] J. Frenzel, A. Wieczorek, I. Opahle, B. Maaß, R. Drautz, G. Eggeler, On the effect
(2020) 307.
of alloy composition on martensite start temperatures and latent heats in Ni–
[10] S. Niedbalski, A. Durán, M. Walczak, J.A. Ramos-Grez, Laser-Assisted Synthesis
Ti-based shape memory alloys, Acta Mater. 90 (2015) 213–231.
of Cu-Al-Ni Shape Memory Alloys: Effect of Inert Gas Pressure and Ni Content,
[42] E. Clementi, Atomic Screening Constants from SCF Functions. II. Atoms with 37
Materials 12 (2019) 794.
to 86 Electrons, J. Chem. Phys. 47 (1967) 1300–1307.
[11] Y. Sutou, R. Kainuma, K. Ishida, Effect of alloying elements on the shape
[43] L. Pauling, The nature of the chemical bond. Iv. The energy of single bonds and
memory properties of ductile Cu–Al–Mn alloys. Materials Science &
the relative electronegativity of atoms, J. Am. Chem. Soc. 54 (1932) 3570–
Engineering A, s273–275 (1999) 375-379.
3582.
[12] K. Otsuka, C.M. Wayman, Mechanism of shape memory effect and
[44] N.N. Greenwood, A. Earnshaw, Silicon Chemistry of the Elements 5 (Suppl 1)
superelasticity, Shape Memory Materials (1998).
(1997) 328–366.
[13] M. Zarinejad, Y. Liu, Dependence of transformation temperatures of NiTi-based
[45] K.M. Rabe, J.C. Phillips, P. Villars, et al., Global multinary structural chemistry
shape memory alloys on the number and concentration of valence electrons,
of stable quasicrystals, high-TC ferroelectrics, and high-Tc superconductors,
Adv. Funct. Mater. 18 (2010) 2789–2794.
Phys. Rev. B 45 (1992) 7650–7676.
[14] J. Frenzel, A. Wieczorek, I. Opahle, et al., On the effect of alloy composition on
[46] J.T. Waber, D.T. Cromer, Orbital Radii of Atoms and Ions, J. Chem. Phys. 42
martensite start temperatures and latent heats in Ni–Ti-based shape memory
(1965) 4116–4123.
alloys, Acta Mater. 90 (2015) 213–231.
[47] K. Pravir, J.A. Kumar, H. Shahadat, P. Abhishek, D. Rupa, Changes in the
[15] K. Otsuka, X.B. Ren, Factors affecting the Ms temperature and its control in
properties of Cu-Al-Mn shape memory alloy due to quaternary addition of
shape-memory alloys, Mater. Sci. Forum. 394–395 (2002) 177–184.
different elements, Matéria(Rio de Janeiro) 20 (2015) 284–292.
[16] X.B. Ren, K. Otsuka, Why does the martensitic transformation temperature
[48] C.A. Canbay, Z.K. Genc, M. Sekerci, Thermal and structural characterization of
strongly depend on composition?, Mater Sci. Forum 327–328 (2000) 429–432.
Cu–Al–Mn–X (Ti, Ni) shape memory alloys, Appl. Phys. A 115 (2014) 371–377.
[17] J. Ma, I. Karaman, R.D. Noebe, High temperature shape memory alloys, Int.
Mater. Rev. 55 (2010) 257–315.

8
B. Xiong, X. Zhao, Y. Hu et al. Materials & Design 210 (2021) 110037

[49] Y. Sutou, T. Omori, R. Kainuma, K. Ishida, N. Ono, Enhancement of Ni–Mn–Nb shape memory alloy by freeze-drying and partial sintering, J.
superelasticity in Cu-Al-Mn-Ni shape-memory alloys by texture control, Mater. Res. Technol. 9 (2020) 3676–3685.
Metallurg. Mater. Trans. A 33 (2002) 2817–2824. [58] I. Ozkul, C.A. Canbay, F. Aladağ, K. Aldasß, The effect of the aging period on the
[50] G.X. Dong, N.J. Gu, X.L. Ma, Cu-Al-Mn-Nb shape memory alloys with low phase martensitic transformation and kinetic characteristic of at%
transformation temperature and broad hysteresis, Chinese J. Nonferrous Cu68.09Al26.1Ni1.54Mn4.27 shape memory alloy, Russian J. Non-Ferrous
Metals 15 (2005) 1124–1130. Metals 58 (2017) 130–135.
[51] Q. Zhou, C.S. Xie, Effect of alloy element on microstructure and phase [59] A.G. Shivasiddaramaiah, U.S. Mallik, V. Jayanth, S. Prashantha, Evaluation of
transformation points of Cu-Al-Mn-Ti shape memory alloy, Mater. Heat Shape memory effect and Pseudo elastic effect of Cu-Al-Be-Mn Quaternary
Treatment 38 (2009) 23–25. shape memory alloys, Mater. Today. Proc. 4 (2017) 10109–10112.
[52] S.Y. Yang, S. Hong, M.P. Li, et al., Microstructure, martensitic transformation [60] J.M. Guilemany, F.J. Gil, The influence of grain boundaries on the
and shape memory effect of polycrystalline Cu-Al-Mn-Fe alloys, Sci. China transformation temperatures of Cu-Zn-Al shape memory alloys, Mater. Res.
Technol. Sci. 64 (2021) 400–406, https://doi.org/10.1007/s11431-020-1617-x. Bull. 25 (1990) 1325–1332.
[53] C.A. Canbay, S. Gudeloglu, Z.K. Genc, Investigation of the Enthalpy/Entropy [61] T.N. Raju, Synthesis and characterization of Cu–Al–Fe shape memory alloys
Variation and Structure of Cu–Al–Mn–Fe Shape Memory Alloys, Int. J. indian institute of technology, PhD thesis, Chennai, India: Indian Institute of
Thermophys. 36 (2015) 783–794. Technology, Madras, (2013)
[54] C.A. Canbay, Kinetic parameters and structural variations in Cu-Al-Mn and Cu- [62] A.M.B. Valle, Imput and interlanguage: a review of the research, Estudios
Al-Mn-Mg shape memory alloys, Turkish Physical Society International Ingleses De La Universidad Complutense 6 (1998) 129–159.
Physics Congress. Turkish Physical Society 32nd International Physics [63] X.F. Wang, T.E. Jones, W.Li, et al., Extreme Poisson’s ratios and their electronic
Congress, 2017. origin in B2 CsCl-type AB intermetallic compounds. Physical review, B 85
[55] Y.M. Han, K.Y. Ding, H.N. Ma, X.P. Lin, Microstructure and shape memory effect (2012) 134108.1–134108.9.
of CuAlMnZnZr alloy, J. Hebei Univ. Technol. 33 (2004) 73–76. [64] S.N. Saud, E. Hamzah, T. Abubakar, H.R. Bakhsheshi-Rad, Correlation of
[56] Z. Li, M.P. Wang, G.Y. Xu, J.Y. Cheng, B.L. Zhang, Stress-induced martensite microstructural and corrosion characteristics of quaternary shape memory
transformation and its reversibility in Cu-18.4Al-8.7Mn-3.4Zn-0.1Zr alloy, alloys Cu–Al–Ni–X (X=Mn or Ti), Trans. Nonferrous Metals Soc. China 25
Acta Metall. Sin. 39 (2003) 61–65. (2015) 1158–1170.
[57] M.L. Lapér, R. Guimarães, B.R. Barrioni, P.A.P. Silva, M. Houmard, E.M. Mazzer, [65] R.D. Cava, C. Bolfarini, C.S. Kiminami, E.M. Mazzer, W.J.B. Filho, P. Gargarella, J.
E.H.M. Nunes, Fabrication of porous samples from a high-temperature Cu–Al– Eckert, Spray forming of Cu–11.85Al–3.2Ni–3Mn (wt%) shape memory alloy, J.
Alloy. Compd. 615 (2014) S602–S606.

You might also like