Applsci 10 08488 v2

applied
sciences
Article
Parameters Adjustment Optimization for Prepreg of
Copper Clad Laminate Based on Virtual Metrology
Yiyo Kuo 1, * and Ssu-Han Chen 1,2
1 Department of Industrial Engineering and Management, Ming Chi University of Technology,
New Taipei City 24301, Taiwan; ssuhanchen@mail.mcut.edu.tw
2 Center for Artificial Intelligence & Data Science, Ming Chi University of Technology,
New Taipei City 24301, Taiwan
* Correspondence: Yiyo@mail.mcut.edu.tw; Tel.: +886-2-2908-9899 (ext. 3118)

Received: 28 October 2020; Accepted: 25 November 2020; Published: 27 November 2020
Abstract: In this research, virtual metrology models, which can estimate the quality characteristics
based on the given manufacturing parameters, were developed by lasso regression and support vector
regression. Then, the virtual metrology (VM) models were integrated into a mathematical model
to minimize the adjustment of manufacturing parameters and ensure the corresponding quality
characteristics that would meet the customers’ needs. According to the results of the experiment,
it was found that developing the virtual metrology model by using integration of lasso regression
and support vector regression and taking second order terms into consideration produces the best
performance for estimating the quality characteristics. Moreover, the integrated mathematical model
can provide manufacturing parameters that minimize the amount of adjustment and ensure that
the quality characteristics equal their corresponding target values. The proposed methodology can
reduce the setup times and consequently increase productivity.
Keywords: lasso regression; prepreg; printed circuit board; support vector regression; virtual metrology
1. Introduction
Trends in the printed circuit board (PCB) industry are obviously affected by the requirements of
the manufacturers of computers, communication equipment, and consumer electronics. Due to global
competition, the price pressure has increased in recent years. Therefore, reducing the manufacturing cost
is very important. This research focuses on the manufacturing process of prepreg of copper-clad laminate
(CCL) which is the core element of a PCB. Different prepregs are required for different specifications of
base material or copper clad laminate. Companies have to produce various specifications of prepreg to
satisfy customers’ needs.
Prepreg is produced by impregnating glass cloth with the epoxy resin. The prepreg can be further
laminated with multi prepregs or copper foil to form the base material or CCL. The manufacturing
process of the prepreg is illustrated by Figure 1.
Appl. Sci. 2020, 10, 8488; doi:10.3390/app10238488 www.mdpi.com/journal/applsci

Appl. Sci. 2020, 10, 8488 2 of 12
Appl. Sci. 2020, 10, 8488 2 of 12
Figure 1. Prepreg
Prepreg manufacturing process.
In Figure 1, the glass cloth is transferred from the left hand side by rollers. First, the glass cloth
was impregnated with epoxy resin. Excess Excess epoxy
epoxy resin
resin which
which attached
attached to to the
the glass
glass cloth was then
removed by two doctor rollers. The The rolling
rolling direction
direction of of the
the two
two doctor
doctor rollers was opposite to the
direction of
of motion
motionofofthe theglass
glasscloth. Then
cloth. Thenthetheglass cloth,
glass withwith
cloth, the appropriate
the appropriate volume of epoxy
volume resin,
of epoxy
was transferred into a huge oven. In the oven, the epoxy resin was heated
resin, was transferred into a huge oven. In the oven, the epoxy resin was heated and dried. The and dried. The prepreg was
finally complete
prepreg andcomplete
was finally was transferred
and wasout of the oven.
transferred out of the oven.
There are a lot of important parameters for the prepreg manufacturing process. When When different
prepregs are produced successively in the same machine, setup time is required. Several Several parameters,
parameters,
including the
the temperatures
temperaturesofofovens, ovens,density
densityofofthetheepoxy
epoxy resin,
resin, andandgelgel
time
timeof the epoxy
of the epoxy resin, are not
resin, are
easyeasy
not to adjust and they
to adjust andare theyalways changedchanged
are always while the machine
while is running.
the machine is Therefore, the engineers
running. Therefore, the
have to adjust
engineers have the other the
to adjust parameters based onbased
other parameters the current temperature
on the current of the oven,
temperature of thedensity of the
oven, density
epoxy
of resin, resin,
the epoxy and gel andtime
gel of theofepoxy
time resin,resin,
the epoxy basing theirtheir
basing judgments
judgmentson their experience.
on their experience.Once the
Once
newnew
the parameters
parameters areare
set,set,
a pilot
a pilotrun
run is is
required
requiredtotoensure
ensurethe thequality
qualityof of the
the prepregs. Two quality
characteristics, ratio
characteristics, ratioofofresin
resinandandgel gel
time, werewere
time, takentaken
into consideration
into considerationin this research. If the prepregs
in this research. If the
did not meet
prepregs did the
not quality
meet the requirements, then the parameters
quality requirements, needed toneeded
then the parameters be adjusted
to be again.
adjusted Thus the
again.
setup time was increased and the efficiency reduced.
Thus the setup time was increased and the efficiency reduced.
It would be helpful if aa prediction
prediction model
model could
could be be built
built for
for predicting
predicting thethe performance
performance of quality
quality
characteristics of the following lot of prepreg. The The concept
concept cancan be be viewed
viewed as asvirtual
virtualmetrology
metrology (VM).(VM).
VM aims to predict the performance indicators by using parameters such as process variables and
machine sensor
sensordata,
data,andand reduces
reducesthe the
usage of physical
usage measurement
of physical [1]. However,
measurement VM onlyVM
[1]. However, provides
only
the quality
provides thecharacteristics based onbased
quality characteristics the given
on the manufacturing
given manufacturingparameters. Even with
parameters. Even a VM
withmodel,
a VM
the engineers
model, need toneed
the engineers knowtohow know to how
adjust tothe current
adjust parameters
the current and make
parameters andsure
make that thethat
sure quality
the
characteristics
quality can be satisfied.
characteristics can be Therefore, an optimization
satisfied. Therefore, methodologymethodology
an optimization for parameterfor adjustment
parameter is
also required.
adjustment is also required.
In this
thisresearch,
research, a dataset
a dataset of the
of prepreg manufacturing
the prepreg processprocess
manufacturing was collected. Then the prediction
was collected. Then the
models of VM
prediction wereofdeveloped
models VM were by support vector
developed regression
by support vector(SVR) and lasso
regression (SVR) regression
and lasso (LR). Finally,
regression
(LR). Finally, mathematical models which integrate the corresponding VM models were proposedthe
mathematical models which integrate the corresponding VM models were proposed to optimize to
parameterthe
optimize adjustment.
parameter adjustment.
2. Literature
2. Literature Review
Review
According to
According to the
the survey
survey of
of VM
VM for
for this
this research,
research, the
the application
application in
in CCL
CCL isis rarely
rarely found
found inin the
the
literature. Kim
literature. Kim etetal.al.[2][2]
proposes thethe
proposes application of virtual
application metrology
of virtual for CCL
metrology formanufacturing to predict
CCL manufacturing to
product quality derived from processing data without a product quality inspection.
predict product quality derived from processing data without a product quality inspection. Four Four prediction
methods, linear
prediction regression,
methods, random forest
linear regression, (RF),forest
random neural(RF),
network (NN),
neural and (NN),
network support vector
and regression
support vector
(SVR), were adopted with three dimensional reduction methods, namely
regression (SVR), were adopted with three dimensional reduction methods, namely forward forward selection (FS),
selection (FS), backward elimination (BE), and genetic algorithm (GA). Therefore, the authors
developed 12 prediction models. The input variables included temperature, velocity, weight,
Appl. Sci. 2020, 10, 8488 3 of 12
backward elimination (BE), and genetic algorithm (GA). Therefore, the authors developed 12 prediction
models. The input variables included temperature, velocity, weight, pressure, and tension measured by
sensors, as well as manufacturing variables obtained from the opinions of domain experts. Three output
variables, treated weight, minimum viscosity, and gel time, were used. The results show that RF-GA
and SVR-GA are more accurate.
VM has been widely applied in semiconductor and thin film transistor-liquid crystal display
(TFT-LCD) manufacturing systems. Jonathan and Cheng [3] dealt with chemical vapor deposition
(CVD) machines in a 300-mm semiconductor factory. A back-propagation neural network (BPNN)
was developed to predict film thickness. Experimental results show that the maximum error was
1.7%. Hung et al. [4] dealt with CVD thickness prediction in semiconductor manufacturing. The radial
basis function network (RBFN) was adopted to develop the prediction model. Meanwhile, the model
parameter coordinator (MPC), which can adjust the values of certain parameters of the VM model
to minimize the prediction errors when equipment properties change or different equipment is used,
and auto-adjusting mechanism, which can be activated to tune the VM model when the errors between
VM and actual metrology values are out of specification, were developed for a real application.
The comparison results show that running MPC and auto-adjusting mechanism can further increase the
accuracy. Kang et al. [5] dealt with two etching processes in a Korean semiconductor manufacturing
company. Linear regression, K-nearest neighbor regression (KNN), regression tree (TR), NN, and SVR
were used to develop a prediction model for VM. Meanwhile, the stepwise linear regression and
genetic algorithm with support vector regression were employed for the variable selection method,
and principal component analysis (PCA) and kernel PCA (KPCA) were employed as variable extraction
methods. The experimental results show that linear regression and SVR are the best prediction
algorithms for different quality characteristics. Lin et al. [6] developed NN as a prediction mode.
A stepwise selection method including forward selection and backward elimination was proposed to
select a near-optimal set of variables for achieving high VM conjecture accuracy. Based on a piece of
semiconductor equipment for the etching process, the results show that the accuracy of NN is better
than multi-regression (MR), but the process time will be the issue for real applications. Zeng and
Spans [7] dealt with plasma etching operations and proposed a VM modeling sequence including data
preprocessing, outlier removal, variable selection, accommodation of process dynamics, and modeling.
In the modeling step, principal component regression, PLS, and BPNN were adopted, respectively.
The best result is from a model using BPNN. Lynn et al. [8] developed global and local VM models for a
plasma etching process by using PLS, NN, and Gaussian process regression (GPR). The global models
were developed by using all available training points to learn the behavior of a system, and local models
were trained using subsets of the available data. The results show that the local model with GPR
modeling produces the best estimation accuracy of etch rate on the dataset investigated. Susto et al. [9]
adopted LR and ridge regression for developing the VM prediction model base on a dataset collected
from CVD, thermal oxidation, coating, and lithography process. The VM target is a particular quality
performance after lithography. The results show that ridge regression outperforms LR in most cases
for the dataset used. Kang et al. [10] proposed a semi-supervised support vector regression (SS-SVR)
method based on self-training. A semiconductor manufacturing dataset was collected from the photo
process of a South Korean semiconductor manufacturing company. The experimental results show
that the proposed SS-SVR method demonstrated excellent performance for a real world semiconductor
manufacturing dataset. Jia et al. [11] proposed an adaptive methodology based on the group method
of data handling (GMDH) type polynomial neural networks for predicting the material removal rate
for the chemical-mechanical planarization process in semiconductor fabrication. The validation results
report improved accuracy in comparison with several candidate methods.
Some VM-related research deals with applications in TFT-LCD. Su et al. [12] proposed a novel
quality prognostics scheme (QPS) for plasma sputtering in TFT-LCD manufacturing processes. The QPS
consists of a conjecture model which can estimate the current-lot quality based on the current lot
processing parameters and a prediction model which can predict the quality of the next lot according
Appl. Sci. 2020, 10, 8488 4 of 12
to the current lot conjecture value and the quality measurement data of several previous lots. NN and
weighted moving average algorithms were applied to construct the QPS. The test results show that
the QPS can be feasible and can conjecture/predict the product quality efficiently and effectively.
Su et al. [13] evaluated various VM algorithms, including BPNN, simple recurrent neural networks
(SRNN), and MR. According to the test in fifth generation TFT-LCD chemical–vapor deposition
processes, the results show that both one-hidden-layered BPNN and SRNN VM algorithms achieve
acceptable conjecture accuracy and meet the real-time requirements. Cheng et al. [14] defined the
VM automation level, proposed the concept of automatic virtual metrology (AVM), and developed
an AVM system which has been successfully deployed in a fifth-generation TFT-LCD factory in Chi
Mei Optoelectronics (CMO), Taiwan. Hung et al. [15] proposed an AVMS architecture which can be a
useful reference for TFT-LCD manufacturing companies.
In addition to semiconductor and TFT-LCD manufacturing, applications of VM in other industries
can be found in the literature. Yang et al. [16] proposed an approach to apply the AVM system
factory-wide in wheel machining automation to achieve the total inspection of all the precision items
of wheel machining automation in a mass production environment. Yang et al. [17] applied VM for
inspecting machining precision of machine tools. The test results of machining standard workpieces
and cellphone shells of two three-axis CNC machines show that the proposed approach of applying
VM to accomplish total precision inspection of machine tools is promising. Hsieh et al. [18] proposed a
scheme to apply AVM for carbon-fiber manufacturing to achieve online and real-time total inspection.
Nguyen et al. [19] developed a novel co-training technique, namely partial Bayesian co-training (PBCT)
for a particular semi-supervised learning problem. PBCT was validated on three industrial applications:
Gun-drilling, inkjet printing, and low-noise amplifier. The experimental results show that under a
reduction of labeled data by up to 50%, a robust estimation is still attainable.
The problem that the present study addresses was not found in the literature. The review of the
literature shows that VM has been studied in some industrial settings, including CCL, but the further
application of integrating with parameter adjustment, which can provide an opportunity to reduce the
number of setups or setup time, is not found in the literature. Consequently, the proposed VM based
parameter adjustment optimization may be of benefit in many similar industries.
3. Virtual Metrology Modeling
3.1. Lasso Regression (LR)

Given a set of I instances with J input variables and 1 corresponding output variable, xij , indicates
the input variable j of instance i, and yi indicates the output of instance i. Linear regression aims to
seek parameters β0 , ..., βJ to minimize the residual sum of squares (RSS) in Equation (1).
 2
I 
X J
X 
β0 + β j xij − yi 

RSS = (1)

 
i j
Lasso regression presented by Tibshirani [19] is an innovative variable selection method for
J
regression by minimizing RSS + λ β j where λ is a non-negative penalty parameter that balances
P
j
the emphasis between the reducing error and using small β coefficients [20]. Some β may reduce to
zero [21], which deselects that input variable, thus endowing LR with a method of pruning predictors
in the model [22].
3.2. Support Vector Regression (SVR)

SVR was proposed by Vapnik [23]. Different from general regression, of which the goal is to find
a function that can minimize the sum of square error, SVR allows the error to be smaller than ε [24].
Figure 2 shows the difference between general regression and SVR.
Appl. Sci. 2020, 10, 8488 5 of 12
Appl. Sci. 2020, 10, 8488 5 of 12
Figure 2. General regression and support vector regression

regression (SVR).
(SVR).
For
For aa dataset
dataset of
ofI Iinput-target
input-targetpair,
pair,{(x{(x y,1),y1…
1, 1 . . .I, y(xI)},
), (x I, y I )}, Equation
Equation (2) is (2)
the iscorresponding
the corresponding
linear
linear function.
function.
f (x) = hw, xi + b (2)
f(x) =〈w, x〉+ b (2)
In Equation (2), hw, xi denotes the dot product. SVR tries to minimize the margin and keep all
data lying
In Equation (2), 〈w,
inside [25]. The margin can bethe
x〉 denotes calculated
dot product. by Equations
SVR tries(3)–(5).
to minimize the margin and keep
all data lying inside [25]. The margin can be calculated by Equations (3)–(5).
1
Minimize kwk2 (3)
Minimize2 ‖ ‖ (3)
Subject
Subject to:
to:
yi − hw, xi − b ≤ (4)
− , − ≤ (4)
hw, xi + b − yi ≤ (5)
, + − ≤ (5)
However, it may not possible that all the pairs can be kept inside the margin when the margin
However,
becomes smaller.it may
SVRnot possible
gives thattoallthe
penalties thepairs
pairsoutside
can be the
keptmargin
inside by
theslack
margin ξi+ and
when the
variables ξ−
margin
i
.
becomes smaller. SVR gives penalties to the pairs
Then Equations (3)–(5) are rewritten as Equations (6)–(9).outside the margin by slack variables and .
Then Equations (3)–(5) are rewritten as Equations (6)–(9).
I
1 X
Minimize kwk
Minimize ‖ 2 ‖+ +
C ∑ ξi+(+ ξ+ −
i ) (6)
2
i=1
Subject to:
Subject to:
− , − ≤ + (7)
yi − hw, xi − b ≤ ε + ξi+ (7)
, ++b − −
hw, xi yi ≤ ≤ ξ−
ε++i (8)
ξi+ , ξ− ≥0 (9)
,i ≥ 0 (9)
The constant C
The constant (C >
C (C 0) represents
> 0) represents the
the trade-off
trade-off between
between the the margin
margin andand the
the extent
extent to
to which
which
deviations than ε are tolerated [26]. If the constant C
deviations larger than ε are tolerated [26]. If the constant C equals to zero, the minimizing problem
larger equals to zero, the minimizing problem
goes
goes back
back to
to linear
linear regression,
regression, which
which also
also means
means that
that the
the rule
ruleisistoo
toostrict
strict[27].
[27].
3.3. Lasso Regression + Support Vector Regression (LR+SVR)
3.3. Lasso Regression + Support Vector Regression (LR+SVR)
Since LR can perform as a variable selection operator [28], this research adopted LR for input
Since LR can perform as a variable selection operator [28], this research adopted LR for input
variable selection and then developed the VM model by SVR based on the selected variables.
variable selection and then developed the VM model by SVR based on the selected variables.
4. Parameter Adjustment Optimization
4. Parameter Adjustment Optimization
In the prepreg manufacturing process, there are two important output variables: The ratio of
resin In the
and gelprepreg manufacturing
time. This process,
research adopted the there are two
prediction important
tools output
to develop variables:
VM models for The
bothratio of
output
resin and gel time. This research adopted the prediction tools to develop VM models for both output
variables, respectively. Moreover, parameters such as temperatures of ovens and density of epoxy
Appl. Sci. 2020, 10, 8488 6 of 12
variables, respectively. Moreover, parameters such as temperatures of ovens and density of epoxy resin
are not easy to adjust, and engineers always want to minimize the adjustment of these parameters,
and adjust other parameters. Based on both VM models, this research aimed to minimize the sum
of adjustments to all parameters. A mathematical model was developed for optimizing parameter
adjustment and the required notation is defined as follows:
xcj : current value of input variable j;
x j : Suggested input variable j;
xnj : The normalized adjustment amounts of input variable j;
xij : The value of input variable j of instance i;
ynk : Normalized deviation value of output variable k;
ytk : Target value of output variable k;
ylk : Lower control limit of output variable k;
yuk : Upper control limit of output variable k;
H: Set of input variables which are hard to be adjusted.
The mathematical model aimed to minimize the sum of normalization of deviation between
output
values and their corresponding target values as given in Equation (10). In Equation (11),
VMk x1 , . . . , x J indicates the VM model of output variable k, and yk is the corresponding VM value
output value k. As the scales of output variables are quite different, this research normalized the
deviation between output values and their corresponding target values by Equation (12). Equation (13)
specifies the bounds of input variables. Equation (14) indicates that the input variables which are hard
to adjust have to retain their current value.
K
X
minimize ynk (10)
k =1
Subject to:
yk = VMk x1 , . . . , x J ∀k (11)
ytk − yk
ynk = ∀k (12)
yuk − ylk

min xij ≤ x j ≤ max xij ∀j (13)
∀i ∀i
xi = xci ∀i ∈ H (14)
5. Experimental Results
5.1. Data Description

This research used real-world data collected from the prepreg manufacturing department of a
plastic company in Taiwan. The demand for different products varies widely in the company. For the
products with high demand, as the manufacturing data has been studied for a long time, the engineers
can adjust input values very well in the light of their experience. Moreover, these high demand
products are always assigned to specific production lines. Thus the number of setups can be reduced
and increase the efficiency of these production lines. There are many different types of products with
low demand, but the corresponding manufacturing data is relatively sparse. Therefore, engineers lack
experience in setting up these product types. These low demand products are also assigned to specific
production lines. The both number of setups and setup time are very high, and this reduces efficiency
of these production lines. This research focuses on one production line which always produces low
demand product types.
The collected input and output variables are listed in Table 1. There are 12 input variables and
2 output variables. Among input variables, moving speed is the transfer speed of the glass cloth,
Appl. Sci. 2020, 10, 8488 7 of 12
which affects the heating time in the oven. Epoxy resin density is the density of resin which is the
raw material of the impregnation process. There were two oven temperatures, as shown in Figure 1;
one was the temperature of the oven on the left-hand side and the other was the temperature of the
oven on the right-hand side. There were two doctor rollers, the positions of which affected the volume
of resin that adhered to the glass cloth. Moreover, there were four fan speeds, as shown in Figure 1.
The first was located on the upper left for moving air into the oven, the second was located on the
lower left for moving air out of the oven, the third was located on the upper right for moving air out of
the oven, and the fourth was located on the lower right for moving air into the oven. Revolution speed
was the revolution speed of both doctor rollers. Gel time 1 indicated the gel time of the epoxy resin.
It was the time required for resin to become solid at a certain temperature.
Table 1. The input and output variables and their corresponding minimal and maximal values.
Input Variables Output Variable

X1 Moving speed 8.2, 6.8 Y1 Ratio of resin 50.3, 38.1
X2 Epoxy resin density * 1.19, 1.15 Y2 Gel time 2 175, 134
X3 Oven temperature 1 * 170.0, 143.2
X4 Oven temperature 2 * 179.5, 168.0
X5 Position of doctor roller 1 665, 370
X6 Position of doctor roller 2 665, 370
X7 Fan speed 1 1150, 1000
X8 Fan speed 2 1700, 1166
X9 Fan speed 3 1719, 1150
X10 Fan speed 4 1800, 550
X11 Revolution speed 0.00, −2.99
X12 Gel time 1 * 346, 309
* indicates the input variables are hard to adjust.
It can be noted that among input variables, epoxy resin density, oven temperature 1, oven temperature
2, and gel time 1 were hard to adjust. Epoxy resin density and gel time 1 were the attributes of the epoxy
resin, and the epoxy resin was made before it been put into the trough. Therefore, it was impossible to
adjust epoxy resin density and gel time 1. Both oven temperatures were increased by electric energy and
cost was required, but both oven temperatures were reduced by itself and time was required. Therefore,
engineers always want the temperature stay the same.
Regarding the output variables, the ratio of resin was the ratio of resin weight in prepreg, and gel
time 2 indicated the gel time of the resin in prepregs.
All instruments were well integrated, and almost input variables could be monitored and
controlled by a central control system. However, epoxy resin density, gel time 1, ratio of resin, and gel
time 2 could only be measured by manual labors. Every time when new parameters were set, the four
variables were entered into the system manually. Moreover, all of the instruments were calibrated
regularly. This research collected the manufacturing data of a production line which focuses on
producing low demand products. The products which are produced by a certain type of glass cloth
and a certain formula of epoxy resin were collected. From January 2018 to January 2019, 216 sets
manufacturing data were collected. The maximal and minimal values of all input and out variables
are shown in Table 1. Although the collected 216 sets of data were based on a single production line,
type of glass cloth, and formula of epoxy resin, the corresponding specifications were quite different
because they respond to customer needs. Consequently, the target values of the collected data were
not the same.
According to preliminary tests, there was an obvious relationship between the square and
interaction of input variables and output variables. This research also analyzed second order terms of
all input variables. This meant that the 12 input variables produced 90 terms (12 original variables,
12 squares of variables, and 66 interactions between variables).
5.2. VM Model Development
Based
Appl. Sci.on the
2020, 10,collected
8488 data, 66.7% of data were randomly selected for training and the remaining
8 of 12
33.3% were saved for testing. According to the introduction in Section 3, λ and C were important
5.2. VM
Appl. Model
Sci. 2020, 10, Development
8488
parameters for LR and SVR, respectively. This research adopted 10-fold cross-validation to8evaluate of 12
the trained BasedVMon the collected

model data, 66.7%
with different λ orof C.
dataThatwere randomly
means selected for
the training training
data and the remaining
was divided into 10 parts.
33.3%
For each
5.2. VM were
λ or saved
C, one
Model for testing. According to the introduction in Section
model was developed based on 9 parts, and validated by the remaining part.
Development 3, λ and C were important
parameters
Following the on for
same LR and SVR, respectively.
procedure, 1066.7%
models This
were research
developed adoptedand 10-fold cross-validation
thenforvalidated. The to evaluate
level of λ or C
Based the collected data, of data were randomly
the trained VM model with different λ or C. That means the training data was divided into selected training and the remaining
10 parts.
with33.3%
the lowest
were average
saved mean
formodel
testing. absolute
According percentage
to based error
the introduction (MAPE) of the
3, λ by10 validations
C were was then
For each λ or C, one was developed on 9 parts, in andSection
validated andthe remainingimportant
part.
adopted for training
parameters LRthe
for same and VM SVR,model with all theresearch
respectively. trainingadopted
data. 10-fold cross-validation to evaluate
Following the procedure, 10 modelsThis were developed and then validated. The level of λ or C
In this
the
with theresearch,
trained VM model
lowest LR was
average with trained
different
mean λ or percentage
based
absolute C.onThat
λ between
means
error 0(MAPE)
the to 2, and
training SVR
data
of thewaswas
10 trained
divided
validations intofor 10Cparts.
was between
then
0 to adopted
30. The for
For each λ
training
or C,
training results
one model
the VM ofmodel
wasLR and
developed
with SVR fortraining
based
all the the output
on 9 data. value “ratio of resin” are illustrated in
parts, and validated by the remaining part.
Following
Figures 3Inand the same procedure, 10 models were
this research, LR was trained based on λ betweenMAPE
4. When λ = 0.1 and C = 21.1 the developed
average and
0 to thenofvalidated.
2, and the
SVR wasThe
corresponding level for
trained λCor
oftrained C with
betweenLR and
the
SVR 0were lowest
to 30.the average
Thelowest. mean absolute
trainingRegarding
results of LR percentage
LR+SVR,
and SVRboth error
for the(MAPE)
λ output of
and C valuethe 10
needed validations
to of
“ratio was
beresin”
optimized.then adopted
A fullfor
are illustrated factor
in
training
Figures 3the
and VM 4. model
When with
λ = all
0.1 the
and training
C = 21.1 data.
the average MAPE of
experiment was constructed, and the results for output value “ratio of resin” are illustrated in Figurethe corresponding trained LR and
In this theresearch, LRRegarding
was trained based onboth λ between and 0Coftoneeded
2, and SVRto bewas trained for Cfull between
5. In SVR
Figure were 5, only lowest.
the lowest averageLR+SVR, MAPE of each λ level λ are shown. optimized.
When λ = A 0.05 the factor
average
0experiment
to 30. Thewas training results
constructed, of
and LRtheand SVR
results for
for the
output output
value value
“ratio “ratio
of of
resin” resin”
are are
illustrated illustrated
in Figure in
MAPE was the lowest with C = 1.75. The final training and testing results are shown in Table 2.
Figures
5. 3 and
In Figure 5, 4. When
only the λlowest
= 0.1 and C = 21.1
average MAPE the average MAPE
of each level ofof the corresponding
λmodels.
are shown. When λtrained
= 0.05 LR and SVR
Table
were 2 shows
the lowest. the performance
Regarding LR+SVR, of the
both sixλ developed
and C needed VM to be It was
optimized. A found
full thatthe
factor using average
experiment LR+SVR
MAPE was the lowest with C = 1.75. The final training and testing results are shown in Table 2.
and was
taking second
constructed, order
and theterms into consideration
results for output value had
“ratio ofVMthe best
resin” performance
are illustrated in
in Figuretraining and
5. In Figure testing
5,
Table 2 shows the performance of the six developed models. It was found that using LR+SVR
results
only for both
the lowest output
average variables.
MAPE This research then
level of λ are had adopted the two developed
λ = 0.05 the average VM modelsthe for
and taking second order termsofintoeachconsideration shown.the When
best performance in training MAPE andwas testing
optimizing
lowest
results withthe both
for C = 1.75.
adjustment
output offinal
input
Thevariables. variables.
training
This and testingthen
research results are shown
adopted the two in Table 2.
developed VM models for
optimizing the adjustment of input variables.
Figure
Figure
Figure 3.
3. Training
Training
3. Training results for
resultsfor
results lasso
forlasso regression
lasso regression (LR)
regression(LR) with
(LR)with
withdifferent
different λ. λ.
λ.
different
Figure 4. Training different C.

Training results of SVR with different C.
Figure 4. Training results of SVR with different C.

Appl. Sci. 2020, 10, 8488 9 of 12
Appl. Sci. 2020, 10, 8488 9 of 12
Appl. Sci. 2020, 10, 8488 9 of 12
Figure
Figure 5. Validation
5. Validation resultsfor
results for LR+SVR
LR+SVR with
withdifferent λ and
different C. C.
λ and
Table The
2. 2.
Table Thetraining
training and testing
and testing results.
results.
Ratio of
Ratio ofResin
Resin Gel Time of Resin
Gel Time of in Prepregs
Resin in Prepregs
Training Testing Training Testing
1 LR Training
2.01% Testing
1.89% Training
3.60% 3.94% Testing
1 2 LR SVR 2.04%
2.01% 1.74%
1.89% 3.42%3.60% 3.31% 3.94%
2 3 SVRLR+SVR 1.85%
2.04% 1.73%
1.74% 3.07%3.42% 3.12% 3.31%
3 4 LR+SVR
2nd order+LR 2.07%
1.85% 1.98%
1.73% 3.51%3.07% 2.93% 3.12%
4 52nd order+LR
2nd order+SVR 2.13%
2.07% 1.72%
1.98% 3.30%3.51% 2.73% 2.93%
Figure 5. Validation results for LR+SVR with different λ and C.
5 6 2nd
2nd order+LR+SVR 2.13%
order+SVR 1.54% 1.41%
1.72% 2.72%3.30% 2.59% 2.73%
6 2nd order+LR+SVRTable 1.54%
2. The training1.41% 2.72%
and testing results. 2.59%
5.3. Optimizing Adjustment of Input Variables
Ratio of Resin Gel Time of Resin in Prepregs
Once the VM models were developed, the mathematical model proposed in Section 4 could be
Table 2 shows the performance of theTraining Testing VM
six developed Training
models. It wasTesting
found that using LR+SVR
adopted to optimize
1 adjustment
LR of input variables.
2.01% This
1.89% research
3.60% tested the mathematical
3.94% model on
and taking second
the 33.3% order
of test terms into consideration had the best performance in training and testing
2data usingSVRthe software Excel Solver.
2.04% 1.74% The average
3.42% normalized3.31%deviation of output
results for bothfor
variables output 3 ofvariables.
ratio resin This
and gel
LR+SVR timeresearch
21.85% then
with the1.73% adopted
test data were theand
0.227
3.07% two0.266,
developed
3.12% VMBased
respectively. models for
optimizing the
on the adjustment
adjustment 4 usingofthe
2nd input
order+LRvariables.
results 2.07%
of the 1.98% model,
mathematical 3.51%
the average 2.93%
normalized deviation of
output variables5 for 2nd order+SVR
ratio of resin and2.13%gel time1.72% 3.30%
2 in the test data were2.73%
5.642 × 10−6 and 0.117,
5.3. Optimizing 6
Adjustment
respectively. 2nd
The results order+LR+SVR
of are
Input 1.54% 1.41% 2.72%
Variablesin Figure 6 by box plots.
summarized 2.59%
Once5.3.
theOptimizing
VM models Adjustment
wereofdeveloped,
Input Variablesthe mathematical model proposed in Section 4 could be
adopted to optimize
Once theadjustment
VM models were of input variables.
developed, This research
the mathematical tested the
model proposed mathematical
in Section 4 could be model on
the 33.3% adopted to optimize
of test data usingadjustment of input
the software variables.
Excel ThisThe
Solver. research tested the
average mathematical
normalized model on of output
deviation
the 33.3% of test data using the software Excel Solver. The average normalized deviation of output
variables for ratio of resin and gel time 2 with the test data were 0.227 and 0.266, respectively. Based on
variables for ratio of resin and gel time 2 with the test data were 0.227 and 0.266, respectively. Based
the adjustment using theusing
on the adjustment results of theof mathematical
the results the mathematical model,
model, thethe average
average normalized
normalized deviationdeviation
of of
output variables for ratio for
output variables of resin and
ratio of geland
resin timegel2 in the2 test
time data
in the testwere
data 5.642 × 10−6
were 5.642 and
× 10 0.117,
−6 and respectively.
0.117,
The resultsrespectively. The results
are summarized inare summarized
Figure in Figure
6 by box plots.6 by box plots.
Figure 6. Performance of proposed mathematical model.
6. Performance
FigureFigure 6. Performanceof
ofproposed mathematical
proposed mathematical model.
model.
Appl. Sci. 2020, 10, 8488 10 of 12
On average, the proposed mathematical model provided better input variables which can produce
output variables that are closer to target values. However, as shown in Figure 6, the proposed
mathematical model
Appl. Sci. 2020, produced results for output variables of gel time 2 that were far from10the
10, 8488 target
of 12
values in some test data. This research then modified the mathematical model proposed in Section 4.
On average, the proposed mathematical model provided better input variables which can
Firstly, in order to minimize the deviation of output variables, Equation (15) was added to ensure that
produce output variables that are closer to target values. However, as shown in Figure 6, the
the resulting output variables were equal to the target variables. Thus, Equation (12) is redundant in
proposed mathematical model produced results for output variables of gel time 2 that were far from
the modified
the targetmodel.
values in some test data. This research then modified the mathematical model proposed in
Section 4. Firstly, in order to minimize the = ytk
yk deviation ∀k variables, Equation (15) was added to (15)
of output
ensure that the resulting output variables were equal to the target variables. Thus, Equation (12) is
Then the objective function was replaced by Equation (16), in order to minimize the normalization
redundant in the modified model.
of adjustment of all input variables. As the scales of input variables were quite different, this research
normalized the adjustment of input variables by = adding Equation
∀k (17). (15)
Then the objective function was replaced by Equation (16), in order to minimize the
J
normalization of adjustment of all input variables. X Asn the scales of input variables were quite
minimize x j variables by adding Equation (17).
different, this research normalized the adjustment of input (16)
j=1
(16)
xcj − x j
n
xj = ∀j (17)
max xij − min xij
∀i = ∀i ∀ (17)
∀ ∀
From Equation (16), the deviation of output variables of the modified mathematical model was
reduced toFrom
zero.Equation
The results(16),for
thethe
deviation of output
adjustment variables
of input of the
variables modified
were mathematical
illustrated in Figuremodel
7. In was
Figure 7,
reduced to zero. The results for the adjustment of input variables were illustrated
the mathematical model proposed in Section 4 is called mathematical model 1, and the mathematical in Figure 7. In
Figure 7, the mathematical model proposed in Section 4 is called mathematical model 1, and the
model modified in this section is called mathematical model 2. In Figure 7, input variables X2 , X3 , X4 ,
mathematical model modified in this section is called mathematical model 2. In Figure 7, input
and X12 were hard to adjust, and their corresponding adjustment for all instances was very close to
variables X2, X3, X4, and X12 were hard to adjust, and their corresponding adjustment for all instances
zero. was
Moreover, the average
very close to zero.adjustment
Moreover, oftheinput variables
average produced
adjustment by mathematical
of input model by
variables produced 2 were
smaller
mathematical model 2 were smaller than those of mathematical model 1; although, in rare instances,input
than those of mathematical model 1; although, in rare instances, the adjustments of
variables X1 and X11 produced
the adjustments by mathematical
of input variables X1 and Xmodel 2 were
11 produced bybigger. Mathematical
mathematical model 2model
were 2bigger.
was more
robustMathematical
in providing values
model of input
2 was variables
more robust with lessvalues
in providing adjustment.
of input variables with less adjustment.
Figure 7. Comparison of two mathematical models.

Figure 7. Comparison of two mathematical models.
6. Conclusions
6. Conclusions
VM has been adopted in many industrial settings. However, it only provides estimates of quality
VM has been adopted in many industrial settings. However, it only provides estimates of quality
characteristics based on the given manufacturing parameters. The target values of prepreg quality
characteristics based on the given manufacturing parameters. The target values of prepreg quality
characteristics are are
characteristics always different
always different and dependonon
and depend thethe requirements
requirements of customers.
of customers. The engineers
The engineers have
have to
to adjust manufacturing parameters in order to achieve the quality characteristics that that
adjust manufacturing parameters in order to achieve the quality characteristics meetmeetthe the
customers’ needs.
customers’ However,
needs. somesome
However, manufacturing parameters
manufacturing are very
parameters difficult
are very to adjust.
difficult In this
to adjust. In research,
this
research,
VM models forVM models for
all quality all quality characteristics
characteristics were developed wereusing
developed
LR and using
SVR. LRThen
and the
SVR. Then the VM
developed
Appl. Sci. 2020, 10, 8488 11 of 12
models were integrated into a mathematical model which was intended to optimize the adjustment of all
manufacturing parameters and achieve the quality characteristics that meet the demands of customers.
According to the experimental results, taking second order terms into consideration and adopting LR
for input variable selection, then developing the VM model using SVR based on the selected variables,
produces the best performance when estimating quality characteristics. The proposed mathematical
model produced values for the manufacturing parameters whose corresponding quality characteristics
were equal to the target values, without the need for adjustment of the difficult-to-adjust manufacturing
parameters. At the same time, the adjustment of the easy-to-adjust manufacturing parameters could
be minimized. Thus, the proposed methodology not only reduced the number of setups, but could
also reduce the setup time, and consequently increase productivity.
Author Contributions: Conceptualization, Y.K.; methodology, Y.K. and S.-H.C.; software, S.-H.C.; validation,
Y.K. and S.-H.C.; formal analysis, Y.K. and S.-H.C.; investigation, Y.K.; resources, Y.K.; data curation, Y.K.;
writing—original draft preparation, Y.K.; writing—review and editing, Y.K.; visualization, S.-H.C.; supervision,
Y.K.; project administration, Y.K.; funding acquisition, Y.K. All authors have read and agreed to the published
version of the manuscript.
Funding: This research received no external funding.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Nguyen, G.; Li, X.; Blanton, R.D.; Li, X. Partial Bayesian Co-training for Virtual Metrology. IEEE Trans.
Industr. Inform. 2019. [CrossRef]
2. Kim, M.; Kang, S.; Lee, J.; Cho, H.; Cho, S.; Park, J.S. Virtual metrology for copper-clad lamination
manufacturing. Comput. Ind. Eng. 2017, 109, 280–287. [CrossRef]
3. Jonathan, C.Y.C.; Cheng, F.T. Application development of virtual metrology in semiconductor industry.
In Proceedings of the 31st Annual Conference of IEEE Industrial Electronics Society 2005, Raleigh, NC, USA,
6–10 November 2005; pp. 124–129.
4. Hung, M.H.; Lin, T.H.; Cheng, F.T.; Lin, R.C. A Novel Virtual Metrology Scheme for Predicting CVD Thickness
in Semiconductor Manufacturing. IEEE ASME Trans. Mechatron. 2007, 12, 308–316. [CrossRef]
5. Kang, P.; Lee, H.J.; Cho, S.; Kim, D.; Park, J.; Park, C.K.; Doh, S. A virtual metrology system for semiconductor
manufacturing. Expert Syst. Appl. 2009, 36, 12554–12561. [CrossRef]
6. Lin, T.H.; Cheng, F.T.; Wu, W.M.; Kao, C.A.; Ye, A.J.; Chang, F.C. NN-Based Key-Variable Selection Method
for Enhancing Virtual Metrology Accuracy. IEEE Trans. Semicond. Manuf. 2009, 22, 204–211. [CrossRef]
7. Zeng, D.; Spanos, C.J. Virtual metrology modeling for plasma etch operations. IEEE Trans. Semicond. Manuf.
2009, 22, 419–431. [CrossRef]
8. Lynn, S.A.; Ringwood, J.; MacGearailt, N. Global and local virtual metrology models for plasma etch process.
IEEE Trans. Semicond. Manuf. 2012, 25, 94–103. [CrossRef]
9. Susto, G.A.; Pampuri, S.; Schirru, A.; Beghi, A.; Nicllao, G.D. Multi-step virtual metrology for semiconductor
manufacturing: A multilevel and regularization methods-based approach. Comput. Oper. Res. 2015, 53,
328–337. [CrossRef]
10. Kang, P.; Kim, D.; Cho, S. Semi-supervised support vector egression based on self-training with label
uncertainty: An application to virtual metrology in semiconductor manufacturing. Expert Syst. Appl. 2016,
51, 85–106. [CrossRef]
11. Jia, X.; Di, Y.; Feng, J.; Yang, Q.; Dai, H.; Lee, J. Adaptive virtual metrology for semiconductor chemical
mechanical planarization process using GMDH-type polynomial neural networks. J. Process Control 2018, 62,
44–54. [CrossRef]
12. Su, Y.C.; Hung, M.H.; Cheng, F.T.; Chen, Y.T. A Processing Quality Prognostics Scheme for Plasma Sputtering
in TFT-LCD Manufacturing. IEEE Trans. Semicond. Manuf. 2006, 19, 193–194. [CrossRef]
13. Su, Y.C.; Lin, H.L.; Cheng, F.T.; Wu, W.M. Accuracy and Real-Time Considerations for Implementing Various
Virtual Metrology Algorithms. IEEE Trans. Semicond. Manuf. 2008, 21, 426–434.
14. Cheng, F.T.; Huang, H.C.; Kao, C.A. Developing an Automatic Virtual Metrology System. IEEE Trans. Autom.
Sci. Eng. 2012, 9, 181–188. [CrossRef]
Appl. Sci. 2020, 10, 8488 12 of 12
15. Hung, M.H.; Tsai, W.H.; Yang, H.C.; Kao, Y.J.; Cheng, F.T. A novel automatic virtual metrology system
architecture for TFT-LCD industry based on main memory database. Robot. Comput. Integr. Manuf. 2012, 28,
559–568. [CrossRef]
16. Yang, H.C.; Tieng, H.; Chen, F.T. Automatic virtual metrology for wheel machining automation. Int. J.
Prod. Res. 2016, 54, 6367–6377. [CrossRef]
17. Yang, H.C.; Tieng, H.; Chen, F.T. Total precision inspection of machine tools with virtual metrology. J. Chin.
Inst. Eng. 2016, 39, 221–235. [CrossRef]
18. Hsieh, Y.M.; Lin, C.Y.; Yang, Y.R.; Hung, M.H.; Cheng, F.T. Automatic Virtual Metrology for Carbon Fiber
Manufacturing. IEEE Robot. Autom. Lett. 2019, 4, 2730–2737. [CrossRef]
19. Tibshiranti, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Series B Stat. Methodol. 2019, 1,
267–288.
20. Park, C.; Kim, Y.; Pare, Y.; Kin, S.B. Multitask learning for virtual metrology in semiconductor manufacturing
systems. Comput. Ind. Eng. 2018, 123, 209–219. [CrossRef]
21. Li, Y.; Li, Y.; Sun, Y. Online static security assessment of power systems based on Lasso algorithm. Appl. Sci.
2018, 8, 1442. [CrossRef]
22. Spencer, B.; Alfandi, O.; Al-Obeidat, F. A refinement of Lasso regression applied to temperature forecasting.
Procedia Comput. Sci. 2018, 130, 728–735. [CrossRef]
23. Vapnik, V. The Nature of Statistical Learning Theory; Springer: New York, NY, USA, 1995.
24. Anusha, K.S.; Ramanathan, R.; Jayakumar, M. Link distance-support vector regression (LD-SVR) based device
free localization technique in indoor environment. Eng. Sci. Technol. Int. J. 2020, 23, 483–493. [CrossRef]
25. Chanklan, R.; Kaoungku, N.; Suksut, K.; Kerdprasop, K.; Kerdprasop, N. Runoff Prediction with a combined
artificial neural network and support vector regression. Int. J. Mach. Learn. Comput. 2018, 8, 39–43. [CrossRef]
26. Cheng, Y.; Peng, J.; Gu, X.; Zhang, X.; Liu, W.; Zhou, Z.; Yang, Y.; Huang, Z. An intelligent supplier evaluation
model based on data-driven support vector regression in global supply chain. Comput. Ind. Eng. 2020,
139, 105843. [CrossRef]
27. Tao, D.W.; Ma, Q.; Li, S.L.; Xie, Z.N.; Lin, D.X.; Li, S.Y. Support vector regression for the relationships between
ground motion parameters and macroseismic intensity in the sichuan–yunnan region. Appl. Sci. 2020,
10, 3086. [CrossRef]
28. Yang, X.; Wen, W. Ridge and Lasso regression models for cross-version defect prediction. IEEE Trans. Reliab.
2018, 67, 885–896. [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional
affiliations.
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Applsci 10 08488 v2

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Applsci 10 08488 v2

Uploaded by

Copyright:

Available Formats

applied

Appl. Sci. 2020, 10, 8488; doi:10.3390/app10238488 www.mdpi.com/journal/applsci

3. Virtual Metrology Modeling

3.1. Lasso Regression (LR)

3.2. Support Vector Regression (SVR)

Figure 2. General regression and support vector regression

5.1. Data Description

Input Variables Output Variable

the trained BasedVMon the collected

Figure 4. Training different C.

Figure 4. Training results of SVR with different C.

Appl. Sci. 2020, 10, 8488 9 of 12

Figure 6. Performance of proposed mathematical model.

Figure 7. Comparison of two mathematical models.

You might also like