Professional Documents
Culture Documents
Article
Sustainable Technology Analysis Using Data
Envelopment Analysis and State Space Models
Jong-Min Kim 1 , Bainwen Sun 1 and Sunghae Jun 2, *
1 Statistics Discipline, Division of Sciences and Mathematics, University of Minnesota-Morris, Morris,
MN 56267, USA
2 Department of Big Data and Statistics, Cheongju University, Chungbuk 28503, Korea
* Correspondence: shjun@cju.ac.kr; Tel.: +82-10-7745-5677
Received: 5 May 2019; Accepted: 27 June 2019; Published: 29 June 2019
Keywords: data envelopment analysis; patent keywords; state space model; technological
sustainability; Apple technology
1. Introduction
Researchers have been trying to forecast future technology for various purposes [1]. For example,
using the results of technology forecasts, a company can undertake a variety of technological innovations
to improve its technological competitiveness in the market [2]. The ability to predict the future state of
technology can have a significant impact on a company. Thus, technology forecasting enables companies
to develop better research and development (R&D) strategies. It may also identify potential business
opportunities in the market and provide the opportunity to identify a new product. Recently, various
methods for technology forecasting have been researched in diverse fields using statistics and machine
learning algorithms [2–5]. Researchers have used graphical methods, ensemble modeling, Bayesian
inference, and copula modeling to analyze patent documents for technology analysis, because a patent
contains many examples of developed technology [6]. In this paper, we introduce a methodology for
technology forecasting by analyzing patent document data.
We consider additive model and directional distance function of data envelopment analysis
(DEA) for technology analysis. In previous research, DEA was used for constructing patent-enhancing
strategies in Korean industries [7]. It has been used to find the empirical measure of productive
efficiency for a decision-making unit (DMU) [8,9], and in this study, we use this method for patent data
analysis. In our research, this unit can be considered to be one of many items, such as patent keywords,
representing the sub-technology. Our research proposes a new method to address the technological
relation between patent keywords to support R&D planning in the management of technology (MOT).
In this study, we conducted two analyses of additive model and directional distance function and
combined these results to understand the sustainable technology in a given target domain. We also
considered the state space model (SSM) to forecast the future state of technology. This is similar to the
hidden Markov model (HMM), and the only difference from the HMM is the use of a continuous value
for the hidden state. The SSM is very well suited for time series forecasting [10].
In this paper, we used a local level model as the SSM for sustainable technology analysis.
This model is based on Gaussian distribution and provides good results for time series forecasting
of technology. The input data applied to the SSM are the international patent classification (IPC)
codes containing detailed information about developed technology. An IPC code is defined by
its corresponding technology [11]. Finally, we combine the results of DEA and the SSM to find
technological sustainability. Sustainable technology is a technology sustaining the technological
competitiveness of a nation or company in the global market [5,12–16]. The goal of this paper is to
find the sustainable technology structure of a target domain using DEA and the SSM. To illustrate the
validity and performance of our study, we conducted a case study using the patent documents used
and registered by Apple, because Apple is one of leading companies in technological innovation [17].
The remainder of this paper is organized as follows. First, we explain the research background
related to our paper in Section 2. In Section 3, we explain the proposed methodology of our research.
We also show the result of our case study of Apple’s patent data in Section 4. Lastly, in Section 5, we
present our conclusions and discuss our future work related to sustainable technology analysis.
2. Research Background
u1 y1 + u2 y2 + · · · + um ym
e= , (1)
v1 x1 + v2 x2 + · · · + vn xn
where V = (v1 , v2 , . . . , vn ) and U = (u1 , u2 , . . . , um ) are the weights for inputs and outputs. The V and
U are determined by linear programming called multiple form as follows [8,18]:
Xm Xm Xn Xn
e = Maxv,u ui yi,j∗ , s.t. ui yij − vi xij ≤ 0, j = 1, 2, . . . , j∗ , . . . , N, vi xi,j∗ , (2)
i=1 i=1 i=1 i=1
where vi (i = 1, n) and ui (i = 1, m) are larger than or equal to 0, respectively. In DEA, the highest
efficiency is set to 1, and the relative ratio to this value is calculated and used for analysis. If the value
of the objective function is equal to one or close to one, the evaluated DMU is efficient. It is used to
measure the productive efficiency of a DMU. In our study, DMU is a technology; it has control over
its inputs—the occurred frequencies of technological keywords. DMU can also be expected to have
control over its outputs—the technological performances of technological keywords. The technique of
DEA is based on linear programming to evaluate the efficiency of a technological keyword in target
Sustainability 2019, 11, 3597 3 of 19
technology. The strategy of DEA is determined by controlling the inputs and outputs of the DMU. After
After determining the strategy, DEA is performed on the observed units using linear programming
determining the strategy, DEA is performed on the observed units using linear programming [8,9,19].
[8,9,19].
2.2. Sustainable Technology
2.2. Sustainable Technology
Most companies have built and implemented enterprise-wide strategies for sustainability and are
Most companies
committed to developing have built andtechnology.
sustainable implemented enterprise-wide
Sustainable technology strategies for sustainability
is a technology and
that can sustain
are
the committed
technological to developing sustainable
competitiveness technology.
of a company Sustainable
[2]. Leading technology
companies, suchisasa technology that can
Apple or Samsung,
sustain the technological competitiveness of a company [2]. Leading companies,
have carried out technological innovation continuously with respect to their sustainable technologies. such as Apple or
Samsung, have carried out technological innovation continuously with respect
Many studies related to sustainable technology have been performed in diverse technology domains. to their sustainable
technologies.
Jun Many Bayesian
(2018) proposed studies related to sustainable
count data modeling for technology have been sustainability
finding technological performed in [2]. diverse
This
technology domains. Jun (2018) proposed Bayesian count data modeling
applied Bayesian inference and Poisson probability model to the proposed method for extractingfor finding technological
sustainability [2]. This applied Bayesian inference and Poisson probability model to the proposed
technological relations between technologies. Park et al. (2015) developed a network model for selecting
method for extracting technological relations between technologies. Park et al. (2015) developed a
sustainable technology [5]. They used graph theory and social network analysis for their proposed
network model for selecting sustainable technology [5]. They used graph theory and social network
model, and international patent classification codes as input data. Kim et al. (2015) introduced various
analysis for their proposed model, and international patent classification codes as input data. Kim et
growth curve methods for technology management of the defense industry [13]. They also combined
al. (2015) introduced various growth curve methods for technology management of the defense
the Delphi technique with surveys and applied the combined results to the growth curve model.
industry [13]. They also combined the Delphi technique with surveys and applied the combined
In addition, sustainable technology analysis using various analytical techniques has been performed in
results to the growth curve model. In addition, sustainable technology analysis using various
diverse technology fields [4,12,14–16].
analytical techniques has been performed in diverse technology fields [4,12,14–16].
3. Sustainable Technology Analysis Using Data Envelopment Analysis and State Space Models
3. Sustainable Technology Analysis Using Data Envelopment Analysis and State Space Models
In this paper, we propose a new methodology for patent data analysis for sustainable technology
In this
analysis. paper,
First, we propose
we applied a new methodology
the additive for patentdistance
model and directional data analysis
function for of
sustainable
DEA to the technology
proposed
analysis.
methodology.First,Wewealso
applied the additive
used patent model
document dataand directional
as input data fordistance
the DEAfunction of DEApatents
model, because to the
proposed methodology. We also used patent document data as input data for
have detailed and rich information about developed technology, such as the titles, abstracts, claims, the DEA model,
because
inventors’ patents
names,have detailed
applied and rich
and issued information
dates, about
figures, etc. [1,6].developed technology,
After determining the such
targetastechnology,
the titles,
abstracts,
we claims,
collected inventors’
the patent names, applied
documents related and issued
to the targetdates, figures, from
technology etc. [1,6]. After
patent determining
databases the
all over
target technology, we collected the patent documents related to the target
the world [20,21]. Next, we extracted patent keywords from the patent documents. The keywords technology from patent
databases all over the
represent various world [20,21]. related
sub-technologies Next, wetoextracted
the targetpatent keywords
technology. In thefrom the patent
process documents.
of preprocessing
The
keyword data, we use various text mining techniques based on R data language and itsprocess
keywords represent various sub-technologies related to the target technology. In the provided of
preprocessing keyword data, we use various text mining
package [22,23]. Our text mining process is shown in Figure 1. techniques based on R data language and
its provided package [22,23]. Our text mining process is shown in Figure 1.
where si and s0 are the input and output slack vectors, respectively, and e and e’ are conformable
vectors. The dual problem to the above model is shown as follows [9,19,24–28]:
As we maximize the input and output slacks simultaneously, the units in the optimal basis of
Min 𝑣𝑋𝑡 + 𝑢𝑌𝑡 , s. t., 𝑣𝑋𝑗 + 𝑢𝑌𝑗 ≥ 0; 𝑗 = 1, … , 𝑛; 𝑣 ≥ +1; 𝑢 ≤ −1. (4)
Equation (2) are always Pareto–Koopmans efficient unlike in the case of the oriented models [24].
UsingAs wemodel,
this maximize the the
we find input and output
efficiency slacks
of each simultaneously,
patent keyword. Wethe useunits in the
another optimal
method basis to
of DEA of
Equation
obtain the (2) are always
efficiency of thePareto–Koopmans
technology keyword. efficient unlike
This is in the distance
directional case of the oriented
function. Themodels [24].
directional
Using this
output model,
distance we findisthe
function theefficiency of each version
output-oriented patent keyword. We use
of the benefit another
function methodThe
[29–31]. of DEA
benefitto
obtain the efficiency of the technology keyword. This is directional distance function. The
function is applied to diverse optimization areas. The directional distance function is a generalized directional
output
form distance
based function
on a radial is the
model as output-oriented
follows [32,33]: version of the benefit function [29–31]. The benefit
function is applied to diverse optimization areas. The directional distance function is a generalized
form based on a radial model Max B,as s.follows + Bgx ≤ x0 , YA − Bg y ≥ y0 , A ≥ 0,
t., XA [32,33]: (5)
Max
where gx and g y are direction B, s. t.,
vectors 𝑋𝐴 +to𝐵𝑔
related ≤ 𝑥0 , 𝑌𝐴
x 𝑥(input) and−y𝐵𝑔 𝑦 ≥ 𝑦0 , and
(output) 𝐴 ≥ B0,is un-efficiency measure.
(5)
This
where is formulated
𝑔𝑥 and 𝑔𝑦on undesirable
are outputsrelated
direction vectors as followsto x[32,33]:
(input) and y (output) and B is un-efficiency
measure. This is formulated on undesirable outputs as follows [32,33]:
Max B, s. t., XA + Bgx ≤ x0 , YA − Bg y ≥ y0 , B0 A + Bgb ≤ b0 , A ≥ 0. (6)
Max B, s. t., 𝑋𝐴 + 𝐵𝑔𝑥 ≤ 𝑥0 , 𝑌𝐴 − 𝐵𝑔𝑦 ≥ 𝑦0 , 𝐵′ 𝐴 + 𝐵𝑔𝑏 ≤ 𝑏0 , 𝐴 ≥ 0. (6)
Equation (4) shows a directional between good (y) and bad (b) outputs. Using a direction vector
Equation (4) shows a directional between good (y) and bad (b) outputs. Using a direction vector
gb , we control the bad outputs (b). In addition, all direction vectors are non-negative. We control the
𝑔𝑏 , we control the bad outputs (b). In addition, all direction vectors are non-negative. We control the
good and bad outputs at the same time. Both have meaning in technology analysis. In the process
good and bad outputs at the same time. Both have meaning in technology analysis. In the process of
of finding the final sustainable technology area, good output strengthens the technology association,
finding the final sustainable technology area, good output strengthens the technology association,
and bad output weaken the association. In our study, we combine the results of the additive model
and bad output weaken the association. In our study, we combine the results of the additive model
and directional distance function to find the technological sustainability in the target technology. The
and directional distance function to find the technological sustainability in the target technology. The
technological structure between two keywords X and Y is shown in Figure 2.
technological structure between two keywords X and Y is shown in Figure 2.
each interaction and visualized them. We search the patent documents used and registered by Apple.
Using text mining techniques, we also extract keywords and built structured data. This data type is a
matrix of patents (rows) and keywords (columns). The element of this matrix is the frequency value of
a specific keyword in the patent document. We use this matrix for input and output measurements
of DEA.
In this paper, we evaluate the state space model as another approach to patent analysis for
sustainable technology. The SSM is similar to the hidden Markov model (HMM), and the only
difference is that the hidden state of the HMM is discrete and the state of the SSM is continuous. The
general form of the SSM is as follows [10]:
where Zt , µt , and Yt are the hidden state, input, and observation, respectively. G and H are the
transition and observation models. εt and δt are the noises of system and observation at time t. The
goal of the SSM is to estimate the belief state P(Zt Y1:t ) and predict future observation P(Yt+1 Y1:t )
using the beliefs of the hidden state. To understand the technological trend for sustainable technology,
we use the SSM for time series forecasting, because the SSM is an efficient approach to time series
forecasting [10]. The SSM makes a generative model using hidden variable integration and posterior
prediction. This is local level model based on Gaussian distribution as follows [10]:
Yt = Zt + εY Z
t , Zt = Zt−1 + εt , (8)
where Yt and Zt are already explained in Equation (5) and εY t and εt are distributed to Gaussian with
Z
same mean 0 and variances VY and VZ , respectively. The forecasting process of the local level model is
carried out as follows:
{Zt−1 } → {Zt } → {Yt }. (9)
In our research, we develop a structural time series model (local level model) to find the
technological trend. In this paper, we use the international patent classification codes by year in the
time series model, because the IPC codes contain various technological descriptions on the target
domain. We combine the results of DEA and the SSM to find technological areas for technological
sustainability of a company. Thus, the final goal of our research is to build the R&D strategy for
the technological sustainability of the target technology. A company can improve its technological
competitiveness in the market using the results of our research. Next, we illustrate a case study
implementing our proposed method. In this study, we show the technological efficiencies of five
patent keywords (system, data, device, media, user) using DEA. We also illustrate the technological
structure of high-ranked IPC codes using the SSM. Combining the results of DEA and the SSM, we
find sustainable areas of the given technology.
4. Case Study
To show the performance and validity of our research, we carried out a case study on Apple
technology. We retrieved patent documents used by Apple [20,21]. The documents used in this study
were filed between 1980 and 2010. We used R and its Package to compute the efficiencies by using
additive model and directional distance function [22,23,34]. First, we extracted the technological
keywords from the retrieved patent documents via text mining techniques [22]. Based on expert
opinions and previous studies [3,35], we identified five key keywords as follows: system, media,
data, user, and device. They represent Apple’s sub-technologies. Using them, we performed five
experiments using DEA. For each experiment, we changed the combination of keywords used as input
and output variables. We did so to understand the association between Apple’s sub-technologies
and the keywords. In this paper, we conclude that these five technology keywords are the basis of
the technology that supports Apple’s innovative products. In addition, the keywords were used
Sustainability 2019, 11, 3597 6 of 19
as variables in our models. Each keyword represents a sub-technology of Apple. For example, the
data keyword represents the Apple technology related to data collecting, integrating, preprocessing,
analyzing, visualizing, etc. We used a total of 8114 patent documents associated with DMUs. We
applied additive model and directional distance function to measure productivity for the efficiency of
Apple technological keywords. Table 1 shows the results of an additive model in which the Y variable
is system and the others are X variables.
We maintained variable Y (system) as a constant and changed variables Xs (data, device, media,
user). As the results shown in Table 1, all the Xs has a technological impact on system. However,
during the first 10 years, only media had an impact on the system, and its effect was mainly on its
stability. More precise implications occurred after 1990, except for device. The variable of data had two
peaks and its trend showed growth peak and then decline with 2010 being the most substantial time.
With respect to the variable device, the apparent impact occurred around 2000, and its roughest time
occurred approximately in 2009 then decreased in approximately 2010. For the media variable, a peak
occurred around 1992 and then reduced to zero. After 1995, it increased steadily and tended to stabilize.
In 2005, it reached its highest essential time and then decreased. For the user variable, the peak occurred
around 1993, and after 1996, the growth tended to stabilize and then reduce. Figure 3 illustrates the
results of the directional distance function for Y = system and X = (media, data, user, device).
Sustainability 2017, 9, x FOR PEER REVIEW 7 of 21
Sustainability 2017, 9, x FOR PEER REVIEW 7 of 21
Sustainability 2019, 11, 3597 7 of 19
Figure 3. Result of directional distance function: Y = system, X = (media, data, user, device).
Figure 3. Result of directional distance function: Y = system, X = (media, data, user, device).
Figure 3. Result of directional distance function: Y = system, X = (media, data, user, device).
According to the results of the directional distance function, during the first 10 years, there was
According
no impact for allto thevariables.
results of the directional distance function, duringtothe
thefirst 10 years, there was
According tothe
the results ofAfter 1990, the growth
the directional trend
distance was similar
function, during the additive
first model,
10 years, except
there was
no impact
forimpact for
the media all the variables.
variable. After
The media 1990, the growth trend was similar to the additive model, except
no for all the variables. After variable
1990, thehad no obvious
growth trend was impacts
similarbefore 2000, butmodel,
to the additive after 2000, it
except
for the media
increased variable.
rapidly and The media
arrived at its variable
peak in had before
2005 no obvious impacts
decreasing. before
Using the2000, butofafter
results the 2000, it
additive
for the media variable. The media variable had no obvious impacts before 2000, but after 2000, it
increased
model andrapidly and arrived
directional distanceatfunction,
its peak in 2005 before
a diagram decreasing.
of the Usingsustainability
technological the results of the additive
of system is
increased rapidly and arrived at its peak in 2005 before decreasing. Using the results of the additive
model and
shown and directional
in Figure 4. distance function, a diagram of the technological sustainability of system is
model directional distance function, a diagram of the technological sustainability of system is
shown in Figure 4.
shown in Figure 4.
In Table 2, the growth trends of all variables are similar. There was no impact before 1990, and the
impact increased rapidly after 1990 until it peaked in 1995. Then it decreased around 2000. There was
increased fluctuation
Sustainability 2017, 9, x after 2000.
FOR PEER This was because the rates of fluctuation are different. Figure
REVIEW 9 of521shows
the results of the directional distance function for media (Y variable).
Figure 5. Result of directional distance function: Y = media, X = (system, data, user, device).
Figure 5. Result of directional distance function: Y = media, X = (system, data, user, device).
According to the results of the directional distance function in Figure 5, the main growth trends
were similar to the additive model; the scale of efficiency was merely less than that in additive model.
From the results of Table 2 and Figure 5, we developed a technological diagram for the model in
which media was the Y variable and others were the X variables (system, media, user, device) in
Figure 6.
Sustainability 2019, 11, 3597 9 of 19
According to the results of the directional distance function in Figure 5, the main growth trends
were similar to the additive model; the scale of efficiency was merely less than that in additive model.
Figure 5. Result of directional distance function: Y = media, X = (system, data, user, device).
From the results of Table 2 and Figure 5, we developed a technological diagram for the model in which
media was the Y variable
According and others
to the results of the were the Xdistance
directional variables (system,
function media,
in Figure user,
5, the device)
main growthin trends
Figure 6.
We
were similar to the additive model; the scale of efficiency was merely less than that in additive model. and
found that the technology of media was affected by system, data, and user strongly
From the
influenced by results
deviceof Table 2 and
technology Figure 5,
weakly. we developed
Next, a technological
Table 3 provides diagram
the result for theand
of Y (data) model in
X (system,
which
media, user,media wasusing
device) the Ythe
variable andmodel.
additive others were the X variables (system, media, user, device) in
Figure 6.
In Table 3, there was no impact in the first 10 years except the media variable. The media variable
had a stability effect in the first 10 years. Then, around 1990, it arrived peak and decreased to zero.
Sustainability 2019, 11, 3597 10 of 19
In the following years, it increased steadily and declined after 2005. For the system variable, its peak
was reached in around 1995. Then it reduced but also steadily grew over the next few years. With
respect to the device variable, it increased around 1995, showing slow but steady growth. The user
variable steadily increased around 1995 and remain unchanged for around five years. It was peaked in
Sustainability
2005 and then 2017,decreased
9, x FOR PEER
toREVIEW 11 of 21
zero. Another result of the data keyword (variable Y) by the directional
distance function is shown in Figure 7.
Figure 7. Result of the directional distance function: Y = data, X = (system, media, user, device).
Figure 7. Result of the directional distance function: Y = data, X = (system, media, user, device).
Figure 7. Result of the directional distance function: Y = data, X = (system, media, user, device).
According to the results of the directional distance function in Figure 7, the main impact was
According
similar that oftothe
toAccording the results
toadditive
the of the
model
results of thedirectional
except withdistance
directional function
respectfunction
distance to inin
the media Figure 7, the
variable.
Figure 7, the Themain
main impact
wasofwas
efficiency
impact the
similar
media to that
began atof the
zero additive
and stayedmodel
in except
this with
situation respect
for around to the
20 media
years. variable.
After 2000, The
it
similar to that of the additive model except with respect to the media variable. The efficiency of the efficiency
increased of the
rapidly
media began
media
and peaked at2005,
began
in zero and
at zero stayed
and
before stayedininthis
decreasing thissituation
tosituation for around
for around
zero. Using 2020years.
years.
the results After
ofAfter
Table 32000,
2000, it increased
it increased
and Figure werapidly
5,rapidly
built a
and peaked
and peaked
diagram in 2005,
of the before
in 2005, decreasing
before
sustainable decreasing
technology totodata,
of zero. Using
zero.as
Using
shown thein
the results
results ofof
Figure 8.Table
Table 3 and
3 and Figure
Figure 5, we5,built
we built
a a
diagram
diagram of sustainable
of the the sustainable technologyofofdata,
technology data, as
as shown
shown in inFigure
Figure8. 8.
Figure Diagram
8. 8.
Figure Diagramof
ofthe
the technological sustainability
technological sustainability of data.
of data.
Based on Figure 6, we
Figure knew thatofdata
8. Diagram technology was
the technological most affected
sustainability by media technology.
of data.
Moreover, media technology had the least impact on data technology. Thus, the system and user
have influenced
Based on Figure data technology
6, we continuously
knew that since 1990.was
data technology Table 4 represents
most affected the
by additive model
media technology.
results media
Moreover, with respect to user technology.
technology had the least impact on data technology. Thus, the system and user
have influenced data technology continuously since 1990. Table 4 represents the additive model
Sustainability 2019, 11, 3597 11 of 19
Based on Figure 6, we knew that data technology was most affected by media technology.
Moreover, media technology had the least impact on data technology. Thus, the system and user have
influenced data technology continuously since 1990. Table 4 represents the additive model results with
respect to user technology.
According to the additive model result in Table 4, comparing data with the system, the primary
growth trends were similar. There was no efficiency in the first 10 years, but after 1990, it increased and
peaked in 1995, before decreasing to zero. The main difference was that the data variable increased
stability, but the system variable showed increased fluctuation from 2000 to 2005; after 2005, the data
variable multiplied, but the system variable increased slowly. The device variable appeared to show
inefficiency before 2010 before demonstrating a sudden increase in influence in 2010. For the media
variable, the first 10 years were efficient, and the growth rate fluctuated until 2010; the peak occurred
around 2005. In addition, we visualized the result of the directional distance function for the user
variable in Figure 9.
According to the results of the directional distance function in Figure 9, the main trends were
similar to the results of the additive model except for the media variable. There was no significant
impact before 2000. After that, the fluctuation increased until 2010, and the peak occurred around 2005.
Figure 10 shows the technological diagram for the sustainability of user technology.
Sustainability 2017, 9, x FOR PEER REVIEW 13 of 21
Sustainability 2019, 11, 3597 12 of 19
Figure 9. Result of the directional distance function: Y = user, X = (system, media, data, device).
According to the results of the directional distance function in Figure 9, the main trends were
similar to the results of the additive model except for the media variable. There was no significant
impact before 2000. After that, the fluctuation increased until 2010, and the peak occurred around
Figure10
2005. Figure Result of
9. shows thethe directional distance
technological diagramfunction: = user, X = (system,
for theY sustainability media,
of user data, device).
technology.
Figure 9. Result of the directional distance function: Y = user, X = (system, media, data, device).
According to the results of the directional distance function in Figure 9, the main trends were
similar to the results of the additive model except for the media variable. There was no significant
impact before 2000. After that, the fluctuation increased until 2010, and the peak occurred around
2005. Figure 10 shows the technological diagram for the sustainability of user technology.
Figure 11. Result of the directional distance function: Y = device, X = (system, media, data, user).
Figure 11. Result of the directional distance function: Y = device, X = (system, media, data, user).
According to the results of the directional distance function in Figure 11, the main growth trends
According to the
were similar to the results
additive of thefor
model directional distanceSo,
all the variables. function in Figurea11,
we constructed the main
diagram growth
of the trends
sustainable
were similar
technology fortothethe additive(device)
Y variable model andfor Xallvariables
the variables. So,media,
(system, we constructed a Figure
data, user) in diagram 12. of the
sustainable technology for the Y variable (device) and X variables (system, media, data, user) in
Figure 12.
Figure 11. Result of the directional distance function: Y = device, X = (system, media, data, user).
According to the results of the directional distance function in Figure 11, the main growth trends
were similar to the additive model for all the variables. So, we constructed a diagram of the
Sustainability 2019, 11, 3597 14 of 19
sustainable technology for the Y variable (device) and X variables (system, media, data, user) in
Figure 12.
Y X
system media (1st), data (2nd), user (2nd), device (4th)
media system (1st), data (1st), user (1st), device (4th)
data media (1st), system (2nd), user (3rd), device (4th)
user media (1st), system (2nd), data (2nd), device (4th)
device system (1st), data (1st), user (1st), media (4th)
In this paper, we determined that Apple’s technology consists of five sub-technologies (system,
media, data, user, and device), and we computed the mutual efficiency of the five technologies.
In Table 6, we found that the media, system, and data technologies are important for the technological
sustainability of Apple. Next, we performed the SSM using R and its packages related to SSM [23,36–39].
We selected top four IPC codes from Apple’s patent documents.
We identified the technological definition of each IPC code in the World Intellectual Property
Organization (WIPO) [11]. Table 7 represents the major technologies of Apple by IPC code. Thus,
we used these four IPC codes for the SSM. Figure 13 shows the technological trends of the observed,
smoothed, and forecasted levels, as well as 50% probability limits.
The forecasted levels of G06F and G09G were smaller than the observed levels. In addition, the
forecasted level of H04N did not reached the observed level. In comparison, the forecasted level of
G06K tended to increase compared to the observed level. From the result show in Figure 13, we know
smoothed, and forecasted levels, as well as 50% probability limits.
The forecasted levels of G06F and G09G were smaller than the observed levels. In addition, the
forecasted level of H04N did not reached the observed level. In comparison, the forecasted level of
G06K tended to increase compared to the observed level. From the result show in Figure 13, we know
that Apple should keep sustaining technology based on G06K to increase technological
competitiveness. We also show the prediction errors of the four IPC codes in Figure 14.
Figure 13. Forecasts and probability limits of the four IPC codes.
Figure 13. Forecasts and probability limits of the four IPC codes.
We know that prediction errors have been getting bigger in recent years than in the past. A large
error means that the risk is high. However, from a sustainability point of view, it is necessary to
constantly manage technological areas with a large error (risk). In the results of Figure 14, we found
that the technological risks of H04N and G06K were larger than those of the other IPC codes. So, we
Sustainability 2019, 11, 3597 16 of 19
We know that prediction errors have been getting bigger in recent years than in the past. A large
error means that the risk is high. However, from a sustainability point of view, it is necessary to
constantly manage technological areas with a large error (risk). In the results of Figure 14, we found
that the technological risks of H04N and G06K were larger than those of the other IPC codes. So, we
should manage technologies based on H04N and G06K. Lastly, we present the sustainability forecasts
Sustainability
for the four2017,IPC
9, x FOR PEER
codes REVIEW
using the forecasts and probability limits via the SSM in Figure 15. 18 of 21the
From
results of Figures
Sustainability 2017, 9, x 13–15, weREVIEW
FOR PEER found that technologies based on H04N and G06K are important18 forofthe
21
sustainability of Apple’s technologies.
In Figure 16, the DEA results provide the technological relationship between patent keywords
of Apple. We considered only the first relation between technological keywords from the efficiency
ranking of the DEA results. It can be seen that the media keyword has a technological relation with
the user, data, and system keywords. In addition, the system and device keywords are connected
to each other. From the DEA perspective, we have confirmed that media and system technologies
are important factors in Apple’s technological sustainability. From the perspective of the SSM, we
found that two IPC codes, H04N and G06K, are important for Apple’s sustainable technology. The IPC
codes of H04N and G06F represent the technologies of ‘pictorial communication’ and ‘recognition and
presentation of data’, respectively. Therefore, Apple needs to continue to research and develop ‘image
and data technologies based on media and system technologies’ for its technological sustainability.
5. Conclusions
In this study, we proposed a methodology consisting of two models, DEA and the SSM, to
analyze the technological relationship of the target technology. We used patent keywords and IPC
codes extracted from patent documents for the proposed methodology. The target domain of our
case study is Apple technology, because Apple is one of the leading companies in the technological
innovation markets. We considered Apple’s top five keywords (system, data, user, device, media)
and four IPC codes (G06F, G09G, H04N, G06K) in our case study. Of course, the list of keywords
used by the analyst can be changed, but the method of constructing the technological structure
between given keywords can follow the proposed steps in this paper. Using the results of DEA,
we found technological relationships between sub-technologies (patent keywords). In addition, we
forecasted the future trends of sub-technologies (IPC codes) via the result of the SSM. In the DEA
result, we could see that technologies based on ‘media’ and ‘system’ are connected to each other
and others. This means that Apple needs to research and develop these areas for the sustainability
of Apple’s technology. Furthermore, via the SSM result, we confirmed that technologies based on
H04N (pictorial communication) and G06K (recognition and presentation of data) are important to
Apple’s technological sustainability. Finally, by combining the results of the two analyses, we were
able to identify ‘Image and data technologies based on media and system technologies’ as Apple’s
sustainable technology.
In this paper, we extended the usage of traditional DEA to sustainable technology analysis. We
used technological keywords extracted from patent documents as DEA input and output variables.
This is a new attempt, different from existing DEA. For a technological keyword that becomes the
output variable, all other keywords are input variables. The relative efficiency of the input keywords
with respect to the technological keyword corresponding to the output variable was evaluated, and
the sub-technologies for constructing the sustainable technology were defined by searching for the
keywords corresponding to the high-ranked input keywords. The frequency values of the technological
keywords were used to measure the consumption and gain for DEA at the same time. In addition,
in this paper, sustainable technologies were presented comprehensively, considering both the results
of the SSM, as well as the DEA. In our future research, we plan to apply Bayesian inference to the
traditional SSM to improve the predictive performance of the general SSM and to consider non-Gaussian
distributions for more sophisticated modeling. We will also study more advanced and hybrid models
using statistics and machine learning algorithms, such as Bayesian inference with Markov chain Monte
Carlo (MCMC) and deep learning with convolutional neural network (CNN).
Author Contributions: J.-M.K. and B.S. designed this research and collected the data set for the experiment. S.J.
analyzed the data, wrote the paper, and performed all the research steps. All the authors cooperated to revise
the paper.
Funding: This research was supported by Basic Science Research Program through the National Research
Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2017R1D1A3B03031152).
Conflicts of Interest: The authors declare no conflicts of interest.
Sustainability 2019, 11, 3597 18 of 19
References
1. Roper, A.T.; Cunningham, S.W.; Porter, A.L.; Mason, T.W.; Rossini, F.A.; Banks, J. Forecasting and Management
of Technology, 2nd ed.; John Wiley & Sons: Hoboken, NJ, USA, 2011.
2. Jun, S. Bayesian Count Data Modeling for Finding Technological Sustainability. Sustainability 2018, 10, 3220.
[CrossRef]
3. Kim, J.; Jun, S. Graphical causal inference and copula regression model for Apple keywords by text mining.
Adv. Eng. Inf. 2015, 29, 918–929. [CrossRef]
4. Lee, J.; Kang, J.; Jun, S.; Lim, H.; Jang, D.; Park, S. Ensemble Modeling for Sustainable Technology Transfer.
Sustainability 2018, 10, 2278. [CrossRef]
5. Park, S.; Lee, S.; Jun, S. A network analysis model for selecting sustainable technology. Sustainability 2015, 7,
13126–13141. [CrossRef]
6. Hunt, D.; Nguyen, L.; Rodgers, M. Patent Searching Tools & Techniques; Wiley: Hoboken, NJ, USA, 2007.
7. Lee, B.; Won, D.; Park, J.; Kwon, L.; Moon, Y.; Kim, H. Patent-Enhancing Strategies by Industry in Korea
Using a Data Envelopment Analysis. Sustainability 2016, 8, 901. [CrossRef]
8. Chames, A.; Cooper, W.W.; Rhodes, E. Measuring the efficiency of decision making units. Eur. J. Oper. Res.
1978, 2, 429–444.
9. Forsund, F.R.; Sarafoglou, N. On the origins of data envelopment analysis. J. Product. Anal. 2002, 17, 23–40.
[CrossRef]
10. Murphy, K.P. Machine Learning: A Probabilistic Perspective; MIT Press: Cambridge, MA, USA, 2012.
11. International Patent Classification (IPC); World Intellectual Property Organization. Available online:
http://www.wipo.int/classifications/ipc/en (accessed on 30 October 2018).
12. Choi, J.; Jun, S.; Park, S. A Patent Analysis for Sustainable Technology Management. Sustainability 2016, 8,
688. [CrossRef]
13. Kim, S.; Jang, D.; Jun, S.; Park, S. A Novel Forecasting Methodology for Sustainable Management of Defense
Technology. Sustainability 2015, 7, 16720–16736. [CrossRef]
14. Kim, J.; Jun, S.; Jang, D.; Park, S. Sustainable Technology Analysis of Artificial Intelligence Using Bayesian
and Social Network Models. Sustainability 2018, 10, 115. [CrossRef]
15. Park, S.; Jun, S. Statistical Technology Analysis for Competitive Sustainability of Three Dimensional Printing.
Sustainability 2017, 9, 1142. [CrossRef]
16. Uhm, D.; Ryu, J.; Jun, S. An Interval Estimation Method of Patent Keyword Data for Sustainable Technology
Forecasting. Sustainability 2017, 9, 2025. [CrossRef]
17. Jun, S.; Park, S. Examining Technological Innovation of Apple Using Patent Analysis. Ind. Manag. Data Syst.
2013, 113, 890–907. [CrossRef]
18. Marinho, J.F.M.A.; Laurencel, L.C.; Amaral, M.R.S. Implementing DEA models in the R program. In
Proceedings of the 11th International Conference on Data Envelopment Analysis, Samsun, Turkey, 28–30
June 2013; pp. 1–6.
19. Banker, R.D.; Cooper, W.W.; Seiford, L.; Thrall, M.; Zhu, J. Returns to Scale in different DEA models. Eur. J.
Oper. Res. 2004, 154, 345–362. [CrossRef]
20. The United States Patent and Trademark Office (USPTO). Available online: http://www.uspto.gov (accessed
on 1 November 2018).
21. WIPSON. WIPS Corporation. Available online: http://www.wipson.com (accessed on 15 October 2018).
22. Feinerer, I.; Hornik, K. Package ‘tm’ Ver. 0.7-5, Text Mining Package, CRAN of R Project. Available online:
https://cran.r-project.org/web/packages/tm/tm.pdf (accessed on 1 August 2018).
23. R Development Core Team. R: A Language and Environment for Statistical Computing; R Foundation for
Statistical Computing: Vienna, Austria. Available online: http://www.R-project.org (accessed on 1 July 2018).
24. Chames, A.; Cooper, W.W.; Golany, B.; Stutz, J. Foundations of Data Envelopment Analysis for
Pareto-Koopmans efficient empirical production functions. J. Econom. 1985, 30, 91–107.
25. Chames, A.; Cooper, W.W.; Thrall, R.M. A structure for classifying and characterizing efficiency and
inefficiency in Data Envelopment Analysis. J. Prod. Anal. 1991, 2, 197–237.
26. Chames, A.; Haag, S.; Jaska, P.; Semple, J. Sensitivity of efficiency classifications in the additive model of data
envelopment analysis. Int. J. Syst. Sci. 1992, 23, 789–798.
Sustainability 2019, 11, 3597 19 of 19
27. Chames, A.; Rousseau, J.J.; Semple, J.H. Sensitivity and stability of efficiency classifications in data
envelopment analysis. J. Prod. Anal. 1996, 7, 5–18.
28. Fried, H.O.; Lovell, C.A.K.; Schmidt, S.S. The Measurement of Productive Efficiency and Productivity Growth;
Oxford University Press: New York, NY, USA, 2008.
29. Luenberger, D.G. Benefit Functions and Duality. J. Math. Econ. 1992, 21, 461–486. [CrossRef]
30. Luenberger, D.G. New Optimality Principles for Economic Efficiency and Equilibrium. J. Optim. Theory Appl.
1992, 75, 221–264. [CrossRef]
31. Luenberger, D.G. Dual Pareto Efficiency. J. Econ. Theory 1994, 62, 70–85. [CrossRef]
32. Cheng, G.; Zervopoulos, P. A Generalized Directional Distance Function in Data Envelopment Analysis and
Its Application to a Cross-Country Measurement of Health Efficiency. Available online: http://mpra.ub.uni-
muenchen.de/42068 (accessed on 1 July 2018).
33. Chung, Y.H.; Färe, R.; Grosskopf, S. Productivity and Undesirable Outputs: A Directional Distance Function
Approach. J. Environ. Manag. 1997, 51, 229–240. [CrossRef]
34. Oh, D.H.; Suh, D. Package ‘nonparaeff’ Ver. 0.5-8, Nonparametric Methods for Measuring Efficiency and
Productivity Package, CRAN of R Project. Available online: https://cran.r-project.org/web/packages/tm/tm.
pdf (accessed on 1 August 2018).
35. Korea Intellectual Property Strategy Agency (KISTA). Available online: http://www.kista.or.kr (accessed on
16 November 2018).
36. Commandeur, J.J.F.; Koopman, S.J.; Ooms, M. Statistical Software for State Space Methods. J. Stat. Softw.
2011, 41. [CrossRef]
37. Petris, G.; Petrone, S. State Space Models in R. J. Stat. Softw. 2011, 41. [CrossRef]
38. Hyndman, R.; Athanasopoulos, G.; Bergmeir, C.; Caceres, G.; Chhay, L.; O’Hara-Wild, M.; Petropoulos, F.;
Razbash, S. Package ‘forecast’ Ver. 8.7, Forecasting Functions for Time Series and Linear Models, CRAN of R
Project. 2019. Available online: https://cran.r-project.org/web/packages/forecast/forecast.pdf (accessed on
1 May 2019).
39. Petris, G.; Gilks, W. Package ‘dlm’ Ver. 1.1-5, Bayesian and Likelihood Analysis of Dynamic Linear Model,
CRAN of R Project. 2019. Available online: https://cran.r-project.org/web/packages/dlm/dlm.pdf (accessed
on 30 December 2018).
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).