You are on page 1of 6

Proceedings,16th

Proceedings,16th IFAC
IFAC Symposium
Symposium on on
Proceedings,16th
Information Control
Proceedings,16th
Information ControlIFAC
IFAC Symposium
Problems
Problems in on
in Manufacturing
Symposium on
Manufacturing
Proceedings,16th
Information
Bergamo, Control
Italy,
Information IFAC
June
Control Symposium
Problems
11-13,
Problems in
2018
in on
Manufacturing
Available online at www.sciencedirect.com
Manufacturing
Bergamo, Italy,
Proceedings,16th
Information June
Control 11-13,
IFAC 2018
Symposium
Problems in on
Manufacturing
Bergamo, Italy, June 11-13, 2018
Bergamo, Italy, June 11-13, 2018
Information Control
Bergamo, Italy, JuneProblems in Manufacturing
11-13, 2018
Bergamo, Italy, June 11-13, 2018 ScienceDirect
IFAC PapersOnLine 51-11 (2018) 316–321
Data Mining-Based Prediction of Manufacturing Situations
Data Mining-Based Prediction of Manufacturing Situations
Data Mining-Based Prediction of Manufacturing Situations
Data Mining-Based
A.
A. Dolgui
*
Dolgui**,, N.
Prediction
N. Bakhtadze
**
Bakhtadze**
of Manufacturing
, V. Pyatetsky *** ****Situations
***, R. Sabitov****
**, V. Pyatetsky***, R. Sabitov****,
,
A.
A.G. Dolgui
Dolgui *, N. Bakhtadze
*, N. Bakhtadze
Smirnova ***** **, V. Pyatetsky
**, V. Pyatetsky
*****, D. Elpashev ****** ***, R. Sabitov
******, E. *** , R. Sabitov ****,
****,
*******
A.G. Smirnova
Dolgui , N. ***** , D. Elpashev
Bakhtadze , V. Pyatetsky
****** , E. Zakharov
Zakharov
, R.
*******
Sabitov
******* ,
G. Smirnova ***** , D. Elpashev ****** , E. Zakharov *******
A.G. Smirnova
*
, N. *****, D. Elpashev
Bakhtadze **
, V. ******, E. ***
Pyatetsky Zakharov
G. Smirnova*****, D. Elpashev******, E. Zakharov******* ,
Dolgui , R. Sabitov ****
*******
*
*IMT
* IMTG. Smirnova
Atlantique , D. Elpashevde
Atlantique Bretagne-Pays
Bretagne-Pays de la , E. Zakharov
la Loire,
Loire, Nantes,
Nantes, France,
France,
*IMT Atlantique Bretagne-Pays de la Loire, Nantes, France,
*IMT Atlantique Bretagne-Pays de la Loire, Nantes, France,
IMT Atlantique(e-mail: Dolgui@imt-atlantique.fr)
Bretagne-Pays de la Loire,
(e-mail: Dolgui@imt-atlantique.fr) Nantes, France,
*
**IMT
** ******Atlantique(e-mail:
(e-mail:
******* Dolgui@imt-atlantique.fr)
Dolgui@imt-atlantique.fr)
Bretagne-Pays de la Loire, Nantes, France,
** ,, ******,, *******
******
**, ******
V.A. Trapeznikov
(e-mail:
*******
V.A. Trapeznikov Institute of
Dolgui@imt-atlantique.fr)
Institute of Control
Control Sciences,
Sciences,
, *******
**, ******, *******
V.A.
V.A.
(e-mail:Trapeznikov
Trapeznikov Institute
Institute
Dolgui@imt-atlantique.fr) of
of Control
Control Sciences,
Sciences,
, , 65
65 Profsoyuznaya,
V.A. Trapeznikov
Profsoyuznaya, Moscow 117997,
Institute
Moscow Russia
of Control
117997, RussiaSciences,
, , 65
65 Profsoyuznaya,
** ****** *******
Profsoyuznaya,
V.A.
** Trapeznikov
** (e-mail:
Moscow
Moscow 117997,
117997,
Institute
sung7@yandex.ru), of Russia
RussiaSciences,
Control
65 Profsoyuznaya,
** (e-mail: Moscow 117997,
sung7@yandex.ru),
** (e-mail: sung7@yandex.ru),
Russia
65 ******
******
** (e-mail:
Profsoyuznaya, sung7@yandex.ru),
Moscow 117997,
(e-mail:den.elpshv@gmail.com), Russia
****** (e-mail: sung7@yandex.ru),
(e-mail:den.elpshv@gmail.com),
**(e-mail:den.elpshv@gmail.com),
******
******* (e-mail:den.elpshv@gmail.com),
(e-mail: sung7@yandex.ru),
******* (e-mail: eddyzaharov@gmail.com)
******
******* (e-mail:den.elpshv@gmail.com),
(e-mail: eddyzaharov@gmail.com)
******(e-mail: eddyzaharov@gmail.com)
*******
***
*** National ******* (e-mail: eddyzaharov@gmail.com)
(e-mail:den.elpshv@gmail.com),
*** National University
(e-mail:
University of
of Science and and Technology
Technology “MISIS”,
eddyzaharov@gmail.com)
Science “MISIS”,
*** National University of Science and
*** National *******
Moscow, University
(e-mail:
Russia, of Science and Technology
Technology “MISIS”,
eddyzaharov@gmail.com)
(e-mail: 7621496@gmail.com) “MISIS”,
National
Moscow, University
Russia, of Science
(e-mail: and Technology “MISIS”,
7621496@gmail.com)
*** Moscow, Russia, (e-mail: 7621496@gmail.com)
****
****
****
,,Moscow,
National
*****
***** Kazan
Moscow,Kazan
Russia,
University (e-mail:
of
National
Russia,
National
Science 7621496@gmail.com)
Research
(e-mail:
andTechnical
Technology “MISIS”,
Technical University
7621496@gmail.com)
Research University
****, *****
***** Kazan National Research Technical University
****, Moscow,
***** Kazan National
Russia, Research
(e-mail: Technical
7621496@gmail.com) University
, named
Kazan
named after
after A.N.Tupolev,
National Research
A.N.Tupolev, Kazan,
Technical
Kazan, Russia
University
Russia
**** *****named after A.N.Tupolev, Kazan, Russia
**** , named
Kazan after A.N.Tupolev,
National Research Kazan,
Technical
*****
**** (e-mail: r.a.sabitov@mail.ru), *****(e-mail: seyl@mail.ru) Russia
University
**** (e-mail: named after A.N.Tupolev,
r.a.sabitov@mail.ru),
**** (e-mail: r.a.sabitov@mail.ru), *****
Kazan,
(e-mail:Russia
seyl@mail.ru)
*****(e-mail: seyl@mail.ru)
**** (e-mail: r.a.sabitov@mail.ru),
named after A.N.Tupolev, Kazan, (e-mail: seyl@mail.ru)
Russia
****
(e-mail: r.a.sabitov@mail.ru), ***** *****
(e-mail: seyl@mail.ru)
(e-mail: r.a.sabitov@mail.ru), (e-mail: seyl@mail.ru)
Abstract: The
Abstract: The paper
paper proposes
proposes an an approach
approach to to the
the early
early detection
detection of of factors
factors implying
implying the
the need
need in
in
Abstract:
Abstract:
production The
The paper
paper
schedule proposes
proposes
update. an
an
Resource approach
approach
state to
to the
the
prediction early
early
methods detection
detection
are basedof on
of factors
factors
the implying the
implying
developmentthe
of aneed
need in
in
binary
production
Abstract: schedule
The update.
paperupdate.
proposes Resource state
an approach prediction methods
to the early are
detectionbased on
of on the
factors development of
implying the a binary
need in
production
production
model and
andTheschedule
schedule update. Resource
Resource state
state prediction
prediction methods
methods are
are based
based on the
the development
development of
of a
a binary
binary
Abstract:
model
production aa machine
machine learning
paper learning techniques
proposestechniquesan approach called
called association
toassociation
the early rules search.
detection
rules search.of on
factors implying the
of aneed in
model and aaschedule
model and machineupdate.
machine learningResource
learning techniques
techniques
state prediction
called
called
methods
association
association rulesare
rules
based
search.
search.
the development binary
production
model and a schedule
machine update.
learning Resource
techniques state prediction
called methods
association rulesare based
search. on the
© 2018, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.development of a binary
model and adata
Keywords:
Keywords: machine
data learning
mining,
mining, techniques called
knowledgebase,
knowledgebase, association
association
association rule rules
rule search.
learning,
learning, production scheduling,
production scheduling, productive
productive
Keywords:
Keywords:
resources data mining,
data mining, knowledgebase,
knowledgebase, association
association rulerule learning,
learning, production
production scheduling,
scheduling, productive
productive
resources
Keywords: data mining, knowledgebase, association rule learning, production scheduling, productive
resources
resources
Keywords:
resources data mining, knowledgebase, association rule learning, production scheduling, productive

resources 


  production
production schedule
schedule update
update for for aa short
short term
term inin view
view ofof
1.  production
production schedule
schedule update
update for
foraaaaperiod
short
short term
term in
in view
view of
of
1. INTRODUCTION
INTRODUCTION  the
the actual
actual
 production operations
operations data
data
schedule update over
over period
foraaperiod compliant
compliant
short term in view with
with
of
1.
1. INTRODUCTION
INTRODUCTION the current
actual
actual operations data
data over compliant with
1. INTRODUCTION  theproduction
the current
actual
operations
planning
schedule horizon,
planning
operations update over
horizon,
data overforaaaperiod
period
compliant
short term in view
compliant
with
of
with
Production the
the current
current planning
planning horizon,
horizon,
Production process
process 1. planning
planning is the
is the distribution
INTRODUCTION distribution ofof resources
resources the
 the actual
current
on-line operations
planningand
monitoring data over
horizon,
control a period compliant with
Production
Production process planning is
is the distribution of
of resources on-line monitoring and control of of operations,
operations,
needed for process
needed for
Production performing
performing
process
planning
certain
certain
planning
the
is the
distribution
operations
operations within
within
distribution
resources
aa certain
certain
of resources  the current
on-line
on-line planningand
monitoring
monitoring horizon,
and control
control of
of operations,
operations,
needed
needed
time for
for
period. performing
performing
The planning certain
certain
process operations
operations
aims at thewithin
within a
a
optimization certain
certain
of  on-line monitoring and control of operations,
Production
needed for process
time period. performing planning
The planning is the
process
certain distribution
aims of resources
at thewithin
operations optimization
a certain of  production
production output
output and
and material
material consumption
consumption recording.
recording.
time
time period. The
The planning process aims
aims atat the optimization of  on-line
productionmonitoring
output
output andandmaterial
control of operations,recording.
consumption
someperiod.
some
needed
time productivity
for performing
productivity
period. The
planning process
measure.
measure.
planning Typically,
certain operations
Typically,
process aims at
the
the optimization
resources
thewithin
the a certain
resources
optimization
of
are
are
of production
 production output
and material
and material
consumption
consumption
recording.
recording.
some
some productivity
productivity
equipment and measure.
measure. Typically,
Typically, the resources
the optimization
resources are
are Generally,
Generally, optimal
optimal routine
routine planning
planning and
and scheduling
scheduling is
is
time
some period.
equipment Theworkplaces,
and
productivity planning
workplaces,
measure. while
process
while the
the execution
aims
Typically, at the
execution time
time is
the resources is the
of Generally,
the
are  production
Generally, output
optimal
optimal and material
routine
routine planning
planning consumption
and
and recording.
scheduling
scheduling is
is
equipment
equipment
most frequentand
and workplaces,
workplaces,
productivity while
while
measure the
the execution
execution
(Leung, 2004). time
time is
is the
the currently
currently
Generally, based
based
optimalon
on detailed
detailed
routine mathematical
mathematical
planning and models
models
scheduling of
of
is
some productivity
most frequent
equipment measure.
andproductivity Typically,
workplaces,measure
while the(Leung, the resources are
2004).time is the currently
execution currently based on (Al-Otabi
detailed mathematical models of
most frequent
most frequent
equipment and
productivity
productivity
workplaces,
measure
measure
while the
(Leung,
(Leung, 2004).
2004).
execution timepredict production based
production
is the Generally,
currently optimalon
processes
processes
based
detailedplanning
detailed and
on routine
(Al-Otabi mathematical
and Stewart,
Stewart,
mathematical
models
2004;
and scheduling
2004;
models
of
Kelly,
Kelly,is
of
most frequent
Production productivity
scheduling measure
systems (Leung,
would 2004).
usually
Production scheduling systems would usually predict currently production
production
2004). processes
processes
Rescheduling (Al-Otabi
(Al-Otabi
requires and
and
model Stewart,
Stewart,
update 2004;
2004;
subject Kelly,
Kelly,
to the
most frequentscheduling
Production productivity measure (Leung,
systems would 2004).
usually predict 2004). based
Rescheduling
production on detailed
requires
processes requires
(Al-Otabimodel mathematical
model update
and Stewart, models
subject
2004; to to of
the
Kelly,
Production
operations
operations at
Production atscheduling
various
scheduling
systems
various deviations
systems from
deviations wouldthe
from
wouldthe
usually
plan.
plan. This
usually
predict
This
predictis 2004).
is 2004).
current Rescheduling
Rescheduling
current production
production
2004). Rescheduling
requires
(Al-Otabimodel
information.
processes
production information.
requires
update
update subject
and Stewart,
model update
subject
2004; to
subject to
the
the
Kelly,
the
operations
operations
necessary at
at
for various
various
timely deviations
deviations
rescheduling from
from
in the
the
case ofplan.
plan. This
This
contingency. is
is current
current production
production information.
information.
Production
necessary for
operations atscheduling
timely systems from
variousrescheduling
deviations would
in casetheusually
ofplan. predict
contingency.
This is 2004).
current Rescheduling
production requires
information. model update subject to the
necessary
necessary for
for timely rescheduling in case of contingency. Present-day industrial sites feature interrelated multi-variable
Present-day industrial sites feature interrelated multi-variable
(Grossmann
operations
(Grossmann
necessary et timely
atet al.,
al., 2002)
various
2002)rescheduling
deviations fromin casetheof contingency.
ofplan. This is current production
Present-day information.
industrial sites feature interrelated multi-variable
(Grossmannforet
(Grossmann et timely
al., 2002)
al., 2002)rescheduling in case contingency. Present-day
production industrial
production
Present-day processessites
processes
industrial andfeature
and
sites
interrelatedmaterial
sophisticated
sophisticated
feature
multi-variable
material
interrelatedmaterial flow
flow
multi-variable
necessary
(Grossmann
Development foret timely
al., 2002)
and rescheduling
implementationin case of
of contingency.
predictive production
production processes
processes and
and sophisticated
sophisticated material flow
flow
Development and implementation of networks;
predictive Present-day
networks; scheduling
industrial
scheduling
productionscheduling at
at such
sites
such
processesat and sites
feature
sites poses nonlinear
interrelated
poses
sophisticated NP-hard
multi-variable
nonlinear
material NP-hard
flow
(Grossmann
Development et al.,and
2002) implementation of predictive
Development
(identification)
(identification) models
Development
and
and under
models implementation
under parametric
implementation
of
parametric uncertainty
uncertainty
of
predictive
is
predictivekey networks;
is aa key networks;
optimization
production
optimization
networks;
scheduling
problems.at
problems.
processes
scheduling at
such
such sites
and
such
sites poses
poses nonlinear
sophisticated
sites poses
nonlinear
material
nonlinear
NP-hard
NP-hard
flow
NP-hard
(identification)
(identification)
area models
models under
under parametric
parametric uncertainty
uncertainty is a key
is a key optimization problems.
area for
for production
Development
production
(identification) scheduling
and
scheduling
models under systems
systems improvement.
implementation
parametric of
improvement.
uncertainty predictive
is a key optimization
networks;
optimization
problems.at such sites poses nonlinear NP-hard
scheduling
problems.
area
area for
for production
production scheduling
scheduling systems
systems improvement.
improvement. Therefore,
Therefore, further
further complication
complication of of existing
existing models
models aimed
aimed at at
(identification) models under parametric uncertainty is a key optimization problems.
area
The for production
The difficulties
difficulties in scheduling
in implementing
implementing systems
this improvement.
this approach,
approach, especially
especially for for Therefore,
Therefore,
allowing
allowing
Therefore, for
for
further
further complication
complication
specific
specific
further manufacturing
manufacturing
complication
of
of existing
of
existing models
models
situations
situations
existing is
is
models
aimed
aimed at
undesirable
undesirable
aimed
at
at
area
The
The for production
difficulties
difficulties
nonlinear and in scheduling
implementing
in implementing systems
this improvement.
approach,
this approach, especially
especially for
for allowing for specific manufacturing situations is
is undesirable
nonlinear
The and non-stationary
difficulties non-stationary
in implementing objects,
objects, and,
and, on
on the
this approach, the other
other hand,
especially for allowing
hand, (Kelly,
Therefore,
allowing
forfurther
(Kelly, 2004).
2004). specific manufacturing
Besides computational
complication
Besides
for specific manufacturing
situations
of existing
computational models undesirable
complexity,
complexity,
situations aimed such
at
such
is undesirable
nonlinear
nonlinear
increased and
and non-stationary
non-stationary
possibilities of plantobjects,
objects,
history and,
and, on
on
analysis the
the other
other
resulted hand,
hand,
in the (Kelly,
(Kelly, 2004).
2004). Besides
Besides computational
computational complexity,
complexity, such
such
The difficulties
increased
nonlinear and in implementing
possibilities
non-stationary this approach,
of plantobjects,
history analysis
and, on theespecially
resulted
other in
hand, complication
for allowing
the complicationfor may
specific
may result in
manufacturing
result in the
the deterioration
situations
deterioration is of model
undesirable
of model
increased
increased
developmentpossibilities
possibilities
andof
of
of plant history
plant methods
identification analysis
history and,
analysis
based resulted
resulted
on on data in
in
othermining the (Kelly,
the 2004).may
complication Besides
result computational
in
in the
complexity,
the deterioration of such
of model
nonlinear
development
increased
development
ofnon-stationary
possibilities
of of plantobjects,
identification
identification
methods based
history analysis
methods based
the
on data
resulted
on data in the complication
hand,
mining
mining accuracy.2004).may
accuracy.
(Kelly,
complication may
result computational
Besides
result in the
deterioration
complexity,
deterioration of
model
such
model
development
(Peretzki
(Peretzki et
increased et of2014).
al., identification
possibilities
al., 2014). of plant methods based on
history analysis data mining
resulted accuracy.
in the accuracy.
development of identification methods based on data mining complication
accuracy.
The may result in the deterioration of model
(Peretzki et
(Peretzki
development
et al.,
al., 2014).
2014).
of2014).
identification methods based on data mining The existing
existing rescheduling
rescheduling techniques
techniques based based onon model
model update
update
(Peretzki
Production
Production et al.,
scheduling
scheduling seeks
seeks to
to ensure
ensure well-organized
well-organized and and accuracy.
The
The existing
existing
typically
typically do
do
rescheduling
rescheduling
not
not address
address
techniques
techniques
the
the needs
needs
based
based
of
of
on
on model
dynamic
dynamic model update
update
scheduling
scheduling
(Peretzki
Production
Production
smooth
smooth flow
et al., 2014). production
scheduling
scheduling
of
of all
seeks
seeks to
to ensure
ensure well-organized
well-organized
operations with
with efficient and The
and existing
typically do rescheduling
not address
address the
techniques
needs based
of on model
dynamic update
scheduling
Production
smooth
flow
flow
all production
scheduling
of all
operations
seeks to ensure
production well-organized
operations with and typically
efficient
efficient
because
The
typically
do rescheduling
the
existing
because the
do
not
rescheduling
rescheduling
not address
the
time
time
the
needs
techniquesis
needs
of
is stronglydynamic
based
strongly
of
scheduling
onlimited
model
limited
dynamic (Kelly,
update
(Kelly,
scheduling
smooth
adaptive flow
adaptive
Production
smooth flow
of all production
management
scheduling
management
of all and
and rational
seeks operations
to ensure
rational
production use of with
of
usewell-organized
operations
efficient
resources.
resources.
with It because
It
and
efficient because
2004,
2004,
the
the
2006).
typically do
2006).
rescheduling
rescheduling
not address
time
time
the
is
is strongly
needs strongly
of
limited
limited
dynamic
(Kelly,
(Kelly,
scheduling
adaptive
adaptive
comprises management
management
the
the following
and
and rational
rational
activities: use
use ofof resources.
resources. It because
It the rescheduling time is strongly limited (Kelly,
2004, 2006).
smooth
comprisesflow
adaptive of all production
following
comprises management
the following
activities:
and rational
following activities:
activities:
operations
use of with efficient
resources. It 2004,
because2006).
the rescheduling time is strongly limited (Kelly,
2004, 2006).
comprises
adaptive the
comprises management and rational use of resources. It 2004, 2006).
the following activities:
comprises©
Copyright
2405-8963
Copyright ©the
© 2018
2018,
2018 following
IFAC activities: Federation of Automatic Control)
IFAC (International
IFAC 316
316Hosting by Elsevier Ltd. All rights reserved.
Copyright
Peer review©
Copyright 2018
©under IFAC
2018 responsibility
IFAC 316
of International Federation of Automatic
316Control.
Copyright © 2018 IFAC
10.1016/j.ifacol.2018.08.302 316
Copyright © 2018 IFAC 316
IFAC INCOM 2018
Bergamo, Italy, June 11-13, 2018 A. Dolgui et al. / IFAC PapersOnLine 51-11 (2018) 316–321 317

For example, an updated schedule for a continuous Production resources may be described differently.
production should take effect prior to the model is reloaded
1) Some have qualitative characteristics which take on
with the current production data (such as material balances
specific values that may be checked against norms at
made late at night before the first day of the new schedule).
any moment.
The revision of nonlinear models is particularly difficult
2) The state of others such as certain equipment pieces,
(Vani, 2015). The state of manufacturing resources should be
may be exclusively either “working” or “not working”.
nevertheless assessed and predicted both to improve control
Remaining life time may be known or not for such
agility and to foresee the situations where schedule execution
resources. The process historian may however keep
becomes problematic or impossible. Such situations will be
failure statistics for a specific equipment piece;
further referred to as incidents.
maintenance downtime statistics may be also available
It may make sense to develop intelligent predictive models for a specific piece or similar kind of equipment.
describing the overall current state of resources employed to
3) One more resource type (including human resources) is
execute all production operations of a specific production
not subject to maintenance. In case of outage, such
process.
resources should be immediately replaced from the
As far as all available resources are described differently and backlog. The replacement process is typically fast;
have different operation patterns, unified models are offered therefore, no values other than 1 (OK) and 0 (not OK)
enabling the prediction of a manufacturing situation as a should be assigned to such resource.
whole using a knowledgebase and associative rules generated
Assume a model of a specific manufacturing situation as a
purposefully for a specific production process.
dynamic schedule fragment comprises the following
Further consideration will comprise manufacturing components:
processes, production scheduling, supply management and
��� ��� � �� �� �� �� �� �� �� �� �� �� ���� ,
other industrial areas with their specific operations.
The further content of this paper is organized as follows. where:

The next section examines a development technique for � �� ��� ��� �


production process’s resource state model. Section 3 is a resource identifier including the resource number, the
describes the resource state prediction algorithms based on operation number and the time stamp (the number of
data mining techniques. Section 4 provides more details of an characteristics may be increased).
algorithm based on associative rules. An application case
from an engineering plant in discussed in Section 5. The Other components of the resource state vector at the time
Conclusion formulates the paper’s summary and offers an moment � may be represented by a binary code.
application outlook. � �� � is the code of the numerical value of a state variable;
this code is different for each of the above-listed resource
2. MANUFACTURING RESOURCE STATE MODELS types.

The term “production resources” will hereinafter mean the � �� �, � �� �, and � �� � will be discussed further.
following: Consider the resources whose state may be described by
 input flows characterized by formal properties some quantitative characteristic, such as inlet flowrate or
dependent on production specificity temperature for chemical processes or an average equipment
failure number.
 production equipment
The solution to the problem described above is based on the
���������� ��� � �� � � �� � � �� � � � possibility of representing numerical values of resource
state’s characteristics as a set of nested self-similar structures
where i is a number of production equipment unit and (Fink, 2007).
other facilities used for performing the j-th operation;
For a specific resource, we assume that the characteristic if its
 human resources state possesses the values on the half-interval [0; 1) (this half-
���������� ��� � �� � � �� � � �� � � � interval was chosen as an example for simplicity, the results
can be easily spread to any other).
involved in the j-the operation; i is number of human
resources; This half-interval can be represented as the union

 other factors [0; 0.5 )∪[0.5; 1).

��������� ��� � �� � � �� � � �� � � � We will further correspond the symbols {0; 1} to the left and
right half-intervals respectively, namely, 0 to the left half-
affecting the j-the operation such as energy resources interval, and 1 to the right one.
and a variety of formal indices and factors related with
the production process. i is number of such factors.

317
IFAC INCOM 2018
318
Bergamo, Italy, June 11-13, 2018 A. Dolgui et al. / IFAC PapersOnLine 51-11 (2018) 316–321

Each of the two subintervals can be further split in the same situation at the current time instant (from the viewpoint of the
way, and, again, the values 0 and 1 can be assigned to the left state of its resources).
and the right parts respectively.
To a great extent (under the assumption of insignificant
In that way, a finite chain of symbols from {0; 1} has a one- influence of external factors on the system) this state
to-one correspondence with a half-interval embedded in [0; determines the possibility of plan execution according to
1). schedule.
For example, the chain 00 corresponds to [0; 0.25), and the
3. RESOURCE STATE PREDICTION FOR PRODUCTION
chain 100 to [0.5; 0.625) respectively. For a binary partition a
PROCESSES BY MEANS OF ASSOCIATIVE RULES
chain of n symbols corresponds to a half-interval with the

length � (Fig. 1). For the developed binary chain, a forecast may be obtained

using data mining techniques. It makes sense to apply the
methods named association rules search.
As against the associative search technique (Bakhtadze et al.,
2007), where:
(i) associations of dynamic system’s states are developed
with the help of a database and cluster analysis,
(ii) parameter values are further calculated by means of least
square technique, and
(iii) the system’s output is finally predicted,

Fig. 1. A code for the numerical value of a quantity on the the association rules search allows to (Agrawal et al., 1993;
half-interval [0;1) Agrawal and Srikant, 1994):
 reveal the hidden statistical relationship between the
This way, for each value of a numerical characteristic at the state of various resources at different time instants
current time moment we obtain a code of zeros and ones. The without any preliminary statistical analysis,
number of positions, as we show further, will determine the
accuracy of prediction.  operate with state variables of various resources
differing in their characteristics the ways of
For the resources from the categories 2 and 3, the respective formalization.
codes will have the same value in all positions (either 1 or 0).
A forecast of a state described by a binary chain with an
� �� � is the code of the time before the maintenance end. If identifier can be obtained by revealing the most probable
a resource is available and operated, the respective code combination of two binary sets of values at a fixed time
consists of ones. instant and at the next instant (a one-step forecast).
� �� � is the code of the time before the equipment piece A more distant prediction horizon is also possible.
fails with the probability close to 1 (remaining life).
In the scheduling practice, this time is not less than the 4. ASSOCIATION RULES SEARCH TECHNIQUES
operating time. However, resource replacement just during
the operation may be sometimes more cost-effective. Association rules search techniques as a method used in
Moreover, the equipment piece may fail unexpectedly. knowledge discovering databases (KDD) was developed for
searching the regularities of relationship between events from
For resource types from categories 1 and 3, � �� � has ones the same set Z (Agrawal et al., 1993; Agrawal and Srikant,
in all positions. 1994).
� �� � is the time before the scheduled end of the operation. In the process of solving the problem of searching for
In real-life manufacturing situations, time may be wasted associative rules, regularities are found between the related
(with the need in schedule update) for the reasons neither events in the data set. The search for regularities is not based
stipulated in the production model nor caused by equipment on the properties of the analyzed object, but between several
failures. events that occur at the same time.
Generally, it is hardly possible to formalize all such causes of In particular, these regularities can be expressed by the
schedule disruption. Therefore, their consolidation as the implication “if X then Y”, �,∈Z.
“remaining plan execution time” is a way to allow for these
hidden factors in the production state model. Assume that for a specific operation j at a fixed time instant t
there are sets of elements
So, for each resource with reference to a specific production
operationб we have defined a binary code, having its time � � ��� �� � � �� � � �
identifier. By combining resource codes with equal values of
this identifier, we obtain a description of the manufacturing

318
IFAC INCOM 2018
Bergamo, Italy, June 11-13, 2018 A. Dolgui et al. / IFAC PapersOnLine 51-11 (2018) 316–321 319

under investigation. These sets supplemented by the Rule confidence is the probability of the implication
identifiers � �� ��� ��� � are called transactions:
“XY” (in our case, the probability of forecast correctness).
�� ���={��� ���|��� ����R, ��� },
The rule “X implies Y” is correct with the confidence C, if
T={�� ���� � � �� � � �� � � ���� � �� }. C% of transactions from the total, which contain the set of
elements X contain also the set of elements Y.
Data analysis based on association rules search aims at the
search of combinations commonly encountered together. Subjective metrics are determined by task specificity. Those
may include the following.
We introduce the term transaction of transition:
Lift (L) is the ratio of the rate of certain condition appearance
�� ���={��� ���� ��� �� � ��|��� ����R, ��� }, in transactions which also contain the consequence, to the
rate of appearance of the whole consequence.
T={�� ���� � � �� � � �� � � ���� � �� }.
L > 1 means that the condition appears in transactions
Such transactions contain combined sets of resource state
containing also the consequence more often than in the rest of
data at a certain time instant and at the next one. The
transactions.
collection of transactions forms the content of the
knowledgebase of the specific production process. Lift can be understood as a generalized measure of the
relation between two sets of objects: for L > 1 the relation is
Assume further that X is a set of elements R, ��R. positive, for L < 1 it is negative, and for L = 0 it lacks.
An implication ���, where ���, ���, �∩�=∅ is called Significance metric called leverage is a difference between
association rule. the rate of simultaneous appearance of the condition and the
Each association rule is characterized by two features: consequence (i.e., the association support) and the product of
support and confidence. appearance rates (supports) of the condition and the
consequence separately.
In our case, the association rule consists of two resource sets
called an antecedent (X) and a consequent (Y) with the The leverage is based on the assumption that if the cause and
relationship “X implies Y”. the consequence are independent then the rule support will
not differ significantly from the product of the condition’s
In particular, X can be the current set of resource state values, and the consequence’s supports (i.e., the cause and the effect
and Y can be the same set at the next time step. appear together with approximately the same frequency as
The objective of production process state prediction is the separately).
establishment of the fact that the production equipment is The above mentioned metrics are used for restricting
serviceable and available for operation, and the operating association sets by setting a lower significance threshold for
time for each operation will not go beyond the schedule. associations rejection.
According to standard models proposed for various resource Support and confidence of association rules is limited by
types this means revealing the following relationships: if a preassigned minimum and maximum values. The thresholds
transaction contains a certain set of elements (consisting of for support and confidence are assigned in such a way that
zeros and ones) then a conclusion can be made that a new set the number of rules found would be limited.
of elements Y will appear at the next time step.
For too high support value there is no need in applying rules
In the association rules search, the metrics describing the search algorithms because the rules will be rather obvious.
significance of the rule are used. Objective and subjective Instead, too low support value requires the development of
metrics should be discerned. The objective ones include too many new rules with respective high consumption of
support and confidence. computational resource.
The rule “X implies Y” is characterized by the support S, if However, at low support threshold rather interesting though
S% of transactions (in our case, pairs of transactions: at the groundless rules can be sometimes found.
current time step and at the next one) of their total number
contain the sets of elements X and Y. After assigning support and confidence thresholds, we select
among all associations that can be pulled from the database,
Let TXY be a number of transactions containing X and Y; those ones for which these two metrics exceed the thresholds.
T is the total number of transactions (both containing and not However, the number of selected associations is too large that
containing X and/or Y). poses significant computational challenges.
Then the support (S) of an association rule is defined as Therefore, special association rules search algorithms are
used in practice enabling search space reduction (Agrawal
S = TXY / P. and Srikant, 1994).
The support is construed as the rule appearance rate. The A priori algorithm based on the popular (frequently
confidence (C) of the association rule is defined as: encountered) set concept is among the most commonly used.
C = ТXY / PX. A frequently encountered set is the one whose appearance

319
IFAC INCOM 2018
320
Bergamo, Italy, June 11-13, 2018 A. Dolgui et al. / IFAC PapersOnLine 51-11 (2018) 316–321

rate in the total of transactions exceeds the preassigned level rule, the color of the histogram changes from blue to red in
(Witten et al., 2011). accordance with the level of confidence (from the smaller to
the larger).
The a priori algorithm comprises the following phases:
 revealing popular sets,
 development (search) of associative rules meeting
specified support and confidence constraints.
We propose a modified algorithm for revealing commonly
encountered sets. Its features are as follows:
1) The sets are revealed with coincidences in all positions
within subsets ൏ ʠଶ ൐ǡ ൏ ʠଷ ൐ǡ ൏ ʠସ ൐ǡ ൏ ʠହ ൐;
2) The sets are revealed with coincidences in all positions
within subsets ൏ ʠଶ ൐ǡ ൏ ʠଷ ൐ǡ ൏ ʠସ ൐ǡ ൏ ʠହ ൐ except
for the farthest “left” positions corresponding to the
farthest right subintervals of the values of resources’
characteristics;
3) Coincidences over all other positions are further
considered. Fig. 2. Visualization of association rules search procedure

On the algorithm’s second and the third steps, the sets are 5. APPLICATIONS
considered that correspond to the rougher “similarity”
conforming to an extended interval chosen at the coding. Some figures from the truck assembly process at KAMAZ
To search for associative rules, standard software can be JSC works in Russia, argue for the practical value of the
used, for example XELOPES LIBRARY developed by the technique proposed.
Prudsys AG company. Assembly on the line means that a truck is moved to the next
The library architecture conforms to the MDA (Model Driven assembly cell after all operations associated with the previous
Architecture) standard. cell are completed. The assembly process comprises of the
following operations:
 chassis frame is transported to the production line
 brackets and the driving bogie are mounted on the frame
 brake system is mounted, tested and tuned
 electrics, engine, cooling and feed systems, fuel and air-
take equipment, truck cab, and platform are mounted.
The assembly process employs the following resources:
 equipment (> 10 000 pieces)
 tools (>180 000)
 utilities (electricity, process gases, instrument air,
superheated water)
 process materials such as lube oils, special liquids,
chemicals, paints and paint bases, metals and nonmetals
(> 6 000 items altogether)
 design and engineering documentation (> 2 000 variants).
The model of associative rules in the XELOPES LIBRARY Abnormal situations resulting in assembly process
is represented in the form of three-dimensional histograms disruptions with the need for rescheduling or even mid-term
(Fig. 2). re-planning may happen if the abovementioned resources are
unavailable or in poor condition, as well as in the absence of
The subsets of the frequent sets are laid along the axes of the human resources (70 positions for each of the two production
plane. On the left axis, the transactions are represented lines), the lack of component parts or their wrong form
corresponding to the accidents, on the right – to the factors (78 000 form factors for KAMAZ own production and
consequent. A histogram is drawn at their intersection. The 23 000 purchased ones).
height of the histogram reflects the level of support for the

320
IFAC INCOM 2018
Bergamo, Italy, June 11-13, 2018 A. Dolgui et al. / IFAC PapersOnLine 51-11 (2018) 316–321 321

6. CONCLUSIONS Bakhtadze, N., Lototsky, V., Maximov, E., and B. Pavlov


(2007). Associative Search Models in Industrial
An intelligent approach was presented to the development of systems, IFAC Proceedings Volumes, vol. 40, no. 3, pp.
algorithms for production process state prediction at complex 105-108.
multiproduct plants featuring a large number of
Fink, Jiří (2007). Perfect matchings extend to Hamiltonian
manufacturing operations. Various production resource types
cycles in hypercubes, Journal of Combinatorial Theory,
were described by a single unified expression.
Series B 97, pp. 1074-1076.
For predicting resource state represented by a binary code,
Grossmann, I.E., Van den Heever, S.A., and I. Harjunkoski
the association rules search method is applied. A modified a
(2002). Discrete optimization methods and their role in
priori algorithm provides for the selection of associations
the integration of planning and scheduling. AIChE
ensuring gradual decrease of resource coding accuracy.
Symposium Series, vol. 98, no. 326, pp.150-168.
The technique proposed enables effective analysis of a
Kelly, J.D. (2004). Formulating Production Planning models,
manufacturing situation, bottlenecks detection, and
Chemical Engineering Progress, January, pp. 43-50.
prevention of events that may result in production
rescheduling. Kelly, J.D. (2006). Logistics: the missing link in blend
scheduling optimization, Hydrocarbon Processing, June,
Timely detection and prediction of contingencies, such as
pp. 45-51.
components delivery delays, may prevent an abnormal
situation and avoid major losses caused by schedule Leung, J.Y.T. (2004). Handbook of Scheduling, Boca Raton,
disruptions. Florida, Chapman & Hall/CRC.
System training enables the development of a knowledgebase Peretzki, D., Isaksson, A., Carvalho Bittencourt, A. and K.
underlying further contingency predictions. Forsman (2014). Data mining of historic data for
process identification. Linköping University Electronic
The approach proposed can be also used in production
Press, http://manualzz.com/doc/8482583/modeling-
scheduling and business management.
anddiagnosis-of-friction-and-wear-in-industrial .
REFERENCES Vani, K. ( 2015). Comparative Analysis of Association Rule
Mining Algorithms Based on Performance Survey,
Agrawal, R., Imielinski, T., and A. Swami (1993). Mining International journal of Computer Science and
Associations between Sets of Items in Massive Information Technologies, vol. 6, no. 4, pp. 3980-3985.
Databases, Proc. of the 1993. ACM-SIGMOD Witten, Ian H., Frank, Eibe, and Mark A. Hall, (2011). Data
International Conference on Management of Data, pp. Mining: Practical Machine Learning Tools and
207-216.
Techniques, 664 p., 3rd Edition, Morgan Kaufmann.
Agrawal, R., and R. Srikant (1994). Fast Discovery of
Association Rules, Proc. of the 20th International
Conference on VLDB, Santiago, Chile.
Al-Otabi, G.A., and M.D. Stewart (2004). Simulation model
determines optimal tank farm design, Oil & Gas Journal,
vol. 102, no. 7, pp. 50-55.

321

You might also like