Professional Documents
Culture Documents
ABSTRACT
The paper presents research into use of data mining methods for knowledge
discovery in mining databases. The data was collected using VIMS system of
Caterpillar installed on several trucks operating in a surface mine. It was mined
with IBM Intelligent Miner for data. Data mining was found to allow for
identification and quantification of relations between the various types of
VIMS data. As such it offers the potential for development of a truck model
that can be used for prognosticating truck condition and performance.
Development of this capability requires further research.
INTRODUCTION
Modern mining equipment is fitted with numerous sensors that monitor its condition
and performance. The data collected by these sensors is used to alert the operator to
existence of abnormal operating conditions and to perform emergency shut-own if the
pre-set values of the monitoring parameters are exceeded. This data is also used for post-
failure diagnostics and for reporting and analysis of equipment performance.
It is believed that availability of this voluminous data, together with availability of
sophisticated data processing methods and tools, may allow for extraction of a variety of
additional information contained in the data. One method that may be of value is data
mining (Golosinski, 2001).
The research presented in this paper analyzes data collected from various sensors
installed on several mining trucks with the purpose to develop a model of truck operation
that may facilitate reliable projection of truck performance and its condition into the
future. Data was collected from Caterpillar 789B trucks equipped with VIMS (Vital
Information Management Systems, during the period of January to October 2000. IBM
Intelligent Miner for Data was used to conduct data mining.:
VIMS OPERATION
Caterpillar's Vital Information Management System (VIMS) is installed on selected
CAT mining equipment. It is intended to assist with machine management by informing
operators, service personnel and supervisors of the status of selected machine functions
and by providing information on equipment production and performance. VIMS monitors
and records parameters of numerous sensors that are integrated into the vehicle design. It
has the capacity to alert the operator if these parameters exceed the pre-set critical values.
In addition it can conduct emergency equipment shut-down if so programmed
(Caterpillar, 2000).
On-board VIMS unit records the collected data as well as occurrence of certain
VIMS events. The recorded data can be downloaded into a notebook computer.
Alternately it can be sent to the central control unit via radio (VIMS Wireless).
VIMS DATA
VIMS records data in seven different formats. These are:
Event Summary List (ESL). A VIMS event is recorded when the measured value of a
monitored parameter exceeds that considered acceptable. Event List is a record of events
that are occurring on the machine. It is limited to the last 500 events, listed in a
chronological order.
Snapshot. Snapshot stores a segment of machine history that consists of values of all
monitored parameters recorded at one-second interval. The snapshot is triggered by
VIMS event and as such it is related to abnormal condition or emergency situation of the
machine.
Data Logger. Data Logger records values of all the machine parameters that are
monitored by VIMS and sampled at one-second intervals. The logger is started and
stopped by the operator command and can record data for up to 30 minutes.
Trends. Trends record the minimums, maximums and averages of the selected
machine condition parameters for a pre-selected period of time.
Cumulative. Cumulative records the number of occurrences of specific events over a
pre-set period of time. An example of cumulative information can be the engine
revolutions or fuel consumption over the life of the machine, or its component.
Histogram. Histogram records the performance history of a selected parameter since
last reset. For example a histogram of the engine speed would indicate the percentages of
time that the engine operated within a pre-specified speed ranges.
Payload. Payload carried by the machine can be recorded if so specified and
providing that the machine is equipped with an appropriate sensor.
Four different data types are recorded. These are:
Sensed Data. This data contains values of sensor parameters and position of
switches installed on the machine.
Internal Data. This data is generated internally within VIMS main module. It
includes records of date and time.
Communicated Data. This data is acquired through the data links to various
machine components, including non-CAT components. For example the engine speed
may be monitored and recorded through the data link to the electronic engine control
system.
Calculated Data. This data is calculated by the VIMS main module as a function of
other data that is being collected. As an example event duration may be calculated based
on internal data and stored in the event list.
INTELLIGENT MINER
Variety of data mining software is available from numerous vendors. It includes
Intelligent Miner of International Business Machines Corporation, MineSet of Silicon
Graphic Inc., Clementine of Integral Solutions Limited of U.K. and other (Westphal and
Blaxton, 1998). The IBM Intelligent Miner (IM) version 6.1 was used for data mining
reported in this paper (IBM, 2000). It offers a choice of algorithms, is easy to use, and
has proven itself useful in many commercial applications.
Following mining and statistical functions are included in Intelligent Miner:
1. Mining functions: associations, demographic and neural clustering, sequential
patterns and similar sequences, tree and neural classification, and neural and RBF (Radial
Basis Function) prediction.
2. Statistics functions: bivariate statistics, linear regression, principal component
analysis, univariate curve fitting and factor analysis.
The IM allows modeling of events and processes that can be either usual or unusual.
Usual events describe the situation that is considered normal and for which the relations
between different attributes are sought. For example, relations between truck operating
and mechanical attributes can be defined such as a relation between engine load and truck
payload. Definition and quantification of these relations may be of help in improving
efficiency of truck operation or help with operator training.
The unusual events are failures of the monitored machine or its component. Data
mining of these events may allow for definition of algorithms that would facilitate
modeling of truck operation to help with planning of its maintenance and reduction of
downtime.
To facilitate data mining of VIMS databases with IM the data format has to be
adapted to that acceptable to the IM. The original VIMS data, downloaded from an on-
board VIMS unit, can be easily merged into MS Access 97 database using the VIMS
PC99 software. However, IM does not accept Access data format and to facilitate its use
data has to be transferred to DB2 database that is compatible with version 6.1 of
Intelligent Miner.
DISCUSSION
Work presented in this paper indicates that data mining of VIMS generated data
bases allows for discovery of knowledge contained in these databases. In particular
relations that exist between various VIMS-collected data can be identified, described and
quantified. This can be achieved through use of two IM statistical functions: linear
regression and factor analysis. While both these functions can serve this purpose, Factor
Analysis allows to significantly reduce the number of variables that need to be considered
and groups all related parameters into factors.
The clustering can be used to segment VIMS database through grouping of data that
have similar characteristics. This allows to idnetify the paramterers that are of key
importance to truck performance. Thus, as an example, the parameters that influence
truck fuel consumption rate can be defined. Further work is needed to fully define
applicability of custering to the problem at hand. It appears that clear definition of
relations or goals sought may be needed to realize the full potential of this method.
The classification was applied to interpret clustering results. It allowed for
quantification of impact that various VIMS parameters have on truck fuel consumption. It
was further proven that classification can be used to build a model that describes the
behaviour of VIMS parameters that are of interest. The research indicates that
classification alone does not yield meaningfull results. It does, however, yield these
results if used in conjunction with clustering.
Sequential patterns are usually used to find predictable patterns of behavior for a
given phenomenon over a period of time. In relation to VIMS parameters the intent was
to be able to predict occurrence of a specific event based on occurrence of similar event
in the past. However, this approach was unsuccessfull. While a multitude of similar
patterns was identified to exist in the data, its wariability was to large to permit draving
of valid conclusions on their repeatibility. It is believed that more complete VIMS
database, collected over extended period of time and in a variety of truck operating
condition may overcome this problem, and permit development of predictive models of
truck behaviour.
Ultimate goal of this work is to construct a model of truck operation that would
allow projection of truck condition and performance into future. The relations discovered
to exist between various VIMS parameters lay a foundation for this modeling work. Of
particular importance is use of Factor Analysis that reduces the number of parameters to
manageable level. Classification and clustering allows for analysis of truck performance
and indirectly its optimization. Use of sequential patterns need further study.
CONCLUSIONS
The investigations presented above prove that data mining can be used to analyze
performance of mining equipment. In particular the relations between its various
Bernson, A. and Smith, S.J. 1997. “Data Warehousing, Data Mining and OLAP”. McGraw-Hill.
Westphal, C. and Blaxton, T. 1998. “Data Mining Solutions”. John Wiley and Sons, Inc.
Caterpillar, Inc. 1999. “Vital Information management System (VIMS): System Operation Testing
and Adjusting”. Company publication.
IBM (International Business Machines Corporation). 2000. Manual: “Using the Intelligent Miner for
Data”. Company publication.
Golosinski. 2001. “Data Mining Uses in Mining”. Proceedings, APCOM 2001, Beijing, China.
Golosinski, T.S., Hu, Hui and Elias, R. 2001. “Data Mining VIMS for Information on Truck
Condition”. APCOM 2001, Beijing, China.
Ataman, K. 2001. M.S. Thesis: “Data Mining for Prediction of Condition and Performance of Mine
Machinery”. University of Missouri-Rolla publication.Madiba E. 2001. M.S. Thesis:
“Application of IBM DB2 Intelligent Miner for data to mine Vital Information Management
System (VIMS) data”. University of Missouri-Rolla publication.
operating, condition and performance parameters can be defined, and quantified using
regression and factor analysis methods.
VIMS provides variety of data that quantify truck condition and performance. To
maximize use of this data it needs to be collected continuously over expended periods of
time and under a variety of operating and climatic conditions.
Intelligent Miner contains a variety of data mining tools, many of which can be used
to successfully data mine VIMS databases. However, the input data format of IM is not
compatible with that of VIMS generated data. Therefore an interface between the two is
needed that facilitates fast data reformatting.
More investigations are needed to fully define the applicability of data mining to
knowledge discovery in VIMS databases. Use of databases collected over extended
period of time and containing data external to VIMS that define truck operating condition
may be the key factor in determining this applicability.
ACKNOWLEDGEMENT
Financial support of investigations presented in this paper by Univeristy of Missouri
Research Board is gratefully acknowledged.
REFERENCES