Professional Documents
Culture Documents
fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Date of publication xxxx 00, 0000, date of current version xxxx 00, 0000.
Digital Object Identifier 10.1109/ACCESS.2019.DOI
ABSTRACT In real-world applications – to minimize the impact of failures – machinery is often monitored
by various sensors. Their role comes down to acquiring data and sending it to a more powerful entity, such
as an embedded computer or cloud server. There have been attempts to reduce the computational effort
related to data processing in order to use edge computing for predictive maintenance. The aim of this paper
is to push the boundaries even further by proposing a novel architecture, in which processing is moved to
the sensors themselves thanks to decrease of computational complexity given by the usage of compressed
recurrent neural networks. A sensor processes data locally, and then wirelessly sends only a single packet
with the probability that the machine is working incorrectly. We show that local processing of the data on
ultra-low power wireless sensors gives comparable outcomes in terms of accuracy but much better results
in terms of energy consumption that transferring of the raw data. The proposed ultra-low power hardware
and firmware architecture makes it possible to use sensors powered by harvested energy while maintaining
high confidentiality levels of the failure prediction previously offered by more powerful mains-powered
computational platforms.
INDEX TERMS IoT, internet of things, smart sensors, predictive maintenance, compressed recurrent neural
networks, RNN, edge computing, induction motors, bearing faults, vibration signature, electric motors.
VOLUME 4, 2019 1
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
• In section III, we describe artificial intelligence tech- manually engineered, but rely on data gained by automated
niques that can be used to predict bearing faults. learning. While this does give good results, it also typically
• In section IV, we introduce the dataset on which is used leads to a loss in interpretability, i.e. human ability to under-
to train the neural networks designated to work on low stand the network’s reasoning process [16]. Expert systems
power devices, are one approach to remediating this issue. In their simplest
• In section V, we describe techniques that can be used to form, they are a set of rules in the form of IF this THEN
minimize the power consumption and memory footprint that. This introduces clear interpretability, but may come at a
of said techniques. loss of accuracy. Fuzzy logic, an extension of Boolean logic
• In section VI, we propose ultra-low power architecture in which truth values can be anywhere between 0 and 1, is
as a computational platform for running proposed artifi- also used due to its flexibility [17]. Finally, there are support
cial intelligence algorithm, vector machines, which – for a given input – map it to a vector
• Finally, in section VII, we show the results of our exper- in such a way that vectors corresponding to inputs belonging
iments including tests of wireless sensor with ultra-low to different categories (in this case: fault or lack thereof) are
power consumption used for predictive maintenance of separated by as wide gap as possible. The use of support
induction motors, vector machines for bearing fault detection is extensively
• In section VIII, we discuss the obtained results. described in [15]. In this paper, we focus on artificial neural
network techniques.
II. STATE OF THE ART
Research shows that 75% of all induction motor failures are III. RECURRENT NEURAL NETWORKS
caused by stator or bearing faults, with about 40% of those A. INTRODUCTION TO NEURAL NETWORKS
being bearing faults [11] (41% – according to IEEE, 42% Neural networks are a popular machine learning approach
– according to Electric Power Research Institute EPRI). It to pattern recognition which does not require hand-crafting
follows that rolling elements are critical parts of motors. of features to analyze, and usually makes creating a model
Bearing faults can be divided into two categories: localized for data analysis without domain-specific knowledge much
and distributed [12]. The former group contains cracks, pits easier. They typically consist of a number of layers, each of
and spalls defects on the rolling surfaces mainly caused by which is fed with an input vector and produces an output
fatigue cracks below the surface which may happen after vector, known as the activation. The input vector of each
overloading or shock loading the bearings when a small piece layer is some combination of the activations of previous lay-
of metal is removed due to this shock, resulting in a small ers and the network’s inputs, with some optional additional
pit or spall. Defects from the latter group are mainly caused components discussed later. The output of a neural network
by a manufacturing error, improper installation or abrasive is simply the output of its last layer. Each layer performs
wear [13]. A varying contact force between rolling elements some mathematical operations, like multiplication by the
and raceways due to defects results in an increased vibration weight matrix followed by addition of the bias vector and the
level. Despite the fact that vibrations in bearings always application of a (typically) point-wise function known as the
occur (because of a finite number of rolling elements to carry activation function. Any differentiable function can be used
the load, which – during rotation – change their position as an activation function.
in the load zone, causing a periodical variation of the total Neural networks can be further categorized by their archi-
stiffness of the bearing assembly [14]), their measurements tecture, i.e. the way the layers in them are interconnected.
and analysis are very often used in condition monitoring of The simplest case is the feed-forward neural network, in
bearings [15]. In the next section we describe the techniques which each of layer operates on the output of the previous
used for that. one. The applications the simples architecture are limited, in
part because it has a fixed input and output size, meaning
A. DIAGNOSIS METHODS that it would be unable to process a sequence of arbitrary
In order to perform automatic fault detection, the raw sig- length. This is also one of the reasons why we decided
nal is often preprocessed in a procedure known as feature to use recurrent neural network (RNN), which additionally
extraction [15]. This aims to reduce the amount of data that are specifically suited for the task of time series analysis.
a diagnostic system would otherwise have to deal with, and Conceptual comparison of selected neural network types are
instead focus on the relevant aspects thereof, represented by shown in Fig. 1.
these features. For example, the Fourier transform of the
signal could be used as a feature. The following paragraphs B. INTRODUCTION TO RECURRENT NEURAL
provide an overview of methods which can be used with or NETWORKS
without feature extraction. In contrast to other approaches, recurrent neural networks
Artificial neural networks, described in more detail in (RRN) operate on the assumption that additional information
section III, are a machine learning method which has been needed to solve the given task can be extracted from tem-
widely explored with different configurations and features poral dependencies between observations. They operate in
[15]. To a significant extent, the neural networks are not timesteps, with each timestep t bringing in a new observation
2 VOLUME 4, 2019
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
FIGURE 1. Architectures of a standard feed-forward neural network (NN) (top), RNN (recurrent NN) (middle) and mapping of outputs of RNN (bottom).
x(t) . Crucially, they are also stateful – they have a way of to that, the cell is able to control what information is stored
“memorizing” some information, in the form of a hidden for further use.
state h(t) , to utilize later. In timestep t, the network has access There are several variants of LSTM architectures, some of
to h(t 1) and produces h(t) . In fact, the network operates which containt peephole connections [21]. However, they are
recursively on its own output from the precious timestep – not used in this work.
and hence the name. The simplest setup of RNN is formally Formally, the operations of LSTM cells is governed by the
described by equations (1) and (2). following vector equations:
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
x(t) , h(t 1)
x(t) , h(t 1)
x(t) , h(t 1)
LSTM
f (t)
x(t) , h(t 1)
h(t)
i(t) C (t) o(t)
function (11), which produces a vector whose elements are This dataset consists of the results of three similar experi-
non-negative and add up to 1 – each can be interpreted as ments carried out on electric induction motors, which were
the probability that the observed input sequence belongs to driven for long periods of time until a bearing fault occurred.
its corresponding class. In our case, we have two classes – In each experiment, there were four bearings, each with
‘normal functioning’ and ‘anomalous functioning’. one or two single axis accelerometer attached. With minor
exceptions, measurements were gathered over a period of one
y (t) = softmax(Wyh h(t) + by ) (10) second every 10 minutes, at a sampling frequency of 20kHz.
exi The duration of these experiments ranges from less than a
softmax(x)i = Pn
j=1 e
xj week to over a month.
We labelled each data sample as coming from a machine
Note that the softmax function is not point-wise, it operates that is normal functioning, anomalous functioning (a bearing
on an entire vector. fault), or unknown. We labelled the first half of each experi-
ment as normal functioning and the last few days (3 in experi-
E. TRAINING NEURAL NETWORKS ment No.2, 10% of experiment duration in experiments No.1
Neural networks are typically trained using an algorithm and 3) as anomalous functioning. The samples in between
called backpropagation [19]. To used it, a loss function is we labelled as unknown. We split the dataset into three parts:
required – it is a measure of how well the neural network one used for training the AI, the second for validating the
is performing, with greater values indicating poorer results. AI’s progress during training, and the third one for testing.
The derivatives of this function with respect to every trainable The training set consists of about 50% of the data, with the
parameter in the network (weight matrix, bias vector) are other two having about 25% of data each.
calculated, and gradient descent is performed in an attempt
to minimize the loss. A. TAILORING THE DATASET TO WSN LIMITATIONS
In recurrent neural networks a variation of backpropaga- The dataset was acquired using advanced sensors. Wireless
tion known as backpropagation through time [20] is used, sensor nodes that have very constrained power budget so it
and number of timesteps for which the gradient is back is not possible to use high sampling frequencies for longer
propagated is called lookback [18], [20]. intervals without a risk of draining the battery. For that
reason we downsampled the data to create learning datasets
IV. DATASET for RNN with sampling frequencies of 100Hz, 200Hz and
We demonstrate the capabilities of the proposed artificial 400Hz to explore tradeoffs between computational efficiency
neural network system on the NASA bearing dataset [22]. of processing the data and accuracy of the results. The other
4 VOLUME 4, 2019
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
FIGURE 3. Spectrogram of accelerations from test No. 2 with original 20kHz FIGURE 4. Spectrogram of the accelerations from the same experiment as
sampling frequency. It is easy to see the moment the bearing fault occurred. shown in Fig. 3 sampled with frequency of 100Hz. The increase in vibration
Acceleration scale is not linear: ⌅ – 20g, ⌅ – 2g, ⌅ – 0.4g, ⌅ – 0g. due to bearing faults remains visible despite much lower sampling frequency.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
Hiperbolic tangent quantization preserves precision for have been developed as a part of PRIME project (Ultra-Low
lower absolute values: Power Technologies and Memory Architectures for IoT).
✓ ⌫ ◆
tanh(w) + 1 1
thq(w) = arctanh sthq · + 1 A. WIRELESS COMMUNICATION
sthq 2
2 Multiple wireless sensor networks use IEEE 802.15.4 proto-
sthq = col for data communication using low-data-rate, low-power
2bits 1
radio transmission [32]. One of them is GreenNet, which
The process described in this section that makes neural
has been initially developed by STMicroelectronics and then
network smaller and less demanding in terms of computa-
by Université de Grenoble and Cezamat [33]. We use it in
tional power is shown in Fig. 5. In the next section we will
our design at 2.4GHz frequency band. To improve energy
present the ultra-low power wireless platform on which the
efficiency and robustness, it heavily rely on beacon-enabled
compressed neural networks will be running.
mode. This helps in minimization of the activity time of
the sensor node: it only wakes up at regular time intervals
Quantized (8-bits) (according to beacons sent by a network coordinator), spend-
(0000.0000)2 ing the remaining time in deep sleep mode. This aggressive
Original Pruned
(0011.1111)2
energy conservation scheme helps in achieving an average
(0001.11110101)2 (0000.00000000)2 energy consumption at a level of tens of µW – despite the
(0011.11110101)2 (0011.11110101)2
Quantized (4-bits) fact that, during normal activity, nodes consume tens of mW.
(0000)2 Power consumption profile during activity period, i.e. node
(0011)2
wakeup, beacon reception, data transmission and transition
to sleep mode is presented in Fig. 9.
FIGURE 5. The process of compressing neural network parameters using
prunning and quantization techniques to speed up computations on low power Having insight into the power requirements of radio trans-
devices such as wireless sensor nodes. mission, we might consider when it is more energy efficient
to analyze the data locally and when to transmit them as they
are. Data transmission of a single packet as shown in Fig. 9
VI. WIRELESS SENSOR NODES WITH AI
consumes about 0.5mJ of energy. Suppose that the sensor
The general hardware architecture of a WSN node can be
(three-axis accelerometer) is configured to have sampling
divided into four subsystems [29], [30]:
frequency of 100Hz. It means that the sensor generates one
• Communication subsystem for wireless data transmis-
hundred triplets describing acceleration along X, Y and Z
sion, consisting of radio transceiver and antenna, axis (plus one hundred and one if temperature is also reg-
• Computing subsystem for data processing and man-
istered) every second. Even if only the 8 most significant bits
aging of the node, consisting of microcontroller unit of each number are taken, one second of sensing generates
(MCU) with processor core, memory, timers and input- almost a kilobyte of data to be send. Increasing the sampling
output ports, frequency to 400Hz gives about 32kB. The IEEE 802.15.4
• Sensing subsystem, for acquiring data about the envi-
PHY standard only supports frames of up to 127 bytes, of
ronment, consisting of a set of sensors communicating which up to 25 bytes (depending on the addressing scheme)
with MCU, is reserved. This means that transmitting a single sample
• Power subsystem, for providing and regulating power
supply voltage, consisting of energy storage, voltage
regulation and optionally an energy harvester.
The most fundamental requirement in design of WSN
is its energy efficiency, because replacement or recharging
of batteries is difficult due to large number of nodes or
limited access to the place where they are deployed [31].
Applicability of a WSN is considered adequate if its lifetime
is at last several months [30]. This is one of that reasons
why sensors used in nodes are mostly low-power, small-sized
and factory-calibrated, it helps to unload MCU and keep its
computational time at minimum.
The current paradigm of wireless sensor nodes is about
gathering raw data and then transmitting it to a more pow-
erful device for further processing. Our approach pushes the
boundaries of this paradigm by performing actual processing FIGURE 6. Measurements of the power consumption of a WSN node during
sensor activity. At first the node wakes up before expected time of arrival of
on the sensor itself in an energy-efficient way, and transmit- beacon, and prepares the embedded radio for transmission. Then it goes a
ting only a single number reflecting probability of fault. The lower power mode for a while, just to make transition to a run mode, perform
firmware and software architecture we used for that purpose ratio reception and transmission and then goes back to a deep sleep mode.
6 VOLUME 4, 2019
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
B. HARDWARE PLATFORM
Well designed wireless node should spend about 99% of the FIGURE 8. IEEE 802.15.4-compatible wireless sensor nodes with ultra-low
time in sleep mode [30]. It means that overall energy budget power accelerometer and detachable power supply. On the right hand side
there is a WSN node with a battery attached.
is heavily influenced by energy consumed in sleep mode.
For that reason we selected a microcontroller (MCU) that
supports low-leakage deep sleep modes. The MCU of choice C. FIRMWARE
(MKW41Z512) is a system on a Chip (SoC) with a IEEE The firmware running on the sensor node is governed by a
802.15.4 radio transceiver, and 48 MHz ARM Cortex-M0+ real time operating system (RTOS) for resource-constrained
processor core. It has several sleep modes, with some of devices in the Internet of Things (IoT) that contains of a low-
them characterised by extremely low power consumption. In power IPv6 communication stack [35]. The main activities of
a very-low-power stop mode (VLPS) current drawn by the the sensor could be presented in a form of infinite loop:
MCU is as small as 3.58µA (buck mode operation for 3V
• Device wake-up (timer-triggered interrupt),
input at 25°C). In a very-low-leakage stop mode (VLLSx)
• Data acquisition from accelerometer,
current drawn by the MCU goes to 0.46µA (buck mode
• LSTM calculations,
operation for 3V input at 25°C) [34].
• Feedforward layer calculations,
The sensor used for gathering data about vibrations –
• Waiting for a beacon for network synchronization,
ADXL363 – is an ultra-low power sensor consisting of a 3-
• Sending AI computation’s results,
axis microelectromechanical systems (MEMS): accelerom-
• Going back into deep sleep mode.
eter and temperature sensor. It consumes less than 2µA at a
100 Hz output data rate. ADXL363 communicates with MCU Wireless sensor nodes are not the best devices for execution
via a serial port interface (SPI) and provides 12-bit output of advanced algorithms due to reasons given in the previous
resolution for accelerometer and temperature. Its sampling sections. What we would like to show, is that they are good
frequency could be increased up to 400Hz. enough to do the processing of some artificial intelligence al-
The nodes have two variants of power supply. The first gorithms like compressed recurrent neural network (CRNN),
variant has a power subsystem which contains ultra-low while consuming amount of energy comparable with energy
power boost regulator with MPPT and charge management spent on maintaining wireless radio connectivity. To prove
which was suitable for photovoltaic (PV) and thermoelectric that, we have to analyze CRNN processing shown as Algo-
(TEG) energy harvesting (ADP5090). The second variant rithm 1.
of nodes has a connector which allows attaching battery Algorithm 1 CRNN data processing algorithm
or cables from a laboratory power supply. For energy con-
1: sprev 0
sumption measurements we used the second variant to have
2: hprev 0
better stability of measurements due to lack of dependency on
3: a getM easurementsF romAccelerometer()
the availability of ambient light or variations in temperature
4: for it < MEASUREMENTS_COUNT do
gradient. The hardware architecture is presented on Fig. 7.
5: x ax [i]
The nodes are presented on Fig. 8.
6: g tanh(Wgx ⇥ x + Wgh ⇥ hprev + bg )
7: i (Wix ⇥ x + Wih ⇥ hprev + bi )
Energy 32 kHz crystal
8: f (Wf x ⇥ x + Wf h ⇥ hprev + bf )
harvester oscillator 9: o (Wox ⇥ x + Woh ⇥ hprev + bo )
10: s g i + sprev f
Power
Microcontroller Accelerometer
11: h tanh(s) o
management
12: sprev s
13: hprev h
Temperature 14: it it + 1
Battery Radio sensor
15: end for
16: y sof tmax(Wyh ⇥ h + by )
FIGURE 7. Hardware architecture of ultra low power wireless sensor nodes.
17: enqueueResultF orT ransmission(y)
VOLUME 4, 2019 7
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
The accelerometer is configured to collect the data for D. ENERGY BUDGET AND DUTY CYCLE
one second periods with sampling frequency of 100Hz. At The rhythm of sleep and activity of a WSN node is strictly
first MCU reads 100 triplets from accelerometer’s buffer related to its network activity. IEEE 802.15.4 defines two
containing of the values of acceleration along X, Y and Z axis kind of devices with different connectivity capabilities. Low-
(line 3). Then MCU iterates over the readouts (line 4), taking complexity and low-cost Reduced Function Device (RFD),
the value of acceleration along one of its axis (X in this case) that connect to a single more advanced Full Function Device
and performing calculations which correspond to equations (FFD), which in contrary can establish multiple connections
(3)-(6) described earlier in Section III-C. Finally, the result to RFDs and perform network routing. The wireless net-
is processed using softmax function and enqueued for wire- work working in a beacon-enabled mode has two parameters
less transmission. Time consuming calculations of hiperbolic 0 SO BO 14 that define duration and frequency of
tangent has been avoided by using of lookup table with pre- RFD and FFD network activity. The time between beacons –
computed values. The rest of the computations are mostly beacon interval BI is defined as:
adding and multiplications of elements of matrices. And their
computational complexity depends on their dimensions. BI = Ts ⇥ 2BO (14)
To illustrate the power consumption during CRNN pro- for a single radio symbol lasting for ts = 16µs in the 2.4GHz
cessing we performed several experiments for different net- frequency band, and duration of base super frame equals
work sizes (we altered the number of LSTMs while keeping to 960 symbols, we obtain Ts = 960 ⇥ ts = 15.36ms. BI
number of samples and at the same level). The measurements determines the periods for which RFD could remain in a
of power consumption for one hundred samples processed sleep mode. The length of super frame duration – SD is
by 64, 32 and 16 LSTMs are shown if Fig. 9. For better defined as:
presentation of the results we wrote an auxiliary program SD = Ts ⇥ 2SO (15)
which altered the state of one of general purpose input
outputs (GPIOs) at the beginning and end of each block of and determines the time the FFD devices have to be active
computations, so we can clearly determine duration of each. to handle network activity of RFD devices connected to it.
Table 1 contains the summary of the energy consumption From our perspective, more important parameter is beacon
interval, because we want our device to be a simple RFD and
TABLE 1. Time, current and energy consumption for neural network to conserve energy as much as possible, delegating routing
processing on WSN node with different number of LSTM.
activities to other elements of the WSN.
Duty cycle, i.e. the fraction of one beacon period in which
Number of LSTMs T [ms] Iavg [mA] E [mJ]
64 500 6 8.19 the node is active is determined by BI and time spent on
32 150 6 2.46 data processing. Assuming for simplicity, that every time
16 44 8 0.52 the node performs network activity it also launches neural
network computations, for BO = 7 energy for that two
for different sizes of CRNN. We see that for 16 LSTMs, activities is spent every 27 ⇥ 15.36ms = 1.98s for BO = 8
the energy consumption spent on computations is at the – for every 3.93s, and so on. As BO increases, dominating
same level as energy spent on maintaining regular network component in energy consumption equation is not neural
connectivity of within WSN. network processing and wireless transmission but energy
spend during sleep time. The summary of energy expenses
of a RFD is presented in Table 2. The given values apply
to a normal network activity. Energy expenses for device
registration within the network during installation is not taken
in into account in estimations of average power consumption
because it is performed only once.
8 VOLUME 4, 2019
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
VOLUME 4, 2019 9
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
accuracy of detection at 96.4%. Then we decreased by half B. TESTS WITH RUNNING INDUCTION MOTOR
the number of LSTMs and lookback, and reduced the number We installed the sensors in machinery similar to one de-
of channels to one to match the power capabilities of the scribed in NASA dataset [22]. Sensors wirelessly transmitted
designed WSN node. We compared the performance of this raw data read from embedded accelerometers and temper-
neural network with the results given by other classification ature sensor for further analysis. Fig. 11 shows test site.
methods (presented in Table 3). Sensors were powered by a laboratory power supply, to
With 91.97% of accuracy we found this result acceptable, ensure stable power supply. The test shown that the power
but to satisfy our curiosity we also tested several compression consumption is at designed levels, nodes work stable and
methods of this recurrent neural network. We found the hiper- wireless communication is reliable. However, we were not
bolic tangent 8-bits quantization (thq) the most interesting. able to confirm accuracy of the neural network because
The comparisons of confusion matrices of uncompressed and our machine has bearings without oil circulation system to
thq-quantized NN is presented in Table 4. force lubricate them which also regulates the flow and the
temperatures of the lubricant. Having such a system would
TABLE 4. Confusion matrix for uncompressed RNN with 16 LSTMs, lookback allow us using of a magnetic plug installed in the oil feedback
of 32 and one-channel data input, and CRNN with the same parameters
compressed using thq 8-bits quantization. pipe to collect debris from the oil as evidence of bearing
degradation. Once the accumulated debris adhered to the
Ground truth Classified as OK Classified as anomaly magnetic plug exceeds a certain level and causes an electrical
Original Thq 8-bits Original Thq 8-bits switch to close, we would conclude the machinery broken.
OK 74463 74586 3584 4324
Anomaly 4447 3640 17639 17583
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
maintenance and eliminating installation of cables. What we [11] A. Choudhary, D. Goyal, S. L. Shimi, and A. Akula, “Condition
also proved that, the sensor with recurrent neural network monitoring and fault diagnosis of induction motors: A review,” Archives
of Computational Methods in Engineering, DOI 10.1007/s11831-
could process the raw data, taken from the sensor as it is, 018-9286-z, Sep. 2018. [Online]. Available: https://doi.org/10.1007/
without any filtering, preprocessing nor feature extraction. s11831-018-9286-z
In the future we would like to investigate possibilities of [12] J. Liu and Y. Shao, “Overview of dynamic modelling and analysis
of rolling element bearings with localized and distributed faults,”
learning healthy condition in early part of machine’s life Nonlinear Dynamics, vol. 93, DOI 10.1007/s11071-018-4314-y, no. 4,
similar to the way described in [10]. We would like also to pp. 1765–1798, Sep. 2018. [Online]. Available: https://doi.org/10.1007/
prepare a fully-functional energy-harvesting wireless sensor s11071-018-4314-y
[13] C. Sunnersjö, “Rolling bearing vibrations - the effects of geometrical
node capable of autonomously analize acquired data from imperfections and wear,” Journal of Sound and Vibration, vol. 98, DOI
motion sensor. Moreover, we want to learn how accuracy of https://doi.org/10.1016/0022-460X(85)90256-1, no. 4, pp. 455 – 474,
fault prediction changes when additional data is provided to 1985. [Online]. Available: http://www.sciencedirect.com/science/article/
pii/0022460X85902561
the neural network like temperature or current consumption [14] N. Tandon and A. Choudhury, “A review of vibration and acoustic mea-
of an induction motor. surement methods for the detection of defects in rolling element bearings,”
Tribology International, vol. 32, DOI https://doi.org/10.1016/S0301-
679X(99)00077-8, no. 8, pp. 469 – 480, 1999. [Online]. Available:
REFERENCES http://www.sciencedirect.com/science/article/pii/S0301679X99000778
[1] I. Sudhakar, S. AdiNarayana, and M. AnilPrakash, “Condition monitoring [15] I. El-Thalji and E. Jantunen, “A summary of fault modelling and predictive
of a 3-Ø induction motor by vibration spectrum anaylsis using fft health monitoring of rolling element bearings,” Mechanical Systems and
analyser- a case study,” Materials Today: Proceedings, vol. 4, DOI Signal Processing, vol. 60-61, DOI 10.1016/j.ymssp.2015.02.008, pp.
https://doi.org/10.1016/j.matpr.2017.01.125, no. 2, Part A, pp. 1099 252–272, 2015.
– 1105, 2017, 5th International Conference of Materials Processing [16] Z. C. Lipton, “The mythos of model interpretability,” CoRR, vol.
and Characterization (ICMPC 2016). [Online]. Available: http://www. abs/1606.03490, 2016. [Online]. Available: http://arxiv.org/abs/1606.
sciencedirect.com/science/article/pii/S2214785317301256 03490
[2] D. Goyal, Vanraj, B. S. Pabla, and S. S. Dhami, “Condition monitoring [17] L. A. Zadeh, “Fuzzy logic,” IEEE Computer, vol. 21, no. 4, pp. 83–93,
parameters for fault diagnosis of fixed axis gearbox: A review,” 1988.
Archives of Computational Methods in Engineering, vol. 24, DOI [18] S. Hochreiter and J. Schmidhuber, “Long Short-Term Memory,” Neural
10.1007/s11831-016-9176-1, no. 3, pp. 543–556, Jul. 2017. [Online]. Comput., vol. 9, DOI 10.1162/neco.1997.9.8.1735, no. 8, pp. 1735–1780,
Available: https://doi.org/10.1007/s11831-016-9176-1 Nov. 1997.
[3] W. Zhang, M.-P. Jia, L. Zhu, and X.-A. Yan, “Comprehensive overview on [19] A. Graves, Neural Networks. Springer Berlin Heidelberg, 2012.
computational intelligence techniques for machinery condition monitoring [20] Z. C. Lipton, J. Berkowitz, and C. Elkan, “A Critical Review of Recurrent
and fault diagnosis,” Chinese Journal of Mechanical Engineering, vol. 30, Neural Networks for Sequence Learning,” 2015.
DOI 10.1007/s10033-017-0150-0, no. 4, pp. 782–795, Jul. 2017. [Online]. [21] K. Greff, R. K. Srivastava, J. Koutník, B. R. Steunebrink, and J. Schmid-
Available: https://doi.org/10.1007/s10033-017-0150-0 huber, “LSTM: A Search Space Odyssey,” 2015.
[4] G. S. Palmero, J. J. Santamaria, E. M. de la Torre, and J. P. González, [22] A. Djebala, N. Ouelaa, and N. Hamzaoui, “Detection of rolling bearing
“Fault detection and fuzzy rule extraction in ac motors by a neuro-fuzzy defects using discrete wavelet analysis,” Meccanica, vol. 43, DOI
art-based system,” Engineering Applications of Artificial Intelligence, 10.1007/s11012-007-9098-y, no. 3, pp. 339–348, Jun. 2008. [Online].
vol. 18, DOI https://doi.org/10.1016/j.engappai.2005.02.005, no. 7, pp. Available: https://doi.org/10.1007/s11012-007-9098-y
867 – 874, 2005. [Online]. Available: http://www.sciencedirect.com/ [23] M. Wielgosz and M. Karwatowski, “Mapping neural networks to
science/article/pii/S0952197605000321 fpga-based iot devices for ultra-low latency processing,” Sensors,
[5] N. Gindy and A. Al-Habaibeh, Condition Monitoring of Cutting vol. 19, DOI 10.3390/s19132981, no. 13, 2019. [Online]. Available:
Tools Using Artificial Neural Networks, pp. 299–304. London: https://www.mdpi.com/1424-8220/19/13/2981
Macmillan Education UK, 1997. [Online]. Available: https://doi.org/10. [24] Y. LeCun, J. S. Denker, and S. A. Solla, “Optimal brain damage,” in
1007/978-1-349-14620-8_47 Advances in Neural Information Processing Systems 2, D. S. Touretzky,
[6] Y.-R. Hwang, K.-K. Jen, and Y.-T. Shen, “Application of cepstrum Ed., pp. 598–605. Morgan-Kaufmann, 1990.
and neural network to bearing fault detection,” Journal of Mechanical [25] B. Hassibi and D. G. Stork, “Second order derivatives for network pruning:
Science and Technology, vol. 23, DOI 10.1007/s12206-009-0802-9, Optimal brain surgeon,” in Advances in neural information processing
no. 10, p. 2730, Oct. 2009. [Online]. Available: https://doi.org/10.1007/ systems, pp. 164–171, 1993.
s12206-009-0802-9 [26] S. Han, J. Pool, J. Tran, and W. J. Dally, “Learning both weights and
[7] V. Sugumaran and K. I. Ramachandran, “Effect of number of features on connections for efficient neural networks,” in Proceedings of the 28th
classification of roller bearing faults using svm and psvm,” Expert Syst. International Conference on Neural Information Processing Systems -
Appl., vol. 38, DOI 10.1016/j.eswa.2010.09.072, no. 4, pp. 4088–4096, Volume 1, ser. NIPS’15, pp. 1135–1143. Cambridge, MA, USA: MIT
Apr. 2011. [Online]. Available: http://dx.doi.org/10.1016/j.eswa.2010.09. Press, 2015.
072 [27] W. Wen, C. Wu, Y. Wang, Y. Chen, and H. Li, “Learning structured
[8] R. Milne, “Artificial intelligence for online diagnosis,” IEE Proceedings sparsity in deep neural networks,” in Proceedings of the 30th International
D (Control Theory and Applications), vol. 134, pp. 238–244(6), Jul. 1987. Conference on Neural Information Processing Systems, ser. NIPS’16, pp.
[Online]. Available: https://digital-library.theiet.org/content/journals/10. 2082–2090. USA: Curran Associates Inc., 2016.
1049/ip-d.1987.0040 [28] J.-H. Luo, J. Wu, and W. Lin, “Thinet: A filter level pruning method for
[9] B. K. N. Rao, P. S. Pai, and T. N. Nagabhushana, “Failure deep neural network compression,” in The IEEE International Conference
diagnosis and prognosis of rolling - element bearings using artificial on Computer Vision (ICCV), Oct. 2017.
neural networks: A critical overview,” Journal of Physics: Conference [29] V. Raghunathan, C. Schurgers, S. Park, and M. B. Srivastava, “Energy-
Series, vol. 364, DOI 10.1088/1742-6596/364/1/012023, p. 012023, aware wireless microsensor networks,” IEEE Signal processing magazine,
May. 2012. [Online]. Available: https://doi.org/10.1088%2F1742-6596% vol. 19, no. 2, pp. 40–50, 2002.
2F364%2F1%2F012023 [30] M. Kuorilehto, M. Kohvakka, J. Suhonen, P. Hamalainen, M. Hannikainen,
[10] S. K. Bose, B. Kar, M. Roy, P. K. Gopalakrishnan, and A. Basu, and T. D. Hamalainen, Ultra-Low Energy Wireless Sensor Networks in
“Adepos: Anomaly detection based power saving for predictive Practice: Theory, Realization and Deployment. Wiley Publishing, 2008.
maintenance using edge computing,” in Proceedings of the 24th Asia and [31] G. Lu, B. Krishnamachari, and C. S. Raghavendra, “An adaptive energy-
South Pacific Design Automation Conference, ser. ASPDAC 19, DOI efficient and low-latency mac for data gathering in wireless sensor net-
10.1145/3287624.3287716, pp. 597–602. New York, NY, USA: ACM, works,” in 18th International Parallel and Distributed Processing Sympo-
2019. [Online]. Available: http://doi.acm.org/10.1145/3287624.3287716 sium, 2004. Proceedings., p. 224. IEEE, 2004.
VOLUME 4, 2019 11
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2019.2953019, IEEE Access
Markiewicz et al.: Predictive maintenance of induction motors using ultra-low power wireless sensors...
12 VOLUME 4, 2019
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.