Professional Documents
Culture Documents
Introduction
PAGE 2
CNC Machine introduction
Literature Survey
PAGE 3
Problem Statement
PAGE 4
Objectives
Methodology
PAGE 7
RNN & LSTM
Implementation
PAGE 8
Database creation , splitting and training of LSTM
System testing
PAGE 9
Automation testing
SNAPSHOTS
PAGE10
Demo
PAGE 11
INNOVATION
PAGE 13
REFERENCES
PAGE 15
ABSTRACT
• Real-time condition monitoring of a CNC machine has been a challenging
project for many researchers and analysts, most of them invest a lot of money
in hardware resources for the condition of the CNC machine whether it's a
worn machine or fresh machine.
• To avoid heavy expenditure and future problems, this paper is designed for a
new real-time condition monitoring of a CNC machine using Deep learning
techniques such as LSTM Recurrent Neural Networks(RNN).
• Long Short Term Memory[LSTM] has a strong ability, can learn sequence
data without manual extraction of complex features, and has well built-in
capabilities of handling complex high dimensional massive data.
• Experimental results show that this model approach fetched an accuracy of
99.7% which is higher than of other neural network algorithms.
Brief working of the model
o
ed t
nect no
C on
ino u Wi-Fi
Ardu
eter
lerom
Acce
fresh Real-time
Classification
worn data
Abdelkabir Bacha, Jamal A CNC Machine Fault Data Acquisition and 1.initial training required for all
2. Benhra, Ahmed Haroun
Sabry
Diagnosis Methodology
based on Bayesian
Bayesian Networks programmers and users
2.Damage in the D.B affects
Networks and Data virtually all application
Acquisition programs
A D Patange, Milling cutter condition Signal acquisition, 1.Database systems are
3. Jegadeeshwaran R and
N C Dhobale
monitoring using machine
learning approach
Decision tree, Random
Forest, Logistic Model
complex, difficult, and time-
consuming to design
Tree Classifier Substantial hardware and
software start-up costs
Mohan krishna k and IoT Based CNC Machine Artificial Neural 1.Heavy Computational Time
4. Prashanth kannadaguli Condition Monitoring
System Using Machine
Network’s , LM
algorithm
learning Techniques
Figure 2:Backpropagration and time complexity Figure 3:Dropout and Connected complexity
4.2 Proposed System
2) Software: - MATLAB R2020a and Above 2)GPU: Nvidia GTX 1050 Ti and Above
• Where X = () is the input sequence. RNN calculates the hidden vector sequence H = ()
after it process information from the input X then passed through time gives the output
vector sequence O = () of t =1 to t as follows:
Figure 7.1.2: RNN
(1) working Methodology
(2)
• Where function is a non-linear function, parameterized by a weight matrix W and is
derivation term.
• The target output is calculated by minimizing loss L internally and the final output will
be calculated with previous output o with softmax layer.
(3)
7.2 LSTM
• A Long Short-Term Memory [LSTM] is a type of RNN that can learn
long-term dependencies between time steps of sequence data. LSTM is
a new RNN architecture proposed by Hochreiter and Schmidhuber in
the year 1997[5].
• The gradient vanishing problem and gradient explosion problem are
solved by introducing a new gate structure and memory. Gradient
vanishing problem occurs when “The gradient is small and increasing
many exponential drops there will be no effect on the output” [2]
whereas in terms of gradient explosion “If the gradient is large,
multiplying multiple exponential increases” leads to a gradient
explosion.
7.2 LSTM
Components Purpose of the component Formula
Fresh tool 50 50
Worn tool 50 50
for i = 1:25
cncValidateX{i,1} = machine{1,1}.testing{i}';
cncValidateY(i,1) = categorical(1);
Figure 8.3: Data cells
end
save cncTestData cncTestX cncTestY
8.4 Training of BiLSTM
• After splitting the dataset into a sequence of vectors, the training is performed by
BiLSTM deep neural network architecture. The network architecture consists 4
layers namely SequenceInputLayer where the inputsize=3, BiLSTM with
hiddenlayersize=20, fully connected layer with 2 classes, softmax layer, and
classification layer which is shown in the Fig 8.4.1
• Since the Condition monitoring system has a huge dataset of sequence vector cells
the network has many parameters to train ,for more computing power we attached
a CUDA enabled GPU to the CPU.
Fig. 8.5: Code For Training of biLSTM with Layers mentioned and
training options such as Adam optimizer etc
8.6 Training Progress and Loss during
training of the Model
Figure 8.6 : Training progress(left image) and Loss during Training(right image)
• The training progress of the model is shown in the above Fig. where we can see the validation accuracy is
increased from 80% to 90% during iteration number 500 with a learning rate of 0.001. After the 750 th
iteration number, the accuracy is increased from 90% to 100%.
• The above Figure 8.6: shows there is no loss in training during iteration number 1000, which illustrates as
the learning rate increases, the Loss will gradually decrease.
9. SYSTEM TESTING
9.1 Automation testing
which can be defined as the ratio of the total number of Net1.net 56 minutes,
32 seconds
3 2 10 10 100 54% 46%
misclassified to the total number of testing samples. The Net2.net 30 minutes, 3 2 20 20 100 64% 36%
15 seconds
misclassified errors are called Machine Error Rate
Net3.net 56 minutes, 3 2 20 10 100 99.7% 0.3%
[MER]. This is as shown in Table 12.1 40 seconds
• The above table 10.1 shows that the experiment has been Net4.net 57 minutes,
59 seconds
3 2 15 10 100 60% 40%
class, with a fewer number of iterations [Epoch size x Net6.net 29 minutes, 3 2 20 10 150 97.7% 3.3%
Mini batch size] then learning time also increases. 37 seconds
Net7.net 70 minutes, 3 2 20 10 200 86% 14%
• Along with this, if the rate of learning increases then loss 38 seconds
during training will decrease. If the hidden layer size is Net8.net 30 minutes,
11 seconds
3 2 25 20 100 99.1% 0.9%
very less, then accuracy will decrease. Net9.net 29 minutes, 3 2 30 20 100 92% 8%
05 seconds
• Hence BiLSTM model yielded a good result with 20 Net10.net 39 minutes, 3 2 30 20 200 86% 14%
hidden layers having 1500 iterations. 07 seconds
Net11.net 45 minutes, 3 2 20 30 200 99% 1%
13 seconds
Scalability
Can be Extended for further monitoring systems Cloud Based open source real time system
14.1 Conclusion
• This work’s purpose was an attempt to prove that the Long Short-Term
Memory [LSTM] deep learning method combined with the
characteristics of RNN was very useful and cost-effective in the field
of CNC Machine condition monitoring.
• Here the LSTM was modeled with 20 hidden layers, a mini-batch size
of 10 and we found that 99.7% of excellent work was done by the
trained model.
• Due to hardware resource limitation we were not able to exceed more
than 30 hidden layers ,30 mini batch size and also 200 epoch size.
14.2 Future Extensions
• Hence this work can be further extended by increasing the parameters
with good hardware capable machine which would lead to faster
computation time.
• To make the work more flexible and portable, an IOS or an android
app can be developed. Email and SMS notification can be sent to the
user based on the condition of the machine.
Any Questions??
[1] Tian, Zhigang,” An artificial neural network method for remaining useful life prediction of equipment subject to condition monitoring”,2009
[2] Bacha, Abdelkabir & Benhra, Jamal & Sabry, Ahmed. (2016). A CNC Machine Fault Diagnosis Methodology based on Bayesian Networks and Data
Acquisition. Communications on Applied Electronics. 5. 41- 48. 10.5120/cae2016652353
[3] Puja Girhe ,Shubham Yenkar ,Arpita Chirde , “Arduino Based Cost Effective CNC Plotter Machine “.
[4] A D Patange et al 2019 IOP Conf. Ser.: Mater. Sci. Eng. 624 012030Milling cutter condition monitoring using machine learning approach
[5] M. K. K and P. Kannadaguli, "IoT Based CNC Machine Condition Monitoring System Using Machine Learning Techniques," 2020 IEEE 9th
International Conference on Communication Systems and Network Technologies (CSNT), Gwalior, India, 2020, pp. 61-65, doi:
10.1109/CSNT48778.2020.9115762
[6] Michael Paluszek and Stephanie Thomas,”Practical MATLAB Deep Learning”
[7 ]C. Y. Lai, R. Chen and R. E. Caraka, "Prediction Stock Price Based on Different Index Factors Using LSTM," 2019 International Conference on
Machine Learning and Cybernetics (ICMLC), 2019, pp. 1-6, doi: 10.1109/ICMLC48188.2019.8949162.
[8] Michael Paluszek and Stephanie Thomas “Practical MATLAB Deep Learning” A Project-Based Approach, Michael Paluszek and Stephanie
Thomas 2020, doi: https://doi.org/10.1007/978-1-4842-5124-9
[9] Jeffrey L. Elman,Finding structure in time,Cognitive Science,Volume 14, Issue 2,1990,Pages 179-211,ISSN 0364-0213,https://doi.org/10.1016/0364-
0213(90)90002-E.
[10] Y. Su, "Research on Website Phishing Detection Based on LSTM RNN," 2020 IEEE 4th Information Technology, Networking, Electronic and
Automation Control Conference (ITNEC), 2020, pp. 284-288, doi: 10.1109/ITNEC48623.2020.9084799.
[11] Hochreiter, Sepp & Schmidhuber, Jürgen. (1997). Long Short-term Memory. Neural computation. 9. 1735-80. 10.1162/neco.1997.9.8.1735.
[12] M. A. Istiake Sunny, M. M. S. Maswood and A. G. Alharbi, "Deep Learning-Based Stock Price Prediction Using LSTM and Bi-Directional LSTM
Model," 2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES), 2020, pp. 87-92, doi: 10.1109/NILES50944.2020.9257950.
REFERENCES
[13] P. Kannadaguli and V. Bhat, "Comparison of Artificial Neural Network and Gaussian Mixture Model Based Machine Learning Techniques Using
DDMFCC Vectors for Emotion Recognition in Kannada," 2019 3rd International Conference on Electronics, Materials Engineering & Nano-
Technology (IEMENTech), Kolkata, India, 2019, pp. 1-6.
[14] V. Mishra, T. Kumar, K. Bhalla and M. M. Patil, "SuJAL: Design and Development of IoT-Based Real-Time Lake Monitoring System," 2018 3rd
International Conference on Circuits, Control, Communication and Computing (I4C), 2018, pp. 1-4, doi: 10.1109/CIMCA.2018.8739474.
[15] Yu Wang, "A new concept using LSTM Neural Networks for dynamic system identification," 2017 American Control Conference (ACC), 2017,
pp. 5324-5329, doi: 10.23919/ACC.2017.7963782.
[16] S. D. Kumar and D. Subha, "Prediction of Depression from EEG Signal Using Long Short Term Memory(LSTM)," 2019 3rd International
Conference on Trends in Electronics and Informatics (ICOEI), 2019, pp. 1248-1253, doi: 10.1109/ICOEI.2019.8862560.
[17] P. Kannadaguli, "FCOS Based Seatbelt Detection System Using Thermal Imaging for Monitoring Traffic Rule Violations," 2020 4th International
Conference on Electronics, Materials Engineering & Nano-Technology (IEMENTech), 2020, pp. 1-6, doi: 10.1109/IEMENTech51367.2020.9270058.
[18] S. Chakraborty, J. Banik, S. Addhya and D. Chatterjee, "Study of Dependency on number of LSTM units for Character based Text Generation
models," 2020 International Conference on Computer Science, Engineering and Applications (ICCSEA), 2020, pp. 1-5, doi:
10.1109/ICCSEA49143.2020.9132839.
[19] P. Kannadaguli, "FCOS Based Human Detection System Using Thermal Imaging for UAV Based Surveillance Applications," 2020 IEEE Bombay
Section Signature Conference (IBSSC), 2020, pp. 79-83, doi: 10.1109/IBSSC51096.2020.9332157.
[20] M. Hajiaghayi and E. Vahedi, "Code Failure Prediction and Pattern Extraction Using LSTM Networks," 2019 IEEE Fifth International Conference
on Big Data Computing Service and Applications (BigDataService), 2019, pp. 55-62, doi: 10.1109/BigDataService.2019.00014.
[21] P. Kannadaguli, "Microscopic Blood Smear RBC Classification using PCA and SVM based Machine Learning," 2020 Third International
Conference on Multimedia Processing, Communication & Information Technology (MPCIT), 2020, pp. 82-86, doi:
10.1109/MPCIT51588.2020.9350389.