Professional Documents
Culture Documents
The data come from ToyADMOS and the MIMII Dataset consisting of the
normal/anomalous operating sounds of six types of toy/real machines.
To visualize the sequence, let's plot the waveplot for these signals is in the
next slide.
Data exploration: wave form
Data exploration: wave form
However, in deep learning models the common practice is to convert the
audio into a spectrogram which is:
a concise snapshot of an audio wave;
an image → well suited to being input to CNN-based architectures
developed for handling images.
The spectrograms of the two signals (normal and anomalous) are visible in the
next two slides
Data exploration: spectrogram
Data exploration: spectrogram
Data exploration: chromagram
Another way to get information about the differences of the two kind of signals
is the Chromagram.
CNN Autoencoder
2 flavour
Model selection: Autoencoder
In this challenge the difficulty lies in learning unlabeled data.
Is the step that is the same as the step in the typical neural networks;
Faster convergence.
Model selection: Autoencoder
Batch size
When we do optimization we consider the whole data to be used for the
optimization. This can be very costly. So instead we can consider a portion of
the data. This is what we call mini-batch.
Accuracy:
Autoencoder 76%
CNN Autoencoder 88%
CNN Autoencoder Batch Normalization 92%
THE TEAM