Professional Documents
Culture Documents
In this study, we system into three format and loading them into MATLAB tables for
separate modules, each of which is implemented by a set of subsequent preprocessing steps. With this hosting, each
procedures. These components perform a chain of actions on record of IoT traffic is delivered in a raw table format, with
the input to generate the end product (anomaly-based the data characteristics shown in columns.
detection).
2. Data Cleansing Process (DCP)
A. IMPLEMENTATION OF THE DATA PREPARATION Data cleansing entails digging into the data to figure out
(DP) UNIT what's really going on and fixing any misunderstandings
The N-BaIoT 2021 dataset [21], [22], [23] contains raw (DHP). DHP's focus is on defect and inconsistency removal
IoT traffic data that must be preprocessed before it can be fed to boost data quality [24], [25], [26]. In this research, we
into component of the LP module. This is the responsibility used DHP to search for null-value cells and replace them
of the data preparation (DP) module. The following sequence with zero numerical, search for corrupted-value cells and
of steps constitutes this module's implementation. replace them with zero numerical, fix the typically attributes
are simple and filled wi), and fix the attrib names. Values
1. Data Hosting Process (DHP) between 0 and 1 are used for the output classes in the binary
The (DHP) is the procedure of storing data on a classifier, whereas values between 0 and 9 are used in the
dependable and always-available server. In this study, we multi classifier. In Table 1 below, we detail the steps for
host the data, train the model, and assess its performance all inputting labels and fixing any mistakes or incorrect data
within the MATLAB environment. This phase is in charge of entries.
receiving the data records in comma-separated value (CSV)
with one another, and that not hold true in many practical
contexts.