Professional Documents
Culture Documents
1.9-A Binaryzation
1.9-A Binaryzation
10 Binaryzation :
By known that Data Mining process may require many algorithms to deal with
multiple tasks and also each of these algorithms require data to be presented
in a particular format. When the data is not in the desired format, then it
needs to be transformed by applying some conversion process. Binaryzation
is such kind of it.
Usually, best Binaryzation approach is the one that produces the best result
for the data mining algorithm that will be used to analyse the data.
It can be defined as, “the process of converting both continuous and discrete
attributes into binary attributes”.
This conversion process uses 3 steps, such as:
- Assigning numerical value
- Finding number of binary attributes required
- Conversion into binary
Ex.:
Suppose our algorithm uses a categorical attribute with ‘m’ number of
values.
Step 1 : Assigning numerical value
If it is nominal type, then numbers assigned would be between [0, m – 1 ].
If it is ordinal type as it has order, then first assignment has to follow the
order.
Step 2 : Finding number of Binary attributes required
Suppose n is the number of binary attributes required, then it can be
calculated using the formula as:
Attribute Integer X1 X2 X3
values value
Awful 0 0 0 0
Poor 1 0 0 1
Ok 2 0 1 0
Good 3 0 1 1
great 4 1 0 0
Attribute Integer X1 X2 X3 X4 X5
values value
Awful 0 1 0 0 0 0
Poor 1 0 1 0 0 0
Ok 2 0 0 1 0 0
Good 3 0 0 0 1 0
Great 4 0 0 0 0 1