Professional Documents
Culture Documents
The variation of precision, recall, TNR, accuracy Approach 1. Our approach using modified Jenson-
with the various thresholds, namely 𝛿1, 𝛿2, фUT , фLT Shanon distance and modified Jaccard index.
that were defined in section 3 is shown in the
Approach 2. Using unmodified Jaccard index with
following figures:
Jenson-Shanon distance.
Fig 6(c) shows the variation of Precision, recall, TNR,
Approach 3. Using Euclidean distance with
accuracy with 𝛿1. It can be observed from the graph
unmodified Jaccard index.
that Precision, TNR and Accuracy increase with the
S.No. PERFORMANCE FORMULA
MEASURE
1 TNR TN
TN + FP
2 Precision TP
TP + FP
3 Accuracy TP + TN
TN + FP + TP + FN
4 F1 Score 2 ∗ Precision ∗ Recall
Precision + Recall
5 PPV TP
TP + FP
6 ACC TP + TN
TP + TN + FP + FN
7 NPV TN
TN + FN
8 FDR FP
FP + TP
9 FOR FN
TN + FN
10 BM TPR + TNR – 1
11 FPR FP
FP + TN
12 FNR FN
FN + TP
13 MK PPV + NPV – 1
14 MCC TP × TN − FP × FN
√(TP + FP)(TP + FN)(TN + FP)(TN + FN)
Table 1 ( Performance Measures)
In table 2 we have compared the three approaches If we compare Approach 1 with Approach 3, we
with each other. observe that:-
Sensitivity Approach 1 Approach 2 Approach 3 TNR and precision of Approach 1 is a lot better
Measures
than the TNR and precision for Approach 3
PPV 0.96 0.73 0.74
.It has also got better accuracy as compared to
TPR 0.81 0.95 1.00
Approach 3.
ACC 0.89 0.80 0.83
Approach 1 also has a much lower FPR and FDR
F1 Score 0.88 0.83 0.85 score as compared to Approach 3.
NPV 0.83 0.93 1.00 Amongst other performance measures, MK
FDR 0.04 0.27 0.26 and MCC values of Approach 1 are also slightly
FOR 0.17 0.07 0.00 better than that of Approach 3.
BM 0.77 0.60 0.65 Approach 3, on the other hand has got better
FPR 0.03 0.34 0.34 TPR, NPV and FOR measures as compared to
TNR 0.96 0.65 0.65 Approach 1. In fact, it has the best values for
FNR 0.19 0.05 0.00 these parameters in the entire table.
MK 0.79 0.66 0.74 Also, both Approach 1 and Approach 3 have got
MCC 0.78 0.63 0.70 somewhat similar F1 score.
Table 2 (Comparison of our approaches) In the measures like TNR and precision,where
Approach 1 has one of the best score in the entire
From the table, following observations can be
table, Approach 3 performs rather poorly. Also,
made:-
Approach 3 lags far behind in measures like FPR and
If we compare Approach 1 with Approach 2, we can FDR score. On the other hand, in the measures in
observe that:- which Approach 3 performs better than Approach
1, Approach 1 is also performing quiet nicely. For
TNR and FPR of Approach 1 is a lot better than example, in case of NPV, both the approaches have
the TNR and precision for Approach 2. good scores, with Approach 3 performing better.
Approach 1 has also got better accuracy as similar trends are observed in case of all other
compared to Approach 2. measures except FNR, where Approach 3 has is far
Approach 1 has a much lower FPR and FDR superior. Considering all the above scenario, we can
score as compared to Approach 2. say that the overall even though Approach 3 has the
Amongst other performance measures, MK best values for some performance measures, its
and MCC values of approach 1 are also better poor performance in other measures are clearly a
than that of Approach 2. disadvantage due to which Approach 1 is better
Approach 2, on the other hand has got better than Approach 3.
TPR, NPV and FOR measures as compared to
Approach 1.
Both Approach 1 and Approach 2 have got Table 3 shows a comparison of our approaches with
somewhat similar F1 score. various other related works. If we compare our
approach with other related approaches, we
In the measures like FPR and TNR where Approach
observe that:-
1 has good performnce, Approach 2 performs
rather poorly. However, in measures like TPR and In comparison to HU Panda, our approach
NPV, where Approach 2 performs better, Approach works better with respect to all the
1 also has good performance. For example, both performance measures considered for the
Approach 1 and Approach 2 have similar NPV scores purpose of comparison.
with Approach 2 performing slightly better As In comparison to the work of Mostafa et al. our
Approach 1 performs far better than Approach 2 in approach performs better with respect to all
most of the measures, we can conclude that the the performance measures that are considered
overall performance of Approach 1 is better than for comparison.
Approach 2.
Sensitivity Approach Approach Approach HU Panda Hashemi Mostafa Mina Majumdar EliSa UP Rao et
Measures 1 2 3 et al. et al. Sohrabi et al. (2006) Bertino al.(2016)
et al. et al.
PPV 0.96 0.73 0.74 0.88 0.97 0.94 0.93 0.88 0.94 0.61
TPR 0.81 0.95 1.00 0.73 0.71 0.75 0.66 0.70 0.91 0.70
ACC 0.89 0.80 0.83 0.81 0.84 0.85 0.80 0.80 0.93 0.64
F1 Score 0.88 0.83 0.85 0.79 0.82 0.83 0.77 0.78 0.92 0.65
NPV 0.83 0.93 1.00 0.77 0.77 0.79 0.73 0.75 0.91 0.68
FDR 0.04 0.27 0.26 0.12 0.03 0.06 0.07 0.13 0.06 0.39
FOR 0.17 0.07 0.00 0.23 0.23 0.21 0.27 0.25 0.09 0.32
BM 0.77 0.60 0.65 0.63 0.69 0.70 0.60 0.60 0.85 0.35
FPR 0.03 0.34 0.34 0.10 0.02 0.05 0.05 0.10 0.06 0.45
TNR 0.96 0.65 0.65 0.90 0.98 0.95 0.94 0.90 0.94 0.65
FNR 0.19 0.05 0.00 0.28 0.29 0.25 0.35 0.30 0.09 0.30
MK 0.79 0.66 0.74 0.65 0.74 0.73 0.66 0.63 0.85 0.29
MCC 0.78 0.63 0.70 0.63 0.72 0.71 0.63 0.61 0.85 0.29
In comparison to the work of Hashemi et al. measures as well, better TNR and precision
even though our approach scores just a little scores can easily cover up lower recall values.
less in measures like TNR and precision, it 7. Analysis and Conclusion
scores a lot better with respect to rest of the
In this paper we have tried to detect malicious
performance measures.
transactions keeping in mind that certain data
If we consider the work of Mina Sohrabi et al.
elements hold more critical information. We also
our approach performs better with respect to
take into consideration user behaviour pattern in
all the performance measures that are present
this approach. A user regularly behaving as a
in the table.
normal user will be gradually improving his
In comparison to the work of Majumdar et al.
suspicion score. We then analyse the approach
our approach performs better with respect to
w.r.t different parameters by conducting
all the performance measures that we have
experiments. Finally, we conclude that the
considered for the purpose of comparison.
approach works efficiently in determining the
With comparison to the work of UP Rao et al.
nature of a transaction.
our approach performs better in context to all
the measures that are considered in the table
for comparison.
In comparison to the work of Elisa Bertino, our
approach gives better TNR and precision
scores. It also gives comparatively better FDR
and FPR scores. In other measures, except TPR
and recall, both approaches have somewhat
similar score. Since our work is mostly related
to finding Critical Data Items in a dataset,
higher TNR and precision scores are more
desirable as compared to other performance
measures. Since our approach performs quiet
well with respect to other performance