0% found this document useful (0 votes)

38 views6 pages

Understanding Confusion Matrix Metrics

A confusion matrix compares a classification model's predictions to actual outcomes. It categorizes predictions as true positives, true negatives, false positives, or false negatives. Several metrics can be derived from a confusion matrix like accuracy, precision, recall, and specificity. For example, a COVID prediction model correctly predicted 100 patients had the disease and 50 did not, while incorrectly predicting 10 healthy patients had it and 5 sick patients did not, yielding an accuracy of 91% and precision of 91%.

Uploaded by

Darya Yanovich

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views6 pages

Understanding Confusion Matrix Metrics

Uploaded by

Darya Yanovich

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Confusion Matrix, Scoring(Accuracy, Recall, Precision)

Introduction to Confusion Matrix

A confusion matrix is a table that illustrates the performance of a classification model. It compares
the actual outcomes with the model's predictions. For instance, consider a Covid prediction model:

 The model can predict two categories: whether a patient has the disease (Predicted: YES) or
doesn't (Predicted: NO).

 Among 165 tested patients, the classifier predicted that 55 patients don't have the disease,
and 110 do.

 In reality, 60 patients don't have the disease, and 105 do.

Categories of Predictions

 Correct Predictions:

 True Positive (TP): Cases where the model correctly predicted the patient has the
disease.

 True Negative (TN): Cases where the model correctly predicted the patient doesn't
have the disease.

 Incorrect Predictions:

 False Positive (FP) or Type 1 Error: Cases where the model incorrectly predicted the
patient has the disease.

 False Negative (FN) or Type 2 Error: Cases where the model incorrectly predicted the
patient doesn't have the disease.

Categories of Predictions:

 True Negative (TN): 50 patients correctly predicted as not having the disease.

 True Positive (TP): 100 patients correctly predicted as having the disease.

 False Positive (FP): 10 patients incorrectly predicted as having the disease.

 False Negative (FN): 5 patients incorrectly predicted as not having the disease.

Metrics Derived from the Confusion Matrix:

1. Prevalence:
 How often the actual condition appears in our sample.

 Actual YES / n = 105 / 165 = 64%

2. Accuracy:

 How often the classifier makes correct predictions.

 (TP + TN) / n = (100 + 50) / 165 = 91%

3. Precision:

 When the model predicts YES, how often is the actual outcome YES?

 TP / (TP + FP) = 100 / (100 + 10) = 91%

4. Recall (Sensitivity, True Positive Rate):

 Of all patients who actually have the disease (YES), how many were correctly
predicted by the model?

 TP / (TP + FN) = 100 / (100 + 5) = 95%

5. Specificity (Selectivity, True Negative Rate):

 Of all patients who actually don't have the disease (NO), how many were correctly
predicted by the model?

 TN / (TN + FP) = 50 / (50 + 10) = 83%

6. False Positive Rate (Fall-out):

 Among the patients who actually don't have the disease (NO), how often did the
model predict they have the disease (YES)?

 FP / (FP + TN) = 10 / (10 + 50) = 17%

7. False Negative Rate (Miss Rate):

 Among the patients who actually have the disease (YES), how often did the model
predict they don't have the disease (NO)?

 FN / (FN + TP) = 5 / (5 + 100) = 5%

Figure sources: https://www.dataschool.io/simple-guide-to-confusion-matrix-terminology/#:~:text=A
%20confusion%20matrix%20is%20a,the%20true%20values%20are%20known

Churn example in Orange

Dataset is here: https://www.kaggle.com/datasets/blastchar/telco-customer-churn

 Results are derived from a decision tree.

 Total observations: n=22,490.

Categories of Predictions:

1. True Negative (TN): 14,967 observations - the model correctly predicted the customer would
stay.

2. True Positive (TP): 2,971 observations - the model correctly predicted the customer would
churn.

3. False Positive (FP): 1,759 observations - the model incorrectly predicted the customer would
churn.

4. False Negative (FN): 2,793 observations - the model incorrectly predicted the customer
would stay.

Scoring:
Accuracy: Reflects the proportion of correct predictions made by the classifier.

 (TP + TN) / n = (14,967 + 2,971) / 22,490 = 79.8%

Precision: When the model predicts a customer will churn (YES), how often is this prediction correct?

 TP / (TP + FP) = 2,971 / (2,971 + 1,759) = 62.8%

Now switch in the top right corner of confusion matrix to the „proportion of actual“.
Recall (Sensitivity, True Positive Rate): Of all customers who actually churned (YES), how many were
correctly predicted by the model?

 TP / (TP + FN) = 2,971 / (2,971 + 2,793) = 51.5%

Specificity (Selectivity, True Negative Rate): Of all customers who actually stayed (NO), how many
were correctly predicted by the model?

 TN / (TN + FP) = 14,967 / (14,967 + 1,759) = 89.5%

False Positive Rate (Fall-out):

 Among the customers who actually stayed (NO), how often did the model predict they would
churn (YES)?

 FP / (FP + TN) = 1,759 / (1,759 + 14,967) = 10.5%. In 10.5% of cases where a customer
stays, the model predicts the customer will churn.
False Negative Rate (Miss Rate):

 Among the customers who actually churned (YES), how often did the model predict they
would stay (NO)?

 FN / (FN + TP) = 2,793 / (2,793 + 2,971) = 48.5%. In 48.5% of cases where a customer
churns, the model predicts the customer will stay.

Observations:

 The "Proportion of Actual" mode displays percentages as a proportion of the actual cases.
This means:

 The percentage (Predicted_No, Actual_No) of the total Actual_No is Specificity.

 The percentage (Predicted_Yes, Actual_No) of the total Actual_No is the False

Positive Rate.

 The percentage (Predicted_No, Actual_Yes) of the total Actual_Yes is the False

Negative Rate.

 The percentage (Predicted_Yes, Actual_Yes) of the total Actual_Yes is Recall.

 The "Test & Score" component verifies these results. Ensure "Target Class: Yes" is selected.
You will see that the Accuracy, Precision, and Recall match the manually calculated values

Differences between Recall and Precision:

 Recall: Deteriorates with an increasing number of false negatives. A false negative
means the customer churned, but the model predicted they would stay.
 Precision: Deteriorates with an increasing number of false positives. A false positive
means the customer stays, but the model predicted they would churn.
From the "Test & Score" analysis, we observe that Precision is higher than Recall. This
suggests our model suffers more from the count of false negatives than false positives. This
is confirmed by the earlier table showing a false negative rate of 48.5% and a false positive
rate of only 10.5%. Hence, Precision is higher.

Confusion Matrix - Explained
No ratings yet
Confusion Matrix - Explained
6 pages
Confusion Matrix in Model Evaluation
No ratings yet
Confusion Matrix in Model Evaluation
43 pages
Evaluating Classifier Accuracy Metrics
No ratings yet
Evaluating Classifier Accuracy Metrics
14 pages
Evaluating AI Models
No ratings yet
Evaluating AI Models
3 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Unit - 3 Evaluation
No ratings yet
Unit - 3 Evaluation
6 pages
Risk Security and Regulatory Compliance
No ratings yet
Risk Security and Regulatory Compliance
12 pages
Confusion Matrix
No ratings yet
Confusion Matrix
7 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
Ads 5
No ratings yet
Ads 5
5 pages
Machine Learning Algorithm Evaluation
No ratings yet
Machine Learning Algorithm Evaluation
17 pages
Understanding Confusion Matrix Metrics
No ratings yet
Understanding Confusion Matrix Metrics
13 pages
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
No ratings yet
JNN 5.2 Confusion Matrix and Performance Evaluation Metrics
13 pages
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
No ratings yet
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
4 pages
Classifier Performance Metrics Explained
No ratings yet
Classifier Performance Metrics Explained
23 pages
Understanding Confusion Matrix in ML
No ratings yet
Understanding Confusion Matrix in ML
60 pages
Confusion Matrix
No ratings yet
Confusion Matrix
13 pages
Model Performance Evaluation Metrics
No ratings yet
Model Performance Evaluation Metrics
13 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
10 Ai Evaluation tp01
No ratings yet
10 Ai Evaluation tp01
5 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
Confusion Matrix & Metrics Guide
No ratings yet
Confusion Matrix & Metrics Guide
13 pages
Evaluation Measures For Machine Learning Models
No ratings yet
Evaluation Measures For Machine Learning Models
6 pages
Understanding Confusion Matrix Metrics
No ratings yet
Understanding Confusion Matrix Metrics
9 pages
Understanding Confusion Matrix Metrics
No ratings yet
Understanding Confusion Matrix Metrics
10 pages
Confusion Metrics
No ratings yet
Confusion Metrics
7 pages
Overfitting and Model Evaluation Techniques
No ratings yet
Overfitting and Model Evaluation Techniques
40 pages
Understanding Confusion Matrix Basics
No ratings yet
Understanding Confusion Matrix Basics
32 pages
9 Roc Auc
No ratings yet
9 Roc Auc
27 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
AI Model Evaluation and Metrics Guide
No ratings yet
AI Model Evaluation and Metrics Guide
2 pages
Evaluation Data
No ratings yet
Evaluation Data
3 pages
Confusion Matrix Explained
No ratings yet
Confusion Matrix Explained
3 pages
Confusion Matrix and Evaluation Metrics
No ratings yet
Confusion Matrix and Evaluation Metrics
27 pages
Logistic Regression Evaluation Metrics
No ratings yet
Logistic Regression Evaluation Metrics
2 pages
BA
No ratings yet
BA
11 pages
Confusion Matrix
No ratings yet
Confusion Matrix
11 pages
Classification Performance Metrics Explained
100% (1)
Classification Performance Metrics Explained
30 pages
Confusion Metrics
No ratings yet
Confusion Metrics
7 pages
Classification Model Evaluation Metrics
No ratings yet
Classification Model Evaluation Metrics
22 pages
CH 07 Evaluation
No ratings yet
CH 07 Evaluation
25 pages
Performance Metrics
No ratings yet
Performance Metrics
34 pages
Evaluating Data Mining Model Performance
No ratings yet
Evaluating Data Mining Model Performance
30 pages
Understanding Confusion Matrix Metrics
No ratings yet
Understanding Confusion Matrix Metrics
18 pages
Confusion Matrix
No ratings yet
Confusion Matrix
3 pages
AI Project Evaluation 1
No ratings yet
AI Project Evaluation 1
5 pages
Understanding The Confusion Matrix in Machine Learning
No ratings yet
Understanding The Confusion Matrix in Machine Learning
4 pages
Accuracy, Precision, Recall & F1 Score Interpretation of Performance Measures
No ratings yet
Accuracy, Precision, Recall & F1 Score Interpretation of Performance Measures
5 pages
Cls 10 Evaluation Final
No ratings yet
Cls 10 Evaluation Final
10 pages
Unit 7 - AI (Evaluation)
No ratings yet
Unit 7 - AI (Evaluation)
28 pages
Intel Assignment
No ratings yet
Intel Assignment
13 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
61 pages
Confussion Analysis
No ratings yet
Confussion Analysis
3 pages
CH EVALUATION
No ratings yet
CH EVALUATION
7 pages
Nandi 2009
No ratings yet
Nandi 2009
8 pages
Anupam Vaishy Resume
No ratings yet
Anupam Vaishy Resume
1 page
Unit 2 Spec
No ratings yet
Unit 2 Spec
7 pages
Tat Hui Foods: Instant Noodle Leader
No ratings yet
Tat Hui Foods: Instant Noodle Leader
9 pages
BCZT - Crystallo Gjit 2
No ratings yet
BCZT - Crystallo Gjit 2
5 pages
Myconids
No ratings yet
Myconids
1 page
DSC 6 Notes
No ratings yet
DSC 6 Notes
14 pages
Online Scholarship Application Form
No ratings yet
Online Scholarship Application Form
2 pages
Altea Reservation Desktop Web - Case Study
No ratings yet
Altea Reservation Desktop Web - Case Study
2 pages
APS-footprint Tool General Methodology - Blonk Sustainabilty Tools
No ratings yet
APS-footprint Tool General Methodology - Blonk Sustainabilty Tools
16 pages
Data Mining for Machine Learning
No ratings yet
Data Mining for Machine Learning
35 pages
Drug Vocabulary Crossword Puzzle
No ratings yet
Drug Vocabulary Crossword Puzzle
4 pages
Savremena Američka Književnost
No ratings yet
Savremena Američka Književnost
36 pages
Solid Deposit Analysis of Well Lita AC-2
No ratings yet
Solid Deposit Analysis of Well Lita AC-2
4 pages
CATIA V5: Create Threads and Taps
No ratings yet
CATIA V5: Create Threads and Taps
11 pages
Fun Activities at the Beach
No ratings yet
Fun Activities at the Beach
7 pages
MP SET 2024 Question Paper Physical Sciences
No ratings yet
MP SET 2024 Question Paper Physical Sciences
53 pages
IoT Hydroponic Lettuce Farming with Solar
No ratings yet
IoT Hydroponic Lettuce Farming with Solar
4 pages
2018Q4 Factory Disclosure List
No ratings yet
2018Q4 Factory Disclosure List
60 pages
तरंगिनी
No ratings yet
तरंगिनी
119 pages
Anatomy Exam Questions for MBBCh Students
No ratings yet
Anatomy Exam Questions for MBBCh Students
2 pages
Rockwell Automation Parts List
No ratings yet
Rockwell Automation Parts List
36 pages
History of International Radar Symposium
No ratings yet
History of International Radar Symposium
32 pages
English For Academic and Professional Purposes Ut 2 Reviewer NAME
No ratings yet
English For Academic and Professional Purposes Ut 2 Reviewer NAME
4 pages
Fault Tree Analysis in Pharma Manufacturing
No ratings yet
Fault Tree Analysis in Pharma Manufacturing
11 pages
Access Java Course Guide: CodeTantra Setup
No ratings yet
Access Java Course Guide: CodeTantra Setup
7 pages
How To - Display Side-Aligned Tabs With TabControl - Windows Forms .NET Framework - Microsoft Docs
No ratings yet
How To - Display Side-Aligned Tabs With TabControl - Windows Forms .NET Framework - Microsoft Docs
3 pages
Imucet Syllabus Marine Edge
No ratings yet
Imucet Syllabus Marine Edge
8 pages
Short Stories Santas Little Helper Worksheet
No ratings yet
Short Stories Santas Little Helper Worksheet
2 pages
Growing Hazelnuts in The Pacific Northwest Orchard Nutrition
No ratings yet
Growing Hazelnuts in The Pacific Northwest Orchard Nutrition
5 pages